CN114276460B - Modified GPR75 and use thereof - Google Patents
Modified GPR75 and use thereof Download PDFInfo
- Publication number
- CN114276460B CN114276460B CN202210205737.1A CN202210205737A CN114276460B CN 114276460 B CN114276460 B CN 114276460B CN 202210205737 A CN202210205737 A CN 202210205737A CN 114276460 B CN114276460 B CN 114276460B
- Authority
- CN
- China
- Prior art keywords
- gpr75
- modified
- protein
- amino acid
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 101001069607 Homo sapiens Probable G-protein coupled receptor 75 Proteins 0.000 title claims abstract description 140
- 102100033860 Probable G-protein coupled receptor 75 Human genes 0.000 title claims abstract description 138
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 49
- 230000000694 effects Effects 0.000 claims abstract description 29
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 21
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 20
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 20
- 101001034831 Homo sapiens Interferon-induced transmembrane protein 5 Proteins 0.000 claims abstract description 19
- 102100039731 Interferon-induced transmembrane protein 5 Human genes 0.000 claims abstract description 19
- 238000004458 analytical method Methods 0.000 claims abstract description 15
- 229920001184 polypeptide Polymers 0.000 claims abstract description 10
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 10
- 238000009510 drug design Methods 0.000 claims abstract description 8
- 102100039705 Beta-2 adrenergic receptor Human genes 0.000 claims abstract description 6
- 108010014499 beta-2 Adrenergic Receptors Proteins 0.000 claims abstract description 5
- 150000003384 small molecules Chemical class 0.000 claims abstract description 5
- 238000007877 drug screening Methods 0.000 claims abstract description 4
- 238000012216 screening Methods 0.000 claims abstract description 4
- 210000004899 c-terminal region Anatomy 0.000 claims abstract description 3
- 210000004027 cell Anatomy 0.000 claims description 44
- 230000004927 fusion Effects 0.000 claims description 19
- 108091033319 polynucleotide Proteins 0.000 claims description 19
- 102000040430 polynucleotide Human genes 0.000 claims description 19
- 239000002157 polynucleotide Substances 0.000 claims description 18
- 239000013604 expression vector Substances 0.000 claims description 8
- 238000012916 structural analysis Methods 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 201000010099 disease Diseases 0.000 claims description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 2
- 238000002372 labelling Methods 0.000 claims description 2
- 238000002405 diagnostic procedure Methods 0.000 claims 1
- 102000034285 signal transducing proteins Human genes 0.000 claims 1
- 108091006024 signal transducing proteins Proteins 0.000 claims 1
- 150000007523 nucleic acids Chemical class 0.000 abstract description 10
- 102000039446 nucleic acids Human genes 0.000 abstract description 9
- 108020004707 nucleic acids Proteins 0.000 abstract description 9
- 230000012743 protein tagging Effects 0.000 abstract 1
- 108090000623 proteins and genes Proteins 0.000 description 58
- 102000004169 proteins and genes Human genes 0.000 description 52
- 239000003446 ligand Substances 0.000 description 30
- 229940024606 amino acid Drugs 0.000 description 27
- 102000005962 receptors Human genes 0.000 description 25
- 108020003175 receptors Proteins 0.000 description 25
- 150000001413 amino acids Chemical class 0.000 description 24
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 21
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 21
- 230000014509 gene expression Effects 0.000 description 21
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 18
- 108010076504 Protein Sorting Signals Proteins 0.000 description 16
- 239000000872 buffer Substances 0.000 description 12
- 238000003776 cleavage reaction Methods 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 230000007017 scission Effects 0.000 description 12
- 238000012360 testing method Methods 0.000 description 12
- 108091005804 Peptidases Proteins 0.000 description 11
- 239000004365 Protease Substances 0.000 description 11
- 230000006870 function Effects 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 10
- 238000006467 substitution reaction Methods 0.000 description 10
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 9
- 239000007983 Tris buffer Substances 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 9
- 241000701447 unidentified baculovirus Species 0.000 description 9
- 108091006068 Gq proteins Proteins 0.000 description 8
- 102000052606 Gq-G11 GTP-Binding Protein alpha Subunits Human genes 0.000 description 8
- 239000000556 agonist Substances 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- NNDIXBJHNLFJJP-UHFFFAOYSA-N 20-Hydroxyeicosatetraenoic acid Chemical compound OCCCCCC=CCC=CCC=CCC=CCCCC(O)=O NNDIXBJHNLFJJP-UHFFFAOYSA-N 0.000 description 7
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 238000000034 method Methods 0.000 description 7
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- NRHMKIHPTBHXPF-TUJRSCDTSA-M sodium cholate Chemical compound [Na+].C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC([O-])=O)C)[C@@]2(C)[C@@H](O)C1 NRHMKIHPTBHXPF-TUJRSCDTSA-M 0.000 description 7
- 241000238631 Hexapoda Species 0.000 description 6
- 208000008589 Obesity Diseases 0.000 description 6
- 238000010521 absorption reaction Methods 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 210000000170 cell membrane Anatomy 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 235000020824 obesity Nutrition 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 5
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 5
- 239000013078 crystal Substances 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000001493 electron microscopy Methods 0.000 description 5
- 230000003834 intracellular effect Effects 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- 108020004414 DNA Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 239000005557 antagonist Substances 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 238000001641 gel filtration chromatography Methods 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108090000250 sortase A Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 102000014974 beta2-adrenergic receptor activity proteins Human genes 0.000 description 3
- 108040006828 beta2-adrenergic receptor activity proteins Proteins 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 238000009509 drug development Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000007062 hydrolysis Effects 0.000 description 3
- 238000006460 hydrolysis reaction Methods 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 239000002547 new drug Substances 0.000 description 3
- 239000004031 partial agonist Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 229940126586 small molecule drug Drugs 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- NCYCYZXNIZJOKI-IOUUIBBYSA-N 11-cis-retinal Chemical compound O=C/C=C(\C)/C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-IOUUIBBYSA-N 0.000 description 2
- 241001112741 Bacillaceae Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 102100032367 C-C motif chemokine 5 Human genes 0.000 description 2
- 108010055166 Chemokine CCL5 Proteins 0.000 description 2
- 102000001327 Chemokine CCL5 Human genes 0.000 description 2
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 2
- 241000588921 Enterobacteriaceae Species 0.000 description 2
- 108091006027 G proteins Proteins 0.000 description 2
- 102000027587 GPCRs class F Human genes 0.000 description 2
- 108091008884 GPCRs class F Proteins 0.000 description 2
- 101150014295 GPR75 gene Proteins 0.000 description 2
- 102000030782 GTP binding Human genes 0.000 description 2
- 108091000058 GTP-Binding Proteins 0.000 description 2
- 229930182566 Gentamicin Natural products 0.000 description 2
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 241000606768 Haemophilus influenzae Species 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- NTYJJOPFIAHURM-UHFFFAOYSA-N Histamine Chemical compound NCCC1=CN=CN1 NTYJJOPFIAHURM-UHFFFAOYSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000797762 Homo sapiens C-C motif chemokine 5 Proteins 0.000 description 2
- 101100449398 Homo sapiens GPR75 gene Proteins 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 102000007474 Multiprotein Complexes Human genes 0.000 description 2
- 108010085220 Multiprotein Complexes Proteins 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 102100040756 Rhodopsin Human genes 0.000 description 2
- 108090000820 Rhodopsin Proteins 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 108010086019 Secretin Proteins 0.000 description 2
- 102100037505 Secretin Human genes 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 230000002124 endocrine Effects 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 238000001215 fluorescent labelling Methods 0.000 description 2
- 230000008014 freezing Effects 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 229960002518 gentamicin Drugs 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 229940047650 haemophilus influenzae Drugs 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 229960002101 secretin Drugs 0.000 description 2
- OWMZNFCDEHGFEP-NFBCVYDUSA-N secretin human Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(N)=O)[C@@H](C)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)C1=CC=CC=C1 OWMZNFCDEHGFEP-NFBCVYDUSA-N 0.000 description 2
- 239000013049 sediment Substances 0.000 description 2
- 238000012772 sequence design Methods 0.000 description 2
- QZAYGJVTTNCVMB-UHFFFAOYSA-N serotonin Chemical compound C1=C(O)C=C2C(CCN)=CNC2=C1 QZAYGJVTTNCVMB-UHFFFAOYSA-N 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- WWUZIQQURGPMPG-UHFFFAOYSA-N (-)-D-erythro-Sphingosine Natural products CCCCCCCCCCCCCC=CC(O)C(N)CO WWUZIQQURGPMPG-UHFFFAOYSA-N 0.000 description 1
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- MMWCIQZXVOZEGG-UHFFFAOYSA-N 1,4,5-IP3 Natural products OC1C(O)C(OP(O)(O)=O)C(OP(O)(O)=O)C(O)C1OP(O)(O)=O MMWCIQZXVOZEGG-UHFFFAOYSA-N 0.000 description 1
- MEDBIJOVZJEMBI-YOEHRIQHSA-N 1-[(2S)-2-(dimethylamino)-3-(4-hydroxyphenyl)propyl]-3-[(2S)-1-thiophen-3-ylpropan-2-yl]urea Chemical compound C[C@@H](CC1=CSC=C1)NC(=O)NC[C@H](CC1=CC=C(O)C=C1)N(C)C MEDBIJOVZJEMBI-YOEHRIQHSA-N 0.000 description 1
- WRGQSWVCFNIUNZ-GDCKJWNLSA-N 1-oleoyl-sn-glycerol 3-phosphate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](O)COP(O)(O)=O WRGQSWVCFNIUNZ-GDCKJWNLSA-N 0.000 description 1
- 101150090986 1.5 gene Proteins 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 102100026041 Acrosin Human genes 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 102000003916 Arrestin Human genes 0.000 description 1
- 108090000328 Arrestin Proteins 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- 102000017917 Atypical chemokine receptor Human genes 0.000 description 1
- 108060003357 Atypical chemokine receptor Proteins 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101710152983 Beta-2 adrenergic receptor Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- BGIRVSMUAJMGOK-FXQIFTODSA-N Cys-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N BGIRVSMUAJMGOK-FXQIFTODSA-N 0.000 description 1
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- MMWCIQZXVOZEGG-XJTPDSDZSA-N D-myo-Inositol 1,4,5-trisphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H](O)[C@@H]1OP(O)(O)=O MMWCIQZXVOZEGG-XJTPDSDZSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 102000027582 GPCRs class B Human genes 0.000 description 1
- 108091008883 GPCRs class B Proteins 0.000 description 1
- 240000004859 Gamochaeta purpurea Species 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- 101800000224 Glucagon-like peptide 1 Proteins 0.000 description 1
- 102400000322 Glucagon-like peptide 1 Human genes 0.000 description 1
- DTHNMHAUYICORS-KTKZVXAJSA-N Glucagon-like peptide 1 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1N=CNC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 DTHNMHAUYICORS-KTKZVXAJSA-N 0.000 description 1
- 102000018899 Glutamate Receptors Human genes 0.000 description 1
- 108010027915 Glutamate Receptors Proteins 0.000 description 1
- 102000005720 Glutathione transferase Human genes 0.000 description 1
- 108010070675 Glutathione transferase Proteins 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 101000959437 Homo sapiens Beta-2 adrenergic receptor Proteins 0.000 description 1
- 101000976075 Homo sapiens Insulin Proteins 0.000 description 1
- 101001002657 Homo sapiens Interleukin-2 Proteins 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 102400001103 Neurotensin Human genes 0.000 description 1
- 101800001814 Neurotensin Proteins 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 102000012547 Olfactory receptors Human genes 0.000 description 1
- 108050002069 Olfactory receptors Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108010019160 Pancreatin Proteins 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- 101710186274 Soluble cytochrome b562 Proteins 0.000 description 1
- 108010056088 Somatostatin Proteins 0.000 description 1
- 102000005157 Somatostatin Human genes 0.000 description 1
- 241000519995 Stachys sylvatica Species 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 108010076818 TEV protease Proteins 0.000 description 1
- 102000003141 Tachykinin Human genes 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 101150003530 Tsnax gene Proteins 0.000 description 1
- 102000014384 Type C Phospholipases Human genes 0.000 description 1
- 108010079194 Type C Phospholipases Proteins 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- MMWCIQZXVOZEGG-HOZKJCLWSA-N [(1S,2R,3S,4S,5R,6S)-2,3,5-trihydroxy-4,6-diphosphonooxycyclohexyl] dihydrogen phosphate Chemical compound O[C@H]1[C@@H](O)[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](O)[C@H]1OP(O)(O)=O MMWCIQZXVOZEGG-HOZKJCLWSA-N 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000001800 adrenalinergic effect Effects 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229940121657 clinical drug Drugs 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000037011 constitutive activity Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 239000000890 drug combination Substances 0.000 description 1
- 150000002066 eicosanoids Chemical class 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 239000012520 frozen sample Substances 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 230000014101 glucose homeostasis Effects 0.000 description 1
- 230000002641 glycemic effect Effects 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000013238 high-fat diet model Methods 0.000 description 1
- 229960001340 histamine Drugs 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 102000057802 human GPR75 Human genes 0.000 description 1
- 102000055277 human IL2 Human genes 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229940042040 innovative drug Drugs 0.000 description 1
- PBGKTOXHQIOBKM-FHFVDXKLSA-N insulin (human) Chemical group C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 PBGKTOXHQIOBKM-FHFVDXKLSA-N 0.000 description 1
- 230000003914 insulin secretion Effects 0.000 description 1
- 210000003093 intracellular space Anatomy 0.000 description 1
- 229940125425 inverse agonist Drugs 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 108020004084 membrane receptors Proteins 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 102000051367 mu Opioid Receptors Human genes 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- PCJGZPGTCUMMOT-ISULXFBGSA-N neurotensin Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 PCJGZPGTCUMMOT-ISULXFBGSA-N 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 1
- 229960002748 norepinephrine Drugs 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 235000019645 odor Nutrition 0.000 description 1
- 239000000014 opioid analgesic Substances 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 229940055695 pancreatin Drugs 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000003488 releasing hormone Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 229940076279 serotonin Drugs 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000007873 sieving Methods 0.000 description 1
- 230000007781 signaling event Effects 0.000 description 1
- -1 small molecule compounds Chemical class 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 1
- 229960000553 somatostatin Drugs 0.000 description 1
- WWUZIQQURGPMPG-KRWOKUGFSA-N sphingosine Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](N)CO WWUZIQQURGPMPG-KRWOKUGFSA-N 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 108060008037 tachykinin Proteins 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 102000027257 transmembrane receptors Human genes 0.000 description 1
- 108091008578 transmembrane receptors Proteins 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108020001612 μ-opioid receptors Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/72—Receptors; Cell surface antigens; Cell surface determinants for hormones
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Endocrinology (AREA)
- Cell Biology (AREA)
- Immunology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention discloses a modified GPR75 and application thereof. The present invention provides a modified GPR75 comprising: a first domain comprising an amino acid sequence derived from a β 2 adrenergic receptor; and a second domain that is a domain in which the random sequence between the fifth and sixth transmembrane helices and the N-terminal, C-terminal random sequence are deleted in wild-type GPR75 and linked between the fifth and sixth transmembrane helices by an amino acid sequence derived from a BRIL fusion protein. The modified GPR75 provided by the invention can be used for GPR75 structure analysis, fluorescent molecular marking, phosphorylated polypeptide or signal protein fusion, GPR75 activity analysis, nucleic acid coding small molecule library screening, computer-aided drug design and drug screening.
Description
Technical Field
The invention belongs to the technical field of biology, and particularly relates to a modified GPR75 and application thereof.
Background
G protein-coupled receptors are the largest class of cell membrane receptors in the human body. The completion of the human genome project provides a basis for analyzing the distribution, sequence and function of the family members1. The human body has over 800G protein-coupled receptor members, of which there are about 370 non-olfactory G protein-coupled receptors and over 400 olfactory receptors. These G protein-coupled receptors are classified into six subfamilies, the rhodopsin family (rhodopsin family), the adhesion family (secretin family), and the secretin receptor family (secretin fa)mill), glutamate receptor family (glutamate family), Frizzled family (Frizzled family), and taste family (Tasted family). The different G protein-coupled receptors are involved in mediating a series of important biological functions of organisms, ranging from chemical perception recognition (vision, smell, taste) to endocrine molecule-related regulation2. The functional importance stems from the fact that receptors of this family are capable of recognizing a wide variety of ligands, common ligands including monoamines (dopamine, norepinephrine, serotonin, histamine), amino acid transmitters (glutamate, γ -aminobutyric acid), polypeptides (tachykinins, neurotensin, somatostatin, pancreatin, glucagon-like peptide-1, endocrine releasing factor), lipid derivatives (lysophosphatidic acid, sphingosine phosphate, eicosanoids), odors, and the like. Statistics indicate that about 35% of the total number of FDA-approved clinical drugs on the market target about 135 different G protein-coupled receptors3. In ongoing clinical studies, more than 20% of the tested drugs target G protein-coupled receptors4. Nevertheless, about 50% of non-olfactory G protein-coupled receptors may be potential disease treatment targets, and there is no relevant new drug clinical test yet, which is yet to be further developed and researched. Therefore, the value and potential of the G protein coupled receptor family members in the field of new drug development can be seen.
GPR75 (G protein-coupled receptor 75) belongs to a member of the G protein-coupled receptor family, and its endogenous agonist ligand includes the metabolite 20-HETE5And the chemokine CCL5/RANTES6. GPR75 has an expression profile in a large number of cell types, wherein GPR75 expressed in islets, through activation of CCL5, regulates insulin release and is involved in regulating glucose homeostasis in humans6. In addition, big data research published in the journal of science indicates that GPR75 gene is involved in regulating mouse obesity and is a clinically important obesity treatment target7. The human GPR75 gene contains 540 amino acids and has the typical 7-transmembrane characteristic of a G protein-coupled receptor family member8The C-terminus of the receptor has a random coil sequence of about 140 amino acids in length. Due to its comparison withLong random coil, and lack of aspartate/arginine/tyrosine (DRY) motifs, are classified as Atypical Chemokine Receptors (ACRs)9. Activation of the GPR75 receptor on the cell membrane by CCL5 results in intracellular phospholipase C mediated IP3 (inositol triphosphate) and Ca2+Up regulation of concentration10。
In 2017, the Nobel chemical prize awards three scientists who make remarkable contribution in the development process of the cryoelectron microscope technology11This marks the introduction of the field of structural biology into a new era12. In the same year, cryoelectron microscopy was applied to the structural analysis of receptor-G protein complexes. The subsequent 4 years, about 45 independent receptor-G protein complexes of high resolution structure were resolved13。
Structure-based drug design is an innovation over traditional new drug development protocols14, 15. The field of G protein-coupled receptor structure research has changed from the everlasting to the ground over the last 20 years. In 2000, the structure of the first high-resolution G protein-coupled receptor-rhodopsin was analyzed, which lays the foundation for the structural and functional research in this field16. In 2007, professor Brian Kobilka and its collaborators used antibodies to stabilize receptor conformation or fused T4 lysozyme17, 18The high-resolution crystal structure of the beta 2 adrenergic receptor was successfully resolved. In 2011, the crystal structure of the ternary complex of agonist-beta 2 adrenergic receptor-Gs protein is further analyzed by professor of Brian Kobilka19He therefore also shared the nobel prize of chemistry 2012 with the his mentor Robert j. Lefkowitz. In the era of traditional protein crystallography, about 50 independent G protein-coupled receptors were resolved in crystal structure. The structural information with high resolution lays a foundation for the structure-based drug design. One of the most notable examples is that professor Brian Kobilka and its co-workers use the high resolution crystal structure of mu opioid receptor to molecularly dock over 300 ten thousand small molecule compounds, ultimately resulting in opioid analgesic PZM21 with lower side effects20。
In the era of AI enabling innovative drug development, how to utilize sequence information of proteins to perform structure analysis work and design and screen active small molecule drugs based on computer-aided drug design (CADD) or molecular structure-based drug design (SBDD) thinking is urgently needed.
In order to develop targeted drug molecules against the GPR75 receptor, we needed to perform structural analysis work on the GPR75 receptor. In recent years, a mainstream structural research means in the field is to analyze the activated state structure of a receptor by forming a ternary complex from an agonist-G protein coupled receptor-G protein. The GPR75 receptor has been reported to play an important role in controlling obesity in humans, with human GPR75 receptor truncations showing a lower proportion of obesity. Knockout of GPR75 in mice can significantly inhibit obesity and enhance glycemic control in high fat diet model mice7. Inhibition of GPR75 receptor activity has been suggested as a strategy for clinical management of obesity. To be able to target the GPR75 receptor and develop a molecule of clinical value, it is necessary to use inhibitors. Therefore, it is very challenging to obtain a receptor structure in an inactivated state and develop a specific receptor activity inhibitor against it.
Disclosure of Invention
Problems to be solved by the invention
Since the clinically valuable molecule of the GPR75 receptor requires inhibitors and currently lacks its inactive receptor structure, the present invention provides a modification of GPR 75.
Means for solving the problems
A first aspect of the invention provides a modified GPR75 comprising:
a first domain comprising an amino acid sequence derived from a β 2 adrenergic receptor; and (c) and (d),
a second domain that is a domain in which the random sequence between the fifth and sixth transmembrane helices and the N-terminal, C-terminal random sequence are deleted in wild-type GPR75 and linked between the fifth and sixth transmembrane helices by an amino acid sequence derived from a BRIL fusion protein.
In some embodiments of the invention, the first domain comprises the amino acid sequence shown as SEQ ID NO. 4 or an amino acid sequence having at least 80% homology to the amino acid sequence shown as SEQ ID NO. 4.
In some embodiments of the invention, the second domain comprises the amino acid sequence shown as SEQ ID NO. 3 or an amino acid sequence having at least 80% homology to the amino acid sequence shown as SEQ ID NO. 3.
In some embodiments of the invention, the modified GPR75 comprises one or more of the following sequences:
(i) an amino acid sequence as shown in SEQ ID NO. 5;
(ii) an amino acid sequence having at least 80%, 82%, 85%, 87%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identity to the amino acid sequence shown in SEQ ID NO. 5 and which retains the function of a binding specific ligand of the amino acid sequence shown in SEQ ID NO. 5;
(iii) an amino acid sequence in which 1 or more amino acid residues are added, substituted, deleted or inserted in the amino acid sequence shown in SEQ ID NO. 5 and which retains the function of a binding-specific ligand of the amino acid sequence shown in SEQ ID NO. 5; alternatively, the first and second electrodes may be,
(iv) an amino acid sequence encoded by a nucleotide sequence that hybridizes under stringent conditions to a polynucleotide sequence encoding the amino acid sequence set forth in SEQ ID NO. 5 and which retains the function of binding to a specific ligand of the amino acid sequence set forth in SEQ ID NO. 5, said stringent conditions being medium stringency conditions, medium-high stringency conditions, high stringency conditions or very high stringency conditions.
In some embodiments of the invention, the modified GPR75 further comprises a tag, a protease cleavage site, a signal peptide, a peptide linker, or any combination thereof.
In some embodiments of the invention, the modified GPR75 comprises a tag at its N-terminus and/or C-terminus.
In some embodiments of the invention, the modified GPR75 comprises a signal peptide at its N-terminus.
In some embodiments of the invention, the protease cleavage site is located between two adjacent elements; the element is selected from the group consisting of a first domain, a second domain, a tag, a signal peptide, and a peptide linker.
In some embodiments of the invention, the modified GPR75 comprises one or more of the following sequences:
(i) an amino acid sequence as shown in SEQ ID NO. 13;
(ii) an amino acid sequence having at least 80%, 82%, 85%, 87%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identity to the amino acid sequence shown in SEQ ID NO. 13 and which retains the function of a binding specific ligand of the amino acid sequence shown in SEQ ID NO. 13;
(iii) an amino acid sequence in which 1 or more amino acid residues are added, substituted, deleted or inserted in the amino acid sequence shown in SEQ ID NO. 13 and which retains the function of a binding-specific ligand of the amino acid sequence shown in SEQ ID NO. 13; alternatively, the first and second electrodes may be,
(iv) an amino acid sequence encoded by a nucleotide sequence that hybridizes under stringent conditions to a polynucleotide sequence encoding the amino acid sequence set forth in SEQ ID NO. 13 and which retains the function of binding to a specific ligand of the amino acid sequence set forth in SEQ ID NO. 13, said stringent conditions being medium stringency conditions, medium-high stringency conditions, high stringency conditions or very high stringency conditions.
In a second aspect the invention provides a polynucleotide encoding a modified GPR75 according to the first aspect of the invention.
In a third aspect, the present invention provides an expression vector comprising a polynucleotide according to the second aspect of the invention.
In a fourth aspect, the present invention provides a host cell comprising an expression vector according to the third aspect of the invention.
A fifth aspect of the invention provides the use of a modified GPR75 according to the first aspect of the invention, a polynucleotide according to the second aspect of the invention, an expression vector according to the third aspect of the invention or a host cell according to the fourth aspect of the invention for use in GPR75 structural analysis, fluorescent molecular labeling, fusion of phosphorylated polypeptide or signal protein, GPR75 activity analysis, nucleic acid encoding small molecule library screening, computer-assisted drug design and drug screening.
ADVANTAGEOUS EFFECTS OF INVENTION
According to the invention, ideas such as molecular structure prediction, molecular design and the like are innovatively combined, a protein sequence which is originally not suitable for the non-activated state structure research is cut off and modified by fusion protein, and a 24-amino-acid beta 2 adrenergic receptor sequence is fused at the N end, so that the stability and the expression quantity of the receptor are improved. In addition, the design of a Flag tag sequence, a Strep tag sequence and a 6 XHis tag sequence is respectively carried out at the N end and the C end of the receptor, so that the receptor expressed by heterologous expression can be conveniently purified. Finally, the invention carries out multi-enzyme cleavage site (3C/TEV) design and Sortase A fusion sequence design on the protein sequence, can conveniently carry out reverse purification, matrix fixation, site-specific fluorescence labeling and the like on the purified protein according to different schemes in the later period as required, and the modified sequence can be conveniently applied to a nucleic acid-coded small molecule drug library, a drug combination experiment and the like. In addition, compared with wild GPR75, the modified GPR75 provided by the invention has better stability and higher expression level.
Drawings
FIGS. 1A and 1B show the prediction model of GPR75 protein structure. FIG. 1A shows the AlphaFold2 and RosettAFold prediction model of full-length GPR75, and FIG. 1B shows the predicted transmembrane region of GPR 75. It can be seen from FIG. 1A that GPR75 has a long random coil structure and is not suitable for direct structural biology studies.
FIG. 2 shows the GPR75 protein engineering concept. First, we used the prediction of AlphaFold2 to truncate the GPR75 transmembrane region and insert the BRIL fusion protein at a third intracellular helical position. Subsequently, we performed stepwise optimization of the fusion sites and selected the best fusion site as the final version of the fusion protein.
FIG. 3 shows the optimization of the fusion site of GPR75 protein.
FIG. 4 is a graph showing the difference in the effect of the optimization of the fusion site of GPR75 protein.
FIG. 5 shows the difference in the expression levels of wild-type GPR75 and modified GPR75 as detected using a western blot. The results of Western blot of GPR75 wild type and modified GPR75 are shown in the figure, lane 1 is GPR75 wild type, lane 2 is GPR75 modified GPR, and the expression level of the modified GPR75 is obviously improved according to the results of Western blot. The modified GPR75 has higher protein expression level. Wild type GPR75 was expressed in very low amounts and could hardly be detected under the same expression conditions.
FIG. 6 is a gel filtration chromatography UV-280 absorption peak alignment of modified GPR75 fused to BRIL and truncated GPR75 (no BRIL fusion and no deletion of the N-terminal random sequence of wild type GPR75) in an example of the invention. In the figure, the solid UV-280 absorption peak is the BRIL-fused, modified GPR75, and the dashed UV-280 absorption peak is truncated GPR 75.
FIGS. 7A and 7B are SDS-PAGE gels (FIG. 7A) and photographs of frozen data (FIG. 7B) of modified GPR75 in accordance with an embodiment of the present invention. The purified modified GPR75 has high purity, and the data particles of the frozen sample have good dispersity.
FIGS. 8A and 8B are background activity assays for modified GPR75 in test examples of the present invention. The ligand-binding-free modified GPR75 was able to accelerate GTP hydrolysis activity of Gq protein, while the ligand 20-HETE showed an effect of inhibiting 75 protein activity (fig. 8A). In the experiment, the IC of 20-HETE was measured50Approximately 2nM (FIG. 8B).
FIGS. 9A-9C illustrate the preparation of complexes of modified GPR75 and fragment of anti-BRIL fab in test examples of the present invention and the analysis of the freezing data. The effect of co-migration of engineered GPR75 with anti-BRIL fab fragments on molecular sieves (FIG. 9A), and SDS-PAGE further demonstrated their ability to form stable complexes (FIG. 9B). Cryo-electron microscopy two-dimensional classification of single particles revealed features of receptor and complex with fab fragment against BRIL (fig. 9C).
Detailed Description
In order that the invention may be more readily understood, certain technical and scientific terms are specifically defined below. Unless otherwise defined herein, all other technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
In the present specification, the numerical range represented by "a numerical value to B numerical value" means a range including the end point numerical value A, B.
In the present specification, the term "substantially" or "substantially" means that the standard deviation from the theoretical model or theoretical data is within 5%, preferably 3%, and more preferably 1%.
In the present specification, the meaning of "may" includes both the meaning of performing a certain process and the meaning of not performing a certain process.
In this specification, "optional" or "optionally" means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where the event occurs and instances where it does not.
Reference in the specification to "some specific/preferred embodiments," "other specific/preferred embodiments," "embodiments," and so forth, means that a particular element (e.g., feature, structure, property, and/or characteristic) described in connection with the embodiment is included in at least one embodiment described herein, and may or may not be present in other embodiments. In addition, it is to be understood that the described elements may be combined in any suitable manner in the various embodiments.
According to the present invention, the terms "polypeptide", "protein", "peptide" are used interchangeably herein to refer to a polymeric form of amino acids of any length, and may include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having a similar peptide backbone.
According to the present invention, the terms "nucleic acid molecule", "polynucleotide", "polynucleic acid", "nucleic acid" are used interchangeably and refer to a polymeric form of nucleotides of any length, whether deoxyribonucleotides or ribonucleotides, or analogues thereof. The polynucleotide may have any three-dimensional structure and may perform any known or unknown function. Non-limiting examples of polynucleotides include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, control regions, isolated RNA of any sequence, nucleic acid probes, and primers. The nucleic acid molecule may be linear or circular.
According to the present invention, the term "G protein-coupled receptor" or "GPCR" or "GPR" refers to a transmembrane receptor capable of transmitting a signal from outside the cell to inside the cell via the G protein pathway and/or the arrestin pathway. Hundreds of such receptors are known in the art; see, e.g., Fredriksson et al, mol. pharmacol.63: 1256-: 4903 and 4908(2003), each of which is incorporated herein by reference. G protein-coupled receptors are polypeptides sharing a common structural motif, with 7 regions between 22 to 24 hydrophobic amino acids that form 7 alpha helices, each spanning the cell membrane. Each span, i.e., transmembrane-1 (TM1), transmembrane-2 (TM2), etc., is identified by number and may also be referred to as a first transmembrane helix, a second transmembrane helix, etc., in the present invention. The transmembrane helices are also connected by regions of amino acids between transmembrane-2 and transmembrane-3, transmembrane-4 and transmembrane-5, and transmembrane-6 and transmembrane-7, outside or "extracellular" side of the cell membrane, which regions are referred to as "extracellular" regions 1, 2 and 3, respectively (EC1, EC2 and EC 3). The transmembrane helices are also connected by regions of amino acids between transmembrane-1 and transmembrane-2, transmembrane-3 and transmembrane-4, and transmembrane-5 and transmembrane-6, inside or on the "intracellular" side of the cell membrane, which regions are referred to as "intracellular" regions 1, 2 and 3, respectively (IC1, IC2 and IC 3). The "carboxy" ("C") terminus of the receptor is located in the intracellular space within the cell, and the "amino" ("N") terminus of the receptor is located in the extracellular space outside the cell. Any of the above regions can be readily identified by analysis of the primary amino acid sequence of the GPCR.
According to the present invention, the term "ligand" or "receptor ligand" means a molecule that specifically binds to a GPCR either intracellularly or extracellularly. The ligand may be, without limiting purpose, a protein, a (poly) peptide, a lipid, a small molecule, a protein scaffold, an antibody fragment, a nucleic acid, a carbohydrate. The ligands may be synthetic or naturally occurring. The term "ligand" includes "natural ligands," which are endogenous, natural ligands of a natural GPCR. In most cases, the ligand is a "modulator" that increases or decreases the intracellular response when contacted with (e.g., binds to) a GPCR expressed by the cell. Examples of ligands as modulators include agonists, partial agonists, inverse agonists, and antagonists. Wherein "agonist" refers to a ligand that increases the signaling activity of a receptor by binding to the receptor. Full agonists are able to maximally stimulate receptors; partial agonists do not elicit full activity even at saturating concentrations. Partial agonists may also function as "blockers" by preventing binding of more potent agonists. "antagonist" refers to a ligand that binds to a receptor without stimulating any activity. "antagonists" are also referred to as "blockers" because of their ability to prevent binding of other ligands and thus block agonist-induced activity. Furthermore, "inverse agonist" refers to an antagonist that, in addition to blocking the agonist effect, also reduces the basal or constitutive activity of the receptor below that of the receptor to which the ligand is not bound.
According to the invention, the amino acid three letter codes and the one letter code used are as described in J.biol. chem, 243, p3558 (1968).
According to the present invention, the term "host cell" refers to a cell into which an expression vector has been introduced. Host cells may include bacterial, microbial, plant or animal cells. Bacteria susceptible to transformation include Enterobacteriaceae (A), (B), (C)enterobacteriaceae) Members of (2), e.g. Escherichia coli (E. coli) ((II))Escherichia coli) Or Salmonella(Salmonella) The strain of (a); bacillaceae (B)Bacillaceae) Such as Bacillus subtilis (B.), (Bacillus subtilis) (ii) a Pneumococcus (A), (B), (C)Pneumococcus) (ii) a Streptococcus (Streptococcus) And Haemophilus influenzae: (Haemophilus influenzae). Suitable microorganisms include Saccharomyces cerevisiae (seeSaccharomyces cerevisiae) And Pichia pastoris (Pichia pastoris). Suitable animalsHost cell lines include CHO (chinese hamster ovary cell line) and NS0 cells.
According to the present invention, amino acid "addition" refers to the addition of an amino acid at the C-terminus or N-terminus of an amino acid sequence. According to the invention, an amino acid "deletion" means that 1, 2 or more than 3 amino acids can be deleted from the amino acid sequence. According to the present invention, the amino acid "insertion" refers to insertion of amino acid residues at appropriate positions in the amino acid sequence, and the inserted amino acid residues may be adjacent to each other in whole or in part, or none of the inserted amino acids may be adjacent to each other.
According to the present invention, an amino acid "substitution" refers to the replacement of an amino acid residue at a certain position in an amino acid sequence with another amino acid residue; wherein "substitution" may be a conservative amino acid substitution.
According to the present invention, "conservative modification", "conservative substitution" or "conservative substitution" refers to the replacement of an amino acid in a protein with another amino acid having similar characteristics (e.g., charge, side chain size, hydrophobicity/hydrophilicity, backbone conformation, rigidity, etc.) so that changes can be frequently made without changing the biological activity of the protein. It is known to The person skilled in The art that, in general, a single amino acid substitution in a non-essential region of a polypeptide does not substantially alter The biological activity (see, for example, Watson et al (1987) Molecular Biology of The Gene, The Benjamin/Cummings pub. Co., p. 224, (4 th edition)). In addition, substitution of structurally or functionally similar amino acids is unlikely to abolish biological activity. Exemplary conservative substitutions are set forth in the following table "exemplary amino acid conservative substitutions".
Exemplary amino acid conservative substitutions
According to the present invention, "medium to very high stringency conditions" include "medium stringency conditions", "medium-high stringency conditions", "high stringency conditions" or "very high stringency conditions", which describe conditions for nucleic acid hybridization and washing. For guidance in performing hybridization reactions see Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6, which is incorporated herein by reference. Aqueous and non-aqueous methods are described in this document, and either may be used. For example, specific hybridization conditions are as follows: (1) low stringency hybridization conditions are washed 2 times in 6 x sodium chloride/sodium citrate (SSC), at about 45 ℃, then at least 50 ℃, in 0.2 x SSC, 0.1% SDS (for low stringency conditions, the wash temperature can be raised to 55 ℃); (2) moderate stringency hybridization conditions wash 1 or more times in 6 XSSC, at about 45 ℃, then in 0.2 XSSC, 0.1% SDS at 60 ℃; (3) high stringency hybridization conditions are 1 or more washes in 6 XSSC, at about 45 ℃, then 65 ℃ in 0.2 XSSC, 0.1% SDS and preferably; (4) very high stringency hybridization conditions are 0.5M sodium phosphate, 7% SDS, 1 or more washes in 0.2 XSSC, 1% SDS at 65 ℃.
According to the present invention, "exogenous" refers to a substance produced outside an organism, cell or human body depending on the case. "endogenous" refers to a substance produced in a cell, organism, or human body as the case may be.
According to the invention, "homology" refers to sequence similarity between two polynucleotide sequences or between two polypeptides. When a position in both of the two compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if each position of two DNA molecules is occupied by adenine, then the molecules are homologous at that position. The percent homology between two sequences is a function of the number of matching or homologous positions shared by the two sequences divided by the number of positions compared x 100. For example, if there are 6 matches or homologies at 10 positions in two sequences when the sequences are optimally aligned, then the two sequences are 60% homologous; two sequences are 95% homologous if there are 95 matches or homologies at 100 positions in the two sequences. Typically, a comparison is made when aligning two sequences to give the maximum percent homology. For example, the comparison may be performed by the BLAST algorithm, where the parameters of the algorithm are selected to give the maximum match between the respective sequences over the entire length of the respective reference sequence. The following references relate to the BLAST algorithm often used for sequence analysis: BLAST algorithm (BLAST ALGORITHMS) Altschul, S.F. et al, (1990) J.mol.biol.215: 403-; gish, W. et al, (1993) Nature Genet.3: 266-; madden, T.L. et al, (1996) meth.Enzymol.266: 131-; altschul, S.F. et al, (1997) Nucleic Acids Res. 25: 3389-3402; zhang, J et al, (1997) Genome Res.7: 649-656. Other conventional BLAST algorithms, such as provided by NCBI BLAST, are also well known to those skilled in the art.
According to the present invention, the term "codon optimized" means that the nucleotide sequence encoding the polypeptide has been configured to comprise codons preferred by the host cell or organism to improve gene expression and increase translation efficiency in the host cell or organism.
According to the present invention, the term "tag" refers to a short peptide that is fused or linked to a protein of interest (e.g., the modified GPR75 of the present invention) and thereby facilitates soluble expression, detection, and/or purification of the recombinant protein. The tag may be fused or linked to the N-terminus and/or C-terminus of the protein of interest (optionally via a linker or protease cleavage site). Such tags are well known to those skilled in the art and have been described in detail in the prior art literature. Such tags include, for example, but are not limited to, histidine tag (Sockolosky, J.T. and F.C.Szoka (2013) Protein Expr Purif 87(2):129-135), glutathione transferase (GST) tag (Hayashi, K. and C.Kojima (2008), Protein ExprPurif 62(1):120-127), Maltose Binding Protein (MBP) tag (Bataille, L., W.Dieryck, A.Hocqellet, C.Cabanne, K.Bathany, S.Lecommandox, B.rbay and E.Garanger, Protein Express and Purfion Volume 110, June 2015, Pages 165-171), thioredoxin (Trax) tag (Tomalay, M.S.Lantin, Lanti, Schnei and Purif, K.S.S.Lecomando, B.g.S.1000, K.Garzan. Garlander, Schnein Expresson and Purison Volume 110, Jun.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.G.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S.S, Disulfide bond isomerases DsbA tags (Zhang, Y., D.R.Olsen, K.B.Nguyen, P.S.Olson, E.T.Rhodes and D.Mascarenhas (1998) Protein Expr Purif 12(2):159-165), DsbC tags (Kurokawa, Y., H.Yanagi and T.yura (2001) J Biol Chem 276(17): 14393-99), SUMO tags (Marbletone, J.G., S.C.Edavettal, Y.Lim, P.Lim, X.Zuo and T.R.Butt (2006) Protein Sci 15 (1)) 182- (189), MSyB tags (Zhang, Z., L.o, P.Sahong, Y.R.Butt (2006) Protein Sci 15- (182-42), JamJ.H.J.S.E.H.J.E.H.J.J.E.S.E.E.E.E.H.J.E.E.E.E.E.E., E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E.E, shuster, Jeffrey R., Barr, Philip J. (1989) Nature Biotechnology 7(7):705- > 709), Myc tag, Flag tag, fluorescent protein (e.g., GFP) tag (Pedelacq, J.D., S.Cabantouts, T.Tran, T.C.Terwilliger and G.S.Waldo (2006) Nat Biotechnology 24(1):79-88), biotin tag, and avidin tag.
According to the present invention, the term "signal peptide", "signal sequence" or "signal peptide sequence" refers to a short peptide which, when fused to a protein of interest (e.g., the modified GPR75 of the present invention), promotes secretion of the protein of interest expressed by a cell onto or out of the cell membrane. Signal peptides are typically located at the N-terminus of the protein of interest, and various signal peptides are known to those skilled in the art, such as, but not limited to, the hemagglutinin signal sequence, the human insulin signal sequence, the human interleukin 2 signal sequence, the albumin signal sequence, and the like.
According to the present invention, the term "protease cleavage site" refers to a site that is specifically recognized and cleaved by a protease. Various specific proteases and their recognition sites are well known to those skilled in the art and are found in many prior art documents. The skilled worker can, depending on the circumstances, use suitable protease cleavage sites in the fusion protein and cleave with the corresponding proteases. The use of protease cleavage sites may be advantageous, for example, in that they may be used to cleave signal peptides and/or tags from fusion proteins, thereby obtaining mature proteins with the desired activity.
According to the present invention, the term "peptide linker" refers to a short peptide used to connect two molecules (e.g., proteins). Typically, a fusion protein, such as protein of interest 1-peptide linker-protein of interest 2, is obtained by introducing (e.g., by PCR amplification or ligase) a polynucleotide sequence encoding the short peptide between two DNA fragments respectively encoding the two proteins of interest to be ligated, and performing protein expression.
According to the present invention, the term "vector" refers to a nucleic acid vehicle into which a polynucleotide may be inserted. When a vector is capable of expressing a protein encoded by an inserted polynucleotide, the vector is referred to as an expression vector. The vector may be introduced into a host cell by transformation, transduction, or transfection, and the genetic material elements carried thereby are expressed in the host cell. Vectors are well known to those skilled in the art and include, but are not limited to: plasmids, phages, cosmids, and the like.
According to the present invention, the terms "cell," "cell line," and "cell culture" are used interchangeably, and all such designations include progeny. Thus, the words "transformant" and "transformed cell" include the primary test cell and cultures derived therefrom, regardless of the number of transfers. It is also understood that all progeny may not be precisely identical in DNA content due to deliberate or inadvertent mutations. Mutant progeny that have the same function or biological activity as screened for in the originally transformed cell are included. Where different names are intended, they are clearly visible from the context.
The invention is further illustrated by the following examples, but is not to be construed as being limited thereto, in conjunction with the accompanying drawings. The following provides specific materials and sources thereof used in embodiments of the present invention. However, it should be understood that these are exemplary only and not intended to limit the invention, and that materials of the same or similar type, quality, nature or function as the following reagents and instruments may be used in the practice of the invention. The experimental procedures used in the following examples are all conventional procedures unless otherwise specified. Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
Example (b): preparation of modified GPR75
First, sequence optimization
In this example, the predicted protein structure software used was AlphaFold local edition (v2.1.0) and rosettafald.
1.1 truncation and engineering of wild-type GPR75
The wild-type GPR75 has a long random sequence region, and is not suitable for protein structure analysis. In this example, the transmembrane region prediction results were very similar for GPR75 secondary structure prediction (fig. 1A) based on both AlphaFold2 and rosettafeold models. The regions of random sequence are of very low confidence, so we truncated the original regions of random sequence of the GPR75 receptor, leaving 7 membrane spanning regions critical to the receptor (fig. 1B). The wild type and its truncated sequence are shown below (see in particular SEQ ID NO: 1), truncated between the fifth and sixth transmembrane helix of wild type GPR75, and the random sequence at the N-and C-termini of wild type GPR75 and the random sequence between the fifth and sixth transmembrane helix of wild type GPR75 were deleted (deleted).
Wild type GPR75 (GPR 75-WT) amino acid sequence (SEQ ID NO: 1):
MNSTGHLQDAPNATSLHVPHSQEGNSTSLQE GLQDLIHTATLVTCTFLLAVIFCLGSYGNFIVFLSFFD PAFRKFRTNFDFMILNLSFCDLFICGVTAPMFTFVLFFSSASSIPDAFCFTFHLTSSGFIIMSLKTVAVIALHRLRM VLGKQPNRTASFPCTVLLTLLLWATSFTLATLATLKTSKSHLCLPMSSLIAGKGKAILSLYVVDFTFCVAVVSVSYI MIAQTLRKNAQVRKCPPVITVDASRPQPFMGVPVQGGGDPIQCAMPALYRNQNYNKLQHVQTRGYTKSPNQLVTPAASRLQLVSAINLSTAKDSKAVVTCVIIVLSVLVCCLPLGISLVQVVLSSNGSFILYQFELFGFTLIFFKSGLNPFIYS RNSAGLRRKVLWCLQYIGLGFFCCKQKTRLRAMGKGNLEVNRNKSSHHETNSAYMLSPKPQKKFVDQACGPSHSKESMVSPKISAGHQHCGQSSSTPINTRIEPYYSIYNSSPSQEESSPCNLQPVNSFGFANSYIAMHYHTTNDLVQEYDSTSAKQIPVPSV
remarking: of the above wild-type GPR75 amino acid sequences, the amino acid sequences retained in modified GPR75 are single underlined and correspond to the first to fifth transmembrane helices from the N-terminus of wild-type GPR75 and the sixth to seventh transmembrane helices from the N-terminus of wild-type GPR75, respectively; the italic portion is a sequence retained in truncated GPR75 and deleted in modified GPR 75.
1.2 ligation of BRIL fusion proteins to truncated GPR75
It is contemplated that the fusion junction at the fifth and sixth transmembrane helix cut (FIG. 2), which has 4 transmembrane helices, has been used in the resolution of multiple GPCR crystals (e.g., PDB ID:7F83, 7VOD, 6LPJ, 6KO5, 6OS0, etc.) and electron microscopy structures (e.g., PDB ID: 7S8O, 6WW2, 6USF, etc.) using the classical BRIL fusion protein derived from bacterial soluble cytochrome b 562. The BRIL fusion protein can increase the size of the extracellular region of the modified GPR75, and can be used as a marker in electron microscope structure analysis.
The fusion site of the BRIL fusion protein needs to be optimized. In this example, 16 sites of fusion sites of BRIL fusion protein were screened (FIG. 3), and according to the prediction of the structure of AlphaFold2 protein, the modified sequence capable of forming stable helix with the fifth and sixth transmembrane helices was selected in this example, and the fifth and sixth transmembrane helices were fused and connected.
The BRIL fusion protein sequence employed in the engineered GPR75 in this example (SEQ ID NO: 2):
ARRQLADLEDNWETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYIQKYLERAR
to further optimize the location of the fusion site, we distributed a single amino acid float to the fifth and sixth transmembrane helix to BRIL junction region and used AlphaFold2 to predict the secondary structure of the fusion protein. From the predicted effect profile of AlphaFold2 (fig. 4), we compared the effect of the selection of 16 fusion sites. The sequence obtained by fusion joining the fifth and sixth transmembrane helix truncations described above by the BRIL fusion protein sequence is (SEQ ID NO: 3):
GLQDLIHTATLVTCTFLLAVIFCLGSYGNFIVFLSFFDPAFRKFRTNFDFMILNLSFCDLFICGVTAPM FTFVLFFSSASSIPDAFCFTFHLTSSGFIIMSLKTVAVIALHRLRMVLGKQPNRTASFPCTVLLTLLLWATSFTLAT LATLKTSKSHLCLPMSSLIAGKGKAILSLYVVDFTFCVAVVSVSYIMIAQTLRKNAQVARRQLADLEDNWETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYIQKYLERARSAINLSTAKDSKAVVTCVIIVLSVLVCCLPLGISLVQVVLSSNGSFILYQFELFGFTL IFFKSGLNPFIYSRNSAGLRRKVLWCLQYIGL
1.3N-terminal fusion of beta 2 adrenoceptor sequences
In this example, a 24 amino acid sequence of the human β 2 adrenoceptor was fused to the N-terminus to improve receptor stability and expression (UNIPORT: P07550).
The sequence of the β 2 adrenoceptor employed in the modified GPR75 in this example is (SEQ ID NO: 4):
MGQPGNGSAFLLAPNRSHAPDHDV
in this example, the sequence of a modified GPR75 with an added β 2 adrenoreceptor sequence is as follows (SEQ ID NO: 5)
MGQPGNGSAFLLAPNRSHAPDHDVGLQDLIHTATLVTCTFLLAVIFCLGSYGNFIVFLSFFDPAFRKFR TNFDFMILNLSFCDLFICGVTAPMFTFVLFFSSASSIPDAFCFTFHLTSSGFIIMSLKTVAVIALHRLRMVLGKQPN RTASFPCTVLLTLLLWATSFTLATLATLKTSKSHLCLPMSSLIAGKGKAILSLYVVDFTFCVAVVSVSYIMIAQTLR KNAQVARRQLADLEDNWETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYIQKYLERARSAINLSTAKDSKAVVTCVIIVLSVLVCCLPLGIS LVQVVLSSNGSFILYQFELFGFTLIFFKSGLNPFIYSRNSAGLRRKVLWCLQYIGL
1.4 addition of Signal peptide, cleavage site and tag sequence
In this example, the N-terminus and C-terminus of modified GPR75 were designed with Flag tag sequence, Strep tag sequence and 6 × His tag sequence, respectively, to facilitate purification of heterologously expressed modified GPR 75. Finally, in this embodiment, the sequence of the protein is subjected to multiple enzyme cleavage site (3C/TEV) design and Sortase a fusion sequence design, so that the purified protein can be conveniently subjected to reverse purification, matrix fixation, site-specific fluorescence labeling and the like of different schemes according to needs in the later period, and the modified sequence can be conveniently applied to a nucleic acid-encoded small molecule drug library, a drug binding experiment and the like.
Specifically, the modified GPR75 in this embodiment is, in order from the N-terminus to the C-terminus:
the hemagglutinin signal sequence (MKTIIALSYIFCLVFA; SEQ ID NO: 6);
flag tag sequence (DYKDDDDA; SEQ ID NO: 7);
the beta 2 adrenoceptor sequence (MGQPGNGSAFLLAPNRSHAPDHDV; SEQ ID NO: 4);
TEV protease cleavage site (ENLYFQG; SEQ ID NO: 8);
the sequence obtained for truncated GPR75 linked by the BRIL fusion protein sequence (SEQ ID NO: 3);
sortase A fusion sequence (LPETG; SEQ ID NO: 9); it is a USB interface, a connection site of Sortase A enzyme.
Strep tag sequence (SAWSHPQFEK; SEQ ID NO: 10);
HRV 3C protease cleavage site (LEVLFQGP; SEQ ID NO: 11);
6 × His tag sequence (HHHHHHHH; SEQ ID NO: 12).
The HRV 3C protease cleavage site was linked to the 6 XHis tag sequence by GS.
The complete amino acid sequence of modified GPR75 (GPR 75-M) is (SEQ ID NO: 13):
MKTIIALSYIFCLVFADYKDDDDAMGQPGNGSAFLLAPNRSHAPDHDVENLYFQGGLQDLIHTATLVTC TFLLAVIFCLGSYGNFIVFLSFFDPAFRKFRTNFDFMILNLSFCDLFICGVTAPMFTFVLFFSSASSIPDAFCFTFH LTSSGFIIMSLKTVAVIALHRLRMVLGKQPNRTASFPCTVLLTLLLWATSFTLATLATLKTSKSHLCLPMSSLIAGK GKAILSLYVVDFTFCVAVVSVSYIMIAQTLRKNAQVARRQLADLEDNWETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYIQKYLERARSAI NLSTAKDSKAVVTCVIIVLSVLVCCLPLGISLVQVVLSSNGSFILYQFELFGFTLIFFKSGLNPFIYSRNSAGLRRK VLWCLQYIGLPETGSAWSHPQFEKLEVLFQGPGSHHHHHH
remarking: the tail end of the C end of the sequence shown in SEQ ID NO. 3 shares amino acid L with the N end of the Sortase A fusion sequence.
GPR75-M codon optimized (sf 9 insect cell based) nucleic acid sequence (SEQ ID NO: 14):
ATGAAAACGATTATCGCACTGTCTTACATCTTCTGCCTGGTTTTTGCAGACTACAAAGACGACGATGATGCAATGGGTCAACCCGGAAACGGTTCAGCATTTTTGTTGGCGCCGAATCGTTCACACGCTCCCGATCACGACGTGGAGAATCTGTATTTCCAAGGCGGTCTGCAGGACTTGATACACACGGCTACGCTTGTCACCTGCACTTTTCTTCTTGCTGTAATATTTTGTTTGGGATCGTACGGCAATTTCATAGTCTTCCTGTCATTTTTCGATCCGGCTTTCCGCAAGTTTAGGACCAATTTTGACTTCATGATCCTTAACCTCTCTTTCTGTGATTTGTTCATATGCGGTGTGACTGCGCCTATGTTTACATTCGTGCTGTTTTTCTCAAGCGCATCATCCATACCCGATGCTTTTTGCTTCACGTTCCATTTGACCTCCTCGGGCTTCATCATTATGTCTTTGAAGACTGTTGCAGTAATAGCACTTCATAGGCTTCGTATGGTCCTCGGCAAACAGCCTAATCGCACTGCGTCGTTCCCTTGCACTGTCCTCTTGACCCTGCTCCTTTGGGCGACATCGTTTACTCTTGCCACCTTGGCTACACTCAAAACAAGCAAGTCTCATCTCTGTTTGCCAATGAGTAGTCTCATTGCCGGTAAAGGAAAGGCAATTTTGTCTTTGTACGTGGTTGACTTTACTTTCTGCGTTGCCGTTGTGTCAGTTTCTTACATCATGATTGCGCAGACACTGCGTAAAAATGCGCAGGTCGCAAGGAGACAGCTCGCCGATCTTGAAGACAATTGGGAGACGTTGAATGACAACCTGAAAGTGATTGAGAAAGCAGACAATGCAGCGCAAGTAAAGGATGCACTCACTAAGATGCGTGCCGCTGCGCTCGACGCGCAAAAGGCAACTCCGCCTAAATTGGAGGATAAGTCCCCTGACTCACCAGAGATGAAGGATTTCAGACATGGCTTCGACATCCTGGTAGGACAGATTGACGATGCGTTGAAGCTCGCGAACGAAGGAAAGGTGAAAGAGGCCCAGGCAGCGGCTGAACAGCTCAAGACCACAAGGAACGCATACATACAAAAATACCTGGAGCGTGCAAGGTCAGCTATAAATCTTTCAACCGCTAAAGATTCCAAGGCGGTAGTCACCTGCGTAATTATAGTACTTTCCGTCTTGGTTTGTTGTCTCCCGTTGGGCATATCCCTCGTACAAGTGGTCCTTTCGAGTAATGGTTCCTTCATTCTGTATCAATTCGAGCTTTTCGGCTTCACTCTGATATTCTTTAAGTCAGGTCTGAATCCCTTTATTTACTCCCGTAATTCAGCGGGACTCAGACGCAAGGTGCTCTGGTGTCTCCAGTACATCGGCCTGCCCGAAACCGGTTCGGCATGGTCTCACCCCCAGTTTGAAAAACTCGAGGTTCTCTTTCAAGGACCGGGAAGTCATCATCACCATCATCATTAG
1.5 Gene Synthesis and plasmid construction
Genes with optimized sequences, including wild type GPR75, truncated type GPR75 and a modified GPR75 gene, are sent to Beijing Honghong biotechnology limited for gene synthesis, carry NotI and HindIII enzyme cutting sites, and are connected and recombined to pFastbac-1 plasmid.
Wherein, the amino acid sequence of truncated GPR75 (SEQ ID NO: 15):
MKTIIALSYIFCLVFADYKDDDDAMGQPGNGSAFLLAPNRSHAPDHDVENLYFQGMNSTGHLQDAPNATSLHVPHSQEGNSTSLQEGLQDLIHTATLVTCTFLLAVIFCLGSYGNFIVFLSFFDPAFRKFRTNFDFMILNLSFCDLFICGVTAPMFTFVLFFSSASSIPDAFCFTFHLTSSGFIIMSLKTVAVIALHRLRMVLGKQPNRTASFPCTVLLTLLLWATSFTLATLATLKTSKSHLCLPMSSLIAGKGKAILSLYVVDFTFCVAVVSVSYIMIAQTLRKNAQVRKCPPVITVDASRPQPFMGVPVQGGGDPIQCAMPALYRNQNYNKLQHVQTRGYTKSPNQLVTPAASRLQLVSAINLSTAKDSKAVVTCVIIVLSVLVCCLPLGISLVQVVLSSNGSFILYQFELFGFTLIFFKSGLNPFIYSRNSAGLRRKVLWCLQYIGLPETGSAWSHPQFEKLEVLFQGPGSHHHHHH
in truncated GPR75, no BRIL fusion protein sequence is present, the random sequence at the N-terminal in wild GPR75 is not deleted, and other signal peptides, enzyme cutting sites and tag sequences are the same as those of modified GPR 75.
Preparation of recombinant baculovirus
2.1, the recombinant pFastbac plasmid containing the gene of interest was introduced into E.coli DH10Bac competent cells (Bomeide organism) by heat shock transformation, and cultured in LB solid medium containing 50. mu.g/mL kanamycin (BioBomei), 7. mu.g/mL gentamicin (BioBomei), 10. mu.g/mL tetracycline (BioBomei), 200. mu.g/mL X-gal (inalco), and 40. mu.g/mL IPTG (inalco) at 37 ℃ for 48 hours. Selecting uniform white spots to 3 mL LB liquid culture medium containing three antibiotics (50. mu.g/mL kanamycin, 7. mu.g/mL gentamicin, 10. mu.g/mL tetracycline), culturing overnight at 37 deg.C under 200 rpm, and waiting for OD of bacterial liquid600At about 0.6, a recombinant baculovirus plasmid was extracted.
2.2, 1 mL of the isolated Medium (Graces) was incubated with 15. mu.L of the transfection reagent (FuGENE) and 5. mu.g of the recombinant baculovirus plasmid at room temperature for 15 minutes, and the mixture solution was used to resuspend 10-12X 10 cells6sf9 insect cells (obtained by centrifuging a cell culture solution at 500 rpm for 10 minutes at room temperature) were cultured at 27 ℃ for 4 hours at 200 rpm, and 5 mL of ESF921 insect cell culture medium (Expression Systems) was added, and the culture was continued at 27 ℃ for 48 hours at 200 rpm. The sf9 cells cultured for 48 hours were transferred to a 100 mL Erlenmeyer flask and cultured at 27 ℃ and 110 rpm until the cell density reached 2-4X 106At room temperature, 2500 rpm at/mLThe supernatant was the P1 generation recombinant baculovirus after 10 minutes.
2.3, transfecting the P1 generation recombinant baculovirus according to the proportion of 1:10000 to 100 mL of cells with the cell density of 1.5 multiplied by 106The insect cells were cultured at 27 ℃ and 110 rpm in the presence of sf9 to a cell density of 6X 106And when the cell volume is expanded and more uniform, centrifuging at 2500 rpm for 10 minutes at room temperature, and filtering the supernatant by using a 0.22-micron needle filter to obtain the P2-generation recombinant baculovirus.
Third, protein purification laboratory
The P2 generation recombinant baculovirus was transfected into 20 mL of 4X 10 density medium at a ratio of 1:506The insect cells were cultured at 27 ℃ and 110 rpm for 48 hours in the presence of/mL sf 9. After the culture is finished, sampling and carrying out Western blot detection to determine the expression of the target protein.
We performed Western blot parallel experiments on wild type GPR75 and modified GPR 75. The experimental results showed that no band was significantly expressed in wild type GPR75, whereas target band expression was observed in modified GPR75 (fig. 5). The expression level of GPR75 is improved after modification. Since wild-type GPR75 was tested for no expression, a further large expression purification comparison was subsequently performed for truncated GPR75 and modified GPR 75.
Fourth, protein mass expression and purification
The P2 generation recombinant baculovirus was transfected at a ratio of 1:50 to give 1L recombinant baculovirus with a density of 4X 106The insect cells were cultured at 27 ℃ and 110 rpm for 48 hours in a/mL sf 9.
After the cell culture is finished, centrifuging at 4 ℃ and 4000 rpm for 20 minutes, collecting cells, re-suspending cell sediment by using 100 mL Buffer A, stirring at 4 ℃ for 10 minutes to fully lyse the cells, centrifuging the lysed cells at 4 ℃ and 15000 rpm for 10 minutes, discarding supernatant, re-suspending the sediment by using 25 mL Buffer B and homogenizing by using a homogenizer, and performing membrane dissolving on the homogenate at 4 ℃ for 90 minutes. After the membrane dissolution, the mixture is centrifuged at 4 ℃ and 37000 rpm for 15 minutes, the supernatant is incubated with nickel affinity chromatography packing for 1 hour, and then the foreign proteins are eluted by Buffer C and Buffer D, and the target protein is eluted by Buffer E. Loading an eluent containing target protein on a Flag affinity chromatography column, eluting hybrid protein by using Buffer F and target protein by using Buffer G, then concentrating the target protein by using a 50 KDa ultrafiltration tube, carrying out gel filtration chromatography when the volume of the concentrated target protein is about 500 mu L, wherein the model of the gel column is Superdex 200 Incrase 10/300 GL (cytiva), the Buffer solution is Buffer H, and collecting a protein sample at a UV-280 ultraviolet absorption peak to carry out SDS-PAGE gel electrophoresis to detect the content and the purity of the target protein.
Wherein, the Buffer solution (Buffer) A-F comprises the following specific components:
buffer A: 20mM Tris, pH 7.5, 2 mg/mL Iodoacetamide (Iodoacetamide);
buffer B: 20mM Tris, pH 7.5, 1 mg/mL iodoacetamide, 750 mM NaCl, 0.5% LMNG, 0.03% CHS, 0.2% sodium cholate (sodium cholate), 1/1000 protease inhibitor;
buffer C: 20mM Tris, pH 7.5, 150 mM NaCl, 0.05% LMNG, 0.003% CHS, 0.02% sodium cholate, 1/1000 protease inhibitor, 20mM Imidazole (Imidazole);
buffer D: 20mM Tris, pH 7.5, 150 mM NaCl, 0.05% LMNG, 0.003% CHS, 0.02% sodium cholate, 1/1000 protease inhibitor, 30 mM imidazole;
buffer E: 20mM Tris, pH 7.5, 150 mM NaCl, 0.05% LMNG, 0.003% CHS, 0.02% sodium cholate, 1/1000 protease inhibitor, 250 mM imidazole;
buffer F: 20mM Tris, pH 7.5, 150 mM NaCl, 0.05% LMNG, 0.003% CHS, 0.02% sodium cholate, 1/1000 protease inhibitor;
buffer G: 20mM Tris, pH 7.5, 150 mM NaCl, 0.05% LMNG, 0.003% CHS, 0.02% sodium cholate, 1/1000 protease inhibitor, 0.13 mg/mL Flag peptide;
buffer H: 20mM Tris, pH 7.5, 150 mM NaCl, 0.00075% LMNG, 0.0001% CHS, 0.00025% GDN, 1/1000 protease inhibitor, 100. mu.M Tris (2-carboxyethyl) phosphine (TCEP).
The experimental results are as follows: as shown in FIG. 6, in the gel filtration chromatography experiment, UV-280 absorption peak began to appear in modified GPR75 when Buffer H eluted in a volume of 11.7 mL, with a maximum absorption of 228.2 mAu, corresponding to a Buffer elution volume of 12.6 mL. As shown in figure 6, the expression of modified GPR75 was significantly increased compared to truncated GPR75 (without the BRIL fusion and without deletion of the N-terminal random sequence of wild-type GPR 75). The modified GPR75 also had a higher proportion of monomer peaks. The molecular weight of the modified GPR75 protein is 58.03 KDa, as shown in figure 7A and figure 7B, and the purity of the protein obtained after affinity chromatography purification and gel filtration chromatography purification is about 90% after SDS-PAGE gel electrophoresis analysis. Cryo-electron microscopy data showed that the samples were in good aggregate state and were available for further structural analysis as shown in fig. 7A and 7B.
Test example
Test example 1 identification of Activity of modified GPR75 protein
To verify whether the modified GPR75 protein prepared and purified in the examples has activity, the test example determined the efficiency of promoting GTP hydrolysis by Gq protein by using that the downstream Gq protein can be activated by G protein-coupled receptor21. Experimental procedure according to GTPase-GloTMAssay (Promega) kit instructions. The principle of the method is that 10 mu M GTP molecules are added into an experimental system, and because Gq protein has GTP hydrolysis activity, GTP molecules in the system are gradually consumed. The non-ligand bound, modified GPR75 protein has a background level of activation activity that will accelerate GTP depletion in the system.
The method comprises the following specific steps: first, the buffer conditions used for the experiment were configured: 20mM Tris pH 7.5, 100mM NaCl, 0.01% MNG, 100. mu.M TCEP, 5mM MgCl2. In the experiment, the purified control buffer solution, Gq protein, modified GPR75 protein, modified GPR75 protein-Gq protein complex and modified GPR75 protein-Gq protein-20-HETE ligand complex are respectively mixed with 10 mu M GTP, wherein the concentrations of the modified GPR75 protein and the Gq protein are both 3 mu M, and the concentration of 20-HETE is 10 mu M. The samples were incubated at room temperature for 2 hours. Subsequently, a Glo reaction solution was prepared: glo reagent in kit was diluted 500-fold into double distilled water and 10 μ M ADP molecules were added. mu.L of Glo reaction solution was added to 5 samples of the previous step at a volume ratio of 1:1 and incubated for 30 minutes. Subsequently, 40. mu.L of Glo detection solution was added at a volume ratio of 1:1,adding into the previous step. Finally, the samples were aliquoted into 384-well plates and read using an Ensight plate reader (perkinelmer).
The hydrolytic activity of Gq protein was enhanced after addition of the modified GPR75 protein compared to the hydrolytic GTP activity of Gq protein itself (fig. 8A). This indicates that the modified GPR75 protein without ligand binding state has a certain level of background activity. The test example shows that the ligand 20-HETE of GPR75 protein reported in the literature shows the effect of inhibiting the activation of the modified GPR75 protein. To further investigate the effect of 20-HETE, this test example was conducted on the IC of 20-HETE50The measurement was carried out (FIG. 8B), and the results showed IC of 20-HETE50The value was approximately 2 nM. The modified GPR75 has certain activity, and the modified GPR75 provided by the invention can be combined with a ligand of wild GPR75, such as 20-HETE, and can be applied to GPR75 structure analysis, GPR75 activity analysis, related nucleic acid coding small molecule library screening, computer-assisted drug design and drug screening and the like. And as demonstrated in the examples, compared with wild type and truncated type GPR75, the modified GPR75 provided by the invention has higher expression level and is suitable for subsequent structural analysis and other applications.
Test example 2 modification of GPR75 protein for Structure analysis
For further structural analysis work using the modified GPR75 protein, the modified GPR75 protein prepared and purified in the examples was incubated with the anti-BRIL fab fragment, and the complex was purified by molecular sieving (fig. 9A). The purified complex was verified in a further SDS-PAGE experiment (FIG. 9B). Furthermore, as shown in fig. 9C, the single-particle two-dimensional classification by cryoelectron microscopy revealed features of the receptor and the complex with fab fragment of anti-BRIL. Test examples 1 and 2 demonstrate that in the modified GPR75, the ligand binding site of wild-type GPR75 was partially retained in GPR75, and the BRIL fusion protein was recognized by its antibody and was used as a marker in electron microscopy structural analysis. Subsequently, in this test example, the obtained stable compound was subjected to freezing sample preparation, electron microscopy data collection, and structure analysis. The single-particle two-dimensional classification data show that the modified 75 protein-BRIL resistant fab fragment forms a stable compound, which lays a solid foundation for further performing structure analysis on the modified GPR75 protein.
The above description of exemplary embodiments has been presented only to illustrate the technical solution of the invention and is not intended to be exhaustive or to limit the invention to the precise form described. Obviously, many modifications and variations are possible in light of the above teaching to those skilled in the art. The exemplary embodiments were chosen and described in order to explain certain principles of the invention and its practical application to thereby enable others skilled in the art to understand, implement and utilize the invention in various exemplary embodiments and with various alternatives and modifications. It is intended that the scope of the invention be defined by the following claims and their equivalents.
Reference documents:
1. Venter, J. C. et al. The Sequence of the Human Genome. Science 291, 1304–1351 (2001).
2. Wise, A., Jupe, S. C. & Rees, S. THE IDENTIFICATION OF LIGANDS AT ORPHAN G-PROTEIN COUPLED RECEPTORS. Pharmacol Toxicol 44, 43–66 (2004).
3. Sriram, K. & Insel, P. A. GPCRs as targets for approved drugs: How many targets and how many drugs. Mol Pharmacol 93, mol.117.111062 (2018).
4. Flock, T. et al. Selectivity determinants of GPCR-G-protein binding. Nature 545, 317–322 (2017).
5. Garcia, V. et al. 20-HETE Signals Through G-Protein–Coupled Receptor GPR75 (Gq) to Affect Vascular Function and Trigger Hypertension. Circ Res 120, 1776–1788 (2017).
6. Liu, B. et al. The novel chemokine receptor, G-protein-coupled receptor 75, is expressed by islets and is coupled to stimulation of insulin secretion and improved glucose homeostasis. Diabetologia 56, 2467–2476 (2013).
7. Akbari, P. et al. Sequencing of 640,000 exomes identifies GPR75 variants associated with protection from obesity. Science 373, eabf8683 (2021).
8. Tarttelin, E. E. et al. Cloning and Characterization of a Novel Orphan G-Protein-Coupled Receptor Localized to Human Chromosome 2p16. Biochem Bioph Res Co 260, 174–180 (1999).
9. Pease, J. E. Tails of the unexpected – an atypical receptor for the chemokine RANTES/CCL5 expressed in brain. Brit J Pharmacol 149, 460–462 (2006).
10. Ignatov, A., Robert, J., Gregory‐Evans, C. & Schaller, H. C. RANTES stimulates Ca2+ mobilization and inositol trisphosphate (IP3) formation in cells transfected with G protein‐coupled receptor 75. Brit J Pharmacol 149, 490–497 (2006).
11. Shen, P. S. The 2017 Nobel Prize in Chemistry: cryo-EM comes of age. Anal Bioanal Chem 410, 2053–2057 (2018).
12. Callaway, E. Revolutionary cryo-EM is taking over structural biology. Nature 578, 201–201 (2020).
13. Kooistra, A. J. et al. GPCRdb in 2021: integrating GPCR sequence, structure and function. Nucleic Acids Res 49, gkaa1080- (2020).
14. Schneider, G. & Fechner, U. Computer-based de novo design of drug-like molecules. Nat Rev Drug Discov 4, 649–663 (2005).
15. Renaud, J.-P. et al. Cryo-EM in drug discovery: achievements, limitations and prospects. Nat Rev Drug Discov 17, 471–492 (2018).
16. Palczewski, K. et al. Crystal Structure of Rhodopsin: A G Protein-Coupled Receptor. Science 289, 739–745 (2000).
17. Rasmussen, S. G. F. et al. Crystal structure of the human β2 adrenergic G-protein-coupled receptor. Nature 450, 383–387 (2007).
18. Cherezov, V. et al. High-Resolution Crystal Structure of an Engineered Human β2-Adrenergic G Protein–Coupled Receptor. Science 318, 1258–1265 (2007).
19. Rasmussen, S. G. F. et al. Crystal structure of the β2 adrenergic receptor-Gs protein complex. Nature 477, 549–555 (2011).
20. Manglik, A. et al. Structure-based discovery of opioid analgesics with reduced side effects. Nature 537, 185–190 (2016).
21. Gregorio, G. G. et al. Single-molecule analysis of ligand efficacy in β2AR-G-protein activation. Nature 547, 68–73 (2017).
sequence listing
<110> Shuimu future (Beijing) Tech Co Ltd
<120> modified GPR75 and use thereof
<130> 6C95-2128763IP
<160> 15
<170> SIPOSequenceListing 1.0
<210> 1
<211> 540
<212> PRT
<213> human ()
<400> 1
Met Asn Ser Thr Gly His Leu Gln Asp Ala Pro Asn Ala Thr Ser Leu
1 5 10 15
His Val Pro His Ser Gln Glu Gly Asn Ser Thr Ser Leu Gln Glu Gly
20 25 30
Leu Gln Asp Leu Ile His Thr Ala Thr Leu Val Thr Cys Thr Phe Leu
35 40 45
Leu Ala Val Ile Phe Cys Leu Gly Ser Tyr Gly Asn Phe Ile Val Phe
50 55 60
Leu Ser Phe Phe Asp Pro Ala Phe Arg Lys Phe Arg Thr Asn Phe Asp
65 70 75 80
Phe Met Ile Leu Asn Leu Ser Phe Cys Asp Leu Phe Ile Cys Gly Val
85 90 95
Thr Ala Pro Met Phe Thr Phe Val Leu Phe Phe Ser Ser Ala Ser Ser
100 105 110
Ile Pro Asp Ala Phe Cys Phe Thr Phe His Leu Thr Ser Ser Gly Phe
115 120 125
Ile Ile Met Ser Leu Lys Thr Val Ala Val Ile Ala Leu His Arg Leu
130 135 140
Arg Met Val Leu Gly Lys Gln Pro Asn Arg Thr Ala Ser Phe Pro Cys
145 150 155 160
Thr Val Leu Leu Thr Leu Leu Leu Trp Ala Thr Ser Phe Thr Leu Ala
165 170 175
Thr Leu Ala Thr Leu Lys Thr Ser Lys Ser His Leu Cys Leu Pro Met
180 185 190
Ser Ser Leu Ile Ala Gly Lys Gly Lys Ala Ile Leu Ser Leu Tyr Val
195 200 205
Val Asp Phe Thr Phe Cys Val Ala Val Val Ser Val Ser Tyr Ile Met
210 215 220
Ile Ala Gln Thr Leu Arg Lys Asn Ala Gln Val Arg Lys Cys Pro Pro
225 230 235 240
Val Ile Thr Val Asp Ala Ser Arg Pro Gln Pro Phe Met Gly Val Pro
245 250 255
Val Gln Gly Gly Gly Asp Pro Ile Gln Cys Ala Met Pro Ala Leu Tyr
260 265 270
Arg Asn Gln Asn Tyr Asn Lys Leu Gln His Val Gln Thr Arg Gly Tyr
275 280 285
Thr Lys Ser Pro Asn Gln Leu Val Thr Pro Ala Ala Ser Arg Leu Gln
290 295 300
Leu Val Ser Ala Ile Asn Leu Ser Thr Ala Lys Asp Ser Lys Ala Val
305 310 315 320
Val Thr Cys Val Ile Ile Val Leu Ser Val Leu Val Cys Cys Leu Pro
325 330 335
Leu Gly Ile Ser Leu Val Gln Val Val Leu Ser Ser Asn Gly Ser Phe
340 345 350
Ile Leu Tyr Gln Phe Glu Leu Phe Gly Phe Thr Leu Ile Phe Phe Lys
355 360 365
Ser Gly Leu Asn Pro Phe Ile Tyr Ser Arg Asn Ser Ala Gly Leu Arg
370 375 380
Arg Lys Val Leu Trp Cys Leu Gln Tyr Ile Gly Leu Gly Phe Phe Cys
385 390 395 400
Cys Lys Gln Lys Thr Arg Leu Arg Ala Met Gly Lys Gly Asn Leu Glu
405 410 415
Val Asn Arg Asn Lys Ser Ser His His Glu Thr Asn Ser Ala Tyr Met
420 425 430
Leu Ser Pro Lys Pro Gln Lys Lys Phe Val Asp Gln Ala Cys Gly Pro
435 440 445
Ser His Ser Lys Glu Ser Met Val Ser Pro Lys Ile Ser Ala Gly His
450 455 460
Gln His Cys Gly Gln Ser Ser Ser Thr Pro Ile Asn Thr Arg Ile Glu
465 470 475 480
Pro Tyr Tyr Ser Ile Tyr Asn Ser Ser Pro Ser Gln Glu Glu Ser Ser
485 490 495
Pro Cys Asn Leu Gln Pro Val Asn Ser Phe Gly Phe Ala Asn Ser Tyr
500 505 510
Ile Ala Met His Tyr His Thr Thr Asn Asp Leu Val Gln Glu Tyr Asp
515 520 525
Ser Thr Ser Ala Lys Gln Ile Pro Val Pro Ser Val
530 535 540
<210> 2
<211> 115
<212> PRT
<213> Artificial sequence ()
<220>
Claims (8)
1. A modified GPR75, characterized in that the modified GPR75 comprises:
a first domain comprising an amino acid sequence derived from a β 2 adrenergic receptor; and the combination of (a) and (b),
a second domain that is a domain in which the random sequence between the fifth and sixth transmembrane helices and the N-terminal, C-terminal random sequence are deleted in wild-type GPR75 and linked between the fifth and sixth transmembrane helices by an amino acid sequence derived from a BRIL fusion protein;
the amino acid sequence of the modified GPR75 is shown as SEQ ID NO: shown at 13.
2. The modified GPR75 of claim 1 wherein the first domain comprises the amino acid sequence set forth as SEQ ID No. 4; and/or
The second domain comprises an amino acid sequence as set forth in SEQ ID NO 3.
3. Modified GPR75 according to claim 1 or 2, characterized in that the modified GPR75 comprises the following sequence:
(i) the amino acid sequence shown as SEQ ID NO. 5.
4. A polynucleotide encoding the engineered GPR75 of any one of claims 1 to 3.
5. The polynucleotide of claim 4, having a nucleotide sequence as set forth in SEQ ID NO: as shown at 14.
6. An expression vector comprising the polynucleotide of claim 4 or 5.
7. A host cell comprising the expression vector of claim 6.
8. Use of a modified GPR75 according to any one of claims 1 to 3, a polynucleotide according to claim 4 or 5, an expression vector according to claim 6 or a host cell according to claim 7 for use in GPR75 structural analysis, fluorescent molecular labeling, fusion of phosphorylated polypeptides or signaling proteins, GPR75 activity analysis, screening of nucleic acid-encoded small molecule libraries, computer-assisted drug design and drug screening, as a non-disease treatment or diagnostic method.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210205737.1A CN114276460B (en) | 2022-03-04 | 2022-03-04 | Modified GPR75 and use thereof |
PCT/CN2022/117816 WO2023165108A1 (en) | 2022-03-04 | 2022-09-08 | Modified gpr75 and uses thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210205737.1A CN114276460B (en) | 2022-03-04 | 2022-03-04 | Modified GPR75 and use thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114276460A CN114276460A (en) | 2022-04-05 |
CN114276460B true CN114276460B (en) | 2022-06-24 |
Family
ID=80882118
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210205737.1A Active CN114276460B (en) | 2022-03-04 | 2022-03-04 | Modified GPR75 and use thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114276460B (en) |
WO (1) | WO2023165108A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114276460B (en) * | 2022-03-04 | 2022-06-24 | 水木未来(北京)科技有限公司 | Modified GPR75 and use thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103874711A (en) * | 2011-05-13 | 2014-06-18 | 瑞塞普托斯公司 | Novel fusion partners for the purpose of crystallizing G-protein coupled receptors |
WO2018207950A1 (en) * | 2017-05-12 | 2018-11-15 | 横山 茂之 | Class a gpcr-binding compound modifier |
CN110914430A (en) * | 2017-07-13 | 2020-03-24 | 协和麒麟株式会社 | anti-BRIL antibody and method for stabilizing BRIL fusion protein using same |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6538107B1 (en) * | 1994-09-30 | 2003-03-25 | Takeda Chemical Industries, Ltd. | G protein coupled receptor protein production, and use thereof |
EP1278839A2 (en) * | 2000-03-29 | 2003-01-29 | PE Corporation (NY) | Isolated human g-protein coupled receptors, nucleic acid molecules encoding human gpcr proteins, and uses thereof |
CN114276460B (en) * | 2022-03-04 | 2022-06-24 | 水木未来(北京)科技有限公司 | Modified GPR75 and use thereof |
-
2022
- 2022-03-04 CN CN202210205737.1A patent/CN114276460B/en active Active
- 2022-09-08 WO PCT/CN2022/117816 patent/WO2023165108A1/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103874711A (en) * | 2011-05-13 | 2014-06-18 | 瑞塞普托斯公司 | Novel fusion partners for the purpose of crystallizing G-protein coupled receptors |
WO2018207950A1 (en) * | 2017-05-12 | 2018-11-15 | 横山 茂之 | Class a gpcr-binding compound modifier |
CN110869362A (en) * | 2017-05-12 | 2020-03-06 | 国立研究开发法人理化学研究所 | Modified class A GPCR binding compounds |
CN110914430A (en) * | 2017-07-13 | 2020-03-24 | 协和麒麟株式会社 | anti-BRIL antibody and method for stabilizing BRIL fusion protein using same |
Also Published As
Publication number | Publication date |
---|---|
CN114276460A (en) | 2022-04-05 |
WO2023165108A1 (en) | 2023-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2681415C (en) | Mutant g-protein coupled receptors and methods for selecting them | |
CN114276460B (en) | Modified GPR75 and use thereof | |
WO2001014577A1 (en) | Molecular cloning of a galanin like 7tm receptor (axor40) | |
WO2001016159A1 (en) | Gpcr, theant | |
US6461836B1 (en) | Molecular cloning of a 7TM receptor (AxOR34) and screening methods thereof | |
JP2003510027A (en) | PGPCR-3 polypeptide and its DNA sequence | |
US20020004222A1 (en) | Cloning of a monkey 7TM receptor (AXOR8) | |
WO2001042486A1 (en) | Axor12 | |
CN115947870A (en) | Modified GPR35 and application thereof | |
CN115850518A (en) | Modified BRIL and application thereof | |
AU2001283930B2 (en) | Novel g-protein coupled receptor | |
GB2367295A (en) | AXOR69 polypeptides and polynucleotides | |
WO2001064836A2 (en) | Cloning of a gpr38 variant | |
US20050054034A1 (en) | Thyrotropin-releasing hormone receptor-like gpcr(gprfwki) | |
JP2004500864A (en) | Novel G-protein coupled receptor | |
JP2003506040A (en) | G protein-coupled receptor and its DNA sequence | |
US20060154244A1 (en) | Novel g-protein coupled receptor | |
GB2371546A (en) | G protein coupled receptor AXOR 109 | |
GB2368065A (en) | AXOR52, a NPY-like G protein coupled receptor | |
WO2001025280A1 (en) | Paul, a g-protein coupled receptor | |
EP1232265A2 (en) | G-protein coupled receptor htogh35 | |
GB2365870A (en) | Polynucleotides and polypeptides of the G-protein coupled receptor AXOR83 | |
GB2364058A (en) | AXOR79, a G-protein coupled receptor | |
JP2004509623A (en) | New GPCRHFRBN63 | |
WO2001014410A1 (en) | Cloning of a pafr like 7tm receptor (axor11) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |