CN114940979A - Method for improving cation-pi interaction by genetic code expansion and application - Google Patents
Method for improving cation-pi interaction by genetic code expansion and application Download PDFInfo
- Publication number
- CN114940979A CN114940979A CN202210140263.7A CN202210140263A CN114940979A CN 114940979 A CN114940979 A CN 114940979A CN 202210140263 A CN202210140263 A CN 202210140263A CN 114940979 A CN114940979 A CN 114940979A
- Authority
- CN
- China
- Prior art keywords
- tryptophan
- protein
- phd
- indole
- compound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 65
- 230000003993 interaction Effects 0.000 title claims abstract description 52
- 230000002068 genetic effect Effects 0.000 title claims abstract description 25
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 172
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 165
- 108010033040 Histones Proteins 0.000 claims abstract description 98
- 230000011987 methylation Effects 0.000 claims abstract description 90
- 238000007069 methylation reaction Methods 0.000 claims abstract description 90
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims abstract description 46
- 238000005516 engineering process Methods 0.000 claims abstract description 32
- 125000003118 aryl group Chemical group 0.000 claims abstract description 14
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims abstract description 10
- 238000011160 research Methods 0.000 claims abstract description 10
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 6
- 239000000126 substance Substances 0.000 claims abstract description 3
- ASMBUJRMSWTSLE-UHFFFAOYSA-N 2-azaniumyl-3-(6-methoxy-1h-indol-3-yl)propanoate Chemical compound COC1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 ASMBUJRMSWTSLE-UHFFFAOYSA-N 0.000 claims description 37
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 claims description 37
- 150000001413 amino acids Chemical class 0.000 claims description 31
- 229940024606 amino acid Drugs 0.000 claims description 30
- 150000001875 compounds Chemical class 0.000 claims description 29
- 238000003786 synthesis reaction Methods 0.000 claims description 23
- 102000002798 Phenylalanine-tRNA Ligase Human genes 0.000 claims description 22
- 108010004478 Phenylalanine-tRNA Ligase Proteins 0.000 claims description 22
- MUZROTSTMQSBFK-VIFPVBQESA-N (2s)-2-amino-3-(7-methoxy-1h-indol-3-yl)propanoic acid Chemical compound COC1=CC=CC2=C1NC=C2C[C@H](N)C(O)=O MUZROTSTMQSBFK-VIFPVBQESA-N 0.000 claims description 21
- 239000002773 nucleotide Substances 0.000 claims description 21
- 125000003729 nucleotide group Chemical group 0.000 claims description 21
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims description 20
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims description 20
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 20
- 230000015572 biosynthetic process Effects 0.000 claims description 20
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 20
- 238000006243 chemical reaction Methods 0.000 claims description 19
- 230000014509 gene expression Effects 0.000 claims description 19
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 claims description 15
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 claims description 15
- KBOZNJNHBBROHM-JTQLQIEISA-N (2s)-2-azaniumyl-3-(7-methyl-1h-indol-3-yl)propanoate Chemical compound CC1=CC=CC2=C1NC=C2C[C@H]([NH3+])C([O-])=O KBOZNJNHBBROHM-JTQLQIEISA-N 0.000 claims description 14
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 14
- 238000012216 screening Methods 0.000 claims description 13
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 12
- 239000003054 catalyst Substances 0.000 claims description 12
- 239000000376 reactant Substances 0.000 claims description 12
- 230000035484 reaction time Effects 0.000 claims description 12
- 239000000047 product Substances 0.000 claims description 11
- 239000002904 solvent Substances 0.000 claims description 10
- -1 lithium aluminum hydride Chemical compound 0.000 claims description 9
- PAYRUJLWNCNPSJ-UHFFFAOYSA-N Aniline Chemical compound NC1=CC=CC=C1 PAYRUJLWNCNPSJ-UHFFFAOYSA-N 0.000 claims description 8
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 claims description 8
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 claims description 8
- 239000007858 starting material Substances 0.000 claims description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 8
- 229910052757 nitrogen Inorganic materials 0.000 claims description 7
- LVTJOONKWUXEFR-FZRMHRINSA-N protoneodioscin Natural products O(C[C@@H](CC[C@]1(O)[C@H](C)[C@@H]2[C@]3(C)[C@H]([C@H]4[C@@H]([C@]5(C)C(=CC4)C[C@@H](O[C@@H]4[C@H](O[C@H]6[C@@H](O)[C@@H](O)[C@@H](O)[C@H](C)O6)[C@@H](O)[C@H](O[C@H]6[C@@H](O)[C@@H](O)[C@@H](O)[C@H](C)O6)[C@H](CO)O4)CC5)CC3)C[C@@H]2O1)C)[C@H]1[C@H](O)[C@H](O)[C@H](O)[C@@H](CO)O1 LVTJOONKWUXEFR-FZRMHRINSA-N 0.000 claims description 7
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 claims description 6
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 claims description 6
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 claims description 6
- 239000003513 alkali Substances 0.000 claims description 6
- JNGZXGGOCLZBFB-IVCQMTBJSA-N compound E Chemical compound N([C@@H](C)C(=O)N[C@@H]1C(N(C)C2=CC=CC=C2C(C=2C=CC=CC=2)=N1)=O)C(=O)CC1=CC(F)=CC(F)=C1 JNGZXGGOCLZBFB-IVCQMTBJSA-N 0.000 claims description 6
- JXDYKVIHCLTXOP-UHFFFAOYSA-N isatin Chemical compound C1=CC=C2C(=O)C(=O)NC2=C1 JXDYKVIHCLTXOP-UHFFFAOYSA-N 0.000 claims description 6
- GDMRVYIFGPMUCG-JTQLQIEISA-N (2s)-2-azaniumyl-3-(6-methyl-1h-indol-3-yl)propanoate Chemical compound CC1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 GDMRVYIFGPMUCG-JTQLQIEISA-N 0.000 claims description 5
- VHYFNPMBLIVWCW-UHFFFAOYSA-N 4-Dimethylaminopyridine Chemical compound CN(C)C1=CC=NC=C1 VHYFNPMBLIVWCW-UHFFFAOYSA-N 0.000 claims description 4
- ONYNOPPOVKYGRS-UHFFFAOYSA-N 6-methyl-1h-indole Chemical compound CC1=CC=C2C=CNC2=C1 ONYNOPPOVKYGRS-UHFFFAOYSA-N 0.000 claims description 4
- FSOPPXYMWZOKRM-UHFFFAOYSA-N 7-methoxy-1h-indole Chemical compound COC1=CC=CC2=C1NC=C2 FSOPPXYMWZOKRM-UHFFFAOYSA-N 0.000 claims description 4
- KGWPHCDTOLQQEP-UHFFFAOYSA-N 7-methylindole Chemical compound CC1=CC=CC2=C1NC=C2 KGWPHCDTOLQQEP-UHFFFAOYSA-N 0.000 claims description 4
- WTDHULULXKLSOZ-UHFFFAOYSA-N Hydroxylamine hydrochloride Chemical compound Cl.ON WTDHULULXKLSOZ-UHFFFAOYSA-N 0.000 claims description 4
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 claims description 4
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 claims description 4
- 239000012043 crude product Substances 0.000 claims description 4
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 claims description 4
- 238000004949 mass spectrometry Methods 0.000 claims description 4
- YJVFFLUZDVXJQI-UHFFFAOYSA-L palladium(ii) acetate Chemical compound [Pd+2].CC([O-])=O.CC([O-])=O YJVFFLUZDVXJQI-UHFFFAOYSA-L 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 239000000758 substrate Substances 0.000 claims description 4
- DYHSDKLCOJIUFX-UHFFFAOYSA-N tert-butoxycarbonyl anhydride Chemical compound CC(C)(C)OC(=O)OC(=O)OC(C)(C)C DYHSDKLCOJIUFX-UHFFFAOYSA-N 0.000 claims description 4
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 claims description 3
- 241000894006 Bacteria Species 0.000 claims description 3
- 102000003960 Ligases Human genes 0.000 claims description 3
- 108090000364 Ligases Proteins 0.000 claims description 3
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 claims description 3
- 229910052740 iodine Inorganic materials 0.000 claims description 3
- 239000011630 iodine Substances 0.000 claims description 3
- VNFWTIYUKDMAOP-UHFFFAOYSA-N sphos Chemical compound COC1=CC=CC(OC)=C1C1=CC=CC=C1P(C1CCCCC1)C1CCCCC1 VNFWTIYUKDMAOP-UHFFFAOYSA-N 0.000 claims description 3
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 claims description 2
- 229960000549 4-dimethylaminophenol Drugs 0.000 claims description 2
- QJRWYBIKLXNYLF-UHFFFAOYSA-N 6-methoxy-1h-indole Chemical compound COC1=CC=C2C=CNC2=C1 QJRWYBIKLXNYLF-UHFFFAOYSA-N 0.000 claims description 2
- 108020004705 Codon Proteins 0.000 claims description 2
- 229940126062 Compound A Drugs 0.000 claims description 2
- NLDMNSXOCDLTTB-UHFFFAOYSA-N Heterophylliin A Natural products O1C2COC(=O)C3=CC(O)=C(O)C(O)=C3C3=C(O)C(O)=C(O)C=C3C(=O)OC2C(OC(=O)C=2C=C(O)C(O)=C(O)C=2)C(O)C1OC(=O)C1=CC(O)=C(O)C(O)=C1 NLDMNSXOCDLTTB-UHFFFAOYSA-N 0.000 claims description 2
- LWLSVNFEVKJDBZ-UHFFFAOYSA-N N-[4-(trifluoromethoxy)phenyl]-4-[[3-[5-(trifluoromethyl)pyridin-2-yl]oxyphenyl]methyl]piperidine-1-carboxamide Chemical compound FC(OC1=CC=C(C=C1)NC(=O)N1CCC(CC1)CC1=CC(=CC=C1)OC1=NC=C(C=C1)C(F)(F)F)(F)F LWLSVNFEVKJDBZ-UHFFFAOYSA-N 0.000 claims description 2
- 241000700605 Viruses Species 0.000 claims description 2
- 229910052799 carbon Inorganic materials 0.000 claims description 2
- 125000004432 carbon atom Chemical group C* 0.000 claims description 2
- 239000007795 chemical reaction product Substances 0.000 claims description 2
- RNFNDJAIBTYOQL-UHFFFAOYSA-N chloral hydrate Chemical compound OC(O)C(Cl)(Cl)Cl RNFNDJAIBTYOQL-UHFFFAOYSA-N 0.000 claims description 2
- 229960002327 chloral hydrate Drugs 0.000 claims description 2
- 150000002475 indoles Chemical class 0.000 claims description 2
- 125000001041 indolyl group Chemical group 0.000 claims description 2
- 239000003446 ligand Substances 0.000 claims description 2
- 239000012280 lithium aluminium hydride Substances 0.000 claims description 2
- 229940098779 methanesulfonic acid Drugs 0.000 claims description 2
- 231100000219 mutagenic Toxicity 0.000 claims description 2
- 230000003505 mutagenic effect Effects 0.000 claims description 2
- 125000004430 oxygen atom Chemical group O* 0.000 claims description 2
- 238000001742 protein purification Methods 0.000 claims description 2
- 239000002994 raw material Substances 0.000 claims description 2
- 230000009467 reduction Effects 0.000 claims description 2
- 229920006395 saturated elastomer Polymers 0.000 claims description 2
- 238000001308 synthesis method Methods 0.000 claims description 2
- 230000004048 modification Effects 0.000 abstract description 62
- 238000012986 modification Methods 0.000 abstract description 62
- 230000008827 biological function Effects 0.000 abstract description 3
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 146
- 210000004027 cell Anatomy 0.000 description 66
- 108010051779 histone H3 trimethyl Lys4 Proteins 0.000 description 65
- 108020004414 DNA Proteins 0.000 description 44
- 239000013612 plasmid Substances 0.000 description 44
- 235000001014 amino acid Nutrition 0.000 description 27
- 241000282414 Homo sapiens Species 0.000 description 19
- 238000003384 imaging method Methods 0.000 description 19
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 18
- 102100037247 Prolyl hydroxylase EGLN3 Human genes 0.000 description 17
- 101710170720 Prolyl hydroxylase EGLN3 Proteins 0.000 description 17
- 239000000243 solution Substances 0.000 description 16
- 239000013598 vector Substances 0.000 description 15
- 108020001580 protein domains Proteins 0.000 description 14
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 108090000765 processed proteins & peptides Proteins 0.000 description 12
- 239000006228 supernatant Substances 0.000 description 12
- 238000001514 detection method Methods 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 101001088892 Homo sapiens Lysine-specific demethylase 5A Proteins 0.000 description 10
- 102100033246 Lysine-specific demethylase 5A Human genes 0.000 description 10
- 238000010276 construction Methods 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 10
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 239000002033 PVDF binder Substances 0.000 description 8
- 108010026552 Proteome Proteins 0.000 description 8
- 239000000872 buffer Substances 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- 239000012528 membrane Substances 0.000 description 8
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 8
- 238000002372 labelling Methods 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 238000003756 stirring Methods 0.000 description 7
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 6
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 6
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 239000005090 green fluorescent protein Substances 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 5
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 5
- 102000009353 PWWP domains Human genes 0.000 description 5
- 108050000223 PWWP domains Proteins 0.000 description 5
- 229960000723 ampicillin Drugs 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 238000011033 desalting Methods 0.000 description 5
- 102000017589 Chromo domains Human genes 0.000 description 4
- 108050005811 Chromo domains Proteins 0.000 description 4
- 102100024810 DNA (cytosine-5)-methyltransferase 3B Human genes 0.000 description 4
- 101710123222 DNA (cytosine-5)-methyltransferase 3B Proteins 0.000 description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 4
- 244000068988 Glycine max Species 0.000 description 4
- 235000010469 Glycine max Nutrition 0.000 description 4
- 102000006947 Histones Human genes 0.000 description 4
- 101000777789 Homo sapiens Testis-specific chromodomain protein Y 1 Proteins 0.000 description 4
- 108010090804 Streptavidin Proteins 0.000 description 4
- 239000006180 TBST buffer Substances 0.000 description 4
- 102100031664 Testis-specific chromodomain protein Y 1 Human genes 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 239000012139 lysis buffer Substances 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 238000009482 thermal adhesion granulation Methods 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 102100027359 Bromo adjacent homology domain-containing 1 protein Human genes 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 101000937839 Homo sapiens Bromo adjacent homology domain-containing 1 protein Proteins 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 238000000684 flow cytometry Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 238000010166 immunofluorescence Methods 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 238000001819 mass spectrum Methods 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 239000012044 organic layer Substances 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 238000007789 sealing Methods 0.000 description 3
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical class O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- 239000011701 zinc Substances 0.000 description 3
- 229910052725 zinc Inorganic materials 0.000 description 3
- QFVPXIGESRCWHK-JTQLQIEISA-N (2S)-2-amino-3-(6-cyano-1H-indol-3-yl)propanoic acid Chemical compound C(#N)C=1C=C2NC=C(C[C@H](N)C(=O)O)C2=CC=1 QFVPXIGESRCWHK-JTQLQIEISA-N 0.000 description 2
- FJGPUTWOJHMELC-JTQLQIEISA-N (2S)-2-amino-3-(7-cyano-1H-indol-3-yl)propanoic acid Chemical compound C(#N)C1=C2NC=C(C[C@H](N)C(=O)O)C2=CC=C1 FJGPUTWOJHMELC-JTQLQIEISA-N 0.000 description 2
- YYROPELSRYBVMQ-UHFFFAOYSA-N 4-toluenesulfonyl chloride Chemical compound CC1=CC=C(S(Cl)(=O)=O)C=C1 YYROPELSRYBVMQ-UHFFFAOYSA-N 0.000 description 2
- FICLVQOYKYBXFN-VIFPVBQESA-N 6-chloro-L-tryptophan Chemical compound ClC1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 FICLVQOYKYBXFN-VIFPVBQESA-N 0.000 description 2
- DMQFGLHRDFQKNR-VIFPVBQESA-N 7-chloro-L-tryptophan Chemical compound C1=CC=C2C(C[C@H]([NH3+])C([O-])=O)=CNC2=C1Cl DMQFGLHRDFQKNR-VIFPVBQESA-N 0.000 description 2
- 108010039627 Aprotinin Proteins 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 238000011537 Coomassie blue staining Methods 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- 238000012404 In vitro experiment Methods 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 229960004405 aprotinin Drugs 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108091006004 biotinylated proteins Proteins 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 210000003855 cell nucleus Anatomy 0.000 description 2
- IJOOHPMOJXWVHK-UHFFFAOYSA-N chlorotrimethylsilane Chemical compound C[Si](C)(C)Cl IJOOHPMOJXWVHK-UHFFFAOYSA-N 0.000 description 2
- 238000004440 column chromatography Methods 0.000 description 2
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000010569 immunofluorescence imaging Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 230000010070 molecular adhesion Effects 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 239000003208 petroleum Substances 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 229950010131 puromycin Drugs 0.000 description 2
- 238000001338 self-assembly Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000001089 thermophoresis Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- PAAZPARNPHGIKF-UHFFFAOYSA-N 1,2-dibromoethane Chemical compound BrCCBr PAAZPARNPHGIKF-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- FQVDXLYJTMHMCG-UHFFFAOYSA-N 3-iodo-1h-indole Chemical compound C1=CC=C2C(I)=CNC2=C1 FQVDXLYJTMHMCG-UHFFFAOYSA-N 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- VZWXNOBHWODXCW-ZOBUZTSGSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-n-[2-(4-hydroxyphenyl)ethyl]pentanamide Chemical compound C1=CC(O)=CC=C1CCNC(=O)CCCC[C@H]1[C@H]2NC(=O)N[C@H]2CS1 VZWXNOBHWODXCW-ZOBUZTSGSA-N 0.000 description 1
- 102000010195 ADD domains Human genes 0.000 description 1
- 108050001756 ADD domains Proteins 0.000 description 1
- 101150052280 APEX2 gene Proteins 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 108060006004 Ascorbate peroxidase Proteins 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 102100026044 Biotinidase Human genes 0.000 description 1
- 108010039206 Biotinidase Proteins 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- LRZPRGJXAZFXCR-DCAQKATOSA-N Cys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N LRZPRGJXAZFXCR-DCAQKATOSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- 108010024491 DNA Methyltransferase 3A Proteins 0.000 description 1
- 230000005971 DNA damage repair Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 description 1
- PXGOKWXKJXAPGV-UHFFFAOYSA-N Fluorine Chemical compound FF PXGOKWXKJXAPGV-UHFFFAOYSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- PDLGMYVCPJOYAR-DKIMLUQUSA-N Glu-Leu-Phe-Ala Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 PDLGMYVCPJOYAR-DKIMLUQUSA-N 0.000 description 1
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- IMPKSPYRPUXYAP-SZMVWBNQSA-N His-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N IMPKSPYRPUXYAP-SZMVWBNQSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- ZXJFURYTPZMUNY-VKOGCVSHSA-N Ile-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 ZXJFURYTPZMUNY-VKOGCVSHSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- 102100030657 Lethal(3)malignant brain tumor-like protein 1 Human genes 0.000 description 1
- 101710173086 Lethal(3)malignant brain tumor-like protein 1 Proteins 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 230000027311 M phase Effects 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 240000007019 Oxalis corniculata Species 0.000 description 1
- 108010019160 Pancreatin Proteins 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 1
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- OAAWNUBFRMVIQS-IHPCNDPISA-N Phe-Trp-Cys Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CS)CC1=CNC2=CC=CC=C12)CC1=CC=CC=C1 OAAWNUBFRMVIQS-IHPCNDPISA-N 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- TYIHBQYLIPJSIV-NYVOZVTQSA-N Ser-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CO)N TYIHBQYLIPJSIV-NYVOZVTQSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- 102000056014 X-linked Nuclear Human genes 0.000 description 1
- 108700042462 X-linked Nuclear Proteins 0.000 description 1
- YVNQAIFQFWTPLQ-UHFFFAOYSA-O [4-[[4-(4-ethoxyanilino)phenyl]-[4-[ethyl-[(3-sulfophenyl)methyl]amino]-2-methylphenyl]methylidene]-3-methylcyclohexa-2,5-dien-1-ylidene]-ethyl-[(3-sulfophenyl)methyl]azanium Chemical compound C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C(=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=2C(=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=C1 YVNQAIFQFWTPLQ-UHFFFAOYSA-O 0.000 description 1
- 210000001015 abdomen Anatomy 0.000 description 1
- XBJFCYDKBDVADW-UHFFFAOYSA-N acetonitrile;formic acid Chemical compound CC#N.OC=O XBJFCYDKBDVADW-UHFFFAOYSA-N 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 229960003767 alanine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 239000012267 brine Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000000723 chemosensory effect Effects 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 239000012230 colorless oil Substances 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000006059 cover glass Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000018044 dehydration Effects 0.000 description 1
- 238000006297 dehydration reaction Methods 0.000 description 1
- 229960003964 deoxycholic acid Drugs 0.000 description 1
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000132 electrospray ionisation Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000834 fixative Substances 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000012632 fluorescent imaging Methods 0.000 description 1
- 229910052731 fluorine Inorganic materials 0.000 description 1
- 239000011737 fluorine Substances 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000009878 intermolecular interaction Effects 0.000 description 1
- 238000001948 isotopic labelling Methods 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- UGZBFCCHLUWCQI-LURJTMIESA-N methyl (2r)-3-iodo-2-[(2-methylpropan-2-yl)oxycarbonylamino]propanoate Chemical compound COC(=O)[C@H](CI)NC(=O)OC(C)(C)C UGZBFCCHLUWCQI-LURJTMIESA-N 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- KDLHZDBZIXYQEI-UHFFFAOYSA-N palladium Substances [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 1
- 229940055695 pancreatin Drugs 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000008823 permeabilization Effects 0.000 description 1
- 101150109515 phd gene Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 229930002371 pyridine alkaloid Natural products 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000006722 reduction reaction Methods 0.000 description 1
- 238000010992 reflux Methods 0.000 description 1
- 230000022983 regulation of cell cycle Effects 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 108010029895 rubimetide Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- AKHNMLFCWUSKQB-UHFFFAOYSA-L sodium thiosulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=S AKHNMLFCWUSKQB-UHFFFAOYSA-L 0.000 description 1
- 235000019345 sodium thiosulphate Nutrition 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y601/00—Ligases forming carbon-oxygen bonds (6.1)
- C12Y601/01—Ligases forming aminoacyl-tRNA and related compounds (6.1.1)
- C12Y601/0102—Phenylalanine-tRNA ligase (6.1.1.20)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The cation-pi interaction is an important non-covalent interaction between molecules, playing an important role in the biological and chemical fields, and although there is great success in understanding the origin and biological functions of cation-pi, research for designing and synthesizing stronger cation-pi interactions is scarce. The invention provides a method for improving cation-pi interaction by using genetic code expansion and application thereof, taking decoded protein subjected to histone methylation modification as an example, tryptophan analogues substituted by strong electron-donating side chain groups are introduced into tryptophan sites of an aromatic cage of the decoded protein by using a genetic code expansion technology, the affinity of the decoded protein and the histone methylation modification is improved, and a super-parent molecular recognition system for recognizing histone methylation modification is established.
Description
Technical Field
The invention relates to a method for improving cation-pi interaction, in particular to a method for improving cation-pi interaction by genetic code expansion and application, and belongs to the technical field of biology.
Background
Non-covalent interactions regulate the structure and function of biomolecules, playing a key role in molecular folding and molecular recognition. Non-covalent interactions include cation-pi interactions, hydrogen bonding interactions, ionic interactions, and hydrophobic interactions, where cation-pi is a strong non-covalent interaction that occurs between cations and the pi electron cloud, playing an important role in biomolecule self-assembly, molecular recognition, molecular adhesion, and molecular folding, and a series of recent work on the origin and rationale of cation-pi interactions suggest that cation-pi interactions play a critical role during substrate-receptor binding and recognition of histone post-translational modifications. It has been reported that substitution of aromatic amino acids in aromatic cages by fluorine-substituted tryptophan analogs impairs cation-pi interactions due to the electron withdrawing ability of fluorine. In addition, aromatic amino acids in the aromatic cages of the mutant decoder proteins significantly reduce or disrupt the interaction. Despite great success in understanding the origin and biological function of cation-pi interactions, research to design and synthesize stronger cation-pi interactions is essentially blanked.
Taking histone methylation decoding protein as an example, histone methylation refers to methylation modification which is mediated by methyltransferase and occurs on arginine or lysine residues at the N-terminal of H3 and H4 histones, histone methylation modification is recognized by decoding protein to participate in important life processes such as regulation of gene expression, DNA replication, DNA damage repair and regulation of cell cycle, the study of distribution and abundance of histone methylation is the basis for understanding the molecular mechanism of histone code and chromatin regulation, and the current histone methylation-based antibody is a technical method for mainly detecting histone methylation genome distribution and site specificity. Unfortunately, antibodies have the disadvantages of sequence-dependent affinity, low substrate resolution, non-specific recognition, and suitability only for in vitro experiments, which limits their use and accurate resolution of histone methylation functions. Therefore, a new method for detecting histone methylation modification with high affinity is urgently needed to be developed.
Studies have shown that a histone methylation-modified decoding protein forms a hydrophobic pocket from 2 to 4 aromatic amino acids to specifically recognize histone methylation modification through cation-pi interaction, and in view of the property that the decoding protein can specifically recognize histone methylation modification, a method for detecting histone methylation modification based on a decoding protein domain is widely focused as an alternative to specific antibodies, and the ADD domain of ATRX protein and the PWWP domain of DNMT3A protein are used to capture H3K9me3 and H3K36me3, respectively; the MBT2 domain of L3MBTL1 broadly recognizes methylated lysine or double methylated lysine modifications and thus was developed as a method to capture the methylated lysine proteome. The detection method based on the decoding protein domain has the advantages of easy modification, economy and capture of a plurality of PTMs, but the affinity of the decoding protein domain and histone methylation modification is in micromolar level, so that the wide application of the technology is limited. Therefore, it is highly desirable to design high affinity histone methylation decoding proteins to facilitate application of decoding protein domains in enrichment, imaging, sequencing and other aspects of histone methylation modification.
The genetic code expansion technology (GCE for short) specifically introduces unnatural amino acids with novel structures and unique properties on proteins, and expands tiles for synthesizing the proteins, thereby providing a powerful tool for precise protein control and identification and optimization of protein functions. The invention, entitled "construction of orthogonal aminoacyl-tRNA synthetase/tRNA System Using chimeric design" patent ZL 201910440254.8 discloses the use of protein chimeric design to transplant the characteristics of Pyrrolysinyl tRNA synthetase (PylRS)/tRNACUA orthogonal pair to universal orthogonality to human Source mitochondrial Phenylaminoacyl tRNA synthetase (PheRS)/tRNA pair to construct chimeric Phenylaminoacyl tRNA synthetase (chPheRS)/tRNA system with universal orthogonality, thereby broadening the recognition of the types of unnatural amino acids and providing new tools for genetic code expansion technology, where the chimeric Phenylaminoacyl tRNA synthetase of the system can specifically recognize tryptophan analogs, such as: 6-methyl-tryptophan, 7-methyl-tryptophan, 6-chloro-tryptophan, 7-chloro-tryptophan, 6-cyano-tryptophan, and 7-cyano-tryptophan.
Disclosure of Invention
The invention aims to provide a method for improving cation-pi interaction by genetic code expansion, which utilizes a genetic code expansion technology to replace tryptophan forming cation-pi interaction in a biological molecule with a tryptophan analogue so as to improve the binding energy of the cation-pi interaction.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a method for improving cation-pi interaction by genetic code expansion features that the tryptophan of aromatic cage for forming cation-pi interaction in biologic molecule is substituted by the tryptophan analog to improve the binding energy of cation-pi interaction.
Non-covalent interactions regulate the structure and function of biomolecules playing a key role in molecular folding and molecular recognition, where cation-pi is a strong non-covalent interaction occurring between cations and pi electron clouds playing an important role in biomolecule self-assembly, molecular recognition, molecular adhesion and molecular folding, but the invention of how to improve cation-pi interactions is in the blank state. The invention provides a synthesis method of tryptophan compounds with electron donating groups, and develops a method for remarkably improving the cation-pi interaction binding energy of a verified and synthesized compound A1-A6 by replacing tryptophan of an aromatic cage with a genetic code expansion technology. The method mainly depends on specific antibodies, but the antibodies have the defects of sequence-dependent affinity, low substrate resolution, non-specific recognition, suitability for in vitro experiments and the like, and the invention takes the decoded protein of the histone methylation modification as an example, introduces tryptophan analogues strongly substituted by electron-donating side chain groups into tryptophan sites of an aromatic cage of the decoded protein by using a genetic code expansion technology, improves the affinity of the decoded protein and the histone methylation modification by 4-8 times, does not influence the functions of the protein, and shows the potential value of the histone methylation modification. The method is utilized to establish a super-parent molecular recognition system for recognizing histone methylation modification, and the system is applied to the aspects of detection, imaging, sequencing and the like of histone methylation modification.
In the invention, a series of tryptophan analogues are introduced into an aromatic cage of the decoding protein site-specifically to regulate the binding affinity of the decoding protein and histone methylation modification, the affinity of histone methylation and the decoding protein thereof is improved by 4-8 times by utilizing the strategy, the affinity of H3K4me3 and a PHD decoding protein structural domain reaches nanomolar level by series repeated design, and the strategy is utilized to develop a super-parent molecule recognition system for detecting histone methylation modification.
The method of the invention can be applied to the study of any biomacromolecule forming cation-pi interaction.
Preferably, the method comprises the steps of:
s1, designing and synthesizing tryptophan analogues with strong electron supply side chain substitution, wherein the tryptophan analogues are non-natural amino acids and are selected from one of 6-methyl-tryptophan (A1), 6-methoxy-tryptophan (A2), 7-methyl-tryptophan (A3), 7-methoxy-tryptophan (A4), 6, 7-methoxy-tryptophan (A5), 6, 7-methyl-tryptophan (A6), 7, 8-dihydrofuran-tryptophan (A7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (A9), 6, 7-furan-tryptophan (A10), 6, 7-dioxole-tryptophan (A11) or 6, 7-cyclopentane-tryptophan (A12), the structural formulas of the tryptophan analogs A1 to A12 are as follows:
s2, screening a chimeric phenylalanine aminoacyl-tRNA synthetase mutant specifically recognizing tryptophan analogs A1 to A12;
s3, taking the biological molecule forming cation-pi interaction as the research object, utilizing the genetic code expansion technology to specifically introduce tryptophan analogues into the biological molecule through the chimera phenylalanine aminoacyl-tRNA synthetase mutant, and obtaining the protein with the tryptophan analogues.
Preferably, indole B substituted at different positions is used as a reactant to react to obtain a target product, wherein the chemical structural formula of the indole substituted at different positions is as follows:the general structural formula of the target product is as follows:wherein X is selected from: oxygen atom or carbon atom.
Preferably, the method for synthesizing the tryptophan analogs A1 to A12 comprises the following steps:
the method comprises the following steps: synthesis of starting material compound B:
starting material B is selected from one of 6-methyl-indole (B1), 6-methoxy-indole (B2), 7-methyl-indole (B3), 7-methoxy-indole (B4), 6, 7-methoxy-indole (B5), 6, 7-methyl-indole (B6), 7, 8-dihydrofuran-indole (B7), 6, 7-dihydrofuran-indole (B8), 7, 8-furan-indole (B9), 6, 7-furan-indole (B10), 6, 7-dioxole-indole (B11) or 6, 7-cyclopentane-indole (B12), the structural formulae of the above indole analogs B1 to B12 are:
(1) synthesis of compounds B6, B7, B8, B9, B10: aniline (G6, G7, G8, G9 or G10) and triethanolamine as reactants, and RuCl as a reaction product 3 ·nH 2 O,SnCl 2 ·2H 2 O and PPh 3 As a catalyst, reacting in anhydrous dioxane to obtain a starting material compound B; (2) synthesis of compound B11, B12: using aniline (G11 or G12), chloral hydrate and hydroxylamine hydrochloride as reactants, sulfuric acid as a catalyst, water as a solvent to obtain a crude product, then reacting the crude product with methanesulfonic acid to obtain an isatin product, and finally reducing the isatin product by lithium aluminum hydride to obtain a starting material compound B;
step two: synthesis of Compound C: reacting an initial raw material compound B and iodine as reactants, potassium hydroxide as alkali and anhydrous N, N-dimethylformamide as a solvent to obtain an intermediate compound C;
step three: synthesis of Compound D: reacting the compound C and di-tert-butyl dicarbonate serving as reactants in anhydrous dichloromethane by using triethylamine as alkali and DMAP (dimethyl formamide) as a catalyst to obtain an intermediate compound D;
step four: synthesis of Compound E: reacting the compound D and Boc-3-iodine-L-alanine methyl ester serving as reactants in an anhydrous N, N-dimethylformamide solvent under the protection of nitrogen by using palladium acetate as a catalyst and S-Phos as a ligand to obtain an intermediate compound E;
step five: synthesis of Compound F: under the condition that methanol and water are used as solvents, potassium hydroxide is used as alkali, and an intermediate compound F is obtained through reaction;
step six: synthesis of Compound A: and reacting the compound F under the condition of taking anhydrous dichloromethane as a solvent and taking trifluoroacetic acid as a catalyst to obtain a target product (tryptophan analogues A1-A12).
Preferably, in the fourth step, the catalyst palladium acetate is used in an amount of 2% by mole of the substrate (compound D);
the reaction time of the first step is 1-2h, the reaction time of the second step is 2h, the reaction time of the third step is 8h, the reaction time of the fourth step is 5h, the reaction time of the fifth step is 2-3h, and the reaction time of the sixth step is 2 h;
the reaction temperature in the first step is 90 ℃, the reaction temperature in the second step is 0 ℃, the reaction temperature in the third step is 0 ℃, the reaction temperature in the fourth step is 40 ℃, the reaction temperature in the fifth step is 25 ℃, and the reaction temperature in the sixth step is 0 ℃.
Preferably, in S2, (1) screening a mutant of chimeric phenylalanyl-tRNA synthetase that specifically recognizes a tryptophan analog by constructing a saturated mutagenic gene library for amino acids in the amino acid binding pocket of the chimeric phenylalanyl-tRNA synthetase; (2) identifying the recognition efficiency and specificity of the phenylalanine aminoacyl-tRNA synthetase mutant by GFP fluorescence and LC-MS mass spectrometry; (3) the obtained chimera phenylalanine aminoacyl-tRNA mutant is screened and applied to the expression of bacteria, cells, viruses and other hosts.
In screening for chimeric phenylalanyl-tRNA synthetase mutants that specifically recognize 6-methoxy-tryptophan (a2), 7-methoxy-tryptophan (a4), 6, 7-methoxy-tryptophan (a5), 6, 7-methyl-tryptophan (a6), 7, 8-dihydrofuran-tryptophan (a7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (a9), 6, 7-furan-tryptophan (a10), 6, 7-dioxol-tryptophan (a11), 6, 7-cyclopentane-tryptophan (a12), the amino acid binding pocket for the chimeric phenylalanyl-tRNA synthetase amino acid was determined by comparing the results of the results obtained using the results of the screening for chimeric phenylalanyl-tRNA synthetase mutants that specifically recognize 6-methoxy-tryptophan (a2), 7-methoxy-tryptophan (a4), 6, 7-dihydrofuran-tryptophan (A8), m490, T467 and a507) to construct saturation mutagenesis gene libraries (E391NNK, V393NNK, F464NNK, M490NNK, T467G and a507G), screening chimeric phenylalanyl-tRNA synthetase mutants specifically recognizing 6-methoxy-tryptophan (a2), 7-methoxy-tryptophan (a4), 6, 7-methoxy-tryptophan (A5), 6, 7-methyl-tryptophan (A6), 7, 8-dihydrofuran-tryptophan (a7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (a9), 6, 7-furan-tryptophan (a10), 6, 7-dioxole-tryptophan (a11), 6, 7-cyclopentane-tryptophan (a12) by positive and negative selection strategies, finally obtaining a mutant comprising E391D, six mutant phenylalanine aminoacyl-tRNA synthetase mutants of V393G, M490V, F464V, T467G and a507G, wherein the nucleotide sequences and amino acid sequences of the phenylalanine aminoacyl-tRNA synthetase mutants are as shown in SEQ: ID 1-2.
Preferably, in S3, (1) the tryptophan-corresponding site of the decoded protein forming the aromatic cage is mutated to the termination codon (TAG), (2) the decoded protein mutant is cotransferred with the chimeric phenylalanyl-tRNA synthetase mutant and the corresponding tryptophan analog (typically 1mM) is added during expression, (3) the decoded protein variant is purified according to the GST-TAG protein purification method and the fidelity of the decoded protein variant is identified by LC-MS.
Preferably, the nucleotide sequence and amino acid sequence of the chimeric phenylalanine-tRNA synthetase mutant recognizing the tryptophan analogue a1-a6 are respectively shown in SEQ ID NO: 1-2.
Preferably, the method takes histone methylation decoding protein structural domain as a research object, and the decoding protein structural domain is any one of Chromo, PHD, PWWP, Tudor, MBT, CW, SPIN and BAH structural domain.
The application of the protein with the tryptophan analogues obtained by the method in establishing a decoded protein super-parent recognition system for specifically recognizing histone methylation modification. Taking H3K4me3 as an example, establishing a decoding protein KDM5A-PHD3 super-parent for identifying H3K4me3, and naming the super-parent as PHD; taking H3K9me3 as an example, establishing a decoding protein CDY1-Chromo super-parent for identifying H3K9me3, and naming the super-parent as Chromo super-parent; taking H3K27me3 as an example, establishing a decoding protein BAHD1-BAH super-parent for identifying H3K27me3, and naming the protein as BAH super-parent; taking H3K36me3 as an example, establishing a decoding protein DNMT3B-PWWP super-parent for identifying H3K36me3, and naming the super-parent as PWWP.
The method establishes the decoding protein super-parent specifically recognizing the histone methylation modification, the affinity of the decoding protein super-parent reaches nanomolar level, the titer is superior to that of a histone methylation modification specific antibody, and the decoding protein super-parent is used for detecting the histone methylation modification in a biological sample. The decoded protein super-parent specifically recognizing histone methylation modification is marked by a fluorescent group, and can be applied to detecting histone methylation modification in a biological sample by an imaging technology. The decoded protein super-parent recognition system specifically recognizing histone methylation modification can be applied to living body imaging and dynamically detecting the change of histone methylation modification. The histone methylation modified decoded protein super-parent recognition system can be used for histone methylation modification of enriched samples and applied to a single cell sequencing technology. The tryptophan analogue with strong electron supply side chain substitution can improve the affinity of the decoded protein and histone methylation modification by 4-8 times, and the decoded protein with repeated tandem improves the affinity of the decoded protein and histone methylation modification.
Taking KDM5A PHD3 as an example,
(1) the specific recognition of H3K4me3 by the aromatic cage formed by W18 and W28 is judged by the crystal structure (PDB: 2KGI), so that the W18 site and the W28 site of KDM5A PHD3 are mutated into stop codons, and the nucleotide sequences are respectively shown as SEQ ID NO: 3 to 4.
(2) Specifically introducing tryptophan analogs such as 6-cyano-tryptophan, 7-cyano-tryptophan, 6-chloro-tryptophan, 7-chloro-tryptophan, 6-methyl-tryptophan (A1), 6-methoxy-tryptophan (A2), 7-methyl-tryptophan (A3), 7-methoxy-tryptophan (A4), 6, 7-methoxy-tryptophan (A5), 6, 7-methyl-tryptophan (A6), 7, 8-dihydrofuran-tryptophan (A7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (A9) into the W18 and W28 sites of the PHD3 decoding protein domain, respectively, using a chimeric phenylalanine translation system, Any one of 6, 7-furan-tryptophan (a10), 6, 7-dioxole-tryptophan (a11), 6, 7-cyclopentane-tryptophan (a12) to obtain PHD3 decoding protein domain variant protein.
(3) The micro calorimetric electrophoresis apparatus is used for measuring the affinity of the PHD3 decoding protein domain variant and H3K4me3, when the non-natural amino acid A2 is inserted into the W28 site of the PHD3 decoding protein domain, the affinity of the PHD3 and H3K4me3 can be improved by 8 times, and the decoding protein variant is PHD3-W28-A2 and is named as PHD. The amino acid sequence of the specific H3K4me3 polypeptide is detailed in Table 2.
(4) In some specific embodiments, the site-specific introduction of 6-methoxy-tryptophan (a2) into different histone methylation-modified decoding protein domains increases the affinity of the decoding protein domain for histone methylation modifications.
Taking H3K9me3 as an example, a Chromo domain of CDY1 is selected as a research object, and after 6-methoxy-tryptophan is inserted into a W28 site of the Chromo domain of CDY1, the affinity of the Chromo domains of H3K9me3 and CDY1 is improved by 2 times; taking H3K27me3 as an example, the BAH domain of BAHD1 is selected as a research object, and when 6-methoxy-tryptophan is inserted into the W667 site of the BAH domain of BAHD1, the affinity of H3K27me3 and the BAH domain is improved by 5 times; taking H3K36me3 as an example, the PWWP domain of DNMT3B is selected as a research object, and when 6-methoxy-tryptophan is inserted into the W263 site of the PWWP domain of DNMT3B, the affinity of the PWWP domains of H3K36me3 and DNMT3B is improved by 7 times. Wherein the nucleotide sequence and protein sequence of Chromo domain of CDY1 are as shown in SEQ ID NO: 5-6; wherein the nucleotide sequence and protein sequence of BAH domain of BAHD1 are shown in SEQ ID NO: 7-8; wherein the nucleotide sequence and protein sequence of the PWWP domain of DNMT3B are set forth in SEQ ID NO: 9-10.
The following illustrates the application of several aspects of the present invention.
The invention provides a method for establishing tandem repeat histone methylation decoding protein to improve the affinity of histone methylation modification and decoding protein.
Taking the PHD3 as an example,
(1) constructing a decoding protein variant with multiple repeats, and converting SEQ ID NO: 4 as a template, constructing duplex and triplet decoding protein mutants, wherein the duplex and triplet decoding proteins carry 2 and 3 amber Terminators (TAGs) respectively, and the nucleotide sequences of the specific duplex and triplet mutants are shown in SEQ ID NO: 11 to 12.
(2) The duplex and tripartite PHD protein variants, designated 2x PHD and 3x PHD, respectively, were obtained by site-specific introduction of 2 or 3 6-methoxy-tryptophan (a2) at the amber terminator site of the duplex or tripartite PHD protein by genetic code expansion techniques.
The affinity of the duplex or triplex PHD protein variants to H3K4me3 was determined using a microcalorimetry electrophoresis apparatus. The obtained duplex and triplet PHD variants had 14.7-fold and 62.9-fold improved affinity for H3K4me 3.
In some specific embodiments, the above strategy is equally applicable to decoding protein domains that are methylation modified with other histones.
The invention provides a method for detecting histone methylation modification by a histone methylation modification super-parent molecule recognition system.
(1) Expressing and purifying the PHD protein variant substituted by 6-methoxy-tryptophan (A2) to obtain PHD protein, 2xPHD protein and 3xPHD protein.
(2) In the case of HeLa cell lysate, HeLa cells were lysed, then they were diluted in a gradient to different concentrations, and the protein samples were separated by SDS-PAGE running gel and transferred to PVDF membrane.
(3) After PVDF membrane milk blocking, H3K4me3 specific antibody, PHD protein, 2x PHD protein and 3x PHD protein were respectively incubated overnight.
(4) PVDF membranes incubated with H3K4me 3-specific antibodies incubated with the corresponding secondary antibodies. The PVDF membrane incubated with PHD protein further incubated with GST-specific antibodies, and finally the corresponding secondary antibodies.
(5) And (4) performing chemiluminescence imaging. Compared to the H3K4me3 specific antibody, the 2x PHD protein and the 3x PHD protein showed higher detection ability.
The invention provides a method for detecting histone methylation modification by a histone methylation modification super-parent molecule recognition system through an imaging technology. The method can be applied to living cell imaging and can also be applied to in vitro immunofluorescence imaging technology. The method can be applied to different cells, such as: HEK 293T cell line, HeLa cell line, NCI-60 cell line, CHOs cell line, etc.
The application of the method to living cell imaging comprises the following steps:
(1) constructing a plasmid expressed by cells, and expressing the plasmid with SEQ ID NO: 4, cloning a PHD-W28TAG fragment by taking the pEGFP-EGFP as a template, cloning a vector fragment by taking the pEGFP-EGFP as a template, and constructing a plasmid of the pEGFP-PHD-W28TAG-EGFP by Gibson assembly; the peptide represented by SEQ ID NO: cloning phenylalanine aminoacyl-tRNA synthetase (chPheRS) fragment with 1 as template, cloning carrier fragment with pCDNA3.1 as template, and assembling with Gibson to construct pCDNA3.1-chPheR9 plasmid. The plasmid map is shown in FIG. 1.
(2) The two plasmids were prepared according to 1: HEK 293T cells were transfected with a molar ratio of 1. After 6-8 hours the solution was changed and 2mM 6-methoxy-tryptophan (A2) was added.
(3) And detecting the EGFP fluorescence signal change by a living cell imaging microscope.
The immunofluorescence imaging technology applied to in vitro comprises the following steps:
(1) expressing and purifying the PHD protein variant substituted by 6-methoxy-tryptophan to obtain PHD protein, 2xPHD protein and 3xPHD protein.
(2) The marker proteins, PHD protein, 2XPHD protein and PHD protein were labeled with NHS-Cy5 activated lipid, respectively.
(3) Taking the HeLa cell line as an example, the formaldehyde-fixed cells were incubated with the Cy 5-labeled PHD protein or histone methylation-modified specific antibody, respectively, and after incubation, the localization of the corresponding histone methylation modification was detected by confocal microscope imaging. Compared with an H3K4me3 specific antibody, the 2xPHD protein variant and the 3xPHD protein variant have higher signal-to-noise ratio.
The invention provides a method for detecting a histone methylation modification interaction relation group by a histone methylation modification super-affinity molecule recognition system through a proximity labeling technology.
The development of proximity labeling technology provides a supplement to the traditional methods for studying intermolecular interactions in living cells, and the technology generally utilizes CRISPR gene editing technology or plasmid-based expression to express proximity biotinylation enzyme and bait protein in a fusion manner in cells. After the exogenous biotin is added, the protein adjacent to the bait protein is biotinylated, and the biotinylated protein can be enriched by streptavidin-coupled magnetic beads and then identified by mass spectrometry. The affinity of the H3K4me3 super-parent molecular recognition system provided by the invention to H3K4me3 reaches 7nM, PHD-W28TAG and ascorbate peroxidase APEX/APEX2 can be expressed in a fusion manner, when a PHD variant is specifically combined with H3K4me3, the APEX2 can mark a proteome adjacent to H3K4me3 with Biotin under the stimulation of hydrogen peroxide, then the proteome is enriched through Streptavidin, and finally the proteome is analyzed through LC-MS. The method is not limited to the proximity labeling technique based on APEX2, but is also applicable to other proximity biotinidase-based techniques. Such as: horseradish peroxidase HRP and biotin ligase BioID, BASU, TurboID, miniTurbo, and the like.
Compared with the prior art, the invention has the main advantages that:
1. the present invention provides a method for enhancing cation-pi interactions, which can be applied to any biomacromolecule that undergoes cation-pi interactions. The present invention provides synthetic routes of 6-methyl-tryptophan (A1), 6-methoxy-tryptophan (A2), 7-methyl-tryptophan (A3), 7-methoxy-tryptophan (A4), 6, 7-methoxy-tryptophan (A5), 6, 7-methyl-tryptophan (A6), 7, 8-dihydrofuran-tryptophan (A7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (A9), 6, 7-furan-tryptophan (A10), 6, 7-dioxole-tryptophan (A11), 6, 7-cyclopentane-tryptophan (A12), all of which are applicable to genetic code expansion technology, the cation-pi interaction can be improved.
2. The present invention has a wide range of applications including (but not limited to): the method is combined with a living cell imaging technology and applied to detecting the dynamic change of histone methylation modification; detecting histone methylation modification in a biological sample by combining with an immunofluorescence technique; the genome related to histone methylation modification can be analyzed by combining with a genome sequencing technology and the like; the histone methylation modification interaction proteome can be identified by combining with a proximity labeling technology and the like. Specifically, the method comprises the following steps: (1) the 6-methoxy-tryptophan (A2) is introduced into the tryptophan site forming the cation-pi interaction by using the genetic code expansion technology, so that the affinity between the biomolecules can be improved by 4-8 times. (2) The PHD site specificity of the decoding protein of H3K4me3 is introduced into 6-methoxy-tryptophan (A2), so that the affinity of H3K4me3 and PHD is improved by 8 times, and the affinity of the PHD variant and H3K4me3 reaches 7nM after triple design. (3) The histone methylation super-parent molecule recognition system has high sensitivity in recognizing histone methylation modification, and shows higher specificity and sensitivity compared with a histone methylation modification specific antibody. (4) The histone methylation modified super-parent molecule recognition system has the advantages of easy modification, economy and capture of a plurality of PTMs, and can develop a plurality of super-parent molecule recognition systems aiming at specific methylation modification.
Drawings
FIG. 1 is a plasmid map;
FIG. 2 is a chemical synthesis pathway, A is the synthesis pathway of 6-methoxy-tryptophan (A2) and 7-methoxy-tryptophan (A4), B is the chemical synthesis pathway of 6, 7-cyclopentane-tryptophan A12;
FIG. 3 is an inventive strategy for modulating cation- π interactions between histone methylation and its decoded proteins using genetic code expansion techniques. (A) The amount of tryptophan in the aromatic cage component of the decoded protein is modified by methylation of different histones. (B) A flow diagram of a histone methylation super-parent molecule recognition system developed by applying genetic code expansion technology. Taking H3K4me3 as an example, the tryptophan in the aromatic cage of the decoded protein is replaced by the tryptophan analogue by utilizing the genetic code expansion technology, so that the cation-pi interaction in the aromatic cage is regulated and controlled, and the unnatural amino acid analogue which obviously improves the methylation affinity of the decoded protein and histone is obtained. (C) The structural formula of the non-natural amino acid used in the present invention;
FIG. 4 is a graph identifying the efficiency and specificity of chimeric alanine aminoacyl-tRNA synthetases A2 and A4, wherein: GFP fluorescence report experiments identify the efficiency of chimeric phenylalanine aminoacyltRNA synthetases A2RS (A) and A4RS (B) in recognizing A2 and A4 respectively, and mass spectrometry identifies the fidelity of chimeric phenylalanine aminoacyltRNA synthetases A2RS (C) and A4RS (D) in recognizing A2 and A4 respectively;
figure 5 is a PHD domain variant protein designed to increase affinity to H3K4me 3. (A) Complex structure (PDB: 2KGI) of KDM5A PHD3 protein and H3K4me3 polypeptide, wherein PHD3 and polypeptide are displayed in cartoon mode, and aromatic amino acid and H3K4me3 are modified and displayed in stick-shaped structure. (B) Coomassie Brilliant blue displays PHD-W18-UAA and PHD-W28-UAA variants. (C) The affinity of the PHD-W18-UAA variant to H3K4me3 was determined by microcalorimetry, in which H3K4me3 was labeled with FITC fluorophore. (D) The affinity of the PHD-W28-UAA variant and H3K4me3 is measured by a micro thermophoresis kinetic instrument, wherein H3K4me3 is marked by FITC fluorescent group;
figure 6 is a multivalent tandem repeat PHD domain designed to recognize H3K4me 3. (A) Multivalent tandem repeat PHD domains design cartoon diagrams. (B) Coomassie blue staining identifies the purity of the concatameric PHD protein variants. (C) The micro-calorimetric electrophoresis apparatus is used for measuring the affinity of the multi-linked PHD protein variant and H3K4me3, wherein H3K4me3 is labeled by FITC fluorescent group;
FIG. 7 is the detection and imaging of H3K4me3 using the histone methylation super-parent molecule recognition system. (A) A strategy diagram of a histone methylation super-parent molecule recognition system applied to detection and imaging is shown. (B) H3K4me3 levels of HeLa cells were detected using histone methylated super-philic molecules. H3 specific antibody and H3K4me3 specific antibody are used as control groups, and PHD-WT, 2xPHD and 3xPHD are respectively used for detecting H3K4me 3. (C) The histone methylation super-parent molecular recognition system is applied to the fluorescent imaging detection of the H3K4me3 positioning of cells, wherein PHD protein is marked by Cy 5;
FIG. 8 shows the efficiency of the system in mammalian cell flow cytometry to detect the efficiency of screened chimeric phenylalanyl-tRNA synthetases in recognizing 6-methoxy-tryptophan, 6, 7-methoxy-tryptophan and 6, 7-methyl-tryptophan in mammalian cells;
FIG. 9 shows the application of histone methylation super-affinity molecule recognition system established by genetic code expansion technology to the near label detection protein interaction group, (A) experimental flow chart, and (B) GO analysis data.
Detailed Description
The technical solution of the present invention will be further specifically described below by way of specific examples. It is to be understood that the practice of the present invention is not limited to the following examples, and that any variations and/or modifications may be made to the present invention without departing from the scope thereof.
In the present invention, all parts and percentages are by weight unless otherwise specified, and the equipment and materials used are commercially available or commonly used in the art. The methods in the following examples are conventional in the art unless otherwise specified.
The sequences of the primers used in the construction of the vectors of the invention in the specific examples are shown in Table 1:
table 1: primer sequences for constructing vectors
The inventive strategy for modulating cation-pi interactions between histone methylation and its decoded proteins using genetic code expansion techniques is shown in FIG. 3, and the following example illustrates a specific method.
Example 1: chemical Synthesis of Compounds A2 and A4
To 50mL of anhydrous N, N-dimethylformamide solution in B (2.0g,13.6mmol) was added potassium hydroxide (1.68g,29.9mmol), and the mixture was stirred at room temperature for 20 min. 30mL of an iodine solution of anhydrous N, N-dimethylformamide (4.14g,16.3mmol) was added dropwise to the reaction flask, and stirring was continued at room temperature for 2 h. The reaction mixture was poured into an ice-water solution containing 0.1% sodium thiosulfate. The mixture was put in a refrigerator to ensure complete precipitation. The precipitate was filtered, washed with cold water and then dried in vacuo. 3-iodo-1H-indole B (90% yield of B2, 93% yield of B4) was obtained as a light yellow solid and used in the next step without further purification.
The solid B obtained in the first step (2.73g,10.0mmol) was dissolved in 30mL of anhydrous N, N-dimethylformamide. After washing 60% NaH (391.2mg,16.3mmol) with hexane, it was suspended in 10mL of anhydrous N, N-dimethylformamide under nitrogen. Feed B was slowly added to the suspension under ice-bath conditions, and after stirring for 10min, p-toluenesulfonyl chloride (2.1g,11.0mmol) was added and stirred at 25 ℃ for 5 h. The mixture was poured into water, extracted three times with ethyl acetate, and then the ethyl acetate organic layer was washed with saturated brine and dried over anhydrous sodium sulfate. The organic phase is then concentrated under reduced pressure. Column chromatography using petroleum ether and ethyl acetate gave compound C as a white solid (85% C2, 81% C4).
Dry degassed N, N-dimethylformamide was charged under nitrogen to a vessel containing zinc dust (3.9g,50.0 mmol). TMSCl (108.6mg,1.0mmol) was added and stirred vigorously at room temperature for 30min, and after stopping stirring, zinc was precipitated. The supernatant was extracted with a syringe under a nitrogen flow, and then a new N, N-bisMethyl formamide is added to the zinc. After stirring was continued for 2min, stirring was stopped to precipitate zinc powder, and the supernatant was removed as before, and this step was repeated twice more. 1, 2-dibromoethane (751.4mg,4.0mmol) was then added to the vessel and stirred at 80 ℃ for 30 min. After the mixture was cooled to 25 ℃, TMSCl (325.8mg,3.0mmol) was added and the resulting mixture was stirred for a further 30 min. Boc-3-iodo-L-alanine methyl ester (3.95g,12mmol) was dissolved in 10mL of N, N-dimethylformamide and added to the activated zinc powder and the mixture stirred vigorously. After the exotherm subsided (controlled by the ice bath), stirring was continued for a further 30min, at which time stirring was stopped and zinc was allowed to precipitate. The supernatant was gently removed with a syringe and poured into a clean reaction flask under a flow of nitrogen. The supernatant was transferred by syringe to Compound D (2.13g,5.0mmol), Pd (OAc) 2 (112.2mg,0.5mmol) and S-Phos (410.5mg, 1.0 mmol). And reacting for 4 hours under the protection of nitrogen. After completion of the reaction, the mixture was poured into water, extracted with ethyl acetate, and the upper organic layer was washed with brine, dried over anhydrous sodium sulfate, and concentrated under reduced pressure, followed by purification by petroleum ether and ethyl acetate column chromatography to give compound E (57% yield of E2, 45% yield of E4) as a pale yellow oil.
The product E was analyzed and the results were as follows: E2) 1 H NMR(500MHz,CDCl 3 )δ7.70(d,J=8.5 Hz,2H),7.48(d,J=2.3Hz,1H),7.30(d,J=8.7Hz,1H),7.22(d,J=8.1Hz,3H),6.84 (dd,J=8.7,2.3Hz,1H),5.05(d,J=8.0Hz,1H),4.60(d,J=7.1Hz,1H),3.86(s,3H), 3.62(s,3H),3.14(qd,J=14.7,5.6Hz,2H),2.34(s,3H),1.49–1.26(m,9H). 13 C NMR (125MHz,CDCl 3 )δ172.15,158.25,155.13,145.01,136.29,135.28,129.98,126.82, 123.22,120.13,117.44,112.55,98.09,80.26,55.91,53.69,52.47,28.46,21.70.HRMS (ESI)m/z calcd.For C 20 H 23 N 2 O 5 S + (M-Boc) + 403.1322,found 403.1331.E4) 1 H NMR (500MHz,CDCl 3 )δ7.69(d,J=8.1Hz,2H),7.62(s,1H),7.24(d,J=8.1Hz,2H),7.16 –7.06(m,2H),6.67(dd,J=7.3,1.5Hz,1H),5.15(d,J=8.0Hz,1H),4.65(dt,J=8.0, 5.5Hz,1H),3.71(s,3H),3.67(s,3H),3.32–3.11(m,2H),2.37(s,3H),1.44(s,9H). 13 C NMR(125MHz,CDCl 3 )δ172.28,155.16,147.49,144.21,137.36,133.77,129.43, 127.30,126.91,124.79,124.04,114.95,112.05,107.16,80.12,55.54,53.85,52.49,28.41, 28.01,26.99,21.67.HRMS(ESI)m/z calcd.For C 20 H 23 N 2 O 5 S + (M-Boc) + 403.1322, found 403.1334.
compound E (973.2mg,2.0mmol) was dissolved in 50mL of methanol, NaOH (1.2g,30.0 mmol) was added and dissolved in 20mL of H 2 And (4) in O. The mixture was heated at reflux for 8h, and then methanol was evaporated under reduced pressure to a volume of about half the reaction volume. Acidified with ice cold 2M dilute hydrochloric acid and adjusted to pH 3. The aqueous solution was extracted with cold ethyl acetate and the upper organic layer was washed with saturated brine, dried over anhydrous sodium sulfate and evaporated in vacuo to give a colorless oil which gave carbamate F without further purification. F was then dissolved in dichloromethane and trifluoroacetic acid (112.2mg,0.5mmol) was added to deprotect to afford the title compound a (74% yield of a 2%, 68% yield of a4) as a pale yellow solid, the complete synthetic route being shown in figure 2.
The product a was analyzed and the results were as follows: A2) 1 H NMR(500MHz,D 2 O)δ7.49(d,J=8.7Hz, 1H),7.01(s,1H),6.95(d,J=2.4Hz,1H),6.72(dd,J=8.7,2.4Hz,1H),3.75(s,3H), 3.45(dd,J=7.3,5.2Hz,1H),3.03(dd,J=14.4,5.2Hz,1H),2.86(dd,J=14.4,7.3Hz, 1H). 13 C NMR(125MHz,D 2 O)δ182.83,155.19,136.74,123.18,122.03,119.51, 110.63,108.80,95.26,56.42,55.78,30.43.HRMS(ESI)m/z calcd.For C 12 H 15 N 2 O 3 S + (M+H) + 235.1077,found 235.1081.A4) 1 H NMR(500MHz,D 2 O)δ7.27–7.22(m,2H), 7.08(td,J=7.9,0.9Hz,1H),6.78(d,J=7.7Hz,1H),4.31(ddd,J=6.3,5.4,0.9Hz, 1H),3.94(s,3H),3.44(ddt,J=15.4,5.3,0.9Hz,1H),3.36(dd,J=15.4,7.3Hz,1H)). 13 C NMR(125MHz,D 2 O)δ171.83,146.06,128.04,126.58,124.97,120.10,117.44, 115.12,106.92,55.63,53.27,25.83.HRMS(ESI)m/z calcd.For C 12 H 13 N 2 O 3 S - (M-H) - 233.0932,found 233.0939.
example 2: library construction and positive-negative screening of chimera phenylalanyl-tRNA synthetase mutant
The gene sequence of the chimera phenylalanyl-tRNA synthetase chPheRS in the embodiment is shown as SEQ ID NO: 1 is shown.
(1) Selecting the amino acid binding site of the chimeric phenylalanyl-tRNA synthetase by taking the structure of the human mitochondria phenylalanyl-tRNA synthetase as a reference: f464, T467 and a507, and the amino acids surrounding the binding pocket: e391, V393, M490.
(2) The gene fragment is amplified by taking chimeric phenylalanyl-tRNA synthetase (T467G and A507G) as a template and primers chPheRS-E391NNK-V393NNK-R/F, chPheRS-M490NNK-R/F and chPheRS-F464NNK-R/F, wherein the nucleotide sequence of the primers is shown as SEQ ID NO: 19-24, cloning the mutant library into the pBK vector by Gibson assembly to generate chPheRS mutant gene library (E391NNK, V393NNK, M490NNK, F464NNK, T467G, and A507G).
(3) Transforming pNEG-chPheT-Barnase-2 TAG into Escherichia coli DH10B to prepare negative selection competent cells, wherein the plasmid map of the negative selection competent cells is shown in figure 1; positive screening competent cells were prepared by transforming pNEG-3C11-CAT-112TAG-GFP190TAG into E.coli DH10B, and the plasmid map is shown in FIG. 1.
(4) The screening library of (2) was transformed into negative screening competent cells, and the bacterial solution was spread on LB plate (kanamycin, 50. mu.g/mL; ampicillin, 100. mu.g/mL; 0.2% L-arabinose) and incubated at 37 ℃.
(5) The plasmids were extracted from the clones in (4) and transformed into positive selection competent cells, and the whole culture was spread on LB agar plate (kanamycin, 50. mu.g/mL; ampicillin, 100. mu.g/mL; chloramphenicol, 10. mu.g/mL; 0.2% L-arabinose; 2mM unnatural amino acid) supplemented with an unnatural amino acid, cultured at 37 ℃ for 12 hours, and further cultured at 30 ℃ for 48 hours.
Example 3: screening of chimeric phenylalanine aminoacyl-tRNA synthetase mutant for specifically recognizing unnatural amino acid through GFP (green fluorescent protein) fluorescence report experiment
(1) After two rounds of forward screening, the single clones with fluorescent signals from example 2 were picked for overnight culture.
(2) According to the following steps: 100 percent of the strain solution in the step (1) is inoculated, when the strain solution is cultured at 37 ℃ until OD600 is 0.6-0.8, 0.2 percent of L-arabinose is added for induction expression, and 1mL of the strain solution is added with 1mM of corresponding unnatural amino acid and expression is carried out for 20h at 30 ℃.
(3) After 750. mu.L of the bacterial suspension in (2) was centrifuged, 150. mu.L of 1 XBugbuster (Millipore, Lot: 3492682) was added and the mixture was incubated at 25 ℃ for 30min, followed by centrifugation, 100. mu.L of the supernatant was transferred to a 96-well plate, and 100. mu.L of the bacterial suspension in (2) was simultaneously subjected to measurement of the GFP fluorescence signal intensity and OD of the corresponding clone by means of a microplate reader Bio Tek Synergy NEO2 600 And calculating the efficiency of the mutant for recognizing the unnatural amino acid.
(4) The chimera phenylalanine aminoacyl-tRNA synthetase mutant which can efficiently identify corresponding unnatural amino acid is sequenced to obtain a specific mutant sequence, and the corresponding cloned plasmid is placed at the temperature of minus 20 ℃ for standby.
(6) Finally, the mutant of the chimeric phenylalanyl-tRNA synthetase which recognizes 6-methoxy-tryptophan, 7-methoxy-tryptophan, 6, 7-methyl-tryptophan and 6, 7-methoxy-tryptophan was identified, and the mutant of the phenylalanyl-tRNA synthetase which comprises six mutations of E391D, V393G, M490V, F464V, T467G and A507G is named chPheRS9, and the nucleotide sequence and the amino acid sequence of the mutant of the phenylalanyl-tRNA synthetase are detailed in SEQ ID NO: 1-2.
(7) The efficiency of the chimera phenylalanine translation system for recognizing the unnatural amino acid under different concentrations of the unnatural amino acid is determined by a GFP fluorescence report experiment. The efficiency and fidelity of recognition of 6-methoxy-tryptophan and 7-methoxy-tryptophan by the chimeric phenylalanyl-tRNA synthetase is shown in FIG. 4.
Example 4: serial plasmid construction of KDM5A PHD3(PHD) Domain
All plasmids were constructed by the Gibson assembly system, except where specifically indicated. A series of plasmid constructs of the KDM5A PHD3(PHD) domain are exemplified.
1. PHD wild type plasmid: and (3) amplifying a GST tag by using a primer pNEG-GST-F/R by taking a pGEX-6p vector as a template, wherein the nucleotide sequence is shown as SEQ ID NO: 25-26; the cDNA was used as a template to amplify the PHD domain (Uniport ID: P29375, nucleotide 1598-1663, nucleotide sequence of the primer is shown in SEQ ID NO: 27-28), pNEG-2 chPheT vector was used as a template to amplify the vector by using primer pNEG-PHD-V-F/R, nucleotide sequence of the primer is shown in SEQ ID NO: 29-30, and plasmid map is shown in 1.
2. PHD mutant plasmid: using pNEG-2 chPheT-PHD-GST as a template, introducing site-directed mutagenesis of amber codon in a PHD domain W28 by using a primer pNEG-PHD-W28TAG-F/R, and constructing a plasmid pNEG-2 chPheT-PHD-W28TAG-GST through Gibson assembly; plasmid pNEG-2 × chPheT-PHD-W18TAG-GST was constructed by Gibson assembly using primer pNEG-PHD-W18TAG-F/R to introduce site-directed mutagenesis of the amber codon in PHD domain W18, the nucleotide sequence of the primer being as shown in SEQ ID NO: 31-34.
3. Multivalent tandem repeat PHD domain plasmids: and (2) amplifying a PHD-W28TAG fragment containing 6x-linker (GGSGGS) by using pNEG-2 chPheT-PHD-W28TAG-GST as a template and adopting a primer pNEG-2 PHD-F/R, wherein the nucleotide sequence of the PHD-W28TAG fragment is shown as SEQ ID NO: 35-36; and (2) amplifying a vector by adopting primers pNEG-2 PHD-V-R and pNEG-PHD-V-F, wherein the nucleotide sequence of the vector is shown as SEQ ID NO: 37 and SEQ ID NO: 29, construction of duplex or triplex PHD plasmids by Gibson assembly: pNEG-2 × chPheT-2 xPHHD-W28 TAG-GST and pNEG-2 × chPheT-3 xPHHD-W28 TAG-GST.
4. Multicomponent tandem repeat PHD-Chromo domain plasmid: amplifying the vector by using pNEG-2 chPheT-PHD-W28-GST as a template and adopting primers pNGE-PHD-V-F and pNEG-2 PHD-V-R; and (2) amplifying a CDY1-W2TAG fragment by using pNEG-2 chPheT-CDY1-W28TAG-GST as a template and adopting a primer pNEG-PHD-CDY1-F/R, wherein the nucleotide sequence of the primer is shown as SEQ ID NO: 38-39. Construction of multicomponent tandem plasmids by Gibson assembly: pNEG-2 chPheT-PHD-W28TAG-CDY1-W28 TAG-GST. The plasmid map is shown in FIG. 1.
5. Eukaryotic cell expression plasmid: plasmid construction of pEGFP-PHD3-W28 TAG-EGFP: pEGFP-EGFP is taken as a template, and a primer pEGFP-PHD-V-F/R is used for amplifying a vector, wherein the nucleotide sequence of the primer is shown as SEQ ID NO: 40-41; and amplifying a PHD structure domain by using pNEG-2 chPheT-PHD-W28TAG-GST as a template and using a primer pEGFP-PHD-F/R, wherein the nucleotide sequence of the PHD structure domain is shown as SEQ ID NO: 42-43, the plasmid was constructed by Gibson assembly and the map of the plasmid is shown in FIG. 1. plasmid construction of pCDNA3.1-chPheRS 9: primers were designed to amplify the chimeric phenylalanyl-tRNA synthetase (chPheRS9) cloned into pcdna3.1 vector under the control of CMV and U6 promoters, respectively, and the primers of the cloned gene and vector were as shown in SEQ ID NO: 44-47, and the map of the plasmid is shown in FIG. 1.
The sequencing of the plasmids is completed by Beijing Okagaku Biotech. The construction of the remaining plasmids was the same as above.
Example 5: expression and purification of wild type and mutant proteins of KDM5A PHD3(PHD)
Expression of KDM5A PHD3(PHD) wild type protein
1. And (3) plasmid transformation: taking out the DH10B chemosensory strain from a refrigerator at-80 ℃, immediately placing the strain into an ice box, adding the plasmid pNEG-2 chPheT-PHD-GST after the strain is melted, and flicking the belly to uniformly mix the strain. Standing in ice bath for 30min, heat-shocking at 42 deg.C for 90s, standing in ice bath for 2min, adding non-anti LB liquid culture medium, recovering at 37 deg.C for 40min, spreading 200 μ L of the bacterial liquid on LB agar plate (ampicillin, 100 μ g/mL), and culturing at 37 deg.C overnight.
2. Inducing expression: single colonies were picked from the resistant plates described above into 3mL of LB liquid medium (ampicillin, 100. mu.g/mL), and cultured overnight with shaking (37 ℃ C., 220 rpm); according to the following steps of 1: inoculating the above bacterial liquid at a ratio of 100, culturing at 37 deg.C to OD 600 When the concentration is 0.6-0.8, L-arabinose (final concentration: 0.2%) and ZnCl are added 2 (final concentration: 0.1mM), expression was induced at 22 ℃ for 24 h.
Second, expression of KDM5A PHD3 mutant protein
The PHD3-W28-6MeOW mutant is exemplified.
1. The plasmids pNEG-2. multidot. chPheT-PHD-W28TAG-GST and pBK-chPheRS9 were co-transformed into E.coli DH10B by the same procedure as above.
2. Inducing expression: single colonies were picked from the resistant plates described above into 3mL of LB liquid medium (ampicillin, 100. mu.g/mL; kanamycin, 50. mu.g/mL), and cultured overnight with shaking (37 ℃ C., 220 rpm); according to the following steps of 1: 100 in 100mL LB liquid medium, cultured at 37 ℃ to OD 600 When the concentration is 0.6-0.8, L-arabinose (final concentration: 0.2%) and ZnCl are added 2 (final concentration: 0.1mM) and a non-natural amino acid (final concentration: 0.5 mM), and induced expression was performed at 22 ℃ for 24 hours.
Thirdly, purification of KDM5A PHD3(PHD)
1, collecting bacterial liquid. The mixture was centrifuged (4 ℃, 4000rpm, 20min) and the deposited bacteria were collected.
2 resuspending the cells. Lysis buffer (20mM Tris-HCl, pH7.5, 150mM NaCl,0.1mM ZnCl) was used 2 2mM beta-Me, protease inhibitors PMSF, Aprotinin).
3, carrying out ultrasonic crushing. Setting an ultrasonic instrument program: working for 2s, intermittent for 5s, power for 60 percent and ultrasonic treatment at 4 ℃.
4 centrifugation (4 ℃, 12000rpm, 20min) and collection of the supernatant.
5 apply 0.5mL of GST beads to the gravity column, add ddH 2 The beads were washed and the column equilibrated with 10 column volumes of lysis buffer.
6 the supernatant from 4 was added to the equilibrated GST column.
7 lysis buffer (20mM Tris-HCl, pH 7.5150 mM NaCl,0.1mM ZnCl) in 20 column volumes 2 2mM beta-Me, protease inhibitors PMSF, Aprotinin) elute the unspecifically adsorbed heteroproteins.
8 the eluate, i.e., the target protein fraction, was collected with 10 column volumes of elution buffer (20mM Tris-HCl, pH7.5, 150mM NaCl, 20mM glutathione).
9 the protein after elution was subjected to SDS polyacrylamide gel electrophoresis (SDS-PAGE) to determine the protein expression purity, and the amount of protein expression was measured using Nanodrop (microspectrophotometer, fluorospectrophotometer, Saimer fly). The protein is used for subsequent SDS protein gel electrophoresis analysis, mass spectrum identification and MST experiment.
Figure 5 is a PHD domain variant protein designed to increase affinity to H3K4me 3. Wherein (A) the complex structure of KDM5A PHD3 protein and H3K4me3 polypeptide (PDB: 2KGI) wherein PHD3 and polypeptide are displayed in cartoon mode, and aromatic amino acid and H3K4me3 are modified and displayed in stick structure. (B) Coomassie Brilliant blue shows PHD-W18-UAA and PHD-W28-UAA variants. (C) The affinity of the PHD-W18-UAA variant to H3K4me3 was determined by microcalorimetry, in which H3K4me3 was labeled with FITC fluorophore. (D) The affinity of the PHD-W28-UAA variant and H3K4me3 is measured by a micro thermophoresis kinetic instrument, wherein H3K4me3 is marked by FITC fluorescent group;
the results of SDS protein gel electrophoresis are shown in FIG. 5B, and the purity of the protein reached 90% or more.
Fourth, LC-MS identification of proteins
The purified proteins were analyzed by SCIEX Triple TOF 6600MS mass spectrometer using electrospray ionization and SCIEX analysis TF software. Using a PHENOMENEX AERIS wide pore C4 column (2.1 × 50mm,3.6 μm) was desalted by separation. Mobile phase a was 0.1% formic acid in water and mobile phase B was 0.1% formic acid acetonitrile. A constant flow rate of 0.2mL/min was set. Mass spectrum data were analyzed by deconvolution of mass spectra using SCIEX OS-Q software (version2.0, SCIEX Corporation). The molecular weight of the protein was predicted using the ExPASy computer pI/Mw tool.
The LC-MS identification results are shown in FIGS. 4C and 4D, the theoretical molecular weight of the target protein is 33378Da, and the actual molecular weights are 33377Da and 33378Da respectively, so that the chimeric phenylalanyl-tRNA synthetase mutant can be proved to specifically recognize 6-methoxy-tryptophan and 7-methoxy-tryptophan.
Example 6: microcalorimetry (MST) measures the affinity of the decoded protein domain variants to histone methylation-modified polypeptides. The polypeptides used in the experiment are all synthesized by Beijing cloisonne department of China, Biotechnology, Inc., and the C end of the peptide segment is marked by Fluorescein Isothiocyanate (FITC), and the specific sequence is shown in Table 2.
TABLE 2 polypeptide sequence information used in MST experiments
The MST determination method is specifically described by taking decoded proteins PHD and H3K4me3 as examples.
(1) Desalting of the protein sample. Protein samples were dialyzed 3 times against 2L of MST buffer (20mM Tris-HCl,50mM NaCl, 1mM DTT, 0.05% Tween-20, pH 7.5).
(2) The protein is concentrated. Protein samples were concentrated to the appropriate concentration using a10 Kd protein concentration tube (Millipore).
(3) Preparing 16 PCR tubes, adding 10 mu L of MST buffer into the No. 2-16 PCR tube, taking 20 mu L of protein sample to the No. 1 tube, pipetting 10 mu L of protein sample from the No. 1 tube to the No. 2 tube, and iteratively diluting the protein sample;
(4) adding 10 μ L of 100nM polypeptide molecule into each tube, and mixing well to obtain 20 μ L total;
(5) and (4) loading the capillary.
(6) Kd values were measured. This was done using a NT.115Monolith instrument (Nano temperature Technologies, Munich, Germany) using a blue LED excitation light source at a constant temperature of 25 ℃. The instrument is set as follows: 20% of blue LED excitation power and 40% of infrared laser power. All measurements were performed using standard glass capillaries (Nano tester Technologies, # catMO-K022) and each set of experiments was repeated 3 times, unless otherwise specified.
(7) And (6) data processing. By NT analysis software, the protein of interest and the fluorescent peptide fragment were expressed in a ratio of 1: 1, fitting the data by using a model of proportional binding to obtain a dissociation constant Kd of the target protein. All data were analyzed by Origin software processing.
(8) Other histone methylation modified decoding proteins have the same affinity determination procedure as histone methylation modification.
The experimental results are as follows: experimental data as shown in figures 5C and 5D: the affinity of the PHD wild-type domain and H3K4me3 is 440nM, the affinity of the PHD variant introduced with 6-methoxy-tryptophan at the W28 site is 52nM with H3K4me3, compared with the PHD wild-type domain of PHD3, the affinity of the PHD variant introduced with 6-methoxy-tryptophan at the W28 site is improved by 8 times with H3K4me3, and the affinity of the PHD protein and H3K4me3 is improved by 2-6 times with other electron-donating tryptophan analogs. Similarly, the introduction of 6-methoxy-tryptophan site specificity into other decoding protein domains also improves the affinity of the decoding protein for methylation modification of the corresponding histone by 2-4 times.
Example 7: construction of multivalent tandem repeat PHD Domain to increase its affinity for H3K4me3
1. The duplex and triplet-repeat PHD domain plasmids were constructed as described in example 4: pNEG-2. multidot. chPheT-2 xPHHD-W28 TAG-GST (2x PHD) and pNEG-2. multidot. chPheT-3 xPHHD-W28 TAG-GST (3x PHD).
2. Duplex and triplex PHD variants of 6-methoxy-tryptophan (2x PHD, 3x PHD) were site-specifically introduced by expression purification of W28 as described in example 5, and protein expression purity was identified by SDS polyacrylamide gel electrophoresis (SDS-PAGE) and protein molecular weight was identified by LC-MS.
3. The affinity of the multivalent tandem repeat PHD domain 2x PHD, 3x PHD to H3K4me3 was determined as described in example 6.
4. The strategy is not limited to the interaction between the PHD structural domain and H3K4me3, and can be expanded to the methylation modification of other decoding proteins and other histones, and experiments prove that the strategy can improve the affinity between different decoding protein structural domains and the corresponding histone methylation modifications.
Figure 6 is a multivalent tandem repeat PHD domain designed to recognize H3K4me 3. (A) Multivalent tandem repeat PHD domains design cartoon figures. (B) Coomassie blue staining identifies the purity of the concatemeric PHD protein variants. (C) The micro-calorimetric electrophoresis apparatus is used for measuring the affinity of the multi-linked PHD protein variant and H3K4me3, wherein the H3K4me3 is labeled by FITC fluorescent group.
The experimental results are as follows: the experimental data are shown in fig. 6C: the affinity of the PHD wild-type domain and H3K4me3 is 440nM, the affinity of the duplex and triplet PHD variants with H3K4me3, which are introduced with 6-methoxy-tryptophan by W28 site specificity, is 30nM and 7nM respectively, and the affinity of the duplex and triplet PHD variants with H3K4me3, which are introduced with 6-methoxy-tryptophan by W28 site specificity, is improved by 14.7 times and 62.9 times compared with the PHD wild-type domain of PHD 3. The strategy is expanded to the affinity determination result of other decoding proteins and corresponding histone methylation modification, which shows that: the multivalent tandem repeat histone methylation modification decoding protein structural domain can improve the affinity of the multivalent tandem repeat histone methylation modification decoding protein structural domain to corresponding histone methylation modification.
Example 8: Far-Western Blot evaluation of recognition efficiency of PHD (phospholipoprotein) super-parent molecule on H3K4me3
1. And (4) protein expression. The 6-methoxy-tryptophan substituted PHD protein variants were expressed and purified as described in example 4, example 5 to obtain PHD protein, 2x PHD protein and 3x PHD protein.
2. SDS Polyacrylamide gel electrophoresis. Taking HeLa cell lysate as an example, Hela cells are lysed and then are diluted to different concentrations in gradient, and the protein sample is separated by SDS-PAGE running gel
3. And (5) transferring the film. Proteins were transferred to PVDF membranes. Constant current 300mA, and rotating the film for 2.5 h.
4. And (3) sealing: the membrane was placed in a plastic box containing 5% skim milk/TBST, placed on a shaker, sealed for 1h, and the blocking solution was decanted. Wash 3 times with TBST for 10 min.
5. And (5) incubating the bait protein. H3K4me 3-specific antibodies, PHD protein, 2x PHD protein, and 3x PHD protein were each incubated overnight. TBST washing 3 times, once for 10 min.
6. And (4) incubating the antibody. PVDF membranes incubated with H3K4me3 specific antibodies incubated with the corresponding secondary antibodies at room temperature. PVDF membrane incubation of PHD protein further incubation of GST specific antibody (Sigma-Aldrich, cat # G7781), finally incubation of the corresponding secondary antibody (Proteitech, cat # SA 00001-2). TBST washing 3 times, once for 10 min.
7. And (4) performing chemiluminescence imaging. The PVDF film was covered on the developer, with care for uniform coverage, left at room temperature for 3 minutes, and then developed imagewise on a multifunctional imager.
FIG. 7 is the detection and imaging of H3K4me3 using the histone methylation super-parent molecule recognition system. (A) A strategy diagram of a histone methylation super-parent molecule recognition system applied to detection and imaging is shown. (B) H3K4me3 levels of HeLa cells were detected using histone methylated super-philic molecules. H3-specific antibody and H3K4me 3-specific antibody were used as control groups, and PHD-WT, 2xPHD and 3xPHD were used to detect H3K4me3, respectively. (C) The histone methylation super-parent molecular recognition system is applied to fluorescence imaging detection of H3K4me3 positioning of cells, wherein PHD protein is labeled by Cy 5.
The experimental results are as follows: experimental data as shown in fig. 7B, the 2x PHD protein and the 3x PHD protein showed higher signal to noise ratios compared to the H3K4me3 specific antibody. The experiment is not limited to the interaction of PHD and H3K4me3, and is also applicable to the detection of the corresponding histone methylation modification by different decoding proteins.
Example 9: the histone methylation super-parent molecular recognition system is combined with an immunofluorescence technology to detect histone methylation modification.
Histone methylation H3K4me3 and PHD decoding protein are taken as examples.
1. The PHD variant is labeled. The PHD decoding protein domain was labeled with Cy5 dye. NHS-Cy5 was dissolved in DMSO and the PHD-decoded protein was in PBS. NHS-Cy5 and PHD proteins were expressed as 1: 2 molar ratio, incubating at 37 ℃ for 1h in the absence of light, and terminating the reaction with 50mM Tris-HCl, pH 8.0 solution.
2. The PHD protein was purified using a PD MiniTratTM G-25 desalting column (Cytiva, cat # 28918007).
3. And (4) preparing a cell sheet. HeLa cells were seeded on a petri dish on which a treated cover glass was placed in advance, and cultured at 37 ℃.
4. And (4) fixing the cells. After the cells were fully adherent, the medium was removed, rinsed 1 time with PBS, fixed with 4% paraformaldehyde (4% PFA/PBS) for 10min at room temperature, and rinsed 3 times with PBS.
5. And (4) cell permeabilization. Cells were permeabilized with PBS containing 0.5% Triton X-100 for 10min and rinsed 3 times with PBS.
6. And (5) sealing. Blocking was performed at room temperature for 30min using 3% BSA in PBS.
7. Primary antibody incubation. Incubation was performed for 2H at room temperature using the Cy 5-labeled PHD protein (wild type and mutant) and H3K4me3 antibody (Abcam, cat # ab8580) in (2), respectively.
8. And (5) incubating a secondary antibody. Cells incubated with H3K4me3 antibody were rinsed with PBS for 10min each, repeated 3 times. Incubated with Dylight 488, goat rabbit antibody IgG (Abbkine, cat # A23220) for 1h at room temperature. PBS rinse 3 times, 10min each time. Cells incubated with Cy 5-labeled PHD protein were rinsed directly 3 times for 10min each with PBS.
9. And (6) sealing the sheet. Coverslips cells were covered face down on slides with DAPI fixative (Abcam, cat # ab104139) dropped and left overnight in the dark. Mounted on a glass slide for imaging.
10. And (6) imaging. Imaging was performed at room temperature using an LSM710 confocal microscope (Zessi) with a 63x oil lens. All images were analyzed and processed using ZEN 2.3lite software from Zeiss.
The experimental results are as follows: experimental data as shown in figure 7C immunofluorescence results indicate that Cy 5-labeled PHD super-parent molecule was able to detect co-localization of H3K4me3 with M-phase condensed chromosomes during mitosis. The PHD super-parent molecule has a higher signal-to-noise ratio compared to the commercially available H3K4me3 antibody (Abcam, cat # ab 8580).
Example 10: flow cytometry analysis of efficiency of chimeric phenylalanine translation System in mammalian cells
1. Cells were transfected. 293T cells were transfected according to the standard plasmid transient transfection protocol, the experimental group was cells co-transfected with plasmid pCDNA3.1-chPheRS9 expressing the chimeric phenylalanine translation system and the fluorescent reporter plasmid pEGFP-mCherry-T2A-EGFP-190TAG, and the control group was cells infected with pEGFP-mCherry and pEGFP-EGFP alone.
2. After 48h of cell transfection, the medium was aspirated off and 1 × PBS was added to wash out residual medium.
3. The PBS solution was aspirated, trypsinized cells were added, 1mL of DMEM medium was added to resuspend the cells, and the cells were transferred to a 1.5mL centrifuge tube.
4. The forward and side scatter gates of the flow cytometer were set with 293T cells, the parameters and gates of the PE channel were set with cells expressing mCherry, and the parameters and gates of the FITC channel were set with cells expressing EGFP.
5. The experimental group of cells was assayed, setting up 50000 cells collected per sample. Data was analyzed using the software FlowJo.
The experimental results are as follows: experimental data as shown in fig. 8, the results of flow cytometry experiments showed that the chimeric phenylalanine aminoacyl-tRNA synthetase (chPheRS9) can efficiently recognize any one of unnatural amino acids of 6-methoxy-tryptophan (6MeOW), 7-methoxy-tryptophan (7MeOW), 6, 7-methyl-tryptophan (67MW), and 6, 7-methoxy-tryptophan (67MeOW) in mammalian cells. The experimental procedure is applicable to 293T cell lines, but not limited to 293T cell lines, and is applicable to various cell lines.
Example 11: capture of proteomes interacting with histone methylation modifications using proximity labeling technology
1. The pT3 vector is used as a template, a primer pT3-PHD-APEX-V-F/R is used for amplifying the vector, and the nucleotide sequence of the primer is shown as SEQ ID NO: 52-53; and (3) amplifying a PHD gene fragment by using a primer pT3-PHD-F/R by taking pNEG-2 chPheT-PHD-W28TAG-GST as a template, wherein the nucleotide sequence of the primer is shown as SEQ ID NO: 56-57; the primer pT3-APEX2-F/R is used for amplifying an APEX2 gene fragment, and the nucleotide sequence of the primer is shown as SEQ ID NO: 54-55, and the nucleotide sequence and amino acid sequence of APEX2 are shown in SEQ ID NO: 13-14 plasmid pT3-PHD-APEX2 was constructed by Gibson assembly. The plasmid map is shown in FIG. 1.
2. Constructing a stable transgenic cell line for stably expressing the fusion protein of PHD and APEX 2. The plasmid pCMV-SB100 (the specific plasmid map is shown in figure 1) containing the Sleeping Beauty transposon system and the plasmid pT3-APEX2-PHD were co-transformed into HeLa cells, and after 24 hours of culture at 37 ℃, the cells were cultured by DMEM containing 2 mug/mL puromycin, and the solution was changed periodically. After the cells of the blank control group all died, the cells of the experimental group were cultured with DMEM containing 1. mu.g/mL puromycin to obtain a mixed clone stable cell line.
3. The cells were transfected and over-expressed pCDNA3.1-chPheRS9 in a clonally stable cell line, with 2mM addition of 6-methoxy-tryptophan and incubated for 36 h.
4. Proximity labeling catalyzed by APEX2. The stable cell line in step (3) was incubated with DMEM containing 500. mu.M biotin phenol at 37 ℃ for 30 min. The change solution is 1mM H 2 O 2 PBS solution, standing at room temperature for 5 min. The cells were rinsed 4 times and 1 time with PBS in turn with pre-chilled 20mM ascorbic acid/PBS. Digestion with pancreatin, neutralization in DMEM, centrifugation (1000g, 1min) and discarding of the supernatant. Finally, PBS was added to resuspend the cells, centrifuged (1000g, 1min), the supernatant was discarded, and the above procedure was repeated 1 time.
5. And (4) separating cell nucleuses. To the cells obtained in step (4), 1.5mL of hypotonic buffer (10mM HEPES,10mM KCl, 0.05% NP40) was added, resuspended, allowed to stand on ice for 10min, centrifuged (4 ℃, 12000rpm, 20min), and the supernatant was discarded. Repeating the above steps for 5-8 times.
6. Lysis of the cell nucleus. To the pellet in step (5), 400. mu.L of lysis buffer (25mM TEOA pH7.5, 150mM NaCl, 0.1% SDS, 1% Triton X-100, 0.5% sodium deoxycholate, 1mM PMSF,1 XPIC) and 20. mu.L of DNase were added, resuspended, left at room temperature for 20min, centrifuged (4 ℃, 18000rpm, 15min), and the supernatant was collected.
7. Enrichment of biotinylated proteins. Streptavidin-coupled magnetic Beads (Streptavidin Beads) were added to the supernatant collected in step (6), and the mixture was incubated overnight at 4 ℃. The cells were washed 1 time with 0.01% NP40/PBS buffer, then washed 3 times with 0.01% NP40/PBS buffer containing 500mM NaCl, 0.01% NP40/PBS buffer containing 0.2% SDS, and 0.01% NP40/PBS buffer containing 2M urea, respectively, washed 1 time with 0.01% NP40/PBS buffer, and finally resuspended by adding 100. mu.L 1 × SDS loading buffer and boiled at 100 ℃ for 10 min.
8. SDS Polyacrylamide gel electrophoresis. The protein sample in step (7) was separated by SDS polyacrylamide gel electrophoresis (SDS-PAGE), stained with Coomassie Brilliant blue G250, and then destained.
9. LC-MS/MS detects proteomes that interact with histone methylation modifications. And (3) cutting the protein strips of the protein gel in the separation step (8) by using a clean blade, and performing decoloration dehydration, drying, reduction, alkylation, enzymolysis, peptide segment extraction, desalting, isotope labeling and desalting treatment respectively. The processed sample passes Q activeThe Orbitrap mass spectrometer analysis was performed by Proxeon nanospray ionization and the HPLC instrument was Proxeon Easy-nLC II HPLC. The samples were loaded into a 100-micron x 20mm Magic C18Desalting in 5U reverse column, and passing through 75-micronx 100mm Magic C18The 3U reverse phase column separates the protein sample. And setting the elution flow rate to be 300nL/min and the elution time to be 60min to obtain an MS/MS result. Data processing: the experimental software MaxQuant and pLabel software analysis process the experimental results.
The operation flow of this example is shown in fig. 9, and combines the proximity labeling technology and the histone methylation super-parent recognition system to capture the proteome interacting with histone methylation modification (H3K4me3), which is beneficial to analyze the biological functions performed by H3K4me3 in the life process.
In conclusion, the invention provides a synthetic method of a pyridine alkaloid compound, and the compound is applied to remarkably improve the cation-pi interaction, thereby providing a research method for researching biomacromolecules of the cation-pi interaction, providing a theoretical basis for developing the biotechnology of histone methylation modified super-parents based on decoded proteins, providing possibility for further application, and having great clinical value and development and application value.
It should be understood that the above detailed description of the present invention is only for illustrating the present invention and is not limited to the technical solutions described in the embodiments of the present invention, and those skilled in the art should understand that the present invention can still be modified or substituted equally to achieve the same technical effects; and are within the scope of the present invention as long as the requirements of use are met.
Sequence listing
<110> Zhejiang university
<120> method for improving cation-pi interaction by genetic code expansion and application
<130> ZJDX-002
<160> 49
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1668
<212> DNA
<213> Artificial Synthesis (synthetic sequence)
<400> 1
atggataaga agccgctgga tgttctgatc tctgcgaccg gtctgtggat gtcccgtacc 60
ggcacgctgc acaagatcaa gcactatgag atttctcgtt ctaaaatcta catcgaaatg 120
gcgtgtggtg accatctggt tgtgaacaac tctcgttctt gtcgtcccgc acgtgcattc 180
cgttatcata aataccgtaa aacctgcaaa cgttgtcgtg tttctgacga agatatcaac 240
aacttcctga cccgttctac cgaaggcaaa acctctgtta aagttaaagt tgtttctgag 300
ccgaaagtga aaaaagcgat gccgaaatct gtttctcgtg cgccgaaacc gctggaaaat 360
ccggtttctg cgaaagcgtc taccgacacc tctcgttctg ttccgtctcc ggcgaaatct 420
accccgaact ctccggttcc gacctctgca agcgccccag ctctgactaa atcccagacg 480
gaccgtctgg aggtgctgct gaacccaaag gatgaaatct ctctgaacag cggcaagcct 540
ttccgtgagc tggaaagcga gctgctgtct cgtcgtaaaa aggatctgca acagatctac 600
gctgaggaac gcgagggtgg cggaagcggc ggcggtggcg gaagcggcgg cggtggcgga 660
agcggcggcg gtggaagcca ggcctgggga tcgaggcctc ctgcagcaga gtgtgccacc 720
caaagagctc caggcagtgt ggtggagctg ctgggcaaat cctaccctca ggacgaccac 780
agcaacctca cccggaaggt cctcaccaga gttggcagga acctgcacaa ccagcagcat 840
caccctctgt ggctgatcaa ggagagggtg ttggagcact tcaacaagca gtatgtgggc 900
agctctggga ccccgttgtt ctcggtctat gacaaccttt cgccagtggt cacgacctgg 960
cagaactttg acagcctgct catcccagct gatcacccct gcaggaagaa gggggacaac 1020
tattacctga atcggactca catgctgaga gcgcacacgt ccgcacacca gtgggacttg 1080
ctgcacgcgg gactggatgc cttcctggtg gtgggtgatg tctacaggcg tgaccagatc 1140
gactcccagc actaccctat tttccaccag ctggacgccg gtcggctctt ctctaagcat 1200
gagttatttg ctggtataaa ggatggggaa agcctgcagc tctttgaaca aagttctcgc 1260
tctgcgcata aacaagagac acacaccatg gaggccgtga agcttgttga gtttgatctt 1320
aagcaaacgc ttaccaggct catggcacat ctttttggag atgagccgga gataaggtgg 1380
gtagactgct acgttccttt tggacatcct tcctttgaga tggagatcaa ctttcatgga 1440
gaatggctgg aagttcttgg ctgcggggtg gttgaacaac aactggtcaa ttcagctggt 1500
gctcaagacc gaatcggctg gggatttggc ctagggttag aaaggctagc catgatcctc 1560
tacgacatcc ctgatatccg tctcttctgg tgtgaggacg agcgcttcct gaagcagttc 1620
tgtgtatcca acattaatca gaaggtgaag tttcagcctc ttagcaaa 1668
<210> 2
<211> 556
<212> PRT
<213> Artificial Synthesis (synthetic sequence)
<400> 2
Met Asp Lys Lys Pro Leu Asp Val Leu Ile Ser Ala Thr Gly Leu Trp
1 5 10 15
Met Ser Arg Thr Gly Thr Leu His Lys Ile Lys His Tyr Glu Ile Ser
20 25 30
Arg Ser Lys Ile Tyr Ile Glu Met Ala Cys Gly Asp His Leu Val Val
35 40 45
Asn Asn Ser Arg Ser Cys Arg Pro Ala Arg Ala Phe Arg Tyr His Lys
50 55 60
Tyr Arg Lys Thr Cys Lys Arg Cys Arg Val Ser Asp Glu Asp Ile Asn
65 70 75 80
Asn Phe Leu Thr Arg Ser Thr Glu Gly Lys Thr Ser Val Lys Val Lys
85 90 95
Val Val Ser Glu Pro Lys Val Lys Lys Ala Met Pro Lys Ser Val Ser
100 105 110
Arg Ala Pro Lys Pro Leu Glu Asn Pro Val Ser Ala Lys Ala Ser Thr
115 120 125
Asp Thr Ser Arg Ser Val Pro Ser Pro Ala Lys Ser Thr Pro Asn Ser
130 135 140
Pro Val Pro Thr Ser Ala Ser Ala Pro Ala Leu Thr Lys Ser Gln Thr
145 150 155 160
Asp Arg Leu Glu Val Leu Leu Asn Pro Lys Asp Glu Ile Ser Leu Asn
165 170 175
Ser Gly Lys Pro Phe Arg Glu Leu Glu Ser Glu Leu Leu Ser Arg Arg
180 185 190
Lys Lys Asp Leu Gln Gln Ile Tyr Ala Glu Glu Arg Glu Gly Gly Gly
195 200 205
Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly
210 215 220
Gly Ser Gln Ala Trp Gly Ser Arg Pro Pro Ala Ala Glu Cys Ala Thr
225 230 235 240
Gln Arg Ala Pro Gly Ser Val Val Glu Leu Leu Gly Lys Ser Tyr Pro
245 250 255
Gln Asp Asp His Ser Asn Leu Thr Arg Lys Val Leu Thr Arg Val Gly
260 265 270
Arg Asn Leu His Asn Gln Gln His His Pro Leu Trp Leu Ile Lys Glu
275 280 285
Arg Val Leu Glu His Phe Asn Lys Gln Tyr Val Gly Ser Ser Gly Thr
290 295 300
Pro Leu Phe Ser Val Tyr Asp Asn Leu Ser Pro Val Val Thr Thr Trp
305 310 315 320
Gln Asn Phe Asp Ser Leu Leu Ile Pro Ala Asp His Pro Cys Arg Lys
325 330 335
Lys Gly Asp Asn Tyr Tyr Leu Asn Arg Thr His Met Leu Arg Ala His
340 345 350
Thr Ser Ala His Gln Trp Asp Leu Leu His Ala Gly Leu Asp Ala Phe
355 360 365
Leu Val Val Gly Asp Val Tyr Arg Arg Asp Gln Ile Asp Ser Gln His
370 375 380
Tyr Pro Ile Phe His Gln Leu Asp Ala Gly Arg Leu Phe Ser Lys His
385 390 395 400
Glu Leu Phe Ala Gly Ile Lys Asp Gly Glu Ser Leu Gln Leu Phe Glu
405 410 415
Gln Ser Ser Arg Ser Ala His Lys Gln Glu Thr His Thr Met Glu Ala
420 425 430
Val Lys Leu Val Glu Phe Asp Leu Lys Gln Thr Leu Thr Arg Leu Met
435 440 445
Ala His Leu Phe Gly Asp Glu Pro Glu Ile Arg Trp Val Asp Cys Tyr
450 455 460
Val Pro Phe Gly His Pro Ser Phe Glu Met Glu Ile Asn Phe His Gly
465 470 475 480
Glu Trp Leu Glu Val Leu Gly Cys Gly Val Val Glu Gln Gln Leu Val
485 490 495
Asn Ser Ala Gly Ala Gln Asp Arg Ile Gly Trp Gly Phe Gly Leu Gly
500 505 510
Leu Glu Arg Leu Ala Met Ile Leu Tyr Asp Ile Pro Asp Ile Arg Leu
515 520 525
Phe Trp Cys Glu Asp Glu Arg Phe Leu Lys Gln Phe Cys Val Ser Asn
530 535 540
Ile Asn Gln Lys Val Lys Phe Gln Pro Leu Ser Lys
545 550 555
<210> 3
<211> 201
<212> DNA
<213> human (H. sapiens)
<400> 3
atgagcggtg cagaagaatc agatgatgaa aatgcagttt gtgcagcaca gaattgtcag 60
cgcccgtgta aagataaagt tgattaggtt cagtgtgatg gtggttgtga tgaatggttt 120
catcaggttt gtgttggtgt tagcccggaa atggcagaaa atgaagatta tatttgcatc 180
aactgcgcaa aaaaacaggg t 201
<210> 4
<211> 201
<212> DNA
<213> person (H. sapiens)
<400> 4
atgagcggtg cagaagaatc agatgatgaa aatgcagttt gtgcagcaca gaattgtcag 60
cgcccgtgta aagataaagt tgattgggtt cagtgtgatg gtggttgtga tgaatagttt 120
catcaggttt gtgttggtgt tagcccggaa atggcagaaa atgaagatta tatttgcatc 180
aactgcgcaa aaaaacaggg t 201
<210> 5
<211> 198
<212> DNA
<213> human (H. sapiens)
<400> 5
atggcaagtc aggaatttga agtagaagca attgttgata aacgtcaaga taaaaacggt 60
aatacccaat atctggttcg ttggaaaggt tatgataaac aggatgatac atgggaaccg 120
gaacagcatc tgatgaattg tgaaaaatgt gtgcatgatt tcaaccgtcg ccaaaccgaa 180
aaacagaaag gtggaagc 198
<210> 6
<211> 66
<212> PRT
<213> human (H. sapiens)
<400> 6
Met Ala Ser Gln Glu Phe Glu Val Glu Ala Ile Val Asp Lys Arg Gln
1 5 10 15
Asp Lys Asn Gly Asn Thr Gln Tyr Leu Val Arg Trp Lys Gly Tyr Asp
20 25 30
Lys Gln Asp Asp Thr Trp Glu Pro Glu Gln His Leu Met Asn Cys Glu
35 40 45
Lys Cys Val His Asp Phe Asn Arg Arg Gln Thr Glu Lys Gln Lys Gly
50 55 60
Gly Ser
65
<210> 7
<211> 579
<212> DNA
<213> person (H. sapiens)
<400> 7
atgaatggct gggtacctgt tggggctgcg tgtgagaagg ctgtgtatgt cttggatgag 60
ccggagccag ccatccgaaa gagctaccag gcggtagagc ggcatgggga gacaatccga 120
gtccgggaca ccgtccttct caaatcaggc ccacgaaaga cctccacacc ttatgtggcc 180
aagatctctg ccctctggga gaaccccgag tcaggagagc tgatgatgag cctcctgtgg 240
tattacagac ctgagcactt acagggaggc cgcagtccca gcatgcacga gcccttgcag 300
aatgaagtgt ttgcatcgcg acatcaggac cagaacagtg tggcctgcat tgaggagaag 360
tgctatgtgc tgacttttgc cgagtactgc aggttctgtg ccatggccaa gcgccgaggt 420
gaaggcctcc ccagccgaaa gacagcactg gttcccccct ctgcagacta ttccacccca 480
ccccaccgca cagtgccaga ggacacggac cctgagctgg tgttcctttg ccgccatgtc 540
tatgacttcc gccacgggcg catccttaag aacccccag 579
<210> 8
<211> 193
<212> PRT
<213> human (H. sapiens)
<400> 8
Met Asn Gly Trp Val Pro Val Gly Ala Ala Cys Glu Lys Ala Val Tyr
1 5 10 15
Val Leu Asp Glu Pro Glu Pro Ala Ile Arg Lys Ser Tyr Gln Ala Val
20 25 30
Glu Arg His Gly Glu Thr Ile Arg Val Arg Asp Thr Val Leu Leu Lys
35 40 45
Ser Gly Pro Arg Lys Thr Ser Thr Pro Tyr Val Ala Lys Ile Ser Ala
50 55 60
Leu Trp Glu Asn Pro Glu Ser Gly Glu Leu Met Met Ser Leu Leu Trp
65 70 75 80
Tyr Tyr Arg Pro Glu His Leu Gln Gly Gly Arg Ser Pro Ser Met His
85 90 95
Glu Pro Leu Gln Asn Glu Val Phe Ala Ser Arg His Gln Asp Gln Asn
100 105 110
Ser Val Ala Cys Ile Glu Glu Lys Cys Tyr Val Leu Thr Phe Ala Glu
115 120 125
Tyr Cys Arg Phe Cys Ala Met Ala Lys Arg Arg Gly Glu Gly Leu Pro
130 135 140
Ser Arg Lys Thr Ala Leu Val Pro Pro Ser Ala Asp Tyr Ser Thr Pro
145 150 155 160
Pro His Arg Thr Val Pro Glu Asp Thr Asp Pro Glu Leu Val Phe Leu
165 170 175
Cys Arg His Val Tyr Asp Phe Arg His Gly Arg Ile Leu Lys Asn Pro
180 185 190
Gln
<210> 9
<211> 207
<212> DNA
<213> person (H. sapiens)
<400> 9
atggagtatc aggatgggaa ggagtttgga ataggggacc tcgtgtgggg aaagatcaag 60
ggcttctcct ggtggcccgc catggtggtg tcttggaagg ccacctccaa gcgacaggct 120
atgtctggca tgcggtgggt ccagtggttt ggcgatggca agttctccga ggtctctgca 180
gacaaactgg tggcactggg gctgttc 207
<210> 10
<211> 69
<212> PRT
<213> human (H. sapiens)
<400> 10
Met Glu Tyr Gln Asp Gly Lys Glu Phe Gly Ile Gly Asp Leu Val Trp
1 5 10 15
Gly Lys Ile Lys Gly Phe Ser Trp Trp Pro Ala Met Val Val Ser Trp
20 25 30
Lys Ala Thr Ser Lys Arg Gln Ala Met Ser Gly Met Arg Trp Val Gln
35 40 45
Trp Phe Gly Asp Gly Lys Phe Ser Glu Val Ser Ala Asp Lys Leu Val
50 55 60
Ala Leu Gly Leu Phe
65
<210> 11
<211> 417
<212> DNA
<213> human (H. sapiens)
<400> 11
atgagcggtg cagaagaatc agatgatgaa aatgcagttt gtgcagcaca gaattgtcag 60
cgcccgtgta aagataaagt tgattgggtt cagtgtgatg gtggttgtga tgaatagttt 120
catcaggttt gtgttggtgt tagcccggaa atggcagaaa atgaagatta tatttgcatc 180
aactgcgcaa aaaaacaggg tggcagcagc ggcagcagca gcggtgcaga agaatcagat 240
gatgaaaatg cagtttgtgc agcacagaat tgtcagcgcc cgtgtaaaga taaagttgat 300
tgggttcagt gtgatggtgg ttgtgatgaa tagtttcatc aggtttgtgt tggtgttagc 360
ccggaaatgg cagaaaatga agattatatt tgcatcaact gcgcaaaaaa acagggt 417
<210> 12
<211> 636
<212> DNA
<213> human (H. sapiens)
<400> 12
atgagcggtg cagaagaatc agatgatgaa aatgcagttt gtgcagcaca gaattgtcag 60
cgcccgtgta aagataaagt tgattgggtt cagtgtgatg gtggttgtga tgaatggttt 120
catcaggttt gtgttggtgt tagcccggaa atggcagaaa atgaagatta tatttgcatc 180
aactgcgcaa aaaaacaggg tggcagcagc ggcagcagca gcggtgcaga agaatcagat 240
gatgaaaatg cagtttgtgc agcacagaat tgtcagcgcc cgtgtaaaga taaagttgat 300
tgggttcagt gtgatggtgg ttgtgatgaa tggtttcatc aggtttgtgt tggtgttagc 360
ccggaaatgg cagaaaatga agattatatt tgcatcaact gcgcaaaaaa acagggtctg 420
gtgccgcgcg gcagcagcag cggtgcagaa gaatcagatg atgaaaatgc agtttgtgca 480
gcacagaatt gtcagcgccc gtgtaaagat aaagttgatt gggttcagtg tgatggtggt 540
tgtgatgaat ggtttcatca ggtttgtgtt ggtgttagcc cggaaatggc agaaaatgaa 600
gattatattt gcatcaactg cgcaaaaaaa cagggt 636
<210> 13
<211> 747
<212> DNA
<213> Soybean (Glycine max)
<400> 13
ggaaagtctt acccaactgt gagtgctgat taccaggacg ccgttgagaa ggcgaagaag 60
aagctcagag gcttcatcgc tgagaagaga tgcgctcctc taatgctccg tttggcattc 120
cactctgctg gaacctttga caagggcacg aagaccggtg gacccttcgg aaccatcaag 180
caccctgccg aactggctca cagcgctaac aacggtcttg acatcgctgt taggcttttg 240
gagccactca aggcggagtt ccctattttg agctacgccg atttctacca gttggctggc 300
gttgttgccg ttgaggtcac gggtggacct aaggttccat tccaccctgg aagagaggac 360
aagcctgagc caccaccaga gggtcgcttg cccgatccca ctaagggttc tgaccatttg 420
agagatgtgt ttggcaaagc tatggggctt actgaccaag atatcgttgc tctatctggg 480
ggtcacacta ttggagctgc acacaaggag cgttctggat ttgagggtcc ctggacctct 540
aatcctctta ttttcgacaa ctcatacttc acggagttgt tgagtggtga gaaggaaggt 600
ctccttcagc taccttctga caaggctctt ttgtctgacc ctgtattccg ccctctcgtt 660
gacaaatatg cagcggacga agatgccttc tttgctgatt acgctgaggc tcaccaaaag 720
ctttccgagc ttgggtttgc tgatgcc 747
<210> 14
<211> 249
<212> PRT
<213> Soybean (Glycine max)
<400> 14
Gly Lys Ser Tyr Pro Thr Val Ser Ala Asp Tyr Gln Asp Ala Val Glu
1 5 10 15
Lys Ala Lys Lys Lys Leu Arg Gly Phe Ile Ala Glu Lys Arg Cys Ala
20 25 30
Pro Leu Met Leu Arg Leu Ala Phe His Ser Ala Gly Thr Phe Asp Lys
35 40 45
Gly Thr Lys Thr Gly Gly Pro Phe Gly Thr Ile Lys His Pro Ala Glu
50 55 60
Leu Ala His Ser Ala Asn Asn Gly Leu Asp Ile Ala Val Arg Leu Leu
65 70 75 80
Glu Pro Leu Lys Ala Glu Phe Pro Ile Leu Ser Tyr Ala Asp Phe Tyr
85 90 95
Gln Leu Ala Gly Val Val Ala Val Glu Val Thr Gly Gly Pro Lys Val
100 105 110
Pro Phe His Pro Gly Arg Glu Asp Lys Pro Glu Pro Pro Pro Glu Gly
115 120 125
Arg Leu Pro Asp Pro Thr Lys Gly Ser Asp His Leu Arg Asp Val Phe
130 135 140
Gly Lys Ala Met Gly Leu Thr Asp Gln Asp Ile Val Ala Leu Ser Gly
145 150 155 160
Gly His Thr Ile Gly Ala Ala His Lys Glu Arg Ser Gly Phe Glu Gly
165 170 175
Pro Trp Thr Ser Asn Pro Leu Ile Phe Asp Asn Ser Tyr Phe Thr Glu
180 185 190
Leu Leu Ser Gly Glu Lys Glu Gly Leu Leu Gln Leu Pro Ser Asp Lys
195 200 205
Ala Leu Leu Ser Asp Pro Val Phe Arg Pro Leu Val Asp Lys Tyr Ala
210 215 220
Ala Asp Glu Asp Ala Phe Phe Ala Asp Tyr Ala Glu Ala His Gln Lys
225 230 235 240
Leu Ser Glu Leu Gly Phe Ala Asp Ala
245
<210> 15
<211> 47
<212> DNA
<213> Artificial sequence (synthetic sequence)
<220>
<221> misc_feature
<222> (21)..(22)
<223> n is a, c, g, or t
<400> 15
taagatgggt agactgctac nnkccttttg gtcatccttc ttttgag 47
<210> 16
<211> 25
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 16
gtagcagtct acccatctta tctcc 25
<210> 17
<211> 46
<212> DNA
<213> Artificial sequence (synthetic sequence)
<220>
<221> misc_feature
<222> (21)..(22)
<223> n is a, c, g, or t
<400> 17
aagttcttgg ctgcggggtg nnkgaacaac aactggtcaa ttcagc 46
<210> 18
<211> 20
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 18
<210> 19
<211> 50
<212> DNA
<213> Artificial sequence (synthetic sequence)
<220>
<221> misc_feature
<222> (21)..(22)
<223> n is a, c, g, or t
<220>
<221> misc_feature
<222> (27)..(28)
<223> n is a, c, g, or t
<400> 19
accctatttt ccaccagctg nnkgccnnkc ggctcttctc caagcatgag 50
<210> 20
<211> 24
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 20
cagctggtgg aaaatagggt agtg 24
<210> 21
<211> 46
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 21
ctggtgccgc gcggcagcat gtcccctata ctaggttatt ggaaaa 46
<210> 22
<211> 40
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 22
gtggcgacca tcctccaaaa tgaagcatgc accattcctt 40
<210> 23
<211> 45
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 23
taagaaggag atatacatat gagcggtgca gaagaatcag atgat 45
<210> 24
<211> 44
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 24
catgctgccg cgcggcacca gaccctgttt ttttgcgcag ttga 44
<210> 25
<211> 22
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 25
tgaagcatgc accattcctt gc 22
<210> 26
<211> 58
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 26
catatgtata tctccttctt aaagttaaac aaaattattt ctagcccaaa aaaacggg 58
<210> 27
<211> 46
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 27
cgtgtaaaga taaagttgat taggttcagt gtgatggtgg ttgtga 46
<210> 28
<211> 25
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 28
atcaacttta tctttacacg ggcgc 25
<210> 29
<211> 27
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 29
tttcatcagg tttgtgttgg tgttagc 27
<210> 30
<211> 47
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 30
ccaacacaaa cctgatgaaa ctattcatca caaccaccat cacactg 47
<210> 31
<211> 59
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 31
actgcgcaaa aaaacagggt ggcagcagcg gcagcagcag cggtgcagaa gaatcagat 59
<210> 32
<211> 42
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 32
atgctgccgc gcggcaccag accctgtttt tttgcgcagt tg 42
<210> 33
<211> 39
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 33
accctgtttt tttgcgcagt tgatgcaaat ataatcttc 39
<210> 34
<211> 42
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 34
atgctgccgc gcggcaccag gcttccacct ttctgttttt cg 42
<210> 35
<211> 59
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 35
actgcgcaaa aaaacagggt ggcagcagcg gcagcagcgc aagtcaggaa tttgaagta 59
<210> 36
<211> 40
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 36
ggcagcagcg gcagcagcgt gagcaagggc gaggagctgt 40
<210> 37
<211> 22
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 37
catggtggcg accggtagcg ct 22
<210> 38
<211> 42
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 38
cgctaccggt cgccaccatg agcggtgcag aagaatcaga tg 42
<210> 39
<211> 43
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 39
cgctgctgcc gctgctgcca ccctgttttt ttgcgcagtt gat 43
<210> 40
<211> 43
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 40
ctgcacggaa gcttgccacc atggataaga agccgctgga tgt 43
<210> 41
<211> 46
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 41
tagtgatggt gatggtggtg tttgctaaga ggctgaaact tcacct 46
<210> 42
<211> 25
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 42
caccaccatc accatcacta aaccc 25
<210> 43
<211> 22
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 43
ggtggcaagc ttccgtgcag tt 22
<210> 44
<211> 23
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 44
taactagtcc actgagatcg acg 23
<210> 45
<211> 26
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 45
cttatcgtcg tcatccttgt agtcca 26
<210> 46
<211> 46
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 46
tctggcagcg gttctgctag cggaaagtct tacccaactg tgagtg 46
<210> 47
<211> 40
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 47
cgatctcagt ggactagtta ggcatcagca aacccaagct 40
<210> 48
<211> 41
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 48
acaaggatga cgacgataag agcggtgcag aagaatcaga t 41
<210> 49
<211> 51
<212> DNA
<213> Artificial sequence (synthetic sequence)
<400> 49
gctagcagaa ccgctgccag aaccgctgcc accctgtttt tttgcgcagt t 51
Claims (10)
1. A method for improving cation-pi interaction by genetic code expansion is characterized in that tryptophan of an aromatic cage forming cation-pi interaction in a biological molecule is replaced by a tryptophan analogue by utilizing a genetic code expansion technology so as to improve the binding energy of the cation-pi interaction.
2. A method according to claim 1, characterized in that the method comprises the steps of:
s1, designing and synthesizing strong electron-donating side chain substituted tryptophan analogues, wherein the tryptophan analogues are unnatural amino acids and are selected from one of 6-methyl-tryptophan (A1), 6-methoxy-tryptophan (A2), 7-methyl-tryptophan (A3), 7-methoxy-tryptophan (A4), 6, 7-methoxy-tryptophan (A5), 6, 7-methyl-tryptophan (A6), 7, 8-dihydrofuran-tryptophan (A7), 6, 7-dihydrofuran-tryptophan (A8), 7, 8-furan-tryptophan (A9), 6, 7-furan-tryptophan (A10), 6, 7-dioxole-tryptophan (A11) or 6, 7-cyclopentane-tryptophan (A12), the structural formulas of the tryptophan analogs A1 to A12 are as follows:
s2, screening a chimeric phenylalanine aminoacyl-tRNA synthetase mutant specifically recognizing tryptophan analogs A1 to A12;
s3, taking the biological molecule forming the cation-pi interaction as a research object, and specifically introducing tryptophan analogues into the biological molecule through the chimeric phenylalanine aminoacyl-tRNA synthetase mutant by utilizing the genetic code expansion technology to obtain the protein with the tryptophan analogues.
3. The method according to claim 1, characterized in that the synthesis of the tryptophan analogues is: indole B substituted at different positions is taken as a reactant to react to obtain a target product,
4. The method of claim 1, wherein: the method for synthesizing the tryptophan analogs A1 to A12 comprises the following steps:
the method comprises the following steps: synthesis of starting material compound B:
starting material B is selected from one of 6-methyl-indole (B1), 6-methoxy-indole (B2), 7-methyl-indole (B3), 7-methoxy-indole (B4), 6, 7-methoxy-indole (B5), 6, 7-methyl-indole (B6), 7, 8-dihydrofuran-indole (B7), 6, 7-dihydrofuran-indole (B8), 7, 8-furan-indole (B9), 6, 7-furan-indole (B10), 6, 7-dioxole-indole (B11) or 6, 7-cyclopentane-indole (B12), the structural formulae of the above indole analogs B1 to B12 are:
(1) synthesis of compounds B6, B7, B8, B9, B10: aniline (G6, G7, G8, G9 or G10) and triethanolamine as reactants, and RuCl as a reaction product 3 ·nH 2 O,SnCl 2 ·2H 2 O and PPh 3 As a catalyst, reacting in anhydrous dioxane to obtain a starting material compound B; (2) synthesis of compound B11, B12: aniline (G11 or G12), chloral hydrate and hydroxylamine hydrochloride are used as reactants, sulfuric acid is used as a catalyst, water is used as a solvent to obtain a crude product, the crude product is reacted with methanesulfonic acid to obtain an isatin product, and finally, a starting material compound B is obtained by reduction of lithium aluminum hydride;
step two: synthesis of Compound C: reacting an initial raw material compound B and iodine as reactants, potassium hydroxide as alkali and anhydrous N, N-dimethylformamide as a solvent to obtain an intermediate compound C;
step three: synthesis of Compound D: reacting the compound C and di-tert-butyl dicarbonate serving as reactants in anhydrous dichloromethane by using triethylamine as alkali and DMAP (dimethyl formamide) as a catalyst to obtain an intermediate compound D;
step four: synthesis of Compound E: reacting the compound D and Boc-3-iodine-L-alanine methyl ester serving as reactants in an anhydrous N, N-dimethylformamide solvent under the protection of nitrogen by using palladium acetate as a catalyst and S-Phos as a ligand to obtain an intermediate compound E;
step five: synthesis of Compound F: under the condition that methanol and water are used as solvents, potassium hydroxide is used as alkali, and an intermediate compound F is obtained through reaction;
step six: synthesis of Compound A: and reacting the compound F under the condition of taking anhydrous dichloromethane as a solvent and taking trifluoroacetic acid as a catalyst to obtain a target product (tryptophan analogues A1-A12).
5. The synthesis method of claim 4, wherein in the fourth step, the amount of the catalyst palladium acetate is 2% of the substrate (compound D) by molar weight;
the reaction time of the first step is 1-2h, the reaction time of the second step is 2h, the reaction time of the third step is 8h, the reaction time of the fourth step is 5h, the reaction time of the fifth step is 2-3h, and the reaction time of the sixth step is 2 h;
the reaction temperature in the first step is 90 ℃, the reaction temperature in the second step is 0 ℃, the reaction temperature in the third step is 0 ℃, the reaction temperature in the fourth step is 40 ℃, the reaction temperature in the fifth step is 25 ℃ and the reaction temperature in the sixth step is 0 ℃.
6. The method of claim 1, wherein: in the step S2, the first step,
(1) constructing a saturated mutagenic gene library for amino acids in an amino acid binding pocket of the chimeric phenylalanyl-tRNA synthetase, and screening a chimeric phenylalanyl-tRNA synthetase mutant for specifically recognizing a tryptophan analogue;
(2) identifying the recognition efficiency and specificity of the phenylalanine aminoacyl-tRNA synthetase mutant by GFP fluorescence and LC-MS mass spectrometry;
(3) the obtained chimera phenylalanine aminoacyl-tRNA mutant is screened and applied to the expression of bacteria, cells, viruses and other hosts.
7. The method of claim 1, wherein: in the step S3, the first step,
(1) the tryptophan corresponding site for decoding the protein to form an aromatic cage is mutated into a stop codon (TAG),
(2) co-transformingly expressing the decoding protein mutant and the chimera phenylalanyl-tRNA synthetase mutant, adding corresponding tryptophan analogues in the expression process,
(3) the decoding protein variant was purified according to the GST-tag protein purification method, and the fidelity of the decoding protein variant was identified by LC-MS.
8. The method of claim 1, wherein: the nucleotide sequence and the amino acid sequence of the chimeric phenylalanine-tRNA synthetase mutant for recognizing the tryptophan analogue A1-A6 are respectively shown as SEQ ID NO: 1-2.
9. The method of claim 1, wherein: the method takes histone methylation decoding protein structural domain as a research object, and the decoding protein structural domain is any one of Chromo, PHD, PWWP, Tudor, MBT, CW, SPIN and BAH structural domain.
10. Use of the protein with tryptophan analogues obtained by the method of claim 1 to establish a super-parent recognition system for specifically recognizing histone methylation-modified decoded proteins.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210140263.7A CN114940979B (en) | 2022-02-16 | 2022-02-16 | Method for improving cation-pi interaction by utilizing genetic code expansion and application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210140263.7A CN114940979B (en) | 2022-02-16 | 2022-02-16 | Method for improving cation-pi interaction by utilizing genetic code expansion and application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114940979A true CN114940979A (en) | 2022-08-26 |
CN114940979B CN114940979B (en) | 2024-01-23 |
Family
ID=82905867
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210140263.7A Active CN114940979B (en) | 2022-02-16 | 2022-02-16 | Method for improving cation-pi interaction by utilizing genetic code expansion and application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114940979B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2006330947A1 (en) * | 2005-12-22 | 2007-07-05 | Pacific Biosciences Of California, Inc. | Polymerases for nucleotide analogue incorporation |
CN110172467A (en) * | 2019-05-24 | 2019-08-27 | 浙江大学 | It is a kind of to construct orthogonal aminoacyl-tRNA synthetase/tRNA system using chimeric design method |
CN111118048A (en) * | 2019-11-11 | 2020-05-08 | 浙江大学 | Use of chimeric phenylalanyl-tRNA synthetases/tRNAs |
CN116990270A (en) * | 2023-06-26 | 2023-11-03 | 浙江大学绍兴研究院 | Method for improving pi effect in living cells by utilizing genetic code expansion and application |
-
2022
- 2022-02-16 CN CN202210140263.7A patent/CN114940979B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2006330947A1 (en) * | 2005-12-22 | 2007-07-05 | Pacific Biosciences Of California, Inc. | Polymerases for nucleotide analogue incorporation |
CN110172467A (en) * | 2019-05-24 | 2019-08-27 | 浙江大学 | It is a kind of to construct orthogonal aminoacyl-tRNA synthetase/tRNA system using chimeric design method |
CN111118048A (en) * | 2019-11-11 | 2020-05-08 | 浙江大学 | Use of chimeric phenylalanyl-tRNA synthetases/tRNAs |
CN116990270A (en) * | 2023-06-26 | 2023-11-03 | 浙江大学绍兴研究院 | Method for improving pi effect in living cells by utilizing genetic code expansion and application |
Non-Patent Citations (8)
Also Published As
Publication number | Publication date |
---|---|
CN114940979B (en) | 2024-01-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2438174B1 (en) | METHOD FOR INCORPORATING ALIPHATIC AMINO ACIDS COMPRISING ALKYNE, AZIDE OR ALIPHATIC KETONE FUNCTIONAL GROUPS USING APPROPRIATE tRNA/tRNA SYNTHASE PAIRS | |
JP6603390B2 (en) | Novel peptide library and use thereof | |
CN110577564B (en) | Polypeptides and methods | |
CN112358414B (en) | Unnatural amino acids and their use in protein site-directed modification and protein interactions | |
US20170152287A1 (en) | Methods and compositions for site-specific labeling of peptides and proteins | |
Lin et al. | A tri-functional amino acid enables mapping of binding sites for posttranslational-modification-mediated protein-protein interactions | |
TW201840582A (en) | Peptide compound and method for producing same, composition for screening use, and method for selecting peptide compound | |
CA2755877A1 (en) | Biomolecular labelling using multifunctional biotin analogues | |
WO2015107071A1 (en) | Genetically encoded spin label | |
WO2011024887A1 (en) | Conjugate containing cyclic peptide and method for producing same | |
CN114940979B (en) | Method for improving cation-pi interaction by utilizing genetic code expansion and application | |
KR20160134669A (en) | Cyclopropene amino acids and methods | |
KR101910169B1 (en) | Methods for identification of proteins using phenolic compounds for protein labeling | |
EP3024823B1 (en) | Intercalating amino acids | |
CN114560819B (en) | Substituted triazine compound, preparation method thereof and application thereof in amino acid, peptide, protein and cell marker | |
CN114231553A (en) | High-throughput screening method of signal peptide library based on fluorescent probe Rho-IDA-CoII | |
WO2016017194A1 (en) | Method for fluorescently labeling methylated dna | |
WO2018207077A1 (en) | Genetically encoded biotin and biotin-analogs and use thereof | |
KR101868917B1 (en) | Phenolic compound for labeling to protein and preparation method thereof | |
JP5686385B2 (en) | Methods for fluorescently labeling proteins | |
CN117292741A (en) | Method for developing cell membrane-binding and/or serum albumin lipidation analogues by using computer aided design and application | |
EP4102227A1 (en) | Novel compound to be photo-crosslinked by visible light, and use thereof | |
Metts | Synthesis and Validation of a Trifunctional Trimethoprim-based Probe for Use with Degradation Domain System | |
EP4347572A1 (en) | Amino acids bearing a tetrazine moiety | |
CN117098768A (en) | Biologically reactive compounds and methods of use thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20230116 Address after: 311100 Room 520, Building 2, No. 366, Tongyun Street, Liangzhu Street, Yuhang District, Hangzhou City, Zhejiang Province Applicant after: Hangzhou Chihua Hesheng Pharmaceutical Technology Co.,Ltd. Address before: 310058 Yuhang Tang Road, Xihu District, Hangzhou, Zhejiang 866 Applicant before: ZHEJIANG University |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |