CN113930400A - Securinega suffruticosa derived oxidase and application thereof - Google Patents
Securinega suffruticosa derived oxidase and application thereof Download PDFInfo
- Publication number
- CN113930400A CN113930400A CN202010606465.7A CN202010606465A CN113930400A CN 113930400 A CN113930400 A CN 113930400A CN 202010606465 A CN202010606465 A CN 202010606465A CN 113930400 A CN113930400 A CN 113930400A
- Authority
- CN
- China
- Prior art keywords
- polypeptide
- leu
- ser
- compound
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000132346 Securinega suffruticosa Species 0.000 title abstract description 60
- 102000004316 Oxidoreductases Human genes 0.000 title abstract description 17
- 108090000854 Oxidoreductases Proteins 0.000 title abstract description 17
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 claims abstract description 85
- 239000002211 L-ascorbic acid Substances 0.000 claims abstract description 37
- 235000000069 L-ascorbic acid Nutrition 0.000 claims abstract description 37
- 229960005070 ascorbic acid Drugs 0.000 claims abstract description 37
- 229930013930 alkaloid Natural products 0.000 claims abstract description 23
- 150000003797 alkaloid derivatives Chemical class 0.000 claims abstract description 21
- SBJKKFFYIZUCET-JLAZNSOCSA-N Dehydro-L-ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(=O)C1=O SBJKKFFYIZUCET-JLAZNSOCSA-N 0.000 claims abstract description 18
- SBJKKFFYIZUCET-UHFFFAOYSA-N Dehydroascorbic acid Natural products OCC(O)C1OC(=O)C(=O)C1=O SBJKKFFYIZUCET-UHFFFAOYSA-N 0.000 claims abstract description 18
- 239000011615 dehydroascorbic acid Substances 0.000 claims abstract description 18
- 235000020960 dehydroascorbic acid Nutrition 0.000 claims abstract description 18
- 230000001590 oxidative effect Effects 0.000 claims abstract description 9
- 238000006482 condensation reaction Methods 0.000 claims abstract description 6
- 229920001184 polypeptide Polymers 0.000 claims description 29
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 29
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 29
- 150000001875 compounds Chemical class 0.000 claims description 23
- 108091033319 polynucleotide Proteins 0.000 claims description 14
- 102000040430 polynucleotide Human genes 0.000 claims description 14
- 239000002157 polynucleotide Substances 0.000 claims description 14
- 244000005700 microbiome Species 0.000 claims description 10
- 239000002773 nucleotide Substances 0.000 claims description 10
- 125000003729 nucleotide group Chemical group 0.000 claims description 10
- CIWBSHSKHKDKBQ-DUZGATOHSA-N D-araboascorbic acid Natural products OC[C@@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-DUZGATOHSA-N 0.000 claims description 9
- 235000010350 erythorbic acid Nutrition 0.000 claims description 9
- 229940026239 isoascorbic acid Drugs 0.000 claims description 9
- 238000002360 preparation method Methods 0.000 claims description 8
- 239000000758 substrate Substances 0.000 claims description 8
- 238000006555 catalytic reaction Methods 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 239000002994 raw material Substances 0.000 claims description 4
- 238000007792 addition Methods 0.000 claims description 3
- 239000004318 erythorbic acid Substances 0.000 claims description 3
- 125000000539 amino acid group Chemical group 0.000 claims description 2
- 230000000295 complement effect Effects 0.000 claims description 2
- 238000012217 deletion Methods 0.000 claims description 2
- 230000037430 deletion Effects 0.000 claims description 2
- 238000006467 substitution reaction Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims 1
- 239000003513 alkali Substances 0.000 abstract description 3
- 238000012827 research and development Methods 0.000 abstract description 2
- 108090000623 proteins and genes Proteins 0.000 description 71
- 102000004169 proteins and genes Human genes 0.000 description 52
- 235000018102 proteins Nutrition 0.000 description 48
- 229940125904 compound 1 Drugs 0.000 description 45
- SWZMSZQQJRKFBP-WZRBSPASSA-N Securinine Chemical compound N12CCCC[C@@H]2[C@@]23OC(=O)C=C2C=C[C@@H]1C3 SWZMSZQQJRKFBP-WZRBSPASSA-N 0.000 description 32
- SWZMSZQQJRKFBP-UHFFFAOYSA-N Vivosecurinine Natural products N12CCCCC2C23OC(=O)C=C2C=CC1C3 SWZMSZQQJRKFBP-UHFFFAOYSA-N 0.000 description 32
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 24
- 230000000694 effects Effects 0.000 description 21
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 19
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 18
- GEWDNTWNSAZUDX-UHFFFAOYSA-N Jasmonic Acid Methyl Ester Chemical compound CCC=CCC1C(CC(=O)OC)CCC1=O GEWDNTWNSAZUDX-UHFFFAOYSA-N 0.000 description 16
- SWZMSZQQJRKFBP-KUNJKIHDSA-N O=C1O[C@@]23CC(C=CC2=C1)N1CCCC[C@@H]31 Chemical compound O=C1O[C@@]23CC(C=CC2=C1)N1CCCC[C@@H]31 SWZMSZQQJRKFBP-KUNJKIHDSA-N 0.000 description 16
- 229950005774 securinine Drugs 0.000 description 16
- 102000004190 Enzymes Human genes 0.000 description 13
- 229940125782 compound 2 Drugs 0.000 description 13
- 238000012360 testing method Methods 0.000 description 13
- 108090000790 Enzymes Proteins 0.000 description 12
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 12
- 210000004027 cell Anatomy 0.000 description 12
- 229940088598 enzyme Drugs 0.000 description 12
- 238000000034 method Methods 0.000 description 12
- 241000208125 Nicotiana Species 0.000 description 11
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 239000000243 solution Substances 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 239000002243 precursor Substances 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- 241000196324 Embryophyta Species 0.000 description 7
- 239000013078 crystal Substances 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 150000001413 amino acids Chemical group 0.000 description 6
- 238000009833 condensation Methods 0.000 description 6
- 230000005494 condensation Effects 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- AICOOMRHRUFYCM-ZRRPKQBOSA-N oxazine, 1 Chemical compound C([C@@H]1[C@H](C(C[C@]2(C)[C@@H]([C@H](C)N(C)C)[C@H](O)C[C@]21C)=O)CC1=CC2)C[C@H]1[C@@]1(C)[C@H]2N=C(C(C)C)OC1 AICOOMRHRUFYCM-ZRRPKQBOSA-N 0.000 description 6
- 239000012071 phase Substances 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Substances CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000007864 aqueous solution Substances 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 229940126214 compound 3 Drugs 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- -1 fermentation liquor Proteins 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- GEWDNTWNSAZUDX-WQMVXFAESA-N (-)-methyl jasmonate Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(=O)OC)CCC1=O GEWDNTWNSAZUDX-WQMVXFAESA-N 0.000 description 4
- 208000002109 Argyria Diseases 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 4
- HEDRZPFGACZZDS-MICDWDOJSA-N Trichloro(2H)methane Chemical compound [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 4
- 238000011033 desalting Methods 0.000 description 4
- 108010009297 diglycyl-histidine Proteins 0.000 description 4
- 238000001035 drying Methods 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 238000000227 grinding Methods 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 230000000155 isotopic effect Effects 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 210000004881 tumor cell Anatomy 0.000 description 4
- 241000282326 Felis catus Species 0.000 description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 239000012980 RPMI-1640 medium Substances 0.000 description 3
- 241000867909 Securinega Species 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- DQLATGHUWYMOKM-UHFFFAOYSA-L cisplatin Chemical compound N[Pt](N)(Cl)Cl DQLATGHUWYMOKM-UHFFFAOYSA-L 0.000 description 3
- 229960004316 cisplatin Drugs 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 239000007791 liquid phase Substances 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 230000003647 oxidation Effects 0.000 description 3
- 238000007254 oxidation reaction Methods 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 239000008223 sterile water Substances 0.000 description 3
- 238000000108 ultra-filtration Methods 0.000 description 3
- PCTMTFRHKVHKIS-BMFZQQSSSA-N (1s,3r,4e,6e,8e,10e,12e,14e,16e,18s,19r,20r,21s,25r,27r,30r,31r,33s,35r,37s,38r)-3-[(2r,3s,4s,5s,6r)-4-amino-3,5-dihydroxy-6-methyloxan-2-yl]oxy-19,25,27,30,31,33,35,37-octahydroxy-18,20,21-trimethyl-23-oxo-22,39-dioxabicyclo[33.3.1]nonatriaconta-4,6,8,10 Chemical group C1C=C2C[C@@H](OS(O)(=O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2.O[C@H]1[C@@H](N)[C@H](O)[C@@H](C)O[C@H]1O[C@H]1/C=C/C=C/C=C/C=C/C=C/C=C/C=C/[C@H](C)[C@@H](O)[C@@H](C)[C@H](C)OC(=O)C[C@H](O)C[C@H](O)CC[C@@H](O)[C@H](O)C[C@H](O)C[C@](O)(C[C@H](O)[C@H]2C(O)=O)O[C@H]2C1 PCTMTFRHKVHKIS-BMFZQQSSSA-N 0.000 description 2
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 2
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 2
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 2
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 2
- OKKJLVBELUTLKV-MZCSYVLQSA-N Deuterated methanol Chemical compound [2H]OC([2H])([2H])[2H] OKKJLVBELUTLKV-MZCSYVLQSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- 108010093096 Immobilized Enzymes Proteins 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 2
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 2
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 2
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 2
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- OEYKVQKYCHATHO-SZMVWBNQSA-N Lys-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N OEYKVQKYCHATHO-SZMVWBNQSA-N 0.000 description 2
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 2
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 2
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- DWAQJAXMDSEUJJ-UHFFFAOYSA-M Sodium bisulfite Chemical compound [Na+].OS([O-])=O DWAQJAXMDSEUJJ-UHFFFAOYSA-M 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 2
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 2
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 2
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 238000002441 X-ray diffraction Methods 0.000 description 2
- 241000235015 Yarrowia lipolytica Species 0.000 description 2
- 229960000583 acetic acid Drugs 0.000 description 2
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 2
- 238000007605 air drying Methods 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 2
- 150000002081 enamines Chemical class 0.000 description 2
- 239000000469 ethanolic extract Substances 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- HQVFCQRVQFYGRJ-UHFFFAOYSA-N formic acid;hydrate Chemical compound O.OC=O HQVFCQRVQFYGRJ-UHFFFAOYSA-N 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 230000008014 freezing Effects 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- 239000012362 glacial acetic acid Substances 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 239000012452 mother liquor Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 229920000523 polyvinylpolypyrrolidone Polymers 0.000 description 2
- 235000013809 polyvinylpolypyrrolidone Nutrition 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 238000002953 preparative HPLC Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 235000010267 sodium hydrogen sulphite Nutrition 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XDIYNQZUNSSENW-UUBOPVPUSA-N (2R,3S,4R,5R)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O XDIYNQZUNSSENW-UUBOPVPUSA-N 0.000 description 1
- XDIYNQZUNSSENW-KZXKDKCNSA-N (2s,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O XDIYNQZUNSSENW-KZXKDKCNSA-N 0.000 description 1
- 238000005160 1H NMR spectroscopy Methods 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 241000221017 Euphorbiaceae Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- 235000019766 L-Lysine Nutrition 0.000 description 1
- 150000000996 L-ascorbic acids Chemical class 0.000 description 1
- WQZGKKKJIJFFOK-DHVFOXMCSA-N L-galactose Chemical compound OC[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O WQZGKKKJIJFFOK-DHVFOXMCSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 101710138460 Leaf protein Proteins 0.000 description 1
- 206010024264 Lethargy Diseases 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 208000007443 Neurasthenia Diseases 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- XDIYNQZUNSSENW-PIIYHDNRSA-N O=C[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)CO.O=C[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)CO Chemical compound O=C[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)CO.O=C[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)CO XDIYNQZUNSSENW-PIIYHDNRSA-N 0.000 description 1
- 108010019160 Pancreatin Proteins 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- 241001130943 Phyllanthus <Aves> Species 0.000 description 1
- 208000000474 Poliomyelitis Diseases 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 229930189077 Rifamycin Natural products 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 239000005708 Sodium hypochlorite Substances 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 229930003268 Vitamin C Natural products 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 235000011114 ammonium hydroxide Nutrition 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 206010003549 asthenia Diseases 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000007859 condensation product Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 239000003712 decolorant Substances 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000009036 growth inhibition Effects 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- ZRJBHWIHUMBLCN-YQEJDHNASA-N huperzine A Chemical compound N1C(=O)C=CC2=C1C[C@H]1\C(=C/C)[C@]2(N)CC(C)=C1 ZRJBHWIHUMBLCN-YQEJDHNASA-N 0.000 description 1
- ZRJBHWIHUMBLCN-SEQYCRGISA-N huperzine a Chemical compound N1C(=O)C=CC2=C1C[C@H]1/C(=C/C)[C@]2(N)CC(C)=C1 ZRJBHWIHUMBLCN-SEQYCRGISA-N 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002329 infrared spectrum Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 230000001678 irradiating effect Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 235000018977 lysine Nutrition 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000005374 membrane filtration Methods 0.000 description 1
- 125000001434 methanylylidene group Chemical group [H]C#[*] 0.000 description 1
- YMWUJEATGCHHMB-UHFFFAOYSA-N methylene chloride Substances ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 1
- 239000000693 micelle Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 229940055695 pancreatin Drugs 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 229960003292 rifamycin Drugs 0.000 description 1
- HJYYPODYNSCCOU-ODRIEIDWSA-N rifamycin SV Chemical compound OC1=C(C(O)=C2C)C3=C(O)C=C1NC(=O)\C(C)=C/C=C/[C@H](C)[C@H](O)[C@@H](C)[C@@H](O)[C@@H](C)[C@H](OC(C)=O)[C@H](C)[C@@H](OC)\C=C\O[C@@]1(C)OC2=C3C1=O HJYYPODYNSCCOU-ODRIEIDWSA-N 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000007873 sieving Methods 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 238000010898 silica gel chromatography Methods 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 1
- YNJBWRMUSHSURL-UHFFFAOYSA-N trichloroacetic acid Chemical compound OC(=O)C(Cl)(Cl)Cl YNJBWRMUSHSURL-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 230000004565 tumor cell growth Effects 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 238000002137 ultrasound extraction Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/188—Heterocyclic compound containing in the condensed system at least one hetero ring having nitrogen atoms and oxygen atoms as the only ring heteroatoms
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention discloses an oxidase SEQ ID NO. 2 derived from securinega suffruticosa, which can catalyze L-ascorbic acid/dehydroascorbic acid to carry out an oxidative condensation reaction with securinega suffruticosa alkali to prepare securinega suffruticosa alkaloid flueaffine A and analogues thereof and has research and development values.
Description
Technical Field
The invention belongs to the field of bioengineering, and particularly relates to a securinega suffruticosa derived oxidase FsBBE and application thereof in preparation of securinega suffruticosa alkaloid fluesuffine A.
Background
Securinega suffruticosa, known as Fluegea suffrutinosa (Pall.) Baill, is a plant of phyllanthus of Euphorbiaceae, is widely distributed in China, and is not found in China except northwest. Securinega suffruticosa can be used for treating nervous related diseases such as paralysis, neurasthenia, lethargy, etc. The securinega suffruticosa contains abundant securinega alkaloid, is considered as a biological active component of a securinega plant, and comprises securinine (securinine) and securinine (allosecurinine), wherein the securinine is a medicament used for treating sequela of poliomyelitis, active compounds with novel structures are separated from the securinega suffruticosa in recent years, and the new compounds have multiple activities of resisting tumors, viruses, bacteria, central nerves and the like. The research on the alkaloid in the securinega suffruticosa can discover more bioactive components, and the analysis on the synthetic pathway of the alkaloid in the securinega suffruticosa can find some enzymes with new functions and new biocatalysis tools, so that a foundation is laid for synthesizing the securinega suffruticosa alkaloid by a synthetic biology method, and the plant resource of the traditional Chinese medicine can be more fully developed and utilized.
Disclosure of Invention
Some alkaloid components in the securinega suffruticosa are extracted, separated and identified, and the securinega suffruticosa alkaloid with a novel structure is obtained by separation, has a molecular structure shown in the following formula 1 and is named as fluebuffine A:
the ABCD ring part in the molecular structure of the compound 1 is very similar to the structure of allosecurinine/securinine, and is shown in the following formula.
It is speculated that the ABCD loop moiety is likely to be derived from allocoresine or securinine, the 6-carbon moiety in the structure in which the FG loop is highly oxidized, is likely to be derived from vitamin C or L-ascorbic acid (L-AA), and thus compound 1 is likely to be a vitamin C-modified natural product. The biosynthetic pathway of compound 1 is inferred to be the following pathway.
Tyrosine (L-tyrosine) and lysine (L-lysine) are polymerized into alloneuronine/securinine after multi-step enzyme catalytic modification, and then the alloneuronine/securinine and L-ascorbic acid (L-AA) are polymerized into the compound 1 under the catalytic action of one or more oxidase (oxidase) in one step or multiple steps.
To verify the above speculation, we isolated an oxidase from the plant of securinega suffruticosa and its coding gene, and found that it should belong to the BBE family of oxidoreductases through sequence analysis and comparison, so named as FsBBE, which can catalyze the oxidative condensation reaction of L-ascorbic acid or dehydroascorbic acid (DHA) and securine to obtain compound 1.
Accordingly, in a first aspect the present invention provides an isolated polypeptide selected from the group consisting of:
(a) a polypeptide having the amino acid sequence of SEQ ID NO: 2:
MNPLKHSSSTPLVFVLLTVCSCATSVTIPELFFQCLSNTTTTSTSIFNVLYTPRNTSYTSILESRIQNLRFNTTDTPKPLAIVTPLDASHIQATIICARKHNLQIRIRSGGHDYEGLSYVSPLPFVVLDLINLRNITVDVENRVAWVGCGATLGEFYYRIAEKTRTLAFPAGACPTVGVGGHFSGGGYGYLLRKFGLAADNILDASLVDVNGRILDRASMGEDLFWAIRGGGGNSFGVVIAWKVNLVPVPSTLTSFKVSKSLEQNTMIQLLNKWQYVANKLPDELSMFAVVSKKNSTISVKFYSLYVGGIDSLLPLMEERFPELGLKRADCNEMSWIESAVSFAGYASNTSLDVLLNHTNNYEIASGRFKGKSDFVKEPVPEAALEGLLKWLSDKDITNAAIYMVPLGGKMGEITETSIPFPHRAGNLYLLAYYVKWEGQGTEAAQKPLSWIRKGYKYMAPYVSKNPREAHLNDRDLDIGTNNISGNTSYEQASIWGTKYFKNNFDRLVRVKTSVDPSDFFRNEQSVPPLLS(SEQ ID NO:2);
(b) 2 by substitution, deletion or addition of one or more amino acid residues, and has (a) polypeptide function;
(c) a polypeptide derived from (a) having a homology of 95% or more, preferably 98% or more, more preferably 99% or more with the polypeptide sequence defined in (a) and having the function of the polypeptide of (a); or
(d) A derivative polypeptide of the polypeptide sequence in (a) or (b) or (c) is contained in the sequence.
In a second aspect the present invention provides an isolated polynucleotide selected from the group consisting of:
(A) a polynucleotide encoding the polypeptide of claim 1;
(B) a polynucleotide encoding a polypeptide having the amino acid sequence shown in SEQ ID NO. 2;
(C) the polynucleotide with the nucleotide sequence shown as SEQ ID NO. 1;
(D) polynucleotide whose nucleotide sequence has homology of 95% or more, preferably 98% or more, more preferably 99% or more with the nucleotide sequence shown in SEQ ID NO. 1;
(E) a nucleotide sequence complementary to the nucleotide sequence of any one of (A) to (D).
Preferably, the polynucleotide is SEQ ID NO 1.
Another aspect of the present invention provides a vector comprising the above polynucleotide, and a microorganism transformed with the vector.
The microorganism can be selected from Escherichia coli, Pichia pastoris, Saccharomyces cerevisiae, yarrowia lipolytica, and Bacillus subtilis. Preferably E.coli BL21(DE 3).
The fourth aspect of the invention provides the use of the polypeptide or the microorganism in the preparation of securinega suffruticosa alkaloid (flueuffine A) and compound 2 (flueuffine B) shown in formula 1.
In the preparation of compound 1, compound 1 is prepared by an oxidative condensation reaction catalyzed by the above-mentioned polypeptide using, for example, L-ascorbic acid (L-AA) or dehydroascorbic acid (DHA) and securinine (allosecurinine) as substrate raw materials.
Specifically, the substrate material is selected from L-ascorbic acid and securinega suffruticosa alkaloid or dehydroascorbic acid and securinega suffruticosa alkaloid.
In the preparation of compound 2, compound 2 is prepared by, for example, oxidation-condensation reaction using isoascorbic acid (isoascorbyl acid) and securinine (allosecurinine) as substrate raw materials under the catalysis of the above-mentioned polypeptide.
Specifically, the substrate material is selected from isoascorbic acid and securinega suffruticosa alkali.
The polypeptide SEQ ID NO. 2 disclosed by the invention can catalyze the oxidation condensation of securinega suffruticosa alkaloid (allosecurinine) and L-ascorbic acid (L-AA) or dehydroascorbic acid (DHA) to prepare a novel securinega suffruticosa alkaloid fluensuine A; and can catalyze the securinega suffruticosa alkali and the isoascorbic acid to carry out oxidative condensation to prepare a novel compound flueuffine B, lays a foundation for analyzing the synthetic biology of the alkaloid in the securinega suffruticosa, promotes the development and utilization of new natural medicines, and has further research and development prospects.
Drawings
Fig. 1 shows a photograph of a culture of a sterilized seedling of securinega suffruticosa and a photograph of feeding L-AA precursor to securinega suffruticosa leaves.
FIG. 2 shows the results of the contents of different L-AA precursor compounds 1 fed to Securinega suffruticosa and isotopically labeled L-ascorbic acid (1-13C) Statistical plot of isotopic abundance of compound 1. FIG. A: results of feeding the contents of the different substrate compounds 1, the ordinate is relative content; and B: i: feeding non-isotopically labeled L-ascorbic acid (1-13C) Isotopic abundance of compound 1; ii: feeding isotopically labelled L-ascorbic acid (1-13C) Isotopic abundance of compound 1; iii: 3 and L-AA in D2Isotopic abundance of compound 1 obtained by the reaction in O.
FIG. 3 is a statistical graph showing the relative content of Compound 1 in Securinega suffruticosa leaves induced by MeJA for various periods of time. Wherein the ordinate is relative fresh weight content, water is water, leaf is leaf, root is root, stem is stem.
FIG. 4 is a graph showing the results of Q-TOF detection of proteins in Securinega suffruticosa leaves. Wherein the abscissa is the retention time. The left side shows the results of extracting EIC (390.1183), i: the results of the activities of the total protein of the leaves of the securinega suffruticosa, allosecurinine and L-AA; ii: the results of the activities of the total protein of the securinega suffruticosa leaf, allosecurinine and DHA; iii: the activity results of the purified FsBBE protein, allosecurinine and L-AA; iv: the activity results of the purified FsBBE protein, allocorenin and DHA; v: the activity results of the purified FsBBE protein, allosecurinine and L-AA; vi: the activity results of the purified FsBBE protein, allocoresine and isoascorbic acid; vii: compound 1 standard. The right side is the research result of the FsBBE mechanism, i and ii are graphs of EIC 216.1019 extracted by the reaction of purified FsBBE protein and allocoresine and the reaction of total protein of tobacco leaves injected with FsBBE gene and allocoresine respectively; iii, iv and v are graphs of the reaction extraction 390.1183 results of enamine intermediate 3 and L-AA, DHA and isoascorbic acid, respectively; vi and vii are standards for compounds 2 and 1, respectively.
FIG. 5 shows the photograph of SDS-PAGE gel electrophoresis silver staining analysis of the active components after the total protein of Securinega suffruticosa leaves passes through the ion exchange column.
FIG. 6 shows a photograph of an SDS-PAGE gel electrophoresis silver stain analysis of the active fraction after molecular sieving. The bands marked FS-F0-11-33-1 and FS-F0-11-34-1 are bands of the target protein.
FIG. 7 shows a photograph of SDS-PAGE gel electrophoresis silver staining of candidate active proteins purified from tobacco, with FsBBE protein bands marked by arrows.
Figure 8 shows the absolute configuration of compounds 1 and 2.
FIG. 9 shows the proposed synthetic mechanism for compound 1 and compound 2.
Detailed Description
In order to analyze and identify alkaloid components in securinega suffruticosa, branches and leaves of securinega suffruticosa are extracted by organic solvents such as ethanol and ethyl acetate, and the obtained total alkaloids are subjected to silica gel column chromatography and semi-preparative high performance liquid chromatography for separation to obtain compound 1. Using spectral analysis including high resolution electrospray ionization mass spectrometry (HRESIMS),1H and13C-NMR, in combination with two-dimensional nuclear magnetic resonance, Infrared (IR), etc., determines its molecular structure, and determines its absolute configuration by X-ray diffraction.
The compound 1 comprises the skeleton structure of allosecurine (securinine) or securinine (securinine), and should be the oxidation condensation product of allosecurinine/securinine and L-ascorbic acid/dehydroascorbic acid, and the oxidation condensation process is carried out under the catalysis of oxidase FsBBE.
In order to allow the oxidase FsBBE to be applied to similar catalytic reactions, expression of the protein by a microorganism is the best method for producing the enzyme.
Since the amino acid sequence of the oxidase FsBBE of the present invention is clear, the genes encoding the same, expression cassettes and plasmids containing the genes, and transformants containing the plasmids are easily obtained by those skilled in the art. These genes, expression cassettes, plasmids, and transformants can be obtained by genetic engineering construction means well known to those skilled in the art.
The above-described transformant host may be any microorganism suitable for expressing the enzyme SEQ ID NO. 1, including bacteria and fungi. Preferably the microorganism is selected from the group consisting of Escherichia coli, Pichia pastoris, Saccharomyces cerevisiae, yarrowia lipolytica, Bacillus subtilis. More preferably E.coli BL21(DE 3).
When used as a biocatalyst for the production of alkaloids of similar structure such as flueuffine A, the oxidase FsBBE of the present invention may be in the form of an enzyme or in the form of a bacterial cell. The enzyme forms comprise free enzyme and immobilized enzyme, including purified enzyme, crude enzyme, fermentation liquor, enzyme immobilized by a carrier and the like; the form of the thallus comprises a viable thallus and a dead thallus.
The isolation and purification of the oxidase FsBBE of the present invention, including immobilized enzyme preparation techniques, are also well known to those skilled in the art.
The present invention will be described in further detail with reference to specific examples. It should be understood that the following examples are illustrative only and are not intended to limit the scope of the present invention.
The addition amount, content and concentration of various substances are referred to herein, wherein the percentage refers to the mass percentage unless otherwise specified.
In the examples herein, if no specific description is made about the reaction temperature or the operation temperature, the temperature is usually referred to as room temperature (15 to 30 ℃).
EXAMPLE 1 isolation and characterization of Compound 1
1.1 extraction and isolation of Securinega suffruticosa alkaloid
Stems and leaves of securinega suffruticosa, purchased in south-south of Henan Yang in 2017 in 9 months, identified by Yangye hong researcher at Shanghai mountain plant science research center of Chinese academy of sciences.
Grinding the branches and leaves of Securinega suffruticosa (L.) Securinega Suffruticosa (L.) Securinega) S.C. and dry weight of leaves of 1.8 kg), repeatedly extracting with 5L ethanol for 4 times, distilling and concentrating ethanol phase to obtain ethanol extract, re-suspending the ethanol extract with 1L diluted hydrochloric acid aqueous solution with pH of 6.0, extracting with ethyl acetate of equal volume for three times, adjusting pH of water layer with ammonia water to 8.0, and collecting the water layerExtracting with equal volume of ethyl acetate for three times, mixing ethyl acetate phases, distilling and concentrating to obtain total alkaloid 27.0g in terms of CH2Cl2The mobile phase of MeOH is 100:0,30:1,20:1,15:1,10:1,7:1,4:1,2:1,1:1 and 0:1, the silica gel column is used for separation, TLC detection is carried out, similar fractions are combined, and 8 components are obtained: F1-F8, fraction F5 was separated again with semi-preparative high performance liquid chromatography on a Luna C18 semi-preparative column (250X 10mm,5 μm) with mobile phase: methanol: 0.1% formic acid water is 5:95-25:75, 0-10 min; 25:75-25:75, 10-15 min; 25:75-95:5, 15-25 min; gradient eluting for 95:5,25-30min, separating to obtain compound 1 (t)R22.5 min). Compound 1 was dissolved in deuterated chloroform for nuclear magnetic identification in the solvent ethyl acetate: the methanol was slowly evaporated at room temperature under the condition of 7:1 to obtain a single crystal of compound 1, and the absolute configuration of the single crystal was determined by X-ray diffraction.
1.2 isolation and Structure characterization of Compound 1
The compound 1 obtained by separation is light yellow solid, and HRESIMS (high resolution electrospray ionization mass spectrometry) M/z [ M + H ]]+390.1183, shows its molecular formula C19H19NO8The unsaturation degree was 11. IR spectrum showed hydroxyl group (3452 cm)-1) Carbonyl group (1745 cm)-1) And double bonds (2943 cm)-1、1441cm-1And 1378cm-1) And (4) absorbing.1H and13the C-NMR spectrum combined with the two-dimensional nuclear magnetic spectrum showed (see Table 1) that Compound 1 had two ester or amide carbonyl groups (. delta.) (C171.4,174.3), 1 sp2Quaternary carbon deltaC163.7, 3 sp2Methine group (. delta.)C/H148.8/7.05,122.4/6.57,111.3/5.86), 4 lower fields of sp3Quaternary carbon (. delta.)C112.3,101.5,93.1,80.4), 4 sp3Methine group (. delta.)C/H90.9/5.05,74.4/4.52,56.5/3.92, 37.0/2.11)), one of which is methine in the lower field, 5 sp3Methylene (table 1.3.1). Comparing the data with reported alkaloid nuclear magnetic data separated from securinega suffruticosa, the compound 1 contains allosecurinine or mixed source dipolymer of securinine framework structure. Detailed analysis of the two-dimensional nuclear magnetic spectrum of compound 1 shows that the planar structure of compound 1 is shown in formula 1, and compound 1 has 8 handsAnd 4 of the carbon atoms are quaternary carbons. After several attempts we fortunately obtained a single crystal that was able to determine the quality of the absolute configuration of the compound, and single crystal diffraction results showed that the compound was in the 2R,3R,7S,9S,2 'S, 3' R,4 'R, and 6' S configurations. We named compound 1 fluesuffine a.
TABLE 1 preparation of Compound 11H spectrum and13c NMR data
The biosynthetic pathway of compound 1 was analyzed and validated as follows.
Example 2 culture of aseptic seedlings of Securinega suffruticosa
Seed was placed at 0.1% v/w 20, slowly oscillating for 5min at 90rpm of a shaking table, and washing with distilled water for 3 times; then placing the seeds into ethanol with the mass fraction of 70% for surface sterilization for 1min, immediately transferring to 0.5% v/v sodium hypochlorite solution and slowly oscillating at 90rpm for 12 min; then washing the seeds with sterile water in a super clean bench for 3 times, putting the seeds into the sterile water overnight, pouring the sterile water in the super clean bench, sucking the excess water on the surfaces of the seeds with sterile filter paper, putting the seeds into a sterile and moist filter paper of a sterile flat plate, standing overnight at 4 ℃, and putting the seeds into a 24 ℃ constant temperature incubator to be cultured in a dark place. After the seeds are cultured on the flat plate for about one week, the seeds begin to germinate, the seeds are continuously cultured for 3 days at 24 ℃ for 16 h/day under illumination, aseptic seedlings grow into aseptic seedlings, the roots of the aseptic seedlings are cut off, the aseptic seedlings with the roots cut off are transferred to a PGMO5 culture medium, and the aseptic seedlings are continuously cultured for about 3-4 weeks in a constant temperature incubator at 24 ℃ for 16 h/day under illumination, as shown in figure 1.
PGMO5 medium: 2.2g MS culture medium, 10g sucrose, 4g phytogel and water to make 1L, adjusting pH to 5.8 with NaOH solution, and sterilizing at 121 deg.C for 20 min.
Example 3L-AA precursor and isotopically labeled L-ascorbic acid (1-13C) Feeding experiment of
Feeding experiment of L-AA precursor including D-glucose (D-glucose), D-mannose (D-mannose), L-galactose (L-galactose), L-galactono-1,4-lactone (L-galactono-1,4-lactone), L-ascorbic acid (L-ascorbic acid) and the like was performed with sterilized leaf of Securinega suffruticosa grown for about 5 weeks, then three seedlings of Securinega suffruticosa were taken out from the bottle (see FIG. 1), the leaves of sterilized seedlings were cut off (the lowermost and uppermost leaves were discarded), and the leaves were cut off from the middle of the leaves, the leaves were immersed in an aqueous solution (concentration of 5mM, as shown in FIG. 1) of the precursor feeding compound, three replicates were performed for each of the precursor feeding compound, water was used as a control group, and the sterilized leaf of Securinega suffruticosa was completely immersed in the aqueous solution of the precursor compound during feeding, shaking at 90rpm, irradiating at 24 deg.C for 72 hr, and culturing in constant temperature incubator for 72 hr. Taking out the leaves after 72h, sucking excessive water by using absorbent paper, weighing, putting into a 1.5ml EP tube, freezing and grinding by using liquid nitrogen, adding 1ml of methanol (HupA with the concentration of 50nM is an internal standard) extracting solution containing 0.1% formic acid into each 100mg of wet weight sample in equal proportion, carrying out ultrasonic extraction for 10min, centrifuging for 10min at 17000g, filtering supernatant by using a 0.2 mu M filter membrane, detecting Q-TOF, calculating the relative content of the compound 1, and calculating the method: (1/hupzine a 50x10-6 x 1x10-3) 389.36/100 (compound 1 and hupzine a (huperzine a) represent peak areas from which EIC 390.1183 and 243.1497 were extracted when mass spectrometrically identified. Content results for feeding different substrate Compound 1 and feeding isotopically labeled L-ascorbic acid (1-13C) The isotopic abundance of compound 1 is shown in figure 2.
As can be seen from FIG. 2, the yield of compound 1 was significantly improved after the leaves of Securinega suffruticosa were fed with L-galactose (L-galactonase), L-galactono-1,4-lactone (L-galactono-1,4-lactone) and L-ascorbic acid (L-ascorbyl acid), indicating that these added compounds are all precursors for the biosynthesis of compound 1, proving our inference about the biosynthetic pathway of compound 1.
Example 4 treatment of Securinega suffruticosa seedling with methyl jasmonate and measurement of the content of Compound 1 in the seedling leaves after the treatment
About one month old seedlings of securinega suffruticosa were grown, at which time about 7 leaves were grown (see fig. 1), 8 plants were placed in each bottle, sprayed with a 50 μ M aqueous solution of Methyl Jasmonate (Methyl Jasmonate, MeJA), treated for 24, 48 and 72 hours, respectively, and water (water) was used as a blank control group (sprayed every 12 hours), for 4 experiments, each of which was repeated three times. The method for measuring the content of the novel compound in the seedling leaves is the same as that in example 3. The content of compound 1 in cacumen Securinegae Suffruticosae is shown in figure 3.
As can be seen from fig. 3, treatment of securinega suffruticosa seedlings with methyl jasmonate MeJA can increase the biosynthesis amount of compound 1, with the longer the treatment time, the higher the yield of compound 1.
Example 5 MeJA Induction experiment of sterile seedlings of Securinega suffruticosa and transcriptome sequencing of MeJA induced leaves at different times
Example 4 seedlings leaves at 0h (water blank), 24h and 72h after MeJA treatment were subjected to second generation transcriptome sequencing using Illumina HiSeq as the instrumentTMAnd filtering the obtained data to obtain clean data, and splicing the obtained clean data by using Trinity to obtain 145153 transcripts.
The obtained transcript is subjected to coding sequence CDS (coding sequence) prediction, and the CDS prediction method comprises the following steps: 1. comparing Gene according to the priority sequence of an NR protein library and a Swissprot protein library, if the comparison is up, extracting ORF coding frame information of a transcript from the comparison result, and translating a coding region sequence into an amino acid sequence (according to the sequence of 5'- > 3'); 2. for sequences without aligned NR protein library, Swissprot protein library or sequences with results not predicted on alignment, the open reading frame ORF (open reading frame) is predicted by using estscan (3.0.3) software, and the nucleic acid sequence and amino acid sequence coded by the part of genes are obtained.
Example 6 extraction and Activity experiment of Securinega suffruticosa aseptic seedling leaf Total protein
After the sterilized seedlings of Securinega suffruticosa (Securinega suffruticosa) growing for about one month were cut, the leaves were weighed, and the wet weight of the seedlings was 8.2g, and the seedlings were subjected to nitrogen freeze-grinding, 41ml of extraction buffer A (100mM NaPi, pH 7.4,5mM sodium bisulfite, 5mM dithiothreitol, 1mM EDTA, 10% (v/v) glycerol, 1% (w/v) PVP, 4% (w/v) PVPP) was added thereto, incubated on ice, gently shaken up and down several times every 15 minutes, after 1 hour, centrifuged at 5000g at 4 ℃ for 30min, the supernatant was desalted through a pD10 column and replaced with buffer B (10mM Tris-HCl, pH 7.5). After the protein was extracted, the concentration of the protein was measured by the Bradford method, and the concentration of the protein was 3.5 mg/ml.
50 mu L of desalted protein is added with allocerinine/securinine (0.5 mu L, the initial concentration is 100mM) and L-ascorbic acid (L-ascorbyl acid, L-AA)/dehydroascorbic acid (DHA) (0.5 mu L, the initial concentration is 100mM), the control group is heat-inactivated protein, the rest are the same, after the protein is placed in a shaking table at 30 ℃ for 2h, the reaction is stopped by adding equal volume of methanol, 17000g is centrifuged for 10min, and after the filter membrane of 0.2 mu M is filtered, Q-TOF (four-rod-time of flight mass spectrometry) is detected. The results are shown in FIG. 4.
As can be seen from the left graph in FIG. 4, the total protein of Securinega suffruticosa leaf can catalyze the oxidative condensation of allosecurinine and L-AA to obtain the compound 1 (see i in the graph), and can catalyze the oxidative condensation of allosecurinine and DHA to obtain the compound 1 (see ii in the graph).
Example 7 fractionation and protein Mass Spectrometry verification of Total protein of Securinega suffruticosa leaf
The extraction method of total protein of Securinega suffruticosa was the same as that in example 6, and about 175.0mg of desalted total protein was fractionated with Hitrap Q HP column (GE), and eluted by gradually increasing NaCl concentration in buffer B at a flow rate of 2.5 ml/min. The F0 protein fraction, which did not bind to the column, was shown to be active by activity testing (activity testing method was the same as total protein testing method, data not shown). Therefore, the protein fraction F0 which did not bind to the column was concentrated using a 10kDa ultrafiltration tube, and buffer C (10mM MES, pH 6.5) was gradually added to the ultrafiltration tube to replace the buffer.
The buffer-exchanged fraction F0 was then fractionated with Hitrap SP HP column (GE), eluting with buffer C at increasing NaCl concentration at a flow rate of 1.0 ml/min. One component was received every 1.5 ml. A total of 40 components F0-F1-F0-F40 were received. And activity test was performed on each fraction (activity test conditions were the same as F0, EIC (390.0, [ M + H ])]+) And the peak areas obtained by extraction were extracted to judge the strength of the activity of each component), F0-F10 and F0 were foundThe F11 fraction was most active (data not shown), while the active fractions F0-F9-F0-F17 were analyzed by SDS PAGE (12% gel) silver staining, and multiple protein bands were still present in fractions F0-F10 and F0-F11, requiring further fractionation (FIG. 5).
The fraction F0-11 with the best activity separated by Hitrap SP HP column was concentrated to below 500. mu.L with a 10kDa ultrafiltration tube, and further separated and purified with Superdex 200 Increate 5/150GL molecular sieve column at 0.3ml/min, and eluted with buffer B, receiving one fraction every 0.5ml, and receiving 60 fractions in total (F0-11-1-F0-11-60). The activity test showed that the components F0-11-33 and F0-11-34 were the most active (the activity test conditions were the same as those of the F0 component, and the data of the test results were not shown), and the active components (F0-11-29-F0-11-37) were analyzed by SDS PAGE (10% gel) silver staining (FIG. 6).
Corresponding to the activity test result, the red marked band in FIG. 6 is the target protein band, so that the mass spectrometric identification of the enzyme-cleaved protein in the gel is carried out on F0-11-33-1 (marked as the target band in FIG. 6) and F0-11-34-1 (marked as the target band in FIG. 6). Cutting the glue into 1mm multiplied by 1mm colloidal particles, putting the colloidal particles into a 1.5ml centrifuge tube, and simultaneously and respectively carrying out the following operations on the samples: 200 μ L ddH2O rinse 3 times. Adding silver dye decolorant (100mM NaS)2O3And 30mM potassium ferricyanide) was incubated at 37 ℃ for 30mins to decolorize, and the operation was repeated several times until the color had gone. Add 200. mu.L of ddH2And O incubation for 5min, and repeating the operation once. After 5min incubation 100. mu.L of 50% AcN was added, centrifugation was carried out, the supernatant was discarded, and 100. mu.L of AcN was added and incubated for 5min until the gel became white. Centrifuging at 1000g for 10min, discarding the supernatant, and air drying at room temperature. Add 200. mu.L of 10mM DTT (25 mM NH for 1M DTT)4HCO3Diluted), gently shaken and incubated at 56 ℃ for 1h, 100. mu.L of 55mM IAA was quickly added to the gel and incubated at room temperature for 30mins in the dark. With 50mM NH4HCO3Adding AcN after rinsing once, repeating for 2-3 times until the colloidal particles become white, and air drying at room temperature. A trypsin solution was added to cover the micelles and incubated at 37 ℃ for 16 h. 1000g centrifugation, supernatant and 50. mu.L 25mM NH4HCO3Extracting twice, drying at 80 deg.C, dissolving the sample with 0.1% formic acid water, desalting with C18 column, 20 μ L0.1% formic acid deionized water, and performing mass spectrometryAnalysis and protein identification.
The protein raw data obtained by mass spectrometry is stored in a Proteome distributorTMAnd (3) performing comparison analysis on Software 2.3 with the obtained securinega suffruticosa protein database to obtain a candidate protein list.
EXAMPLE 83 'and 5' RACE of candidate genes to obtain full-Length genes
The candidate gene Cluster-10344.81450(FsBBE) is likely to be the target gene to be searched. However, the gene has no full length in transcriptome data, and is only 438bp, so that the full-length sequence of the gene needs to be obtained by a RACE method. Primers required for 3 'and 5' RACE were designed based on the existing sequences as described in the materials section. RACE method is shown in clongtech kit. The full-length sequence of the obtained gene (FsBBE) is shown as SEQ ID NO. 1, and the amino acid sequence of the coded oxidase is shown as SEQ ID NO. 2.
Example 9 heterologous expression, protein purification and functional verification of candidate genes
OD was tested by growing about 24h in LB medium containing rifamycin and kanamycin with GV3101 Agrobacterium containing the constructed pGambia 2301::81450-HIS Tag (with BamHI and SalI cleavage sites) vector600After centrifugation at 4000g, the cells were washed with buffer (10mM MES,10mM MgCl, 1.0 g)2And 100. mu.M acetosyringone, pH 5.6) resuspended to OD600After 0.6, injecting tobacco, collecting tobacco leaves after 4 days of injection, freezing and grinding, and extracting protein by the same method as that of extracting the total protein of the suffruticosa: 1g fresh leaves were freeze-ground with liquid nitrogen and extracted for 1h on ice using 5mL of buffer A (100mM NaPi, pH 7.4,5mM sodium bisulfite, 5mM dithiothreitol, 1mM EDTA, 10% (v/v) glycerol, 1% (w/v) PVP, 4% (w/v) PVPP) at 4 ℃. Centrifuge at 5,000x g for 30min at 4 ℃. The supernatant was desalted with pD10 columns (GE-Healthcare) and buffer B (10mM Tris-HCl, pH 7.5) was replaced. Then purifying the reconstructed protein of the Cluster-10344.81450 gene HIS-TAG label, adsorbing the total protein by a nickel column, eluting by imidazole with different gradients, verifying by SDS PAGE (see figure 7) of fractions washed by 100mM imidazole, desalting by a pD10 desalting column, replacing with Tris-HCl pH 7.5 buffer solution, and performing enzyme activity test, wherein the test result is shown inFig. 4.
As can be seen from the left graph in FIG. 4, after the FsBBE protein is expressed in tobacco in a heterologous way, the total tobacco leaf protein can also catalyze allocorenine to perform oxidative condensation with L-AA or DHA or isoascorbyl acid to obtain the compound 1 or 2 (see iii, iv and v in the right graph).
Example 10 validation of the catalytic mechanism of oxidase FsBBE
Incubating FsBBE protein purified from tobacco leaves and allosecurinine, taking heat-inactivated protein as a control group, incubating tobacco leaf total protein expressing Cluster-10344.81450 gene and allosecurinine, injecting unloaded pGambia2301 Agrobacterium tumefaciens tobacco leaf total protein as a control group, reacting for 3 hours at 30 ℃, adding equal volume of methanol to terminate the reaction, centrifuging 17000g to take supernatant, injecting 1 mu L of sample, and detecting by Q-TOF. The results are shown in FIG. 4.
Incubating tobacco leaf total protein expressing Cluster-10344.81450 gene with allosecurinine, reacting at 30 ℃ for 3h, adding equal volume of n-butanol, extracting for 3 times, combining n-butanol phases, spin-drying, separating liquid phase, spin-drying separated compound 3 (enamine intermediate) component, dissolving with 40 μ L of methanol, and performing the following experiment: mu.L of Compound 3 solution was added to 50. mu.L of Tris-HCl pH 7.5 buffer as a control (Compound 3); 1 μ L L-AA (100mM in ddH)2O) adding 50. mu.L Tris-HCl buffer solution with pH 7.5 to the aqueous solution to obtain a control group (L-AA); 1 μ L of Compound 3 solution and 1 μ L L-AA added to 50 μ L Tris-HCl pH 7.5 buffer as experimental group (Compound 3+ L-AA); DHA is analogous to the reaction above, in D2Experiments in O except that the solution was D2The rest of the buffer solution prepared by O is the same, after reaction is carried out for 3 hours at 30 ℃, the sample is injected with 1 mu L of filter membrane filtration and Q-TOF detection. The results are shown in FIG. 4.
Example 11 oxidase FsBBE catalyzes the synthesis and structural characterization of another novel compound
When looking for L-AA analogs, only erythorbic acid (isoascorbyl acid) was available, and therefore activity tests were performed with purified FsBBE protein and allocoreurinine and erythorbic acid, and the test results are shown in FIG. 4. The method for separating, purifying and identifying the large-scale enzyme activity of the new compound comprises the following steps: desalting total protein of tobacco leaf expressing Cluster-10344.81450 gene, adding allocoreurinine with final concentration of 1mM and isoascorbic acid with final concentration of 1mM, reacting at 30 deg.C for 3h, adding n-butanol with equal volume, extracting for 3 times, mixing n-butanol phases, spin drying, and separating liquid phase, wherein the liquid phase separation method is the same as that for separating compound 1. Respectively obtaining a compound 2, dissolving in deuterated chloroform and deuterated methanol, and performing nuclear magnetic identification. Both compounds were single crystal tried in different solvents, compound 2 in ethyl acetate: the methanol was slowly evaporated at room temperature in a solvent of 4:1 to obtain crystals, and the absolute configuration was determined by crystal diffraction (see fig. 8). That is, the molecular structural formula of compound 2 is as follows:
the molecular structure of the compound is different from that of the compound 1 in the spatial configuration of 6' -hydroxyl, and the compound is named fluesuffine B.
EXAMPLE 12 two novel Compound physiological Activity assays
The growth inhibitory activity of compound 1(fluesuffine A) and compound 2(fluesuffine B) on tumor cells was tested by the SRB method (Skehan P.et al.1990).
The tumor cell strain is glioma cell SF-268, breast cancer cell MCF-7, liver cancer cell HePG-2, non-small cell lung cancer cell A549 and hepatic astrocyte LX-2.
The experimental method comprises the following steps: respectively dissolving the compound 1(fluesuffine A) and the compound 2(fluesuffine B) in dimethyl sulfoxide (DMSO) to obtain mother liquor with the concentration of 10mmol/L, and then diluting the mother liquor to the required concentration by using an RPMI-1640 culture medium, wherein the positive control medicament is cisplatin. Collecting SF-268, MCF-7, HepG-2, A549 and LX-2 cells in logarithmic growth phase, digesting with pancreatin, staining and counting with trypan blue, detecting cell viability to be more than 95% by trypan blue exclusion experiment, and adjusting cell concentration to 3 × 10 with fresh RPMI-1640 culture medium4one/mL, cells were seeded in 96-well plates, 180. mu.L of cell suspension was added to each well, and 3 blank wells were set to zero, at 37 ℃ with 5% CO2Culturing in an incubator for 24 h. After the cells adhere to the wall, 20 mu L of the mixture with a certain concentration is added into each holeCompound solution, negative control plus 20. mu.L RPMI-1640 medium, with cisplatin as positive control. Placing at 37 ℃ and 5% CO2After culturing for 72h in an incubator, 50 μ L of 50% cold trichloroacetic acid is added to fix the cells, the cells are placed at 4 ℃ for 1h, washed with distilled water for 5 times, and naturally dried in the air. Then, 100. mu.L/well of SRB solution prepared from 1% glacial acetic acid and having a concentration of 4mg/mL was added, stained at room temperature for 30min, the supernatant was removed, and washed 5 times with 1% glacial acetic acid. Finally adding 200 mu L/hole 10mmol/mL Tris solution for dissolution, measuring the absorbance value (A) at 570nm by a microplate reader, and calculating the inhibition rate of the drug on the cell growth by using the following formula: cell growth inhibition (%) - (1-A)Sample set/AControl group)×100%。
The experimental results are as follows: the IC of the compound 1, the compound 2 and the positive contrast medicament cisplatin prepared by the invention on tumor cell strains SF-268, MCF-7, HePG-2, A549 and LX-250The values are shown in Table 2.
The results show that: the compound 1 and the compound 2 have obvious growth inhibition activity on tumor cells, so the implementation of the invention provides candidate compounds for researching and developing new antitumor drugs, and provides scientific basis for developing and utilizing traditional Chinese medicine plant resources.
The biosynthesis of the compound 1 and the compound 2 lays a foundation for analyzing an alkaloid synthesis path in the securinega suffruticosa so as to promote the development and utilization of new natural medicines.
Sequence listing
<110> China academy of sciences molecular plant science remarkable innovation center
<120> Securinega suffruticosa derived oxidase and application thereof
<130> SHPI2010290
<160> 2
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1599
<212> DNA
<213> Flueggea suffruticosa (Pall.) Baill.
<220>
<221> CDS
<222> (1)..(1596)
<400> 1
atg aat cca tta aag cac tct tca tca acg cct tta gtg ttt gtg ttg 48
Met Asn Pro Leu Lys His Ser Ser Ser Thr Pro Leu Val Phe Val Leu
1 5 10 15
ctg act gta tgt tca tgt gca act tca gta aca att cct gag ctg ttc 96
Leu Thr Val Cys Ser Cys Ala Thr Ser Val Thr Ile Pro Glu Leu Phe
20 25 30
ttt caa tgc ctg tcc aat aca acc aca acc agt act agt att ttc aat 144
Phe Gln Cys Leu Ser Asn Thr Thr Thr Thr Ser Thr Ser Ile Phe Asn
35 40 45
gtc cta tac aca cca aga aac acg tcc tac act tcc atc tta gaa tca 192
Val Leu Tyr Thr Pro Arg Asn Thr Ser Tyr Thr Ser Ile Leu Glu Ser
50 55 60
cgc att caa aac ctc agg ttc aat aca act gac acg ccg aaa cct ctg 240
Arg Ile Gln Asn Leu Arg Phe Asn Thr Thr Asp Thr Pro Lys Pro Leu
65 70 75 80
gcc ata gtc aca ccg ctg gat gca tct cac att caa gcc acc atc ata 288
Ala Ile Val Thr Pro Leu Asp Ala Ser His Ile Gln Ala Thr Ile Ile
85 90 95
tgt gcc cgg aaa cac aac ctc caa atc aga atc cga agc ggt ggc cac 336
Cys Ala Arg Lys His Asn Leu Gln Ile Arg Ile Arg Ser Gly Gly His
100 105 110
gac tat gag ggg ttg tct tat gtc tcg ccc ctc cct ttt gtt gtg ctt 384
Asp Tyr Glu Gly Leu Ser Tyr Val Ser Pro Leu Pro Phe Val Val Leu
115 120 125
gat cta atc aat ctt cga aac atc acc gtt gat gta gaa aat aga gtc 432
Asp Leu Ile Asn Leu Arg Asn Ile Thr Val Asp Val Glu Asn Arg Val
130 135 140
gca tgg gtc ggg tgc gga gca aca tta gga gaa ttc tac tat aga att 480
Ala Trp Val Gly Cys Gly Ala Thr Leu Gly Glu Phe Tyr Tyr Arg Ile
145 150 155 160
gca gag aaa act agg acc ctg gca ttc cct gca ggt gct tgt cct act 528
Ala Glu Lys Thr Arg Thr Leu Ala Phe Pro Ala Gly Ala Cys Pro Thr
165 170 175
gta gga gtt ggt ggg cat ttc agt gga ggc gga tac ggg tat ttg ttg 576
Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Tyr Leu Leu
180 185 190
cgt aaa ttt ggc ctc gca gca gat aac atc ctt gat gca agt tta gtt 624
Arg Lys Phe Gly Leu Ala Ala Asp Asn Ile Leu Asp Ala Ser Leu Val
195 200 205
gat gtg aat ggt aga att ctc gat aga gct tcc atg ggg gaa gat ttg 672
Asp Val Asn Gly Arg Ile Leu Asp Arg Ala Ser Met Gly Glu Asp Leu
210 215 220
ttt tgg gca att aga ggt ggt gga gga aat agt ttc gga gta gtt att 720
Phe Trp Ala Ile Arg Gly Gly Gly Gly Asn Ser Phe Gly Val Val Ile
225 230 235 240
gct tgg aag gtt aat ttg gtt cca gtc cct tcc aca ttg act tct ttc 768
Ala Trp Lys Val Asn Leu Val Pro Val Pro Ser Thr Leu Thr Ser Phe
245 250 255
aaa gtc tca aaa agt ttg gaa cag aat acg atg att cag ctt ctc aac 816
Lys Val Ser Lys Ser Leu Glu Gln Asn Thr Met Ile Gln Leu Leu Asn
260 265 270
aag tgg caa tat gtt gca aat aaa ctt cct gat gaa tta agc atg ttt 864
Lys Trp Gln Tyr Val Ala Asn Lys Leu Pro Asp Glu Leu Ser Met Phe
275 280 285
gct gta gtt tct aaa aaa aac tca aca ata tct gtt aag ttt tat tcc 912
Ala Val Val Ser Lys Lys Asn Ser Thr Ile Ser Val Lys Phe Tyr Ser
290 295 300
ttg tat gta ggt gga att gat agc ctc ctt cca tta atg gaa gaa agg 960
Leu Tyr Val Gly Gly Ile Asp Ser Leu Leu Pro Leu Met Glu Glu Arg
305 310 315 320
ttt cct gag ctt ggt tta aaa aga gcg gat tgc aat gag atg agc tgg 1008
Phe Pro Glu Leu Gly Leu Lys Arg Ala Asp Cys Asn Glu Met Ser Trp
325 330 335
ata gag tca gca gta tct ttc gcc ggg tac gca agt aat aca tca ttg 1056
Ile Glu Ser Ala Val Ser Phe Ala Gly Tyr Ala Ser Asn Thr Ser Leu
340 345 350
gat gtt ctc ctc aat cac act aat aat tac gaa att gct agt gga aga 1104
Asp Val Leu Leu Asn His Thr Asn Asn Tyr Glu Ile Ala Ser Gly Arg
355 360 365
ttc aaa ggc aaa tcg gac ttt gtc aaa gag ccc gtg cca gaa gct gca 1152
Phe Lys Gly Lys Ser Asp Phe Val Lys Glu Pro Val Pro Glu Ala Ala
370 375 380
tta gaa ggc tta ttg aaa tgg ctt tca gac aaa gac ata acg aat gca 1200
Leu Glu Gly Leu Leu Lys Trp Leu Ser Asp Lys Asp Ile Thr Asn Ala
385 390 395 400
gcg atc tat atg gtt cca tta gga gga aaa atg ggt gag ata aca gaa 1248
Ala Ile Tyr Met Val Pro Leu Gly Gly Lys Met Gly Glu Ile Thr Glu
405 410 415
aca agc att cca ttc cca cat aga gca ggg aat cta tac ttg ttg gcg 1296
Thr Ser Ile Pro Phe Pro His Arg Ala Gly Asn Leu Tyr Leu Leu Ala
420 425 430
tat tat gtt aaa tgg gag ggg caa gga aca gaa gca gct caa aag ccc 1344
Tyr Tyr Val Lys Trp Glu Gly Gln Gly Thr Glu Ala Ala Gln Lys Pro
435 440 445
cta agt tgg atc aga aag ggt tac aaa tac atg gct ccc tat gtc tcc 1392
Leu Ser Trp Ile Arg Lys Gly Tyr Lys Tyr Met Ala Pro Tyr Val Ser
450 455 460
aaa aat cca aga gaa gca cat ctc aac gac aga gat ctt gat att ggt 1440
Lys Asn Pro Arg Glu Ala His Leu Asn Asp Arg Asp Leu Asp Ile Gly
465 470 475 480
act aac aat atc tca gga aat acc agt tac gaa cag gct agt att tgg 1488
Thr Asn Asn Ile Ser Gly Asn Thr Ser Tyr Glu Gln Ala Ser Ile Trp
485 490 495
gga acc aag tat ttt aaa aat aat ttt gac agg ttg gtt cgg gtg aag 1536
Gly Thr Lys Tyr Phe Lys Asn Asn Phe Asp Arg Leu Val Arg Val Lys
500 505 510
act agt gtt gat cct tca gat ttt ttc aga aat gaa caa agc gtc cct 1584
Thr Ser Val Asp Pro Ser Asp Phe Phe Arg Asn Glu Gln Ser Val Pro
515 520 525
cct ctg tta tct tga 1599
Pro Leu Leu Ser
530
<210> 2
<211> 532
<212> PRT
<213> Flueggea suffruticosa (Pall.) Baill.
<400> 2
Met Asn Pro Leu Lys His Ser Ser Ser Thr Pro Leu Val Phe Val Leu
1 5 10 15
Leu Thr Val Cys Ser Cys Ala Thr Ser Val Thr Ile Pro Glu Leu Phe
20 25 30
Phe Gln Cys Leu Ser Asn Thr Thr Thr Thr Ser Thr Ser Ile Phe Asn
35 40 45
Val Leu Tyr Thr Pro Arg Asn Thr Ser Tyr Thr Ser Ile Leu Glu Ser
50 55 60
Arg Ile Gln Asn Leu Arg Phe Asn Thr Thr Asp Thr Pro Lys Pro Leu
65 70 75 80
Ala Ile Val Thr Pro Leu Asp Ala Ser His Ile Gln Ala Thr Ile Ile
85 90 95
Cys Ala Arg Lys His Asn Leu Gln Ile Arg Ile Arg Ser Gly Gly His
100 105 110
Asp Tyr Glu Gly Leu Ser Tyr Val Ser Pro Leu Pro Phe Val Val Leu
115 120 125
Asp Leu Ile Asn Leu Arg Asn Ile Thr Val Asp Val Glu Asn Arg Val
130 135 140
Ala Trp Val Gly Cys Gly Ala Thr Leu Gly Glu Phe Tyr Tyr Arg Ile
145 150 155 160
Ala Glu Lys Thr Arg Thr Leu Ala Phe Pro Ala Gly Ala Cys Pro Thr
165 170 175
Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Tyr Leu Leu
180 185 190
Arg Lys Phe Gly Leu Ala Ala Asp Asn Ile Leu Asp Ala Ser Leu Val
195 200 205
Asp Val Asn Gly Arg Ile Leu Asp Arg Ala Ser Met Gly Glu Asp Leu
210 215 220
Phe Trp Ala Ile Arg Gly Gly Gly Gly Asn Ser Phe Gly Val Val Ile
225 230 235 240
Ala Trp Lys Val Asn Leu Val Pro Val Pro Ser Thr Leu Thr Ser Phe
245 250 255
Lys Val Ser Lys Ser Leu Glu Gln Asn Thr Met Ile Gln Leu Leu Asn
260 265 270
Lys Trp Gln Tyr Val Ala Asn Lys Leu Pro Asp Glu Leu Ser Met Phe
275 280 285
Ala Val Val Ser Lys Lys Asn Ser Thr Ile Ser Val Lys Phe Tyr Ser
290 295 300
Leu Tyr Val Gly Gly Ile Asp Ser Leu Leu Pro Leu Met Glu Glu Arg
305 310 315 320
Phe Pro Glu Leu Gly Leu Lys Arg Ala Asp Cys Asn Glu Met Ser Trp
325 330 335
Ile Glu Ser Ala Val Ser Phe Ala Gly Tyr Ala Ser Asn Thr Ser Leu
340 345 350
Asp Val Leu Leu Asn His Thr Asn Asn Tyr Glu Ile Ala Ser Gly Arg
355 360 365
Phe Lys Gly Lys Ser Asp Phe Val Lys Glu Pro Val Pro Glu Ala Ala
370 375 380
Leu Glu Gly Leu Leu Lys Trp Leu Ser Asp Lys Asp Ile Thr Asn Ala
385 390 395 400
Ala Ile Tyr Met Val Pro Leu Gly Gly Lys Met Gly Glu Ile Thr Glu
405 410 415
Thr Ser Ile Pro Phe Pro His Arg Ala Gly Asn Leu Tyr Leu Leu Ala
420 425 430
Tyr Tyr Val Lys Trp Glu Gly Gln Gly Thr Glu Ala Ala Gln Lys Pro
435 440 445
Leu Ser Trp Ile Arg Lys Gly Tyr Lys Tyr Met Ala Pro Tyr Val Ser
450 455 460
Lys Asn Pro Arg Glu Ala His Leu Asn Asp Arg Asp Leu Asp Ile Gly
465 470 475 480
Thr Asn Asn Ile Ser Gly Asn Thr Ser Tyr Glu Gln Ala Ser Ile Trp
485 490 495
Gly Thr Lys Tyr Phe Lys Asn Asn Phe Asp Arg Leu Val Arg Val Lys
500 505 510
Thr Ser Val Asp Pro Ser Asp Phe Phe Arg Asn Glu Gln Ser Val Pro
515 520 525
Pro Leu Leu Ser
530
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010606465.7A CN113930400A (en) | 2020-06-29 | 2020-06-29 | Securinega suffruticosa derived oxidase and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010606465.7A CN113930400A (en) | 2020-06-29 | 2020-06-29 | Securinega suffruticosa derived oxidase and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113930400A true CN113930400A (en) | 2022-01-14 |
Family
ID=79273105
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010606465.7A Pending CN113930400A (en) | 2020-06-29 | 2020-06-29 | Securinega suffruticosa derived oxidase and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113930400A (en) |
-
2020
- 2020-06-29 CN CN202010606465.7A patent/CN113930400A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2198173C2 (en) | Epotylones c, d, e and f, their synthesis and agents based on thereof | |
TEMPÉ et al. | Occurrence and biosynthesis of opines | |
Campanella et al. | A novel auxin conjugate hydrolase from wheat with substrate specificity for longer side-chain auxin amide conjugates | |
CN108299462B (en) | Mixed source terpene compound and separation method and application thereof | |
JP2013540769A5 (en) | ||
CN114409660B (en) | CPA type indole alkaloid compound and preparation method and application thereof | |
CN104031875B (en) | A kind of S-equol produces engineering bacteria and application | |
CN102993275B (en) | Bleomycin derivatives and anti-tumour activity thereof | |
CN111808902B (en) | C-Glycosyltransferase and its application in the synthesis of schaftosides and isosschaftosides | |
CN101186912A (en) | Method for increasing terpene lactone content in ginkgo callus by transgenic ggpps gene | |
CN113930400A (en) | Securinega suffruticosa derived oxidase and application thereof | |
CN101302555B (en) | Molecule identification method for taxol-producing endophytic fungi in yew | |
Wang et al. | Establishment of genetic transformation system of peach callus | |
CN107880134B (en) | A kind of method of enzymatically synthesizing kaempferol | |
CN108841769B (en) | A kind of fidaxomicin genetically engineered bacteria and construction method and application | |
CN113402357B (en) | 5-12-5 Tricyclic diterpene skeleton compound and preparation thereof | |
CN114350524B (en) | Mycolepothilone indole sesquiterpene compound, biosynthesis method and application thereof | |
KR100984480B1 (en) | Transformation microorganism comprising gene encoding β-glucosidase and production method of indigo dye using same | |
CN113801802B (en) | Streptomyces and application thereof | |
CN104004065B (en) | A kind of Isolation and purification method of bleomycin race derivative | |
CN100387609C (en) | The manufacture method of novel anthracycline polyketide antibiotics | |
CN110055232B (en) | Two glycyrrhetinic acid sucrose synthases and application thereof in synthesis of glycyrrhetinic acid glycosylated derivatives | |
CN112341467A (en) | Diketopiperazine alkaloid compound and preparation method and application thereof | |
CN111807952A (en) | A novel compound involved in lipid metabolism and preparation method thereof | |
CN116003416B (en) | Isoquinazolinone compounds, preparation method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |