CN116640752A - Aconitate hydratase mutant and application thereof - Google Patents
Aconitate hydratase mutant and application thereof Download PDFInfo
- Publication number
- CN116640752A CN116640752A CN202210141296.3A CN202210141296A CN116640752A CN 116640752 A CN116640752 A CN 116640752A CN 202210141296 A CN202210141296 A CN 202210141296A CN 116640752 A CN116640752 A CN 116640752A
- Authority
- CN
- China
- Prior art keywords
- leu
- gly
- ala
- val
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 102000009836 Aconitate hydratase Human genes 0.000 title claims abstract description 69
- 108010009924 Aconitate hydratase Proteins 0.000 title claims abstract description 69
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims abstract description 35
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims abstract description 34
- 235000003704 aspartic acid Nutrition 0.000 claims abstract description 34
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims abstract description 34
- 239000004472 Lysine Substances 0.000 claims abstract description 30
- 244000005700 microbiome Species 0.000 claims abstract description 30
- 150000001413 amino acids Chemical group 0.000 claims abstract description 15
- 239000002207 metabolite Substances 0.000 claims abstract description 14
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 13
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 13
- 238000006243 chemical reaction Methods 0.000 claims abstract description 12
- 241000588724 Escherichia coli Species 0.000 claims description 25
- 108090000623 proteins and genes Proteins 0.000 claims description 25
- 239000002243 precursor Substances 0.000 claims description 20
- 238000010276 construction Methods 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 15
- 108020004707 nucleic acids Proteins 0.000 claims description 11
- 102000039446 nucleic acids Human genes 0.000 claims description 11
- 150000007523 nucleic acids Chemical class 0.000 claims description 11
- 238000012258 culturing Methods 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 8
- 239000012620 biological material Substances 0.000 claims description 6
- 241000894006 Bacteria Species 0.000 claims description 5
- 239000013598 vector Substances 0.000 claims description 4
- 241000588722 Escherichia Species 0.000 claims description 3
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 3
- 150000001510 aspartic acids Chemical class 0.000 claims description 2
- 238000012262 fermentative production Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 15
- 102000004190 Enzymes Human genes 0.000 abstract description 10
- 108090000790 Enzymes Proteins 0.000 abstract description 10
- KHPXUQMNIQBQEV-UHFFFAOYSA-N oxaloacetic acid Chemical compound OC(=O)CC(=O)C(O)=O KHPXUQMNIQBQEV-UHFFFAOYSA-N 0.000 abstract description 7
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 32
- 108010057821 leucylproline Proteins 0.000 description 30
- 108020004414 DNA Proteins 0.000 description 24
- 108010050848 glycylleucine Proteins 0.000 description 24
- 238000000855 fermentation Methods 0.000 description 19
- 230000004151 fermentation Effects 0.000 description 19
- 235000018977 lysine Nutrition 0.000 description 19
- 108010013835 arginine glutamate Proteins 0.000 description 18
- 108010047857 aspartylglycine Proteins 0.000 description 18
- 108010026333 seryl-proline Proteins 0.000 description 18
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 17
- 229940023064 escherichia coli Drugs 0.000 description 16
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 12
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 12
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 12
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 12
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 12
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 12
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 12
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 108010037850 glycylvaline Proteins 0.000 description 12
- 239000013612 plasmid Substances 0.000 description 12
- 108010061238 threonyl-glycine Proteins 0.000 description 12
- 108700004896 tripeptide FEG Proteins 0.000 description 12
- 101150113917 acnA gene Proteins 0.000 description 10
- 210000004027 cell Anatomy 0.000 description 10
- 229930027917 kanamycin Natural products 0.000 description 10
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 10
- 229960000318 kanamycin Drugs 0.000 description 10
- 229930182823 kanamycin A Natural products 0.000 description 10
- 101100054574 Corynebacterium diphtheriae (strain ATCC 700971 / NCTC 13129 / Biotype gravis) acn gene Proteins 0.000 description 9
- 101100215150 Dictyostelium discoideum aco1 gene Proteins 0.000 description 9
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 9
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 9
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 7
- 235000019766 L-Lysine Nutrition 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 229940024606 amino acid Drugs 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 6
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 6
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 6
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 6
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 6
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 6
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 6
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 6
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 6
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 6
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 6
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 6
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 6
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 6
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 6
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 6
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 6
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 6
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 6
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 6
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 6
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 6
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 6
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 6
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 6
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 6
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 6
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 6
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 6
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 6
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 6
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 6
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 6
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 6
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 6
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 6
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 6
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 6
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 6
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 6
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 6
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 6
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 6
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 6
- VTJLJQGUMBWHBP-GUBZILKMSA-N Cys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N VTJLJQGUMBWHBP-GUBZILKMSA-N 0.000 description 6
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 6
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 6
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 6
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 6
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 6
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 6
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 6
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 6
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 6
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 6
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 6
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 6
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 6
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 6
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 6
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 6
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 6
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 6
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 6
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 6
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 6
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 6
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 6
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 6
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 6
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 6
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 6
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 6
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 6
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 6
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 6
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 6
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 6
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 6
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 6
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 6
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 6
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 6
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 6
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 6
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 6
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 6
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 6
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 6
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 6
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 6
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 6
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 6
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 6
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 6
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 6
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 6
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 6
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 6
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 6
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 6
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 6
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 6
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 6
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 6
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 6
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 6
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 6
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 6
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 6
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 6
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 6
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 6
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 6
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 6
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 6
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 6
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 6
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 6
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 6
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 6
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 6
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 6
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 6
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 6
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 6
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 6
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 6
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 6
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 6
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 6
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 6
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 6
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 6
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 6
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 6
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 6
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 6
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 6
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 6
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 6
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 6
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 6
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 6
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 6
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 6
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 6
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 6
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 6
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 6
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 6
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 6
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 6
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 6
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 6
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 6
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 6
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 6
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 6
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 6
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 6
- 108010003201 RGH 0205 Proteins 0.000 description 6
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 6
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 6
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 6
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 6
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 6
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 6
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 6
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 6
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 6
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 6
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 6
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 6
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 6
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 6
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 6
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 6
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 6
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 6
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 6
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 6
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 6
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 6
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 6
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 6
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 6
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 6
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 6
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 6
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 6
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 6
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 6
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 6
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 6
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 6
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 6
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 6
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 6
- ULHJJQYGMWONTD-HKUYNNGSSA-N Tyr-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ULHJJQYGMWONTD-HKUYNNGSSA-N 0.000 description 6
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 6
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 6
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 6
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 6
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 6
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 6
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 6
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 6
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 6
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 6
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 6
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 6
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 6
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 6
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 6
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 6
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 6
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 6
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 6
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 6
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 6
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 6
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 6
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010079547 glutamylmethionine Proteins 0.000 description 6
- 108010089804 glycyl-threonine Proteins 0.000 description 6
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 108010036413 histidylglycine Proteins 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 6
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 6
- 108010091871 leucylmethionine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 108010085203 methionylmethionine Proteins 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 238000004321 preservation Methods 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 108010084932 tryptophyl-proline Proteins 0.000 description 6
- 108010020532 tyrosyl-proline Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 6
- 108010009962 valyltyrosine Proteins 0.000 description 6
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 238000011218 seed culture Methods 0.000 description 5
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 4
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 4
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 230000002503 metabolic effect Effects 0.000 description 4
- GTZCVFVGUGFEME-UHFFFAOYSA-N trans-aconitic acid Natural products OC(=O)CC(C(O)=O)=CC(O)=O GTZCVFVGUGFEME-UHFFFAOYSA-N 0.000 description 4
- 239000001124 (E)-prop-1-ene-1,2,3-tricarboxylic acid Substances 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 229940091181 aconitic acid Drugs 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- GTZCVFVGUGFEME-IWQZZHSRSA-N cis-aconitic acid Chemical compound OC(=O)C\C(C(O)=O)=C\C(O)=O GTZCVFVGUGFEME-IWQZZHSRSA-N 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 235000013922 glutamic acid Nutrition 0.000 description 3
- 239000004220 glutamic acid Substances 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- RUTXIHLAWFEWGM-UHFFFAOYSA-H iron(3+) sulfate Chemical compound [Fe+3].[Fe+3].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O RUTXIHLAWFEWGM-UHFFFAOYSA-H 0.000 description 3
- 229910000360 iron(III) sulfate Inorganic materials 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 3
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 3
- 229940099596 manganese sulfate Drugs 0.000 description 3
- 239000011702 manganese sulphate Substances 0.000 description 3
- 235000007079 manganese sulphate Nutrition 0.000 description 3
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 3
- 235000019796 monopotassium phosphate Nutrition 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 3
- 229960000268 spectinomycin Drugs 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 229910000019 calcium carbonate Inorganic materials 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 235000013379 molasses Nutrition 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 230000003313 weakening effect Effects 0.000 description 2
- 101710159080 Aconitate hydratase A Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- ODBLHEXUDAPZAU-ZAFYKAAXSA-N D-threo-isocitric acid Chemical compound OC(=O)[C@H](O)[C@@H](C(O)=O)CC(O)=O ODBLHEXUDAPZAU-ZAFYKAAXSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- YTRFFJUOYBMLPN-UHFFFAOYSA-N Ile-Lys-Lys-Ser Chemical compound CCC(C)C(N)C(=O)NC(CCCCN)C(=O)NC(CCCCN)C(=O)NC(CO)C(O)=O YTRFFJUOYBMLPN-UHFFFAOYSA-N 0.000 description 1
- ODBLHEXUDAPZAU-FONMRSAGSA-N Isocitric acid Natural products OC(=O)[C@@H](O)[C@H](C(O)=O)CC(O)=O ODBLHEXUDAPZAU-FONMRSAGSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 229940091179 aconitate Drugs 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 235000019730 animal feed additive Nutrition 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- -1 but not limited to Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000009655 industrial fermentation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000005464 sample preparation method Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- ODBLHEXUDAPZAU-UHFFFAOYSA-N threo-D-isocitric acid Natural products OC(=O)C(O)C(C(O)=O)CC(O)=O ODBLHEXUDAPZAU-UHFFFAOYSA-N 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000004102 tricarboxylic acid cycle Effects 0.000 description 1
- 150000003628 tricarboxylic acids Chemical class 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/08—Lysine; Diaminopimelic acid; Threonine; Valine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/20—Aspartic acid; Asparagine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/01—Hydro-lyases (4.2.1)
- C12Y402/01003—Aconitate hydratase (4.2.1.3)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention relates to the technical field of microorganisms, in particular to aconitate hydratase mutants and application thereof. The aconitate hydratase mutant provided by the invention has an amino acid sequence shown in any one of SEQ ID NO. 1-5. The aconitate hydratase mutant provided by the invention obviously reduces the enzyme activity of aconitate hydratase, and can enable oxaloacetate to flow to the synthesis path of aspartic acid and downstream metabolites thereof more. The recombinant microorganism constructed by the aconitate hydratase mutant has obviously improved lysine synthesis capability, obviously improved lysine yield and conversion rate, and better growth performance.
Description
Technical Field
The invention relates to the technical field of microorganisms, in particular to aconitate hydratase mutants and application thereof.
Background
L-lysine is basic essential amino acid with molecular formula of C 6 H 14 N 2 O 2 The appearance is white or almost white crystalline powder, darkens at 210 ℃, decomposes at 224.5 ℃, is easily soluble in water, slightly soluble in alcohol, and insoluble in ether. L-lysine is widely used in the fields of animal feed, medicine and food industry, of which about 90% is used in the feed industry and 10% is used in the food and medicine industry. The L-lysine is added into the animal feed additive to promote the organism to absorb other amino acids, thereby improving the quality of the feed.
At present, the most commonly used production method of L-lysine is a microbial fermentation method, and the microbial fermentation method has the advantages of low raw material cost, mild reaction conditions, easiness in realizing large-scale production and the like. Coli (Escherichia coli) has the advantages of high growth speed, clear genetic background, simple culture condition, mature metabolic engineering means and the like, is widely applied to the field of industrial fermentation, and can be used for producing L-amino acid, nucleotide, other organic acids and the like.
Aconitate hydratase (Aconitate hydratase A) is an enzyme in the tricarboxylic acid cycle that catalyzes two chemical reactions, namely conversion of citric acid to aconitic acid and conversion of aconitic acid to isocitric acid, respectively, whereas citric acid requires one enzyme catalytic step synthesis with oxaloacetic acid as a precursor, while oxaloacetic acid is also a precursor for synthesis of aspartic acid, whereas lysine is synthesized with aspartic acid as a precursor. Therefore, consumption of oxaloacetate can be reduced by weakening down metabolism of citric acid, which is advantageous for entrance of oxaloacetate into lysine synthesis pathway, thereby improving lysine yield. At present, the prior art reports on weakening aconitate hydratase expression, but no mutant with reduced enzyme activity obtained by mutating aconitate hydratase has been reported.
Disclosure of Invention
The invention aims to provide aconitate hydratase mutants and application thereof. Another object of the present invention is to provide a recombinant microorganism expressing the aconitate hydratase mutant and its use.
Specifically, the invention provides the following technical scheme:
the invention provides an aconitate hydratase mutant, which has an amino acid sequence shown in any one of SEQ ID NO. 1-5.
Aconitate hydratase mutants with sequences shown in SEQ ID No.1-5 are obtained by mutating 522 th amino acid into histidine (H), glutamic acid (E), isoleucine (I), tryptophan (W) and glycine (G) on the basis of aconitate hydratase of wild type escherichia coli.
The amino acid sequence of aconitate hydratase of wild type escherichia coli MG1655 is shown as SEQ ID NO. 6.
The invention discovers that mutation of 522 th amino acid of aconitate hydratase into the amino acid can obviously reduce the catalytic activity of aconitate hydratase, but can not lead to complete inactivation of the aconitate hydratase, the reduction of aconitate hydratase activity can lead oxaloacetic acid to be used for synthesizing aspartic acid more, enhance the metabolic flux of an aspartic acid pathway, further be beneficial to improving the synthesis of aspartic acid and metabolic products taking aspartic acid as precursors, and the residual aconitate hydratase activity can maintain the requirements of tricarboxylic acid circulation and bacterial growth, and the mutation can not lead to obvious inhibition of bacterial growth.
It will be appreciated by those skilled in the art that the addition of a tag protein to the N-or C-terminus of the aconitate hydratase mutant sequence or the fusion of the tag protein with other proteins to form a fusion protein will not significantly alter the activity of the aconitate hydratase mutant without altering the structure of the aconitate hydratase mutant itself, and therefore, the tagged protein or fusion protein is also within the scope of the invention.
The invention also provides a nucleic acid molecule encoding the aconitate hydratase mutant described above.
Based on the amino acid sequence of the aconitate hydratase mutants, one skilled in the art can determine the nucleotide sequence of the nucleic acid molecule encoding the mutants. Based on the degeneracy of the codons, more than one of the sequences of the above-mentioned nucleic acid molecules, all nucleic acid molecules capable of encoding the above-mentioned aconitate hydratase mutants are within the scope of the invention.
The invention also provides a biological material containing the nucleic acid molecule, wherein the biological material is an expression cassette, a vector or a host cell.
Wherein, the expression cassette may be a recombinant DNA molecule obtained by ligating elements for driving transcription and translation of the gene upstream or downstream of the gene.
The vector may be an expression vector or a cloning vector, including but not limited to, plasmid vectors, phage vectors, viral vectors, transposons, and the like.
The host cell may be a microbial cell.
On the basis of the aconitate hydratase mutant, the invention provides a recombinant microorganism which expresses the aconitate hydratase mutant.
Preferably, the recombinant microorganism does not express aconitate hydratase possessed by its starting strain.
Further preferably, in the recombinant microorganism, the gene encoding aconitate hydratase is replaced with the nucleic acid molecule encoding the aconitate hydratase mutant described above.
The recombinant microorganism described above is a bacterium of the genus Escherichia, preferably Escherichia coli (Escherichia coli).
The starting strain mentioned above refers to a starting strain for replacing the gene encoding aconitate hydratase with the gene encoding aconitate hydratase mutant, namely: the recombinant microorganism can be obtained by replacing the gene encoding aconitate hydratase of the original strain with the gene encoding aconitate hydratase mutant.
The recombinant microorganism obtained by mutating aconitate hydratase encoding genes in the initial strain into genes encoding aconitate hydratase mutants has the advantages that the yield and the conversion rate of aspartic acid or metabolic products taking aspartic acid as precursors are obviously improved compared with the initial strain, and the recombinant microorganism has better growth performance.
Preferably, the starting strain is E.coli capable of synthesizing and accumulating lysine.
As one embodiment of the invention, the starting strain is lysine-producing Escherichia coli MHZ-0914 obtained by mutagenesis, and the strain is preserved in China general microbiological culture Collection center (CGMCC, address: north Xielu No.1, 3 of the Beijing Chaoyang area, post code 100101 of the national academy of sciences of China) at 1/6/1 of 2021, and the preservation number is CGMCC No.22648, and the classification is named Escherichia coli.
The invention also provides a construction method of the recombinant microorganism, which comprises the following steps: and replacing a gene encoding aconitate hydratase in the original strain of the recombinant microorganism with a gene encoding the aconitate hydratase mutant.
The substitution of the above genes can be achieved by conventional means in the art, for example: and replacing a gene encoding aconitate hydratase on a chromosome of the original strain with a gene encoding the aconitate hydratase mutant by adopting a homologous recombination method.
The invention further provides the use of any one of said aconitate hydratase mutants or said nucleic acid molecules or said biological material or said recombinant microorganism as follows:
(1) Use in the construction of a production strain for the production of aspartic acid or a metabolite of aspartic acid as a synthetic precursor;
(2) Use in the fermentative production of aspartic acid or metabolites with aspartic acid as synthesis precursor;
(3) Use in increasing the yield and/or conversion of aspartic acid or a metabolite having aspartic acid as a synthetic precursor.
In the above (1), the production strain is preferably E.coli.
Preferably, the metabolic product of aspartic acid as a synthetic precursor according to the present invention is lysine.
The present invention provides a method for fermenting aspartic acid or a metabolite having aspartic acid as a synthetic precursor, the method comprising: culturing the recombinant microorganism to obtain a culture, and separating and extracting the culture to obtain aspartic acid or a metabolite taking the aspartic acid as a synthesis precursor.
Specifically, the method comprises the following steps: inoculating the recombinant microorganism into a seed culture medium for seed culture to obtain seed liquid, inoculating the seed liquid into a fermentation culture medium for culture to obtain fermentation liquor, and separating and extracting the fermentation liquor to obtain aspartic acid or a metabolite taking the aspartic acid as a synthesis precursor.
Preferably, the metabolite with aspartic acid as a synthetic precursor is lysine.
Preferably, the fermentation medium comprises the following components: 55-65g/L of glucose, 8-12g/L of molasses, 35-45g/L of ammonium sulfate, 8-12g/L of corn steep liquor, 1-2g/L of monopotassium phosphate, 0.5-1.5g/L of magnesium sulfate heptahydrate, 0.02-0.04g/L of ferric sulfate, 0.02-0.04g/L of manganese sulfate and 0-30g/L of calcium carbonate.
The invention has the beneficial effects that: the aconitate hydratase mutant provided by the invention obviously reduces the enzyme activity of aconitate hydratase, and can enable oxaloacetate to flow to the synthesis path of aspartic acid and downstream metabolites thereof more. The recombinant microorganism constructed by the aconitate hydratase mutant has obviously improved lysine synthesis capability, obviously improved lysine yield and conversion rate, and better growth performance.
Detailed Description
The amino acid sequence of the aconitate hydratase mutant provided by the invention is shown in any one of SEQ ID NO. 1-5.
The invention also provides a recombinant microorganism expressing the aconitate hydratase mutant, and the recombinant microorganism is preferably recombinant escherichia coli. The recombinant escherichia coli is obtained by mutating an original aconitate hydratase coding gene in an original strain into a coding gene of the aconitate hydratase mutant.
The present invention also provides a method for producing lysine by fermentation using the recombinant microorganism, the method comprising: culturing the recombinant microorganism to obtain a culture, and separating and extracting the culture to obtain lysine.
The following examples are illustrative of the invention and are not intended to limit the scope of the invention.
The following examples are not to be construed as limiting the specific techniques or conditions described in the literature in this field or as per the product specifications. The reagents or equipment used were conventional products available for purchase by regular vendors without the manufacturer's attention.
The names and sequences of the primers involved in the following examples are shown in Table 1.
TABLE 1 primer sequences
Example 1 contains the mutant Gene acnA S522H Construction of recombinant strains of (2)
In this example, escherichia coli MHZ-0914 (with the preservation number of CGMCC No. 22648) is used as an initial strain, and aconitate hydratase mutant (AcnA) expressed as SEQ ID No.1 is provided S522H ) The amino acid sequence of aconitic acid hydratase of the escherichia coli MHZ-0914 is shown as SEQ ID NO.6 through sequencing analysis.
The construction method of the recombinant escherichia coli comprises the following steps:
(1) pTargetF-N20 (acnA S522H) plasmid and Donor DNA construction
Step 1: using pTF-acnA-sgRNA-F/pTF-acnA-sgRNA-R as primers, using plasmid pTargetF as template (Multigene Editing in the Escherichiacoli Genome via the CRISPR-Cas9 System, jiang Y, chen B, et al, appl. EnvironMicrobiol, 2015), amplifying to obtain a linear plasmid of pTF with N20, assembling this linear plasmid using a seamless assembly ClonExpress kit at 37 ℃, then transforming Trans1-T1 competent cells to obtain plasmid pTargetF-N20 (acnA S522H), and subjecting the plasmid to PCR identification and sequencing verification;
step 2: the MHZ-0914 genome is used as a template, an acnAS522-UF/acnAS522H-UR primer pair is selected for amplification to obtain an upstream homology arm (1), an acnAS522H-DF/acnAS522-DR primer pair is selected for amplification to obtain a downstream homology arm (2), and the acnAS522-UF/acnAS522-DR primer pair is selected for amplification by using (1) and (2) as templates to obtain the Donor DNA.
(2) Competent cell preparation and electrotransformation
Step 1: the pCas plasmid (Multigene Editing in the Escherichia coliGenome via the CRISPR-Cas9 System, jiang Y, chen B, et al appl. EnvironMicrobiol, 2015) was electrotransferred into CGMCC No.22648 competent cells (both transformation and competent preparation methods refer to molecular clone III);
step 2: single colonies from step 1 were picked up and cultured to OD at 30℃at 200r/min in 5mL LB medium containing kanamycin and arabinose at a final concentration of 10mM 650 Electrotransformation competent cells were prepared after 0.4 (competent preparation methods reference molecular clone III);
step 3: the pTargetF-N20 (acnA S522H) plasmid constructed in (1) and the Donor DNA were simultaneously electrotransferred into cells with pCas competence (electrotransfer conditions: 2.5GV,200Ω, 25. Mu.F), plated on LB plates containing spectinomycin and kanamycin, and cultured at 30℃until single colonies were visible.
(3) Recombinant verification
Step 1: performing colony PCR verification on the single colony obtained in the step 3 of the step (2) by using a primer pair acnAS522H-F1/acnAS 522-R;
step 2: the PCR identification of the correct strain was amplified using the primer pair acnAS522-F/acnAS522-R and the amplified product was sequenced.
(4) Construction of recombinant strains with related plasmid loss
Step 1: picking the single colony which is verified to be correct by sequencing in the step (3), inoculating the single colony into 5mL of LB culture medium containing kanamycin and IPTG with the final concentration of 0.5mM, culturing overnight at 30 ℃, and streaking on an LB plate containing kanamycin;
step 2: selecting the single colony obtained in the step 1, and culturing the single colony on a LB plate containing kanamycin and spectinomycin and a LB plate only containing kanamycin at 30 ℃ overnight, if the single colony cannot grow on the LB plate containing kanamycin and spectinomycin, and growing on the LB plate containing kanamycin, wherein the single colony indicates that the pTargetF-N20 plasmid is lost;
step 3: selecting positive colonies lost by pTargetF-N20 plasmid, inoculating to an antibiotic-free LB medium, culturing at 42 ℃ for 8 hours, streaking on an LB plate, and culturing at 37 ℃ overnight;
step 4: picking single colony to be on the LB plate containing kanamycin and the non-resistant LB plate, if the colony can not grow on the LB plate containing kanamycin, the colony grows on the non-resistant LB plate, which shows that pCas plasmid is lost, and CGMCC No.22648-acnA is obtained S522H Compared with CGMCC No.22648 strain, the strain has mutated acnA encoding gene to make its encoding sequence shown in SEQ ID NO.1 aconitate hydratase mutant.
Example 2 containing the mutant Gene acnA S522E Construction of recombinant strains of (2)
In this example, escherichia coli MHZ-0914 (with the preservation number of CGMCC No. 22648) is used as an initial strain, and aconitate hydratase mutant (AcnA) expressed as SEQ ID No.2 is provided S522E ) The construction method of the recombinant E.coli of (2) refers to the method of example 1, and the obtained recombinant strain is named CGMCC No.22648-acnA S522E 。
Example 3 mutant Gene acnA S522I Recombinant strain construction of (2)
In this example, escherichia coli MHZ-0914 (with the preservation number of CGMCC No. 22648) is used as an initial strain, and aconitate hydratase mutant (AcnA) expressed as SEQ ID No.3 is provided S522I ) The construction method of the recombinant E.coli of (2) refers to the method of example 1, and the obtained recombinant strain is named CGMCC No.22648-acnA S522I 。
Example 4 mutant Gene acnA S522W Recombinant strain construction of (2)
In this example, escherichia coli MHZ-0914 (with the preservation number of CGMCC No. 22648) is used as an initial strain, and aconitate hydratase mutant (AcnA) expressed as SEQ ID No.4 is provided S522W ) The construction method of the recombinant E.coli of (2) refers to the method of example 1, and the obtained recombinant strain is named CGMCC No.22648-acnA S522W 。
Example 5 mutant Gene acnA S522G Recombinant strain construction of (2)
In this example, escherichia coli MHZ-0914 (with the preservation number of CGMCC No. 22648) is used as an initial strain, and aconitate hydratase mutant (AcnA) expressed as SEQ ID No.5 is provided S522G ) The construction method of the recombinant E.coli of (2) refers to the method of example 1, and the obtained recombinant strain is named CGMCC No.22648-acnA S522G 。
EXAMPLE 6 lysine fermentation experiment
The recombinant strains constructed in examples 1 to 5 were subjected to lysine fermentation experiments, and the medium formulation used in the lysine fermentation process was as follows:
seed activation medium: 10g/L peptone, 10g/L NaCl, 5g/L yeast powder and 18g/L agar powder, and adjusting the pH value to 7.0.
Seed culture medium: glucose 20g/L, ammonium sulfate 4g/L, corn steep liquor 2.0g/L, monopotassium phosphate 3g/L, magnesium sulfate heptahydrate 0.4g/L, ferric sulfate 0.01g/L and manganese sulfate 0.01g/L, and adjusting the pH to 7.0.
Fermentation medium: glucose 60g/L, molasses 10g/L, ammonium sulfate 40g/L, corn steep liquor 10g/L, potassium dihydrogen phosphate 1.6g/L, magnesium sulfate heptahydrate 1.0g/L, ferric sulfate 0.03g/L, manganese sulfate 0.03g/L, calcium carbonate 25g/L, and pH adjusted to 7.0.
The lysine fermentation method comprises the following steps:
1. seed activation: taking the strain to be verified from the freezing tube, streaking and activating on a seed activation culture medium, and culturing for 12 hours at 37 ℃;
2. seed culture: the plate activated seeds 1 are picked and looped into a 500mL triangular flask filled with 20mL of seed culture medium, and subjected to shaking culture at 33 ℃ and 220r/min for 7h to obtain seed liquid;
3. fermentation culture: 2mL of seed solution is inoculated into a 500mL triangular flask filled with 30mL of fermentation medium, and is subjected to shaking culture for 12h at 37 ℃ and 220r/min, and three strains are parallel to each other, so that fermentation liquid is obtained.
4、OD 600 And (3) measuring: diluting 100 μl of fermentation broth by a proper multiple, detecting OD at 600nm wavelength with spectrophotometer, performing three parallels for each strain, calculating average value, and detecting OD 600 As shown in table 2.
5. Lysine concentration measurement: 2mL of the fermentation broth was centrifuged (12000 rpm,2 min), the supernatant was collected, the content of L-lysine in the fermentation broth of the recombinant bacteria and the control bacteria was measured by HPLC, three bacteria were used in parallel, and the average value was calculated, and the measured lysine concentration was shown in Table 2.
6. Enzyme activity determination: aconitate hydratase activity assay was carried out using a aconitate enzyme assay kit from Biovision company (product number K716-100). The sample preparation method is as follows: the cells were collected by centrifugation at 12000rpm at 4℃for 10min, and the cells were dissolved in a pre-chilled buffer, sonicated for 20s, and the supernatant was collected for enzyme activity assay. Three strains were prepared in parallel, the average value was calculated, and the enzyme activities of aconitate hydratase detected are shown in Table 2.
TABLE 2 lysine production, growth and enzyme Activity detection of recombinant strains
Strain | L-lysine (g/L) | Sugar acid conversion% | OD 600 | Enzyme activity U/mg |
CGMCC No.22648 | 18.9 | 28.6 | 15.1 | 2.49 |
CGMCC No.22648-acnA S522H | 21.5 | 32.9 | 14.8 | 2.18 |
CGMCC No.22648-acnA S522E | 22.0 | 33.8 | 15.0 | 2.15 |
CGMCC No.22648-acnA S522I | 21.4 | 32.8 | 14.9 | 2.19 |
CGMCC No.22648-acnA S522W | 22.6 | 34.8 | 14.4 | 1.98 |
CGMCC No.22648-acnA S522G | 20.5 | 31.1 | 14.9 | 2.32 |
Fermentation results show that 522 th ammonia of acnA gene coding protein in original strainAfter the mutation of the amino acid from serine (S) to histidine (H), glutamic acid (E), isoleucine (I), tryptophan (W) and glycine (G), the activity of aconitate hydratase of the obtained recombinant strain was reduced to some extent as compared with that of the enzyme of the starting strain (same amino acid sequence as AcnA of wild-type E.coli MG 1655), and the activity was significantly different (P < 0.05). The final OD of the recombinant strain is equivalent to that of the original strain, but the lysine yield and the conversion rate are both obviously improved (P is less than 0.05), wherein the effect of mutation of 522 th amino acid into histidine (H), glutamic acid (E), isoleucine (I) and tryptophan (W) is obviously better than that of mutation into glycine (G), the effect of mutation of 522 th amino acid into tryptophan (W) is optimal, and the recombinant strain CGMCC No.22648-acnA S522W The L-lysine yield of the strain is improved by 3.7g/L compared with the original strain, and the conversion rate is improved by 6.2 percent compared with the original strain.
While the invention has been described in detail in the foregoing general description and with reference to specific embodiments thereof, it will be apparent to one skilled in the art that modifications and improvements can be made thereto. Accordingly, such modifications or improvements may be made without departing from the spirit of the invention and are intended to be within the scope of the invention as claimed.
Sequence listing
<110> gallery plum blossom biotechnology development Co., ltd
<120> aconitate hydratase mutant and application thereof
<130> KHP211125945.0
<160> 27
<170> SIPOSequenceListing 1.0
<210> 1
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 1
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys His Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 2
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 2
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys Glu Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 3
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 3
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys Ile Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 4
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 4
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys Trp Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 5
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 5
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys Gly Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 6
<211> 891
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 6
Met Ser Ser Thr Leu Arg Glu Ala Ser Lys Asp Thr Leu Gln Ala Lys
1 5 10 15
Asp Lys Thr Tyr His Tyr Tyr Ser Leu Pro Leu Ala Ala Lys Ser Leu
20 25 30
Gly Asp Ile Thr Arg Leu Pro Lys Ser Leu Lys Val Leu Leu Glu Asn
35 40 45
Leu Leu Arg Trp Gln Asp Gly Asn Ser Val Thr Glu Glu Asp Ile His
50 55 60
Ala Leu Ala Gly Trp Leu Lys Asn Ala His Ala Asp Arg Glu Ile Ala
65 70 75 80
Tyr Arg Pro Ala Arg Val Leu Met Gln Asp Phe Thr Gly Val Pro Ala
85 90 95
Val Val Asp Leu Ala Ala Met Arg Glu Ala Val Lys Arg Leu Gly Gly
100 105 110
Asp Thr Ala Lys Val Asn Pro Leu Ser Pro Val Asp Leu Val Ile Asp
115 120 125
His Ser Val Thr Val Asp Arg Phe Gly Asp Asp Glu Ala Phe Glu Glu
130 135 140
Asn Val Arg Leu Glu Met Glu Arg Asn His Glu Arg Tyr Val Phe Leu
145 150 155 160
Lys Trp Gly Lys Gln Ala Phe Ser Arg Phe Ser Val Val Pro Pro Gly
165 170 175
Thr Gly Ile Cys His Gln Val Asn Leu Glu Tyr Leu Gly Lys Ala Val
180 185 190
Trp Ser Glu Leu Gln Asp Gly Glu Trp Ile Ala Tyr Pro Asp Thr Leu
195 200 205
Val Gly Thr Asp Ser His Thr Thr Met Ile Asn Gly Leu Gly Val Leu
210 215 220
Gly Trp Gly Val Gly Gly Ile Glu Ala Glu Ala Ala Met Leu Gly Gln
225 230 235 240
Pro Val Ser Met Leu Ile Pro Asp Val Val Gly Phe Lys Leu Thr Gly
245 250 255
Lys Leu Arg Glu Gly Ile Thr Ala Thr Asp Leu Val Leu Thr Val Thr
260 265 270
Gln Met Leu Arg Lys His Gly Val Val Gly Lys Phe Val Glu Phe Tyr
275 280 285
Gly Asp Gly Leu Asp Ser Leu Pro Leu Ala Asp Arg Ala Thr Ile Ala
290 295 300
Asn Met Ser Pro Glu Tyr Gly Ala Thr Cys Gly Phe Phe Pro Ile Asp
305 310 315 320
Ala Val Thr Leu Asp Tyr Met Arg Leu Ser Gly Arg Ser Glu Asp Gln
325 330 335
Val Glu Leu Val Glu Lys Tyr Ala Lys Ala Gln Gly Met Trp Arg Asn
340 345 350
Pro Gly Asp Glu Pro Ile Phe Thr Ser Thr Leu Glu Leu Asp Met Asn
355 360 365
Asp Val Glu Ala Ser Leu Ala Gly Pro Lys Arg Pro Gln Asp Arg Val
370 375 380
Ala Leu Pro Asp Val Pro Lys Ala Phe Ala Ala Ser Asn Glu Leu Glu
385 390 395 400
Val Asn Ala Thr His Lys Asp Arg Gln Pro Val Asp Tyr Val Met Asn
405 410 415
Gly His Gln Tyr Gln Leu Pro Asp Gly Ala Val Val Ile Ala Ala Ile
420 425 430
Thr Ser Cys Thr Asn Thr Ser Asn Pro Ser Val Leu Met Ala Ala Gly
435 440 445
Leu Leu Ala Lys Lys Ala Val Thr Leu Gly Leu Lys Arg Gln Pro Trp
450 455 460
Val Lys Ala Ser Leu Ala Pro Gly Ser Lys Val Val Ser Asp Tyr Leu
465 470 475 480
Ala Lys Ala Lys Leu Thr Pro Tyr Leu Asp Glu Leu Gly Phe Asn Leu
485 490 495
Val Gly Tyr Gly Cys Thr Thr Cys Ile Gly Asn Ser Gly Pro Leu Pro
500 505 510
Asp Pro Ile Glu Thr Ala Ile Lys Lys Ser Asp Leu Thr Val Gly Ala
515 520 525
Val Leu Ser Gly Asn Arg Asn Phe Glu Gly Arg Ile His Pro Leu Val
530 535 540
Lys Thr Asn Trp Leu Ala Ser Pro Pro Leu Val Val Ala Tyr Ala Leu
545 550 555 560
Ala Gly Asn Met Asn Ile Asn Leu Ala Ser Glu Pro Ile Gly His Asp
565 570 575
Arg Lys Gly Asp Pro Val Tyr Leu Lys Asp Ile Trp Pro Ser Ala Gln
580 585 590
Glu Ile Ala Arg Ala Val Glu Gln Val Ser Thr Glu Met Phe Arg Lys
595 600 605
Glu Tyr Ala Glu Val Phe Glu Gly Thr Ala Glu Trp Lys Gly Ile Asn
610 615 620
Val Thr Arg Ser Asp Thr Tyr Gly Trp Gln Glu Asp Ser Thr Tyr Ile
625 630 635 640
Arg Leu Ser Pro Phe Phe Asp Glu Met Gln Ala Thr Pro Ala Pro Val
645 650 655
Glu Asp Ile His Gly Ala Arg Ile Leu Ala Met Leu Gly Asp Ser Val
660 665 670
Thr Thr Asp His Ile Ser Pro Ala Gly Ser Ile Lys Pro Asp Ser Pro
675 680 685
Ala Gly Arg Tyr Leu Gln Gly Arg Gly Val Glu Arg Lys Asp Phe Asn
690 695 700
Ser Tyr Gly Ser Arg Arg Gly Asn His Glu Val Met Met Arg Gly Thr
705 710 715 720
Phe Ala Asn Ile Arg Ile Arg Asn Glu Met Val Pro Gly Val Glu Gly
725 730 735
Gly Met Thr Arg His Leu Pro Asp Ser Asp Val Val Ser Ile Tyr Asp
740 745 750
Ala Ala Met Arg Tyr Lys Gln Glu Gln Thr Pro Leu Ala Val Ile Ala
755 760 765
Gly Lys Glu Tyr Gly Ser Gly Ser Ser Arg Asp Trp Ala Ala Lys Gly
770 775 780
Pro Arg Leu Leu Gly Ile Arg Val Val Ile Ala Glu Ser Phe Glu Arg
785 790 795 800
Ile His Arg Ser Asn Leu Ile Gly Met Gly Ile Leu Pro Leu Glu Phe
805 810 815
Pro Gln Gly Val Thr Arg Lys Thr Leu Gly Leu Thr Gly Glu Glu Lys
820 825 830
Ile Asp Ile Gly Asp Leu Gln Asn Leu Gln Pro Gly Ala Thr Val Pro
835 840 845
Val Thr Leu Thr Arg Ala Asp Gly Ser Gln Glu Val Val Pro Cys Arg
850 855 860
Cys Arg Ile Asp Thr Ala Thr Glu Leu Thr Tyr Tyr Gln Asn Asp Gly
865 870 875 880
Ile Leu His Tyr Val Ile Arg Asn Met Leu Lys
885 890
<210> 7
<211> 59
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 7
tcctaggtat aatactagtt gggcctcaag cggcaaccag ttttagagct agaaatagc 59
<210> 8
<211> 36
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 8
actagtatta tacctaggac tgagctagct gtcaag 36
<210> 9
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
ggacatcagt atcagttacc 20
<210> 10
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
atgttttttg attgccgttt 20
<210> 11
<211> 40
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 11
aaacggcaat caaaaaacat gatttaaccg tcggtgcggt 40
<210> 12
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 12
ctcttttttg attgccgttt 20
<210> 13
<211> 40
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 13
aaacggcaat caaaaaagag gatttaaccg tcggtgcggt 40
<210> 14
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 14
gccttttttg attgccgttt 20
<210> 15
<211> 40
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 15
aaacggcaat caaaaaaggc gatttaaccg tcggtgcggt 40
<210> 16
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 16
gatttttttg attgccgttt 20
<210> 17
<211> 40
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 17
aaacggcaat caaaaaaatc gatttaaccg tcggtgcggt 40
<210> 18
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 18
ccattttttg attgccgttt 20
<210> 19
<211> 40
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 19
aaacggcaat caaaaaatgg gatttaaccg tcggtgcggt 40
<210> 20
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 20
accgtaggta tcggatcgtg 20
<210> 21
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 21
aaacggcaat caaaaaacat 20
<210> 22
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 22
aaacggcaat caaaaaagag 20
<210> 23
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 23
aaacggcaat caaaaaaggc 20
<210> 24
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 24
aaacggcaat caaaaaaatc 20
<210> 25
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 25
aaacggcaat caaaaaatgg 20
<210> 26
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 26
ggatcgcgtt gcactgcccg 20
<210> 27
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 27
atcccccagc attgcgagga 20
Claims (10)
1. The aconitate hydratase mutant is characterized by having an amino acid sequence shown in any one of SEQ ID NO. 1-5.
2. A nucleic acid molecule encoding the aconitate hydratase mutant of claim 1.
3. A biological material comprising the nucleic acid molecule of claim 2, wherein the biological material is an expression cassette, a vector or a host cell.
4. A recombinant microorganism expressing the aconitate hydratase mutant of claim 1.
5. The recombinant microorganism according to claim 4, wherein the recombinant microorganism does not express aconitate hydratase possessed by its starting strain.
6. The recombinant microorganism according to claim 5, wherein the gene encoding aconitate hydratase is replaced with the nucleic acid molecule according to claim 2.
7. Recombinant microorganism according to any of claims 4 to 6, characterized in that it is a bacterium of the genus Escherichia, preferably Escherichia coli (Escherichia coli).
8. The method for constructing a recombinant microorganism according to any one of claims 4 to 7, comprising: replacing a gene encoding aconitate hydratase in an original strain of the recombinant microorganism with a gene encoding the aconitate hydratase mutant of claim 1.
9. Use of the aconitate hydratase mutant of claim 1 or the nucleic acid molecule of claim 2 or the biomaterial of claim 3 or the recombinant microorganism of any of claims 4-7, as follows:
(1) Use in the construction of a production strain for the production of aspartic acid or a metabolite of aspartic acid as a synthetic precursor;
(2) Use in the fermentative production of aspartic acid or metabolites with aspartic acid as synthesis precursor;
(3) Use in increasing the yield and/or conversion of aspartic acid or a metabolite having aspartic acid as a synthetic precursor;
preferably, the metabolite with aspartic acid as a synthetic precursor is lysine.
10. A method for fermenting aspartic acid or a metabolite having aspartic acid as a synthetic precursor, comprising: culturing the recombinant microorganism according to any one of claims 4 to 7 to obtain a culture, and separating and extracting the culture to obtain aspartic acid or a metabolite taking aspartic acid as a synthesis precursor;
preferably, the metabolite with aspartic acid as a synthetic precursor is lysine.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210141296.3A CN116640752A (en) | 2022-02-16 | 2022-02-16 | Aconitate hydratase mutant and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210141296.3A CN116640752A (en) | 2022-02-16 | 2022-02-16 | Aconitate hydratase mutant and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116640752A true CN116640752A (en) | 2023-08-25 |
Family
ID=87621697
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210141296.3A Pending CN116640752A (en) | 2022-02-16 | 2022-02-16 | Aconitate hydratase mutant and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116640752A (en) |
-
2022
- 2022-02-16 CN CN202210141296.3A patent/CN116640752A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110106206B (en) | Corynebacterium glutamicum construction method for improving yield and stability of L-lysine | |
CN110373370B (en) | Catalytic system coupled with ATP regeneration system and application of catalytic system in glutathione production process | |
CN113122488B (en) | Klebsiella engineering bacteria and application thereof in producing glycerol and dihydroxyacetone | |
CN109593702B (en) | Method for synthesizing L-phenyllactic acid by whole cell transformation of genetic engineering strain | |
CN111411092A (en) | Corynebacterium glutamicum with high L-lysine yield and application thereof | |
CN113201524B (en) | Inositol-3-phosphate synthase mutant and application thereof in constructing corynebacterium glutamicum capable of producing glutamine at high yield | |
CN111411066B (en) | Double-way composite neuraminic acid-producing bacillus subtilis and construction method thereof | |
CN106164252A (en) | The improvement microorganism produced for succinic acid | |
CN109295023B (en) | Glutamate oxidase mutant, nucleic acid molecule, application and method for preparing ketoglutaric acid | |
CN111172089A (en) | Method for synthesizing trehalose by using recombinant trehalose synthase | |
CN114426983B (en) | Method for producing 5-aminolevulinic acid by knocking out transcription regulatory factor Ncgl0580 in corynebacterium glutamicum | |
CN111549050B (en) | Vitreoscilla hemoglobin expression frame suitable for bacillus and application | |
CN114835783A (en) | NCgl2747 gene mutant and application thereof in preparation of L-lysine | |
CN116640752A (en) | Aconitate hydratase mutant and application thereof | |
CN116640751A (en) | Ammonium aspartate ion lyase mutant and application thereof | |
CN114806982B (en) | Modified 1, 3-propanediol producing strain and application thereof | |
CN116694540B (en) | Escherichia coli and application thereof in threonine production | |
CN115678863A (en) | Succinate dehydrogenase mutant, recombinant microorganism, preparation method and application thereof | |
KR102602060B1 (en) | Recombinant microorganism for producing 2,3-butanediol with reduced by-product production and method for producing 2,3-butanediol using the same | |
CN110872595B (en) | Acid-resistant expression cassette and application thereof in fermentation production of organic acid | |
CN114107270B (en) | L-aspartic acid beta-decarboxylase mutant | |
CN116286513B (en) | Lactobacillus johnsonii FR-1012 and method for industrially producing gamma-aminobutyric acid by same | |
CN115612681A (en) | Protein mutant, recombinant bacterium, preparation method and application thereof | |
CN116640810A (en) | Lysine production method, mutant, recombinant microorganism and application | |
CN109762779B (en) | Application of corynebacterium gene JNy31014 in improving L-lysine yield |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |