CN113683667A - Engineering bacterium obtained by YH66-RS10865 gene transformation and application thereof in preparation of valine - Google Patents
Engineering bacterium obtained by YH66-RS10865 gene transformation and application thereof in preparation of valine Download PDFInfo
- Publication number
- CN113683667A CN113683667A CN202110980059.1A CN202110980059A CN113683667A CN 113683667 A CN113683667 A CN 113683667A CN 202110980059 A CN202110980059 A CN 202110980059A CN 113683667 A CN113683667 A CN 113683667A
- Authority
- CN
- China
- Prior art keywords
- protein
- gly
- ala
- valine
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 121
- 241000894006 Bacteria Species 0.000 title claims abstract description 111
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 title claims abstract description 81
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 title claims abstract description 59
- 239000004474 valine Substances 0.000 title claims abstract description 59
- 238000002360 preparation method Methods 0.000 title abstract description 9
- 230000009466 transformation Effects 0.000 title description 5
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 46
- 230000014509 gene expression Effects 0.000 claims abstract description 29
- 238000000034 method Methods 0.000 claims abstract description 21
- 230000002401 inhibitory effect Effects 0.000 claims abstract description 20
- 239000000126 substance Substances 0.000 claims abstract description 18
- 230000001580 bacterial effect Effects 0.000 claims abstract description 15
- 230000004952 protein activity Effects 0.000 claims abstract description 5
- 125000000539 amino acid group Chemical group 0.000 claims description 16
- 238000004519 manufacturing process Methods 0.000 claims description 14
- 108020004705 Codon Proteins 0.000 claims description 12
- 102000008300 Mutant Proteins Human genes 0.000 claims description 7
- 108010021466 Mutant Proteins Proteins 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims description 4
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 238000009776 industrial production Methods 0.000 abstract description 2
- 229960004295 valine Drugs 0.000 description 52
- 239000013612 plasmid Substances 0.000 description 47
- 241000186226 Corynebacterium glutamicum Species 0.000 description 46
- 108020004414 DNA Proteins 0.000 description 45
- 239000000047 product Substances 0.000 description 32
- 230000003321 amplification Effects 0.000 description 29
- 238000003199 nucleic acid amplification method Methods 0.000 description 29
- 102000053602 DNA Human genes 0.000 description 27
- 238000000855 fermentation Methods 0.000 description 22
- 230000004151 fermentation Effects 0.000 description 22
- 239000001963 growth medium Substances 0.000 description 18
- 238000012408 PCR amplification Methods 0.000 description 16
- 229930027917 kanamycin Natural products 0.000 description 15
- 229960000318 kanamycin Drugs 0.000 description 15
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 15
- 229930182823 kanamycin A Natural products 0.000 description 15
- 239000002773 nucleotide Substances 0.000 description 15
- 125000003729 nucleotide group Chemical group 0.000 description 15
- 239000002609 medium Substances 0.000 description 11
- 238000012163 sequencing technique Methods 0.000 description 9
- 229940024606 amino acid Drugs 0.000 description 8
- 150000001413 amino acids Chemical class 0.000 description 8
- 239000000203 mixture Substances 0.000 description 7
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 230000002829 reductive effect Effects 0.000 description 6
- 241000186146 Brevibacterium Species 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 5
- 238000012258 culturing Methods 0.000 description 5
- 238000001962 electrophoresis Methods 0.000 description 5
- 239000008103 glucose Substances 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 4
- 241000186216 Corynebacterium Species 0.000 description 4
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 4
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 4
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 4
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 4
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- 229930006000 Sucrose Natural products 0.000 description 4
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 4
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 4
- 239000002518 antifoaming agent Substances 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010009297 diglycyl-histidine Proteins 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 239000005720 sucrose Substances 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 150000005693 branched-chain amino acids Chemical class 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 2
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 2
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 2
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 2
- 241001517047 Corynebacterium acetoacidophilum Species 0.000 description 2
- 241000186145 Corynebacterium ammoniagenes Species 0.000 description 2
- 241000572303 Corynebacterium pekinense Species 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 2
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 2
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 2
- QAMFAYSMNZBNCA-UWVGGRQHSA-N His-Gly-Met Chemical compound CSCC[C@H](NC(=O)CNC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O QAMFAYSMNZBNCA-UWVGGRQHSA-N 0.000 description 2
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 2
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 108700039609 IRW peptide Proteins 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 2
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 2
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 2
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 2
- IECZNARPMKQGJC-XIRDDKMYSA-N Met-Gln-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N IECZNARPMKQGJC-XIRDDKMYSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 2
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 2
- GQHAIUPYZPTADF-FDARSICLSA-N Trp-Ile-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 GQHAIUPYZPTADF-FDARSICLSA-N 0.000 description 2
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 2
- NESIQDDPEFTWAH-BPUTZDHNSA-N Trp-Met-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O NESIQDDPEFTWAH-BPUTZDHNSA-N 0.000 description 2
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 2
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 2
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 2
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 2
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 235000011114 ammonium hydroxide Nutrition 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- -1 aromatic amino acid Chemical class 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000008021 deposition Effects 0.000 description 2
- 235000005911 diet Nutrition 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000006260 foam Substances 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 238000009629 microbiological culture Methods 0.000 description 2
- 150000007523 nucleic acids Chemical group 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000001502 supplementing effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 208000007848 Alcoholism Diseases 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- 206010003591 Ataxia Diseases 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- 206010019663 Hepatic failure Diseases 0.000 description 1
- 206010060378 Hyperinsulinaemia Diseases 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- 125000003580 L-valyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(C([H])([H])[H])(C([H])([H])[H])[H] 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 208000010428 Muscle Weakness Diseases 0.000 description 1
- 206010028372 Muscular weakness Diseases 0.000 description 1
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 108010073771 Soybean Proteins Proteins 0.000 description 1
- 206010044565 Tremor Diseases 0.000 description 1
- 229930003451 Vitamin B1 Natural products 0.000 description 1
- 241000319304 [Brevibacterium] flavum Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 201000007930 alcohol dependence Diseases 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000005013 brain tissue Anatomy 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 230000007882 cirrhosis Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 230000000378 dietary effect Effects 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000003208 gene overexpression Methods 0.000 description 1
- 230000002394 glycogenic effect Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000003451 hyperinsulinaemic effect Effects 0.000 description 1
- 201000008980 hyperinsulinism Diseases 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 208000007903 liver failure Diseases 0.000 description 1
- 231100000835 liver failure Toxicity 0.000 description 1
- 230000003908 liver function Effects 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 230000037081 physical activity Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 238000012113 quantitative test Methods 0.000 description 1
- 210000000463 red nucleus Anatomy 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229940001941 soy protein Drugs 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 235000010374 vitamin B1 Nutrition 0.000 description 1
- 239000011691 vitamin B1 Substances 0.000 description 1
- 230000029663 wound healing Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/34—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Corynebacterium (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/77—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Corynebacterium; for Brevibacterium
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/08—Lysine; Diaminopimelic acid; Threonine; Valine
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention discloses an engineering bacterium obtained by modifying YH66-RS10865 gene and application thereof in preparation of valine. The invention provides an application of a substance for inhibiting YH66-RS10865 gene expression or a substance for reducing YH66-RS10865 protein abundance or a substance for reducing YH66-RS10865 protein activity in improving the yield of bacterial valine. The invention provides a recombinant bacterium, which is obtained by inhibiting YH66-RS10865 gene expression in bacteria. The invention also protects the preparationValine method: and fermenting the recombinant strain. The invention finds that the YH66-RS10865 protein negatively regulates the valine yield of bacteria, inhibits the expression of the YH66-RS10865 gene to improve the valine yield, and overexpresses the YH66-RS10865 gene to reduce the valine yield. Further, YH66-RS10865 was found in the present inventionA844C,C846AProtein and its coding gene and application. The invention has great application value for industrial production of valine.
Description
Technical Field
The invention belongs to the technical field of biology, and relates to an engineering bacterium obtained by YH66-RS10865 gene modification and application thereof in valine preparation, wherein the modification specifically comprises A844C and C846A.
Background
Valine, which is one of 20 amino acids constituting proteins, is 8 amino acids and glycogenic amino acid essential to the human body, and works together with other two high-concentration amino acids (isoleucine and leucine) to promote the normal growth of the body, repair tissues, regulate blood glucose, and supply required energy. Valine can provide additional energy to the muscle to produce glucose when engaged in strenuous physical activity to prevent muscle weakness. Valine also helps to clear excess nitrogen (a potential toxin) from the liver and to transport the body's required nitrogen to various sites.
Valine is an essential amino acid, which means that the body cannot produce itself and must be supplemented by dietary sources. Its natural food sources include grains, dairy products, mushrooms, peanuts, soy protein, and meats. Although most people can obtain sufficient quantities from their diets, cases of valine deficiency are also rare. When valine is insufficient, the central nervous system of the brain is disturbed, and limb tremor occurs due to ataxia. When the brain tissue is dissected and sliced, the degeneration phenomenon of the red nucleus cells is found, hyperinsulinemia is easy to form due to the liver function damage of a patient with late cirrhosis, so that the branched chain amino acid in the blood is reduced, the ratio of the branched chain amino acid to the aromatic amino acid is reduced from 3.0-3.5 of a normal person to 1.0-1.5, and therefore, the injection of the branched chain amino acid such as valine is commonly used for treating liver failure and the damage to the organs caused by alcoholism and drug absorption. In addition, valine can also be used as a therapeutic agent for accelerating wound healing. L-valine, also known as 2-amino-3-methylbutyric acid, CAS number 72-18-4, MDL number MFCD00064220, EINECS number 200-773-6. The current method for preparing L-valine is mainly a chemical synthesis method. Limitations of chemical synthesis: the production cost is high, the reaction is complex, the steps are multiple, and a plurality of byproducts exist.
Disclosure of Invention
The invention aims to provide an engineering bacterium obtained by YH66-RS10865 gene modification and application thereof in valine preparation.
The invention provides application of a substance for inhibiting YH66-RS10865 gene expression or a substance for reducing YH66-RS10865 protein abundance or a substance for reducing YH66-RS10865 protein activity;
the application is as follows (I) or (II):
the application of (I) in improving the yield of the bacterial valine;
(II) the application in the production of valine.
The YH66-RS10865 gene is a gene encoding YH66-RS10865 protein;
the YH66-RS10865 protein is (a1) or (a2) or (a 3):
(a1) protein shown in a sequence 3 in a sequence table;
(a2) a protein derived from a bacterium, having 95% or more identity to (a1), and being associated with valine production by the bacterium;
(a3) and (b) a protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the protein shown in (a1) and is related to the production of valine by bacteria and derived from (a 1).
The term "identity" as used herein refers to sequence similarity to a native amino acid sequence. Identity can be assessed visually or by computer software. Using computer software, the identity between two or more sequences can be expressed in percent (%), which can be used to assess the identity between related sequences.
The identity of 95% or more may be 96% or more, 97% or more, 98% or more, or 99% or more.
Specifically, the YH66-RS10865 gene is (b1) or (b2) or (b 3):
(b1) the coding region is a DNA molecule shown as a sequence 4 in the sequence table;
(b2) a DNA molecule derived from a bacterium and having 95% or more identity to (b1) and encoding said protein;
(b3) a DNA molecule that hybridizes under stringent conditions to (b1) and encodes said protein.
The term "identity" as used herein refers to sequence similarity to a native nucleic acid sequence. Identity can be assessed visually or by computer software. Using computer software, the identity between two or more sequences can be expressed in percent (%), which can be used to assess the identity between related sequences.
The identity of 95% or more may be 96% or more, 97% or more, 98% or more, or 99% or more.
The stringent conditions may be hybridization and membrane washing at 65 ℃ in a solution of 0.1 XSSPE (or 0.1 XSSC), 0.1% SDS.
The suppression of the YH66-RS10865 gene expression can be realized by knocking out the YH66-RS10865 gene or mutating the YH66-RS10865 gene.
The knockout may be a partial segment of the knockout gene or may be the entire coding frame of the knockout gene.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 5 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 5 of the sequence table.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 8 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 8 of the sequence table.
The substance for inhibiting YH66-RS10865 gene expression can be exemplified by recombinant plasmid pK18-YH66-RS10865A844C,C846AOr recombinantPlasmid pK18- Δ YH66-RS 10865.
The invention also provides a recombinant bacterium obtained by inhibiting YH66-RS10865 gene expression in bacteria.
The YH66-RS10865 gene expression in the bacteria can be knocked out from YH66-RS10865 gene in the bacteria, and can also be YH66-RS10865 gene in mutant bacteria.
The knockout may be a partial segment of the knockout gene or may be the entire coding frame of the knockout gene.
Illustratively, the YH66-RS10865 gene in the knockout bacterium can be specifically: and (3) deleting the DNA molecule shown in the sequence 4 in the sequence table in the bacterial genome DNA.
For YH66-RS10865 gene in mutant bacteria, a person of ordinary skill in the art can easily adopt known methods such as directed mutation or gene editing, etc.
Illustratively, the YH66-RS10865 gene in the mutant bacterium specifically can be: the codon of amino acid 282 of YH66-RS10865 protein encoded in bacterial genomic DNA was mutated from the codon encoding N to the codon encoding other amino acid residues. In particular, the other amino acid residue is Q.
Illustratively, the YH66-RS10865 gene in the mutant bacterium specifically can be: the YH66-RS10865 gene in the bacterial genomic DNA was subjected to the following two point mutations: the 844 th nucleotide is mutated from A to other nucleotides (specifically C), and the 846 th nucleotide is mutated from C to other nucleotides (specifically A).
Illustratively, the manner of inhibiting YH66-RS10865 gene expression in bacteria may be: a substance for inhibiting YH66-RS10865 gene expression was introduced into the bacterium.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 5 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 5 of the sequence table.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 8 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 8 of the sequence table.
The substance for inhibiting YH66-RS10865 gene expression can be exemplified by recombinant plasmid pK18-YH66-RS10865A844C,C846AOr recombinant plasmid pK 18-delta YH66-RS 10865.
The invention also protects the application of the recombinant bacterium in the preparation of valine.
The invention also provides a method for preparing valine, which comprises the following steps: and fermenting the recombinant strain.
The fermentation can be carried out by a person skilled in the art using fermentation methods known in the art. Optimization and modification of the fermentation process can also be carried out by routine experimentation. Fermentation of the bacteria may be carried out in a suitable medium under fermentation conditions known in the art. The culture medium may comprise: carbon sources, nitrogen sources, trace elements, and combinations thereof. In the culture, the pH of the culture may be adjusted. Further, prevention of bubble generation, for example, by using an antifoaming agent, may be included in the culture. In addition, the culturing may include injecting a gas into the culture. The gas may include any gas capable of maintaining aerobic conditions of the culture. In the culture, the temperature of the culture may be 20 to 45 ℃.
The method may further comprise the steps of: valine was obtained from the culture. Obtaining valine from the culture can be accomplished in a variety of ways, including but not limited to: the culture is treated with sulfuric acid or hydrochloric acid or the like, followed by a combination of methods such as anion exchange chromatography, concentration, crystallization, and isoelectric precipitation.
In the fermentation, an exemplary fermentation medium formulation is shown in table 3, with the balance being water.
An exemplary fermentation control process for the fermentation is shown in table 4.
In the fermentation, the OD value of the system can be 0.3-0.5 at the initial moment of completing the inoculation.
Illustratively, the fermentation process of the fermentation is: ammonia water is used for adjusting the pH value; when foam exists in the fermentation system, adding a proper amount of antifoaming agent anifoam (CB-442); the sugar content (residual sugar) of the system is controlled by supplementing 70% glucose aqueous solution.
The invention also provides a method for improving the valine yield of bacteria, which comprises the following steps: inhibit YH66-RS10865 gene expression or reduce YH66-RS10865 protein abundance or reduce YH66-RS10865 protein activity in bacteria.
The YH66-RS10865 gene expression in the bacteria can be knocked out from YH66-RS10865 gene in the bacteria, and can also be YH66-RS10865 gene in mutant bacteria.
The knockout may be a partial segment of the knockout gene or may be the entire coding frame of the knockout gene.
Illustratively, the YH66-RS10865 gene in the knockout bacterium can be specifically: and (3) deleting the DNA molecule shown in the sequence 4 in the sequence table in the bacterial genome DNA.
For YH66-RS10865 gene in mutant bacteria, a person of ordinary skill in the art can easily adopt known methods such as directed mutation or gene editing, etc.
Illustratively, the YH66-RS10865 gene in the mutant bacterium specifically can be: the codon of amino acid 282 of YH66-RS10865 protein encoded in bacterial genomic DNA was mutated from the codon encoding N to the codon encoding other amino acid residues. In particular, the other amino acid residue is Q.
Illustratively, the YH66-RS10865 gene in the mutant bacterium specifically can be: the YH66-RS10865 gene in the bacterial genomic DNA was subjected to the following two point mutations: the 844 th nucleotide is mutated from A to other nucleotides (specifically C), and the 846 th nucleotide is mutated from C to other nucleotides (specifically A).
Illustratively, the manner of inhibiting YH66-RS10865 gene expression in bacteria may be: a substance for inhibiting YH66-RS10865 gene expression was introduced into the bacterium.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 5 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 5 of the sequence table.
Illustratively, the substance for inhibiting YH66-RS10865 gene expression may specifically be a DNA molecule represented by sequence 8 of the sequence table or a recombinant plasmid having a DNA molecule represented by sequence 8 of the sequence table.
The substance for inhibiting YH66-RS10865 gene expression can be exemplified by recombinant plasmid pK18-YH66-RS10865A844C,C846AOr recombinant plasmid pK 18-delta YH66-RS 10865.
The invention also protects the application of YH66-RS10865 protein in regulating the valine yield of bacteria.
The regulation is negative regulation, namely YH66-RS10865 protein content is increased, and valine yield is reduced.
The regulation is negative regulation, namely YH66-RS10865 protein content is reduced, and valine yield is increased.
The invention also discloses a mutant protein named YH66-RS10865A844C,C846AThe protein is obtained by mutating YH66-RS10865 protein 282 amino acid residue from N to other amino acid residue.
In particular, the other amino acid residue is Q.
Illustratively, the mutant protein is shown as a sequence 1 in a sequence table.
The invention also protects YH66-RS10865A844C,C846AThe coding gene of the protein is named YH66-RS10865A844C ,C846AA gene.
Specifically, YH66-RS10865A844C,C846AThe genes are (c1) or (c2) or (c3) as follows:
(c1) the coding region is a DNA molecule shown as a sequence 2 in a sequence table;
(c2) a DNA molecule derived from a bacterium and having 95% or more identity to (c1) and encoding said protein;
(c3) a DNA molecule that hybridizes under stringent conditions to (c1) and encodes said protein.
The invention also protects the product with YH66-RS10865A844C,C846AExpression cassette for Gene or Gene having YH66-RS10865A844C ,C846ARecombinant vector of gene or gene with YH66-RS10865A844C,C846ARecombinant bacteria of genes.
The term "identity" as used herein refers to sequence similarity to a native nucleic acid sequence. Identity can be assessed visually or by computer software. Using computer software, the identity between two or more sequences can be expressed in percent (%), which can be used to assess the identity between related sequences.
The identity of 95% or more may be 96% or more, 97% or more, 98% or more, or 99% or more.
The stringent conditions may be hybridization and membrane washing at 65 ℃ in a solution of 0.1 XSSPE (or 0.1 XSSC), 0.1% SDS.
The invention also protects YH66-RS10865A844C,C846AProtein YH66-RS10865A844C,C846AGene, having YH66-RS10865A844C,C846AExpression cassette for Gene or Gene having YH66-RS10865A844C,C846ARecombinant vector of gene or gene with YH66-RS10865A844C,C846AApplication of recombinant bacteria of the gene in preparation of valine.
The invention also provides a method for improving the valine yield of bacteria, which comprises the following steps: the codon of amino acid 282 of YH66-RS10865 protein encoded in bacterial genomic DNA was mutated from the codon encoding N to the codon encoding other amino acid residues. In particular, the other amino acid residue is Q.
The method specifically comprises the following steps: the YH66-RS10865 gene in the bacterial genomic DNA was subjected to the following two point mutations: the 844 th nucleotide is mutated from A to other nucleotides (specifically C), and the 846 th nucleotide is mutated from C to other nucleotides (specifically A).
The method specifically comprises the following steps: introducing a DNA molecule shown in a sequence 5 of a sequence table or a recombinant plasmid having the DNA molecule shown in the sequence 5 of the sequence table into bacteria.
Any of the above bacteria include, but are not limited to, the following: corynebacterium bacteria, preferably Corynebacterium acetoacidophilum (Corynebacterium acetoacidophilum), Corynebacterium acetoglutamicum (Corynebacterium acetoglutamicum), Corynebacterium melallum (Corynebacterium glutamicum), Brevibacterium flavum (Brevibacterium lactofermentum), Corynebacterium ammoniagenes (Corynebacterium ammoniagenes), Corynebacterium pekinense (Corynebacterium pekinense), Brevibacterium saccharolyticum (Brevibacterium saccharolyticum), Brevibacterium roseum (Brevibacterium roseum), Brevibacterium thiolyticum (Brevibacterium thiogenitum), Brevibacterium thiogenitalis (Brevibacterium thiogeniticum), and Brevibacterium thiogenitalis (Brevibacterium thiogenitalis).
Any of the above-mentioned bacteria is a bacterium having an ability to produce valine.
"bacterium having an ability to produce valine" means that the bacterium has the following ability: the ability to produce and accumulate valine in the culture medium and/or in the cells of the bacterium. Thus, valine can be collected when the bacterium is cultured in a medium.
The bacteria may be naturally collected wild-type bacteria or modified bacteria.
"modified bacterium" refers to an engineered bacterium obtained by artificially mutating and/or mutagenizing a naturally collected wild-type bacterium.
Specifically, the corynebacterium glutamicum can be corynebacterium glutamicum CGMCC 21260.
Corynebacterium glutamicum YPFV1, which has been deposited in China general microbiological culture Collection center (CGMCC, address: No. 3, institute of microbiology, China academy of sciences, North West Lu 1, Kyoho, Beijing, Inc.) at 11 months and 30 days of 2020, with the deposition registration number of CGMCC No. 21260. Corynebacterium glutamicum YPFV1 (also called Corynebacterium glutamicum CGMCC 21260).
Any valine mentioned above is meant to be taken in the broad sense of valine including valine in free form, a salt of valine or a mixture of both.
Specifically, the valine is L-valine.
Any of the above methods or applications may also be used for the production of downstream products of valine.
YH66-RS10865 protein in Corynebacterium glutamicum is shown as sequence 3 in a sequence table, and coding genes thereof are shown as sequence 4 in the sequence table. In the invention, YH66-RS1 shown in sequence 1 of a sequence table is obtained by introducing point mutation0865A844C,C846AProtein, YH66-RS10865A844C,C846AThe coding gene of the protein is shown as a sequence 2 in a sequence table. Compared with YH66-RS10865 gene, YH66-RS10865A844C,C846AThe difference of the genes is that the 844 th nucleotide is mutated from A to C and the 846 th nucleotide is mutated from C to A. Compared with YH66-RS10865 protein, YH66-RS10865A844C,C846AThe difference between the proteins is that the 282 th amino acid residue is mutated from N to Q.
The invention finds that YH66-RS10865 protein has negative control on the valine yield of bacteria, namely YH66-RS10865 protein content is increased, valine yield is reduced, YH66-RS10865 protein content is reduced, and valine yield is increased. The suppression of the YH66-RS10865 gene expression can improve the valine yield, and the overexpression of the YH66-RS10865 gene can reduce the valine yield. Further, YH66-RS10865 was found in the present inventionA844C,C846AProtein and its coding gene and application. The invention has great application value for industrial production of valine.
Detailed Description
The present invention is described in further detail below with reference to specific embodiments, which are given for the purpose of illustration only and are not intended to limit the scope of the invention. The examples provided below serve as a guide for further modifications by a person skilled in the art and do not constitute a limitation of the invention in any way.
The experimental procedures in the following examples, unless otherwise indicated, are conventional and are carried out according to the techniques or conditions described in the literature in the field or according to the instructions of the products. Materials, reagents and the like used in the following examples are commercially available unless otherwise specified. pK18mobsacB plasmid: addgene, Inc.; the plasmid pK18mobsacB has the kanamycin resistance gene as a selection marker. plasmid pXMJ 19: biovector plasmid vector strain cell gene collection center; the plasmid pXMJ19 has a chloramphenicol resistance gene as a selection marker. NEBuilder enzyme: NEB corporation. Unless otherwise specified, the medium in the examples was a medium having the formulation shown in Table 1 (balance water, pH 7.0). The kanamycin-free medium was the medium shown in Table 1. The kanamycin-containing medium consisted of the medium shown in Table 1 and kanamycin at a content of 50. mu.g/ml. Unless otherwise specified, the culture in the examples refers to a static culture at 32 ℃. Single-stranded conformational polymorphic polyacrylamide gel electrophoresis (sscp-PAGE) in the examples: the concentration of the gel used was 8%, and the composition of the electrophoretic gel is shown in table 2; the electrophoresis conditions are as follows: 1 XTBE buffer, 120V voltage, electrophoresis time 10 h.
Unless otherwise stated, the quantitative tests in the following examples were performed in triplicate, and the results were averaged.
TABLE 1
Components | Concentration in the culture Medium |
Sucrose | 10g/L |
Polypeptone | 10g/L |
Beef extract | 10g/L |
Yeast powder | 5g/L |
Urea | 2g/L |
Sodium chloride | 2.5g/L |
Agar powder | 20g/L |
TABLE 2
Components | Amount of addition |
40% acrylamide | 8mL |
ddH2O | 26mL |
Glycerol | 4mL |
10×TBE | 2mL |
TEMED | 40μL |
10%AP | 600μL |
Example 1 obtaining of Corynebacterium glutamicum CGMCC21260
Corynebacterium glutamicum ATCC 15168: corynebacterium glutamicum (Corynebacterium glutamicum) No. 15168 in ATCC.
Corynebacterium glutamicum ATCC15168 was subjected to mutagenesis to obtain Corynebacterium glutamicum (Corynebacterium glutamicum) YPFV 1.
Corynebacterium glutamicum YPFV1, which has been deposited in China general microbiological culture Collection center (CGMCC, address: No. 3, institute of microbiology, China academy of sciences, North West Lu 1, Kyoho, Beijing, Inc.) at 11 months and 30 days of 2020, with the deposition registration number of CGMCC No. 21260. Corynebacterium glutamicum YPFV1 (also called Corynebacterium glutamicum CGMCC 21260).
Example 2 construction of recombinant bacterium YPV-031
P1:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGGCTTCGTTTGCTTGAGCGTG-3';
P2:5'-CGCTGCAGGTTGCATTGAACCACGAAGGTCAGGT-3';
P3:5'-ACCTGACCTTCGTGGTTCAATGCAACCTGCAGCG-3';
P4:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCTAGCCACCGAGCGCTGCGCG-3'。
P5:5'-ACGTTTCAACCGCTACCTCG-3';
P6:5'-ATCCCACTCGCGACCCCAAAC-3'。
Construction of recombinant plasmid
1. The Corynebacterium glutamicum ATCC15168 was used as a template, and a primer pair consisting of primer P1 and primer P2 was used for PCR amplification to recover an amplification product (720 bp).
2. The Corynebacterium glutamicum ATCC15168 was used as a template, and a primer pair consisting of primer P3 and primer P4 was used for PCR amplification to recover an amplification product (678 bp).
3. Meanwhile, the amplification product recovered in the step 1 and the amplification product recovered in the step 2 are used as templates, and PCR amplification (Overlap PCR) is carried out by adopting a primer pair consisting of a primer P1 and a primer P4, and the amplification product (1364bp) is recovered. And after sequencing, the amplification product is shown as a sequence 5 in the sequence table.
4. The pK18mobsacB plasmid was digested with restriction enzyme Xba I, and the linearized plasmid was recovered.
5. Co-incubating the amplification product recovered in step 3 with the linearized plasmid recovered in step 4 (with NEBuilder enzyme, incubation at 50 ℃ for 30min) to obtain recombinant plasmid pK18-YH66-RS10865A844C,C846A. Measured bySequence verification, recombinant plasmid pK18-YH66-RS10865A844C,C846AHas a DNA molecule shown in a sequence 5 of a sequence table.
Secondly, constructing recombinant bacteria YPV-031
1. Adopts a recombinant plasmid pK18-YH66-RS10865A844C,C846ACorynebacterium glutamicum CGMCC21260 was transformed by electric shock and then cultured.
2. The strain in step 1 is picked up and cultured by using a culture medium containing 15% of sucrose, then a single colony is picked up and cultured by using a culture medium containing kanamycin and a culture medium not containing kanamycin respectively, and strains which cannot grow on the culture medium containing kanamycin and can grow on the culture medium not containing kanamycin are screened.
3. And (3) taking the strain screened in the step (2), carrying out PCR amplification by adopting a primer pair consisting of a primer P5 and a primer P6, and then recovering an amplification product (283 bp).
4. Taking the amplification product in the step 3, firstly, denaturing at 95 ℃ for 10min, then, carrying out ice bath for 5min, and then, carrying out sscp-PAGE. During electrophoresis, the recombinant plasmid pK18-YH66-RS10865 is adoptedA844C,C846AThe amplified fragment of (1) (i.e., the recombinant plasmid pK18-YH66-RS 10865)A844C,C846AThe primer pair consisting of the primer P5 and the primer P6 is used as a template for PCR amplification, the amplification product) is used as a positive control, the amplification fragment of the Corynebacterium glutamicum CGMCC21260 (namely, the amplification product of the Corynebacterium glutamicum CGMCC21260 is used as a template and the primer pair consisting of the primer P5 and the primer P6 is used for PCR amplification) is used as a negative control, and water is used as a blank control. Due to different fragment structures and different electrophoresis positions, the strains with the electrophoresis positions inconsistent with the negative control and consistent with the positive control are the target strains for screening (recombinant strains with successful allelic replacement).
5. And (4) according to the result of the step (4), performing sequencing verification on the amplified product of the step (3) of the screened strain to obtain the recombinant bacterium YPV-031. Compared with Corynebacterium glutamicum CGMCC21260, the recombinant bacterium YPV-031 has the difference that: the YH66-RS10865 gene shown in sequence 4 of the sequence table in the Corynebacterium glutamicum CGMCC21260 genome is replaced by YH66-RS10865 gene shown in sequence 2 of the sequence tableA844C,C846AA gene. Sequence 2 and sequence 4 present only two nucleiA nucleotide difference between 844 th and 846 th. The recombinant strain YPV-031 is an engineering strain obtained by mutating (mutating at two sites) YH66-RS10865 gene in Corynebacterium glutamicum CGMCC 21260.
Example 2 construction of recombinant bacterium YPV-032 and recombinant bacterium YPV-033
P7:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGCATGACGGCTGACTGGACTC-3';
P8:5'-ACTGGGATTTCAGGTATCCAAATCGGACTCCTTAAATGGG-3';
P9:5'-CCCATTTAAGGAGTCCGATTTGGATACCTGAAATCCCAGT-3';
P10:5'-CTATGTGAGTAGTCGATTTATTATTCCTCAGGAGCGTTTG-3';
P11:5'-CAAACGCTCCTGAGGAATAATAAATCGACTACTCACATAG-3';
P12:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCTGCATAAGAAACAACCACTT-3'。
P13:5'-GTCCGCTCTGTTGGTGTTCA-3';
P14:5'-TGTTACCGCGGACAGGTCCG-3'。
P15:5'-GCAAGCAGCAGATCGCTACC-3';
P16:5'-TGGAGGAATATTCGGCCCAG-3'。
Firstly, constructing recombinant bacteria YPV-033
1. The recombinant bacterium YPV-031 is used as a template, a primer pair consisting of a primer P7 and a primer P8 is adopted for PCR amplification, and an amplification product (806bp) is recovered.
2. The recombinant bacterium YPV-031 is used as a template, a primer pair consisting of a primer P9 and a primer P10 is adopted for PCR amplification, and an amplification product (3014bp) is recovered.
3. The recombinant bacterium YPV-031 is used as a template, a primer pair consisting of a primer P11 and a primer P12 is adopted for PCR amplification, and an amplification product (783bp) is recovered.
4. The pK18mobsacB plasmid was digested with restriction enzyme Xba I, and the linearized plasmid was recovered.
5. And (3) co-incubating the amplification product recovered in the step (1), the amplification product recovered in the step (2), the amplification product recovered in the step (3) and the linearized plasmid recovered in the step (4) (incubating for 30min at 50 ℃ by using NEBuilder enzyme) to obtain a recombinant plasmid 033. Through sequencing verification, the recombinant plasmid 033 has a DNA molecule shown as a sequence 6 in a sequence table.
6. The corynebacterium glutamicum CGMCC21260 is subjected to electric shock transformation by adopting the recombinant plasmid 033, then cultured, and then each single colony is subjected to PCR identification (by adopting a primer pair consisting of a primer P13 and a primer P14), so that the strain capable of amplifying a 1921bp band is a positive strain.
7. And (3) selecting the positive strain in the step 6, culturing the positive strain by using a culture medium containing 15% of sucrose, then selecting a single colony, culturing the single colony by using a culture medium containing kanamycin and a culture medium not containing kanamycin respectively, and screening the strain which can not grow on the culture medium containing kanamycin and can grow on the culture medium not containing kanamycin.
8. Taking the strain screened in the step 7, carrying out PCR amplification by adopting a primer pair consisting of a primer P15 and a primer P16, and obtaining the strain YH66-RS10865 with 2013bp bandsA844C,C846AThe positive strain with gene integrated into the genome of Corynebacterium glutamicum CGMCC21260 was named recombinant strain YPV-033. Recombinant bacterium YPV-033 is overexpressed YH66-RS10865 on genomeA844C,C846AEngineering strain of gene.
Secondly, constructing recombinant bacteria YPV-032
Replacing the templates with recombinant bacteria YPV-031 to Corynebacterium glutamicum ATCC15168, and carrying out the same steps.
The positive strain integrating the YH66-RS10865 gene into the Corynebacterium glutamicum CGMCC21260 genome is obtained and named recombinant strain YPV-032. The recombinant strain YPV-032 is an engineering strain with YH66-RS10865 gene overexpression on genome. Compared with the recombinant bacterium YPV-033, the recombinant bacterium YPV-032 has the following differences: the sequence 4 replaces the sequence 2 in the sequence of the foreign DNA integrated into the genome of Corynebacterium glutamicum CGMCC 21260.
Example 3 construction of recombinant bacterium YPV-035 and recombinant bacterium YPV-034
Firstly, constructing recombinant bacteria YPV-035
1. The recombinant bacterium YPV-031 is used as a template, a primer pair consisting of a primer P17 and a primer P18 is adopted for PCR amplification, and an amplification product (3044bp) is recovered. And after sequencing, the amplification product is shown as a sequence 7 in the sequence table.
P17:5'-GCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCCCTGGATACCTGAAATCCCAGT-3';
P18:5'-ATCAGGCTGAAAATCTTCTCTCATCCGCCAAAACTTATTCCTCAGGAGCGTTTG-3'。
2. Taking pXMJ19 plasmid, adopting restriction enzyme EcoR I to perform single enzyme digestion, and recovering the linearized plasmid.
3. Co-incubating the amplification product recovered in step 1 with the linearized plasmid recovered in step 2 (with NEBuilder enzyme, incubation at 50 ℃ for 30min) to obtain recombinant plasmid pXMJ19-YH66-RS10865A844C,C846A. Sequencing verification shows that the recombinant plasmid pXMJ19-YH66-RS10865A844C,C846AHas a DNA molecule shown in a sequence 7 of a sequence table.
4. The recombinant plasmid pXMJ19-YH66-RS10865A844C,C846AThe Corynebacterium glutamicum CGMCC21260 is electrically transduced to obtain recombinant bacteria YPV-035. The recombinant bacterium YPV-035 is prepared by over-expressing pXMJ19-YH66-RS10865 by a plasmidA844C,C846AEngineering strain of gene.
Secondly, constructing recombinant bacteria YPV-034
The template is replaced by the recombinant bacterium YPV-031 to the Corynebacterium glutamicum ATCC15168, and the steps are the same as the first step.
The recombinant strain YPV-034 is obtained. The recombinant strain YPV-034 is an engineering strain for overexpressing YH66-RS10865 gene by a plasmid. Compared with the recombinant bacteria YPV-035, the recombinant bacteria YPV-034 only have the following differences: sequence 4 replaces sequence 2 in the sequence of the foreign DNA over-expressed by the plasmid.
Example 4 construction of engineered Strain with deletion of YH66-RS10865 Gene on genome
P19:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGTCGGCCACTTTAATAATCCT-3';
P20:5'-TATCTGTCCCTTGAGGTGATTTCCACACCTCCTGTTGGAA-3';
P21:5'-TTCCAACAGGAGGTGTGGAAATCACCTCAAGGGACAGATA-3';
P22:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCAACAATGTAAACAAGATGTC-3'。
P23:5'-TCGGCCACTTTAATAATCCT-3';
P24:5'-AACAATGTAAACAAGATGTC-3'。
Construction of recombinant plasmid
1. Using Corynebacterium glutamicum ATCC15168 as a template, PCR amplification was carried out using a primer pair consisting of primer P19 and primer P20, and an amplification product (upstream homology arm fragment, 730bp) was recovered.
2. Using Corynebacterium glutamicum ATCC15168 as a template, PCR amplification was carried out using a primer pair consisting of primer P21 and primer P22, and an amplification product (downstream homology arm fragment, 725bp) was recovered.
3. Meanwhile, the amplification product recovered in the step 1 and the amplification product recovered in the step 2 are used as templates, and PCR amplification (Overlap PCR) is carried out by using a primer pair consisting of a primer P19 and a primer P22, so that the amplification product is recovered. And after sequencing, the amplification product is shown as a sequence 8 in the sequence table.
4. The pK18mobsacB plasmid was digested with restriction enzyme Xba I, and the linearized plasmid was recovered.
5. And (3) co-incubating the amplification product recovered in the step (3) with the linearized plasmid recovered in the step (4) (by adopting NEBuilder enzyme, incubating at 50 ℃ for 30min) to obtain the recombinant plasmid pK 18-delta YH66-RS 10865. Through sequencing, the recombinant plasmid pK 18-delta YH66-RS10865 has a DNA molecule shown as a sequence 8 in a sequence table.
Secondly, constructing recombinant bacteria YPV-036
1. The Corynebacterium glutamicum CGMCC21260 is subjected to electric shock transformation by adopting a recombinant plasmid pK 18-delta YH66-RS10865, then cultured, and then PCR identification is respectively carried out on each single colony (by adopting a primer pair consisting of a primer P23 and a primer P24). The bacterial strain capable of amplifying bands of 1341bp and 4110bp simultaneously is a positive bacterial strain. The strain only amplifying the 4110bp band is a starting strain which fails in transformation, wherein the 4110bp fragment is shown as a sequence 9 in a sequence table.
2. Selecting the positive strain in the step 1, culturing the positive strain by using a culture medium containing 15% of sucrose, then selecting a single colony, culturing the single colony by using a culture medium containing kanamycin and a culture medium not containing kanamycin respectively, and screening the strain which can not grow on the culture medium containing kanamycin and can grow on the culture medium not containing kanamycin.
3. And (3) taking the strain screened in the step (2), carrying out PCR amplification by adopting a primer pair consisting of a primer P23 and a primer P24, wherein the strain only shows a single amplification product and has the size of 1341bp and is a positive strain with the YH66-RS10865 gene coding region knocked out.
4. And (3) taking the strain obtained by screening in the step (3), performing PCR amplification and sequencing by using a primer pair consisting of a primer P23 and a primer P24 again, and naming the strain with correct sequencing as recombinant bacteria YPV-036. Compared with the genome DNA of Corynebacterium glutamicum CGMCC21260, the recombinant bacterium YPV-036 is different only in that the DNA molecule shown in sequence 4 in the sequence table is deleted.
Example 5 fermentative preparation of L-valine
The test strains are respectively as follows: corynebacterium glutamicum CGMCC21260, recombinant bacteria YPV-031, recombinant bacteria YPV-032, recombinant bacteria YPV-033, recombinant bacteria YPV-034, recombinant bacteria YPV-035 and recombinant bacteria YPV-036.
Fermentation tank: BLBIO-5GC-4-H model fermenter (Shanghai Bailun Biotech Co., Ltd.).
The formulation of the fermentation medium is shown in Table 3, with the balance being water.
TABLE 3 fermentation Medium formulation
Composition (I) | Content (wt.) |
Ammonium sulfate | 14g/L |
Potassium dihydrogen phosphate | 1g/L |
Dipotassium hydrogen phosphate | 1g/L |
Magnesium sulfate | 0.5g/L |
Yeast powder | 2g/L |
Ferrous sulfate | 18mg/L |
Manganese sulfate | 4.2mg/L |
Biotin | 0.02mg/L |
Vitamin B1 | 2mg/L |
Antifoaming agent antidioam (CB-442) | 0.5mL/L |
Glucose (base candy) | 40g/L |
The fermentation control process is shown in Table 4.
At the initial moment of completion of inoculation, the OD value of the system is 0.3-0.5.
In the fermentation process: ammonia water is used for adjusting the pH value; when foam exists in the fermentation system, adding a proper amount of antifoaming agent anifoam (CB-442); the sugar content (residual sugar) of the system is controlled by supplementing 70% glucose aqueous solution.
TABLE 4 fermentation control Process
After completion of the fermentation, the supernatant was collected, and the L-valine production in the supernatant was measured by HPLC.
The results are shown in Table 5. The L-valine yield of the recombinant bacteria YPV-031 and YPV-036 is obviously higher than that of Corynebacterium glutamicum CGMCC 21260. The results show that the suppression of YH66-RS10865 gene expression can increase L-valine production, and the overexpression of YH66-RS10865 gene can decrease L-valine production.
TABLE 5 results of L-valine fermentation experiments
Bacterial strains | OD610 | L-valine yield (g/L) |
Corynebacterium glutamicum CGMCC21260 | 97.8 | 83.2 |
Recombinant bacterium YPV-031 | 98.1 | 85.6 |
Recombinant bacterium YPV-032 | 98.4 | 82.3 |
Recombinant bacterium YPV-033 | 98.2 | 83.1 |
Recombinant bacterium YPV-034 | 98.6 | 82.4 |
Recombinant bacterium YPV-035 | 98.3 | 82.6 |
Recombinant bacterium YPV-036 | 97.9 | 84.3 |
The present invention has been described in detail above. It will be apparent to those skilled in the art that the invention can be practiced in a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation. While the invention has been described with reference to specific embodiments, it will be appreciated that the invention can be further modified. In general, this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. The use of some of the essential features is possible within the scope of the claims attached below.
Sequence listing
<110> Ningxia Yipin Biotechnology Ltd
Engineering bacterium obtained by YH66-RS10865 gene modification and application thereof in preparation of valine
<130> GNCYX211995
<160> 9
<170> SIPOSequenceListing 1.0
<210> 1
<211> 922
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 1
Met Ala Asp Gln Ala Lys Leu Gly Gly Lys Pro Ser Asp Asp Ser Asn
1 5 10 15
Phe Ala Met Ile Arg Asp Gly Val Ala Ser Tyr Leu Asn Asp Ser Asp
20 25 30
Pro Glu Glu Thr Asn Glu Trp Met Asp Ser Leu Asp Gly Leu Leu Gln
35 40 45
Glu Ser Ser Pro Glu Arg Ala Arg Tyr Leu Met Leu Arg Leu Leu Glu
50 55 60
Arg Ala Ser Ala Lys Arg Val Ser Leu Pro Pro Met Thr Ser Thr Asp
65 70 75 80
Tyr Val Asn Thr Ile Pro Thr Ser Met Glu Pro Glu Phe Pro Gly Asp
85 90 95
Glu Glu Met Glu Lys Arg Tyr Arg Arg Trp Ile Arg Trp Asn Ala Ala
100 105 110
Ile Met Val His Arg Ala Gln Arg Pro Gly Ile Gly Val Gly Gly His
115 120 125
Ile Ser Thr Tyr Ala Gly Ala Ala Pro Leu Tyr Glu Val Gly Phe Asn
130 135 140
His Phe Phe Arg Gly Lys Asp His Pro Gly Gly Gly Asp Gln Ile Phe
145 150 155 160
Phe Gln Gly His Ala Ser Pro Gly Met Tyr Ala Arg Ala Phe Met Glu
165 170 175
Gly Arg Leu Ser Glu Asp Asp Leu Asp Gly Phe Arg Gln Glu Val Ser
180 185 190
Arg Glu Gln Gly Gly Ile Pro Ser Tyr Pro His Pro His Gly Met Lys
195 200 205
Asp Phe Trp Glu Phe Pro Thr Val Ser Met Gly Leu Gly Pro Met Asp
210 215 220
Ala Ile Tyr Gln Ala Arg Phe Asn Arg Tyr Leu Glu Asn Arg Gly Ile
225 230 235 240
Lys Asp Thr Ser Asp Gln His Val Trp Ala Phe Leu Gly Asp Gly Glu
245 250 255
Met Asp Glu Pro Glu Ser Arg Gly Leu Ile Gln Gln Ala Ala Leu Asn
260 265 270
Asn Leu Asp Asn Leu Thr Phe Val Val Gln Cys Asn Leu Gln Arg Leu
275 280 285
Asp Gly Pro Val Arg Gly Asn Thr Lys Ile Ile Gln Glu Leu Glu Ser
290 295 300
Phe Phe Arg Gly Ala Gly Trp Ser Val Ile Lys Val Val Trp Gly Arg
305 310 315 320
Glu Trp Asp Glu Leu Leu Glu Lys Asp Gln Asp Gly Ala Leu Val Glu
325 330 335
Ile Met Asn Asn Thr Ser Asp Gly Asp Tyr Gln Thr Phe Lys Ala Asn
340 345 350
Asp Gly Ala Tyr Val Arg Glu His Phe Phe Gly Arg Asp Pro Arg Thr
355 360 365
Ala Lys Leu Val Glu Asn Met Thr Asp Glu Glu Ile Trp Lys Leu Pro
370 375 380
Arg Gly Gly His Asp Tyr Arg Lys Val Tyr Ala Ala Tyr Lys Arg Ala
385 390 395 400
Leu Glu Thr Lys Asp Arg Pro Thr Val Ile Leu Ala His Thr Ile Lys
405 410 415
Gly Tyr Gly Leu Gly His Asn Phe Glu Gly Arg Asn Ala Thr His Gln
420 425 430
Met Lys Lys Leu Thr Leu Asp Asp Leu Lys Leu Phe Arg Asp Lys Gln
435 440 445
Gly Ile Pro Ile Thr Asp Glu Gln Leu Glu Lys Asp Pro Tyr Leu Pro
450 455 460
Pro Tyr Tyr His Pro Gly Glu Asp Ala Pro Glu Ile Lys Tyr Met Lys
465 470 475 480
Glu Arg Arg Ala Ala Leu Gly Gly Tyr Leu Pro Glu Arg Arg Glu Asn
485 490 495
Tyr Asp Pro Ile Gln Val Pro Pro Leu Asp Lys Leu Arg Ser Val Arg
500 505 510
Lys Gly Ser Gly Lys Gln Gln Ile Ala Thr Thr Met Ala Thr Val Arg
515 520 525
Thr Phe Lys Glu Leu Met Arg Asp Lys Gly Leu Ala Asp Arg Leu Val
530 535 540
Pro Ile Ile Pro Asp Glu Ala Arg Thr Phe Gly Leu Asp Ser Trp Phe
545 550 555 560
Pro Thr Leu Lys Ile Tyr Asn Pro His Gly Gln Asn Tyr Val Pro Val
565 570 575
Asp His Asp Leu Met Leu Ser Tyr Arg Glu Ala Pro Glu Gly Gln Ile
580 585 590
Leu His Glu Gly Ile Asn Glu Ala Gly Ser Met Ala Ser Phe Ile Ala
595 600 605
Ala Gly Thr Ser Tyr Ala Thr His Gly Lys Ala Met Ile Pro Leu Tyr
610 615 620
Ile Phe Tyr Ser Met Phe Gly Phe Gln Arg Thr Gly Asp Ser Ile Trp
625 630 635 640
Ala Ala Ala Asp Gln Met Ala Arg Gly Phe Leu Leu Gly Ala Thr Ala
645 650 655
Gly Arg Thr Thr Leu Thr Gly Glu Gly Leu Gln His Met Asp Gly His
660 665 670
Ser Pro Val Leu Ala Ser Thr Asn Glu Gly Val Glu Thr Tyr Asp Pro
675 680 685
Ser Phe Ala Tyr Glu Ile Ala His Leu Val His Arg Gly Ile Asp Arg
690 695 700
Met Tyr Gly Pro Gly Lys Gly Glu Asp Val Ile Tyr Tyr Ile Thr Ile
705 710 715 720
Tyr Asn Glu Pro Thr Pro Gln Pro Ala Glu Pro Glu Gly Leu Asp Val
725 730 735
Glu Gly Leu His Lys Gly Ile Tyr Leu Tyr Ser Arg Gly Glu Gly Thr
740 745 750
Gly His Glu Ala Asn Ile Leu Ala Ser Gly Val Gly Met Gln Trp Ala
755 760 765
Leu Lys Ala Ala Ser Ile Leu Glu Ala Asp Tyr Gly Val Arg Ala Asn
770 775 780
Ile Tyr Ser Ala Thr Ser Trp Val Asn Leu Ala Arg Asp Gly Ala Ala
785 790 795 800
Arg Asn Lys Ala Gln Leu Arg Asn Pro Gly Ala Asp Ala Gly Glu Ala
805 810 815
Phe Val Thr Thr Gln Leu Lys Gln Thr Ser Gly Pro Tyr Val Ala Val
820 825 830
Ser Asp Phe Ser Thr Asp Leu Pro Asn Gln Ile Arg Glu Trp Val Pro
835 840 845
Gly Asp Tyr Thr Val Leu Gly Ala Asp Gly Phe Gly Phe Ser Asp Thr
850 855 860
Arg Pro Ala Ala Arg Arg Phe Phe Asn Ile Asp Ala Glu Ser Ile Val
865 870 875 880
Val Ala Val Leu Asn Ser Leu Ala Arg Glu Gly Lys Ile Asp Val Ser
885 890 895
Val Ala Ala Gln Ala Ala Glu Lys Phe Lys Leu Asp Asp Pro Thr Ser
900 905 910
Val Ser Val Asp Pro Asn Ala Pro Glu Glu
915 920
<210> 2
<211> 2769
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
atggccgatc aagcaaaact tggtggcaag ccctcggatg actctaactt cgcgatgatc 60
cgcgatggcg tggcatctta tttgaacgac tcagatccgg aggagaccaa cgagtggatg 120
gattcactcg acggattact ccaggagtct tctccagaac gtgctcgtta cctcatgctt 180
cgtttgcttg agcgtgcatc tgcaaagcgc gtatctcttc ccccaatgac gtcaaccgac 240
tacgtcaaca ccattccaac ctctatggaa cctgaattcc caggcgatga ggaaatggag 300
aagcgttacc gtcgttggat tcgctggaac gcagccatca tggttcaccg cgctcagcga 360
ccaggcatcg gcgtcggcgg gcacatttcc acttacgcag gcgcagcccc tctgtacgaa 420
gttggcttca accacttctt ccgcggcaag gatcacccag gcggcggcga ccagatcttc 480
ttccagggcc acgcatcacc aggtatgtac gcacgtgcat tcatggaggg tcgcctttct 540
gaagacgatc tcgatggctt ccgtcaggaa gtttcccgtg agcagggtgg tattccgtcc 600
taccctcacc cacacggtat gaaggacttc tgggagttcc caactgtgtc catgggtctt 660
ggcccaatgg atgccattta ccaggcacgt ttcaaccgct acctcgaaaa ccgtggcatc 720
aaggacacct ctgaccagca cgtctgggcc ttccttggcg acggcgaaat ggacgagcca 780
gaatcacgtg gtctcatcca gcaggctgca ctgaacaacc tggacaacct gaccttcgtg 840
gttcaatgca acctgcagcg tctcgacgga cctgtccgcg gtaacaccaa gatcatccag 900
gaactcgagt ccttcttccg tggcgcaggc tggtctgtga tcaaggttgt ttggggtcgc 960
gagtgggatg aacttctgga gaaggaccag gatggtgcac ttgttgagat catgaacaac 1020
acctccgatg gtgactacca gaccttcaag gctaacgacg gcgcatatgt tcgtgagcac 1080
ttcttcggac gtgacccacg caccgcaaag ctcgttgaga acatgaccga cgaagaaatc 1140
tggaagctgc cacgtggcgg ccacgattac cgcaaggttt acgcagccta caagcgagct 1200
cttgagacca aggatcgccc aaccgtcatc cttgctcaca ccattaaggg ctacggactc 1260
ggccacaatt tcgaaggccg taacgcaacc caccagatga agaagctgac gcttgatgat 1320
ctgaagttgt tccgcgacaa gcagggcatc ccaatcaccg atgagcagct ggagaaggat 1380
ccttaccttc ctccttacta ccacccaggt gaagacgctc ctgaaatcaa gtacatgaag 1440
gaacgtcgcg cagcgctcgg tggctacctg ccagagcgtc gtgagaacta cgatccaatt 1500
caggttccac cactggataa gcttcgctct gtccgtaagg gctccggcaa gcagcagatc 1560
gctaccacca tggcgactgt tcgcaccttc aaggaactga tgcgcgataa gggcttggct 1620
gatcgccttg tcccaatcat tcctgatgag gcacgtacct tcggtcttga ctcttggttc 1680
ccaaccttga agatctacaa cccgcacggt cagaactacg tgcctgttga ccacgacctg 1740
atgctctcct accgtgaggc acctgaagga cagatcctgc acgaaggcat caacgaggct 1800
ggttccatgg catcgttcat cgctgcgggt acctcctacg ccacccacgg caaggccatg 1860
attccgctgt acatcttcta ctcgatgttc ggattccagc gcaccggtga ctccatctgg 1920
gcagcagccg atcagatggc acgtggcttc ctcttgggcg ctaccgcagg tcgcaccacc 1980
ctgaccggtg aaggcctcca gcacatggat ggacactccc ctgtcttggc ttccaccaac 2040
gagggtgtcg agacctacga cccatccttt gcgtacgaga tcgcacacct ggttcaccgt 2100
ggcatcgacc gcatgtacgg cccaggcaag ggcgaagatg ttatctacta catcaccatc 2160
tacaacgagc caaccccaca gccagctgag ccagaaggac tggacgtaga aggcctgcac 2220
aagggcatct acctctactc ccgcggtgaa ggcaccggcc atgaggcaaa catcttggct 2280
tccggtgttg gtatgcagtg ggctctcaag gctgcatcca tccttgaggc tgactacgga 2340
gttcgtgcaa acatttactc cgctacttct tgggttaact tggctcgcga tggcgctgct 2400
cgtaacaagg cacagctgcg caacccaggt gcagatgctg gcgaggcatt cgtaaccacc 2460
cagctgaagc agacctctgg cccatacgtc gcagtgtctg acttctccac tgatctgcca 2520
aaccagatcc gtgaatgggt cccaggcgac tacaccgttc tcggtgcaga tggcttcggt 2580
ttctctgata cccgcccagc tgctcgtcgc ttcttcaaca tcgacgctga gtccattgtt 2640
gttgcagtgc tgaactccct ggcacgcgaa ggcaagatcg acgtctccgt tgctgctcag 2700
gctgctgaga agttcaagtt ggatgatcct acgagtgttt ccgtagatcc aaacgctcct 2760
gaggaataa 2769
<210> 3
<211> 922
<212> PRT
<213> Corynebacterium glutamicum
<400> 3
Met Ala Asp Gln Ala Lys Leu Gly Gly Lys Pro Ser Asp Asp Ser Asn
1 5 10 15
Phe Ala Met Ile Arg Asp Gly Val Ala Ser Tyr Leu Asn Asp Ser Asp
20 25 30
Pro Glu Glu Thr Asn Glu Trp Met Asp Ser Leu Asp Gly Leu Leu Gln
35 40 45
Glu Ser Ser Pro Glu Arg Ala Arg Tyr Leu Met Leu Arg Leu Leu Glu
50 55 60
Arg Ala Ser Ala Lys Arg Val Ser Leu Pro Pro Met Thr Ser Thr Asp
65 70 75 80
Tyr Val Asn Thr Ile Pro Thr Ser Met Glu Pro Glu Phe Pro Gly Asp
85 90 95
Glu Glu Met Glu Lys Arg Tyr Arg Arg Trp Ile Arg Trp Asn Ala Ala
100 105 110
Ile Met Val His Arg Ala Gln Arg Pro Gly Ile Gly Val Gly Gly His
115 120 125
Ile Ser Thr Tyr Ala Gly Ala Ala Pro Leu Tyr Glu Val Gly Phe Asn
130 135 140
His Phe Phe Arg Gly Lys Asp His Pro Gly Gly Gly Asp Gln Ile Phe
145 150 155 160
Phe Gln Gly His Ala Ser Pro Gly Met Tyr Ala Arg Ala Phe Met Glu
165 170 175
Gly Arg Leu Ser Glu Asp Asp Leu Asp Gly Phe Arg Gln Glu Val Ser
180 185 190
Arg Glu Gln Gly Gly Ile Pro Ser Tyr Pro His Pro His Gly Met Lys
195 200 205
Asp Phe Trp Glu Phe Pro Thr Val Ser Met Gly Leu Gly Pro Met Asp
210 215 220
Ala Ile Tyr Gln Ala Arg Phe Asn Arg Tyr Leu Glu Asn Arg Gly Ile
225 230 235 240
Lys Asp Thr Ser Asp Gln His Val Trp Ala Phe Leu Gly Asp Gly Glu
245 250 255
Met Asp Glu Pro Glu Ser Arg Gly Leu Ile Gln Gln Ala Ala Leu Asn
260 265 270
Asn Leu Asp Asn Leu Thr Phe Val Val Asn Cys Asn Leu Gln Arg Leu
275 280 285
Asp Gly Pro Val Arg Gly Asn Thr Lys Ile Ile Gln Glu Leu Glu Ser
290 295 300
Phe Phe Arg Gly Ala Gly Trp Ser Val Ile Lys Val Val Trp Gly Arg
305 310 315 320
Glu Trp Asp Glu Leu Leu Glu Lys Asp Gln Asp Gly Ala Leu Val Glu
325 330 335
Ile Met Asn Asn Thr Ser Asp Gly Asp Tyr Gln Thr Phe Lys Ala Asn
340 345 350
Asp Gly Ala Tyr Val Arg Glu His Phe Phe Gly Arg Asp Pro Arg Thr
355 360 365
Ala Lys Leu Val Glu Asn Met Thr Asp Glu Glu Ile Trp Lys Leu Pro
370 375 380
Arg Gly Gly His Asp Tyr Arg Lys Val Tyr Ala Ala Tyr Lys Arg Ala
385 390 395 400
Leu Glu Thr Lys Asp Arg Pro Thr Val Ile Leu Ala His Thr Ile Lys
405 410 415
Gly Tyr Gly Leu Gly His Asn Phe Glu Gly Arg Asn Ala Thr His Gln
420 425 430
Met Lys Lys Leu Thr Leu Asp Asp Leu Lys Leu Phe Arg Asp Lys Gln
435 440 445
Gly Ile Pro Ile Thr Asp Glu Gln Leu Glu Lys Asp Pro Tyr Leu Pro
450 455 460
Pro Tyr Tyr His Pro Gly Glu Asp Ala Pro Glu Ile Lys Tyr Met Lys
465 470 475 480
Glu Arg Arg Ala Ala Leu Gly Gly Tyr Leu Pro Glu Arg Arg Glu Asn
485 490 495
Tyr Asp Pro Ile Gln Val Pro Pro Leu Asp Lys Leu Arg Ser Val Arg
500 505 510
Lys Gly Ser Gly Lys Gln Gln Ile Ala Thr Thr Met Ala Thr Val Arg
515 520 525
Thr Phe Lys Glu Leu Met Arg Asp Lys Gly Leu Ala Asp Arg Leu Val
530 535 540
Pro Ile Ile Pro Asp Glu Ala Arg Thr Phe Gly Leu Asp Ser Trp Phe
545 550 555 560
Pro Thr Leu Lys Ile Tyr Asn Pro His Gly Gln Asn Tyr Val Pro Val
565 570 575
Asp His Asp Leu Met Leu Ser Tyr Arg Glu Ala Pro Glu Gly Gln Ile
580 585 590
Leu His Glu Gly Ile Asn Glu Ala Gly Ser Met Ala Ser Phe Ile Ala
595 600 605
Ala Gly Thr Ser Tyr Ala Thr His Gly Lys Ala Met Ile Pro Leu Tyr
610 615 620
Ile Phe Tyr Ser Met Phe Gly Phe Gln Arg Thr Gly Asp Ser Ile Trp
625 630 635 640
Ala Ala Ala Asp Gln Met Ala Arg Gly Phe Leu Leu Gly Ala Thr Ala
645 650 655
Gly Arg Thr Thr Leu Thr Gly Glu Gly Leu Gln His Met Asp Gly His
660 665 670
Ser Pro Val Leu Ala Ser Thr Asn Glu Gly Val Glu Thr Tyr Asp Pro
675 680 685
Ser Phe Ala Tyr Glu Ile Ala His Leu Val His Arg Gly Ile Asp Arg
690 695 700
Met Tyr Gly Pro Gly Lys Gly Glu Asp Val Ile Tyr Tyr Ile Thr Ile
705 710 715 720
Tyr Asn Glu Pro Thr Pro Gln Pro Ala Glu Pro Glu Gly Leu Asp Val
725 730 735
Glu Gly Leu His Lys Gly Ile Tyr Leu Tyr Ser Arg Gly Glu Gly Thr
740 745 750
Gly His Glu Ala Asn Ile Leu Ala Ser Gly Val Gly Met Gln Trp Ala
755 760 765
Leu Lys Ala Ala Ser Ile Leu Glu Ala Asp Tyr Gly Val Arg Ala Asn
770 775 780
Ile Tyr Ser Ala Thr Ser Trp Val Asn Leu Ala Arg Asp Gly Ala Ala
785 790 795 800
Arg Asn Lys Ala Gln Leu Arg Asn Pro Gly Ala Asp Ala Gly Glu Ala
805 810 815
Phe Val Thr Thr Gln Leu Lys Gln Thr Ser Gly Pro Tyr Val Ala Val
820 825 830
Ser Asp Phe Ser Thr Asp Leu Pro Asn Gln Ile Arg Glu Trp Val Pro
835 840 845
Gly Asp Tyr Thr Val Leu Gly Ala Asp Gly Phe Gly Phe Ser Asp Thr
850 855 860
Arg Pro Ala Ala Arg Arg Phe Phe Asn Ile Asp Ala Glu Ser Ile Val
865 870 875 880
Val Ala Val Leu Asn Ser Leu Ala Arg Glu Gly Lys Ile Asp Val Ser
885 890 895
Val Ala Ala Gln Ala Ala Glu Lys Phe Lys Leu Asp Asp Pro Thr Ser
900 905 910
Val Ser Val Asp Pro Asn Ala Pro Glu Glu
915 920
<210> 4
<211> 2769
<212> DNA
<213> Corynebacterium glutamicum
<400> 4
atggccgatc aagcaaaact tggtggcaag ccctcggatg actctaactt cgcgatgatc 60
cgcgatggcg tggcatctta tttgaacgac tcagatccgg aggagaccaa cgagtggatg 120
gattcactcg acggattact ccaggagtct tctccagaac gtgctcgtta cctcatgctt 180
cgtttgcttg agcgtgcatc tgcaaagcgc gtatctcttc ccccaatgac gtcaaccgac 240
tacgtcaaca ccattccaac ctctatggaa cctgaattcc caggcgatga ggaaatggag 300
aagcgttacc gtcgttggat tcgctggaac gcagccatca tggttcaccg cgctcagcga 360
ccaggcatcg gcgtcggcgg gcacatttcc acttacgcag gcgcagcccc tctgtacgaa 420
gttggcttca accacttctt ccgcggcaag gatcacccag gcggcggcga ccagatcttc 480
ttccagggcc acgcatcacc aggtatgtac gcacgtgcat tcatggaggg tcgcctttct 540
gaagacgatc tcgatggctt ccgtcaggaa gtttcccgtg agcagggtgg tattccgtcc 600
taccctcacc cacacggtat gaaggacttc tgggagttcc caactgtgtc catgggtctt 660
ggcccaatgg atgccattta ccaggcacgt ttcaaccgct acctcgaaaa ccgtggcatc 720
aaggacacct ctgaccagca cgtctgggcc ttccttggcg acggcgaaat ggacgagcca 780
gaatcacgtg gtctcatcca gcaggctgca ctgaacaacc tggacaacct gaccttcgtg 840
gttaactgca acctgcagcg tctcgacgga cctgtccgcg gtaacaccaa gatcatccag 900
gaactcgagt ccttcttccg tggcgcaggc tggtctgtga tcaaggttgt ttggggtcgc 960
gagtgggatg aacttctgga gaaggaccag gatggtgcac ttgttgagat catgaacaac 1020
acctccgatg gtgactacca gaccttcaag gctaacgacg gcgcatatgt tcgtgagcac 1080
ttcttcggac gtgacccacg caccgcaaag ctcgttgaga acatgaccga cgaagaaatc 1140
tggaagctgc cacgtggcgg ccacgattac cgcaaggttt acgcagccta caagcgagct 1200
cttgagacca aggatcgccc aaccgtcatc cttgctcaca ccattaaggg ctacggactc 1260
ggccacaatt tcgaaggccg taacgcaacc caccagatga agaagctgac gcttgatgat 1320
ctgaagttgt tccgcgacaa gcagggcatc ccaatcaccg atgagcagct ggagaaggat 1380
ccttaccttc ctccttacta ccacccaggt gaagacgctc ctgaaatcaa gtacatgaag 1440
gaacgtcgcg cagcgctcgg tggctacctg ccagagcgtc gtgagaacta cgatccaatt 1500
caggttccac cactggataa gcttcgctct gtccgtaagg gctccggcaa gcagcagatc 1560
gctaccacca tggcgactgt tcgcaccttc aaggaactga tgcgcgataa gggcttggct 1620
gatcgccttg tcccaatcat tcctgatgag gcacgtacct tcggtcttga ctcttggttc 1680
ccaaccttga agatctacaa cccgcacggt cagaactacg tgcctgttga ccacgacctg 1740
atgctctcct accgtgaggc acctgaagga cagatcctgc acgaaggcat caacgaggct 1800
ggttccatgg catcgttcat cgctgcgggt acctcctacg ccacccacgg caaggccatg 1860
attccgctgt acatcttcta ctcgatgttc ggattccagc gcaccggtga ctccatctgg 1920
gcagcagccg atcagatggc acgtggcttc ctcttgggcg ctaccgcagg tcgcaccacc 1980
ctgaccggtg aaggcctcca gcacatggat ggacactccc ctgtcttggc ttccaccaac 2040
gagggtgtcg agacctacga cccatccttt gcgtacgaga tcgcacacct ggttcaccgt 2100
ggcatcgacc gcatgtacgg cccaggcaag ggcgaagatg ttatctacta catcaccatc 2160
tacaacgagc caaccccaca gccagctgag ccagaaggac tggacgtaga aggcctgcac 2220
aagggcatct acctctactc ccgcggtgaa ggcaccggcc atgaggcaaa catcttggct 2280
tccggtgttg gtatgcagtg ggctctcaag gctgcatcca tccttgaggc tgactacgga 2340
gttcgtgcaa acatttactc cgctacttct tgggttaact tggctcgcga tggcgctgct 2400
cgtaacaagg cacagctgcg caacccaggt gcagatgctg gcgaggcatt cgtaaccacc 2460
cagctgaagc agacctctgg cccatacgtc gcagtgtctg acttctccac tgatctgcca 2520
aaccagatcc gtgaatgggt cccaggcgac tacaccgttc tcggtgcaga tggcttcggt 2580
ttctctgata cccgcccagc tgctcgtcgc ttcttcaaca tcgacgctga gtccattgtt 2640
gttgcagtgc tgaactccct ggcacgcgaa ggcaagatcg acgtctccgt tgctgctcag 2700
gctgctgaga agttcaagtt ggatgatcct acgagtgttt ccgtagatcc aaacgctcct 2760
gaggaataa 2769
<210> 5
<211> 1364
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
cagtgccaag cttgcatgcc tgcaggtcga ctctaggctt cgtttgcttg agcgtgcatc 60
tgcaaagcgc gtatctcttc ccccaatgac gtcaaccgac tacgtcaaca ccattccaac 120
ctctatggaa cctgaattcc caggcgatga ggaaatggag aagcgttacc gtcgttggat 180
tcgctggaac gcagccatca tggttcaccg cgctcagcga ccaggcatcg gcgtcggcgg 240
gcacatttcc acttacgcag gcgcagcccc tctgtacgaa gttggcttca accacttctt 300
ccgcggcaag gatcacccag gcggcggcga ccagatcttc ttccagggcc acgcatcacc 360
aggtatgtac gcacgtgcat tcatggaggg tcgcctttct gaagacgatc tcgatggctt 420
ccgtcaggaa gtttcccgtg agcagggtgg tattccgtcc taccctcacc cacacggtat 480
gaaggacttc tgggagttcc caactgtgtc catgggtctt ggcccaatgg atgccattta 540
ccaggcacgt ttcaaccgct acctcgaaaa ccgtggcatc aaggacacct ctgaccagca 600
cgtctgggcc ttccttggcg acggcgaaat ggacgagcca gaatcacgtg gtctcatcca 660
gcaggctgca ctgaacaacc tggacaacct gaccttcgtg gttcaatgca acctgcagcg 720
tctcgacgga cctgtccgcg gtaacaccaa gatcatccag gaactcgagt ccttcttccg 780
tggcgcaggc tggtctgtga tcaaggttgt ttggggtcgc gagtgggatg aacttctgga 840
gaaggaccag gatggtgcac ttgttgagat catgaacaac acctccgatg gtgactacca 900
gaccttcaag gctaacgacg gcgcatatgt tcgtgagcac ttcttcggac gtgacccacg 960
caccgcaaag ctcgttgaga acatgaccga cgaagaaatc tggaagctgc cacgtggcgg 1020
ccacgattac cgcaaggttt acgcagccta caagcgagct cttgagacca aggatcgccc 1080
aaccgtcatc cttgctcaca ccattaaggg ctacggactc ggccacaatt tcgaaggccg 1140
taacgcaacc caccagatga agaagctgac gcttgatgat ctgaagttgt tccgcgacaa 1200
gcagggcatc ccaatcaccg atgagcagct ggagaaggat ccttaccttc ctccttacta 1260
ccacccaggt gaagacgctc ctgaaatcaa gtacatgaag gaacgtcgcg cagcgctcgg 1320
tggctagggt accgagctcg aattcgtaat catggtcata gctg 1364
<210> 6
<211> 4523
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
cagtgccaag cttgcatgcc tgcaggtcga ctctagcatg acggctgact ggactcgact 60
tccatacgag gttctggaga agatctccac ccgcatcacc aacgaagttc cagatgtgaa 120
ccgcgtggtt ttggacgtaa cctccaagcc accaggaacc atcgaatggg agtaggcctt 180
aaatgagcct tcgttaagcg gcaatcacct tattggagat tgtcgctttt cccatttctc 240
cgggttttct ggaacttttt gggcgtatgc tgggaatgat tctattattg ccaaatcaga 300
aagcaggaga gacccgatga gcgaaatcct agaaacctat tgggcacccc actttggaaa 360
aaccgaagaa gccacagcac tcgtttcata cctggcacaa gcttccggcg atcccattga 420
ggttcacacc ctgttcgggg atttaggttt agacggactc tcgggaaact acaccgacac 480
tgagattgac ggctacggcg acgcattcct gctggttgca gcgctatccg tgttgatggc 540
tgaaaacaaa gcaacaggtg gcgtgaatct gggtgagctt gggggagctg ataaatcgat 600
ccggctgcat gttgaatcca aggagaacac ccaaatcaac accgcattga agtattttgc 660
gctctcccca gaagaccacg cagcagcaga tcgcttcgat gaggatgacc tgtctgagct 720
tgccaacttg agtgaagagc tgcgcggaca gctggactaa ttgtctccca tttaaggagt 780
ccgatttgga tacctgaaat cccagtgagc gcaccactcc ccttacgtca cagtctgtaa 840
aacaaatctt cggtgttgcg tatccttgtt aataacttat gcgttgactc attcgtgcac 900
ttcggcgtgt cacaattagg tacgaccaag aatgggaccg ggaaaccggg acgtataaac 960
gaaataaaac attccaacag gaggtgtgga aatggccgat caagcaaaac ttggtggcaa 1020
gccctcggat gactctaact tcgcgatgat ccgcgatggc gtggcatctt atttgaacga 1080
ctcagatccg gaggagacca acgagtggat ggattcactc gacggattac tccaggagtc 1140
ttctccagaa cgtgctcgtt acctcatgct tcgtttgctt gagcgtgcat ctgcaaagcg 1200
cgtatctctt cccccaatga cgtcaaccga ctacgtcaac accattccaa cctctatgga 1260
acctgaattc ccaggcgatg aggaaatgga gaagcgttac cgtcgttgga ttcgctggaa 1320
cgcagccatc atggttcacc gcgctcagcg accaggcatc ggcgtcggcg ggcacatttc 1380
cacttacgca ggcgcagccc ctctgtacga agttggcttc aaccacttct tccgcggcaa 1440
ggatcaccca ggcggcggcg accagatctt cttccagggc cacgcatcac caggtatgta 1500
cgcacgtgca ttcatggagg gtcgcctttc tgaagacgat ctcgatggct tccgtcagga 1560
agtttcccgt gagcagggtg gtattccgtc ctaccctcac ccacacggta tgaaggactt 1620
ctgggagttc ccaactgtgt ccatgggtct tggcccaatg gatgccattt accaggcacg 1680
tttcaaccgc tacctcgaaa accgtggcat caaggacacc tctgaccagc acgtctgggc 1740
cttccttggc gacggcgaaa tggacgagcc agaatcacgt ggtctcatcc agcaggctgc 1800
actgaacaac ctggacaacc tgaccttcgt ggttcaatgc aacctgcagc gtctcgacgg 1860
acctgtccgc ggtaacacca agatcatcca ggaactcgag tccttcttcc gtggcgcagg 1920
ctggtctgtg atcaaggttg tttggggtcg cgagtgggat gaacttctgg agaaggacca 1980
ggatggtgca cttgttgaga tcatgaacaa cacctccgat ggtgactacc agaccttcaa 2040
ggctaacgac ggcgcatatg ttcgtgagca cttcttcgga cgtgacccac gcaccgcaaa 2100
gctcgttgag aacatgaccg acgaagaaat ctggaagctg ccacgtggcg gccacgatta 2160
ccgcaaggtt tacgcagcct acaagcgagc tcttgagacc aaggatcgcc caaccgtcat 2220
ccttgctcac accattaagg gctacggact cggccacaat ttcgaaggcc gtaacgcaac 2280
ccaccagatg aagaagctga cgcttgatga tctgaagttg ttccgcgaca agcagggcat 2340
cccaatcacc gatgagcagc tggagaagga tccttacctt cctccttact accacccagg 2400
tgaagacgct cctgaaatca agtacatgaa ggaacgtcgc gcagcgctcg gtggctacct 2460
gccagagcgt cgtgagaact acgatccaat tcaggttcca ccactggata agcttcgctc 2520
tgtccgtaag ggctccggca agcagcagat cgctaccacc atggcgactg ttcgcacctt 2580
caaggaactg atgcgcgata agggcttggc tgatcgcctt gtcccaatca ttcctgatga 2640
ggcacgtacc ttcggtcttg actcttggtt cccaaccttg aagatctaca acccgcacgg 2700
tcagaactac gtgcctgttg accacgacct gatgctctcc taccgtgagg cacctgaagg 2760
acagatcctg cacgaaggca tcaacgaggc tggttccatg gcatcgttca tcgctgcggg 2820
tacctcctac gccacccacg gcaaggccat gattccgctg tacatcttct actcgatgtt 2880
cggattccag cgcaccggtg actccatctg ggcagcagcc gatcagatgg cacgtggctt 2940
cctcttgggc gctaccgcag gtcgcaccac cctgaccggt gaaggcctcc agcacatgga 3000
tggacactcc cctgtcttgg cttccaccaa cgagggtgtc gagacctacg acccatcctt 3060
tgcgtacgag atcgcacacc tggttcaccg tggcatcgac cgcatgtacg gcccaggcaa 3120
gggcgaagat gttatctact acatcaccat ctacaacgag ccaaccccac agccagctga 3180
gccagaagga ctggacgtag aaggcctgca caagggcatc tacctctact cccgcggtga 3240
aggcaccggc catgaggcaa acatcttggc ttccggtgtt ggtatgcagt gggctctcaa 3300
ggctgcatcc atccttgagg ctgactacgg agttcgtgca aacatttact ccgctacttc 3360
ttgggttaac ttggctcgcg atggcgctgc tcgtaacaag gcacagctgc gcaacccagg 3420
tgcagatgct ggcgaggcat tcgtaaccac ccagctgaag cagacctctg gcccatacgt 3480
cgcagtgtct gacttctcca ctgatctgcc aaaccagatc cgtgaatggg tcccaggcga 3540
ctacaccgtt ctcggtgcag atggcttcgg tttctctgat acccgcccag ctgctcgtcg 3600
cttcttcaac atcgacgctg agtccattgt tgttgcagtg ctgaactccc tggcacgcga 3660
aggcaagatc gacgtctccg ttgctgctca ggctgctgag aagttcaagt tggatgatcc 3720
tacgagtgtt tccgtagatc caaacgctcc tgaggaataa taaatcgact actcacatag 3780
ggtcgggcta gtcattctga tcagcgaatt ccacgttcac atcgccaatt ccagagttca 3840
caaccagatt cagcattgga ccttctagat cagcattgtg ggcggtgaga tctccaacat 3900
cacagcgcgc tgtgcccaca ccggcggtac aacttaggct cacgggcaca tcatcgggca 3960
gggtgaccat gacttcgccg atccctgagg tgatttggat gttttgttcc tgatccaatt 4020
gggtgaggtg gctgaaatcg aggttcattt cacccacgcc agaggtgtag ctgctgagga 4080
gttcatcgtt ggtggggatg agattgacat cgccgattcc agggtcgtct tcaaagtaga 4140
tgggatcgat atttgaaata aacaggcctg cgagggcgct catgacaact ccggtaccaa 4200
ctacaccgcc gacaatccat ggccacacat ggcgcttttt ctgaggcttt tgtggaggga 4260
cttgtacatc ccaggtgttg tattggtttt gggcaagtgg atcccaatga ggcgcttcgg 4320
gggtttgttg cgcgaagggt gcatagtagc cctcaacggg ggtgatagtg cttagatctg 4380
gttggggttg tgggtagaga tcttcgtttt tcatggtggc atcctcagaa acagtgaatt 4440
cagtggtgag tagtccgcgg ggtggaagtg gttgtttctt atgcagggta ccgagctcga 4500
attcgtaatc atggtcatag ctg 4523
<210> 7
<211> 3044
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
gcttgcatgc ctgcaggtcg actctagagg atcccctgga tacctgaaat cccagtgagc 60
gcaccactcc ccttacgtca cagtctgtaa aacaaatctt cggtgttgcg tatccttgtt 120
aataacttat gcgttgactc attcgtgcac ttcggcgtgt cacaattagg tacgaccaag 180
aatgggaccg ggaaaccggg acgtataaac gaaataaaac attccaacag gaggtgtgga 240
aatggccgat caagcaaaac ttggtggcaa gccctcggat gactctaact tcgcgatgat 300
ccgcgatggc gtggcatctt atttgaacga ctcagatccg gaggagacca acgagtggat 360
ggattcactc gacggattac tccaggagtc ttctccagaa cgtgctcgtt acctcatgct 420
tcgtttgctt gagcgtgcat ctgcaaagcg cgtatctctt cccccaatga cgtcaaccga 480
ctacgtcaac accattccaa cctctatgga acctgaattc ccaggcgatg aggaaatgga 540
gaagcgttac cgtcgttgga ttcgctggaa cgcagccatc atggttcacc gcgctcagcg 600
accaggcatc ggcgtcggcg ggcacatttc cacttacgca ggcgcagccc ctctgtacga 660
agttggcttc aaccacttct tccgcggcaa ggatcaccca ggcggcggcg accagatctt 720
cttccagggc cacgcatcac caggtatgta cgcacgtgca ttcatggagg gtcgcctttc 780
tgaagacgat ctcgatggct tccgtcagga agtttcccgt gagcagggtg gtattccgtc 840
ctaccctcac ccacacggta tgaaggactt ctgggagttc ccaactgtgt ccatgggtct 900
tggcccaatg gatgccattt accaggcacg tttcaaccgc tacctcgaaa accgtggcat 960
caaggacacc tctgaccagc acgtctgggc cttccttggc gacggcgaaa tggacgagcc 1020
agaatcacgt ggtctcatcc agcaggctgc actgaacaac ctggacaacc tgaccttcgt 1080
ggttcaatgc aacctgcagc gtctcgacgg acctgtccgc ggtaacacca agatcatcca 1140
ggaactcgag tccttcttcc gtggcgcagg ctggtctgtg atcaaggttg tttggggtcg 1200
cgagtgggat gaacttctgg agaaggacca ggatggtgca cttgttgaga tcatgaacaa 1260
cacctccgat ggtgactacc agaccttcaa ggctaacgac ggcgcatatg ttcgtgagca 1320
cttcttcgga cgtgacccac gcaccgcaaa gctcgttgag aacatgaccg acgaagaaat 1380
ctggaagctg ccacgtggcg gccacgatta ccgcaaggtt tacgcagcct acaagcgagc 1440
tcttgagacc aaggatcgcc caaccgtcat ccttgctcac accattaagg gctacggact 1500
cggccacaat ttcgaaggcc gtaacgcaac ccaccagatg aagaagctga cgcttgatga 1560
tctgaagttg ttccgcgaca agcagggcat cccaatcacc gatgagcagc tggagaagga 1620
tccttacctt cctccttact accacccagg tgaagacgct cctgaaatca agtacatgaa 1680
ggaacgtcgc gcagcgctcg gtggctacct gccagagcgt cgtgagaact acgatccaat 1740
tcaggttcca ccactggata agcttcgctc tgtccgtaag ggctccggca agcagcagat 1800
cgctaccacc atggcgactg ttcgcacctt caaggaactg atgcgcgata agggcttggc 1860
tgatcgcctt gtcccaatca ttcctgatga ggcacgtacc ttcggtcttg actcttggtt 1920
cccaaccttg aagatctaca acccgcacgg tcagaactac gtgcctgttg accacgacct 1980
gatgctctcc taccgtgagg cacctgaagg acagatcctg cacgaaggca tcaacgaggc 2040
tggttccatg gcatcgttca tcgctgcggg tacctcctac gccacccacg gcaaggccat 2100
gattccgctg tacatcttct actcgatgtt cggattccag cgcaccggtg actccatctg 2160
ggcagcagcc gatcagatgg cacgtggctt cctcttgggc gctaccgcag gtcgcaccac 2220
cctgaccggt gaaggcctcc agcacatgga tggacactcc cctgtcttgg cttccaccaa 2280
cgagggtgtc gagacctacg acccatcctt tgcgtacgag atcgcacacc tggttcaccg 2340
tggcatcgac cgcatgtacg gcccaggcaa gggcgaagat gttatctact acatcaccat 2400
ctacaacgag ccaaccccac agccagctga gccagaagga ctggacgtag aaggcctgca 2460
caagggcatc tacctctact cccgcggtga aggcaccggc catgaggcaa acatcttggc 2520
ttccggtgtt ggtatgcagt gggctctcaa ggctgcatcc atccttgagg ctgactacgg 2580
agttcgtgca aacatttact ccgctacttc ttgggttaac ttggctcgcg atggcgctgc 2640
tcgtaacaag gcacagctgc gcaacccagg tgcagatgct ggcgaggcat tcgtaaccac 2700
ccagctgaag cagacctctg gcccatacgt cgcagtgtct gacttctcca ctgatctgcc 2760
aaaccagatc cgtgaatggg tcccaggcga ctacaccgtt ctcggtgcag atggcttcgg 2820
tttctctgat acccgcccag ctgctcgtcg cttcttcaac atcgacgctg agtccattgt 2880
tgttgcagtg ctgaactccc tggcacgcga aggcaagatc gacgtctccg ttgctgctca 2940
ggctgctgag aagttcaagt tggatgatcc tacgagtgtt tccgtagatc caaacgctcc 3000
tgaggaataa gttttggcgg atgagagaag attttcagcc tgat 3044
<210> 8
<211> 1415
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
cagtgccaag cttgcatgcc tgcaggtcga ctctagtcgg ccactttaat aatcctcctc 60
gtgtgggccc cgatgtgttt ttcgattaca tggattcaac atgaaaccgc ggggctattg 120
atatatccga attgcacatt accgtccaac cggtactttg aaccaccttt ccctggaatt 180
ttttcctttt cctccccctt tacgctcaag aatcaatgaa ttcaatcact ggccagcgat 240
taacttttcg agttttcagt cttggatttc cacaattctc ttcaaaataa tggtggctag 300
atttttcatc aaaccttcac caaaaggaca tcagacctgt agttttatgc gattcgcgtc 360
aaacgtgaga gaaacatcac atctcgcggg aaactacccg ataattcttt gcaaaacttt 420
gcaaagcgga atgaacatgc agctagtttc cgtagaaatg ttctttaaaa aatccacaac 480
aattgccagg aagcacaccg attgatggat acctgaaatc ccagtgagcg caccactccc 540
cttacgtcac agtctgtaaa acaaatcttc ggtgttgcgt atccttgtta ataacttatg 600
cgttgactca ttcgtgcact tcggcgtgtc acaattaggt acgaccaaga atgggaccgg 660
gaaaccggga cgtataaacg aaataaaaca ttccaacagg aggtgtggaa atcacctcaa 720
gggacagata aatcccgccg ccagacgtta gtctggcggc gggattcgtc gtaaagcaag 780
ctctttttag ccgaggaacg ccttgtcaga caatgttgcg cccttgatgt tggcgaactc 840
ctgcagcaaa tcgcgcacag tcaacttcga cttggtagcc tgatctgcct ggtagacaat 900
ctggccttca tgcatcatga tcaggcgatt gcccaggcga attgcctgtt ccatgttgtg 960
cgtgaccata agcgtagtca gagttccatc tgccacgatc ttttcggtca aggtggtcac 1020
aagctctgca cgctgtggat caagtgctgc ggtgtgctca tccaacagca tgattttagg 1080
ttgagtaaaa ccagccatca gcagggacaa tgcctgacgc tgaccgccag agagcaaacc 1140
aactttggca gtgagcctgt tttccagacc cagctcgagg cgctcaagtt cctgcttgaa 1200
ttgctcacgg cgcttcgagg tcagtgcaaa gcccaatcca cggcgcttgc cgcgcagcaa 1260
cgcgatggcc agattctctt caatggtgag attcggcgcg gtgccggcca gaggatcctg 1320
gaaaacgcgg ccgatgtagc gggcacgctt gtgctctgac atcttgttta cattgttggg 1380
taccgagctc gaattcgtaa tcatggtcat agctg 1415
<210> 9
<211> 4110
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
tcggccactt taataatcct cctcgtgtgg gccccgatgt gtttttcgat tacatggatt 60
caacatgaaa ccgcggggct attgatatat ccgaattgca cattaccgtc caaccggtac 120
tttgaaccac ctttccctgg aattttttcc ttttcctccc cctttacgct caagaatcaa 180
tgaattcaat cactggccag cgattaactt ttcgagtttt cagtcttgga tttccacaat 240
tctcttcaaa ataatggtgg ctagattttt catcaaacct tcaccaaaag gacatcagac 300
ctgtagtttt atgcgattcg cgtcaaacgt gagagaaaca tcacatctcg cgggaaacta 360
cccgataatt ctttgcaaaa ctttgcaaag cggaatgaac atgcagctag tttccgtaga 420
aatgttcttt aaaaaatcca caacaattgc caggaagcac accgattgat ggatacctga 480
aatcccagtg agcgcaccac tccccttacg tcacagtctg taaaacaaat cttcggtgtt 540
gcgtatcctt gttaataact tatgcgttga ctcattcgtg cacttcggcg tgtcacaatt 600
aggtacgacc aagaatggga ccgggaaacc gggacgtata aacgaaataa aacattccaa 660
caggaggtgt ggaaatggcc gatcaagcaa aacttggtgg caagccctcg gatgactcta 720
acttcgcgat gatccgcgat ggcgtggcat cttatttgaa cgactcagat ccggaggaga 780
ccaacgagtg gatggattca ctcgacggat tactccagga gtcttctcca gaacgtgctc 840
gttacctcat gcttcgtttg cttgagcgtg catctgcaaa gcgcgtatct cttcccccaa 900
tgacgtcaac cgactacgtc aacaccattc caacctctat ggaacctgaa ttcccaggcg 960
atgaggaaat ggagaagcgt taccgtcgtt ggattcgctg gaacgcagcc atcatggttc 1020
accgcgctca gcgaccaggc atcggcgtcg gcgggcacat ttccacttac gcaggcgcag 1080
cccctctgta cgaagttggc ttcaaccact tcttccgcgg caaggatcac ccaggcggcg 1140
gcgaccagat cttcttccag ggccacgcat caccaggtat gtacgcacgt gcattcatgg 1200
agggtcgcct ttctgaagac gatctcgatg gcttccgtca ggaagtttcc cgtgagcagg 1260
gtggtattcc gtcctaccct cacccacacg gtatgaagga cttctgggag ttcccaactg 1320
tgtccatggg tcttggccca atggatgcca tttaccaggc acgtttcaac cgctacctcg 1380
aaaaccgtgg catcaaggac acctctgacc agcacgtctg ggccttcctt ggcgacggcg 1440
aaatggacga gccagaatca cgtggtctca tccagcaggc tgcactgaac aacctggaca 1500
acctgacctt cgtggttaac tgcaacctgc agcgtctcga cggacctgtc cgcggtaaca 1560
ccaagatcat ccaggaactc gagtccttct tccgtggcgc aggctggtct gtgatcaagg 1620
ttgtttgggg tcgcgagtgg gatgaacttc tggagaagga ccaggatggt gcacttgttg 1680
agatcatgaa caacacctcc gatggtgact accagacctt caaggctaac gacggcgcat 1740
atgttcgtga gcacttcttc ggacgtgacc cacgcaccgc aaagctcgtt gagaacatga 1800
ccgacgaaga aatctggaag ctgccacgtg gcggccacga ttaccgcaag gtttacgcag 1860
cctacaagcg agctcttgag accaaggatc gcccaaccgt catccttgct cacaccatta 1920
agggctacgg actcggccac aatttcgaag gccgtaacgc aacccaccag atgaagaagc 1980
tgacgcttga tgatctgaag ttgttccgcg acaagcaggg catcccaatc accgatgagc 2040
agctggagaa ggatccttac cttcctcctt actaccaccc aggtgaagac gctcctgaaa 2100
tcaagtacat gaaggaacgt cgcgcagcgc tcggtggcta cctgccagag cgtcgtgaga 2160
actacgatcc aattcaggtt ccaccactgg ataagcttcg ctctgtccgt aagggctccg 2220
gcaagcagca gatcgctacc accatggcga ctgttcgcac cttcaaggaa ctgatgcgcg 2280
ataagggctt ggctgatcgc cttgtcccaa tcattcctga tgaggcacgt accttcggtc 2340
ttgactcttg gttcccaacc ttgaagatct acaacccgca cggtcagaac tacgtgcctg 2400
ttgaccacga cctgatgctc tcctaccgtg aggcacctga aggacagatc ctgcacgaag 2460
gcatcaacga ggctggttcc atggcatcgt tcatcgctgc gggtacctcc tacgccaccc 2520
acggcaaggc catgattccg ctgtacatct tctactcgat gttcggattc cagcgcaccg 2580
gtgactccat ctgggcagca gccgatcaga tggcacgtgg cttcctcttg ggcgctaccg 2640
caggtcgcac caccctgacc ggtgaaggcc tccagcacat ggatggacac tcccctgtct 2700
tggcttccac caacgagggt gtcgagacct acgacccatc ctttgcgtac gagatcgcac 2760
acctggttca ccgtggcatc gaccgcatgt acggcccagg caagggcgaa gatgttatct 2820
actacatcac catctacaac gagccaaccc cacagccagc tgagccagaa ggactggacg 2880
tagaaggcct gcacaagggc atctacctct actcccgcgg tgaaggcacc ggccatgagg 2940
caaacatctt ggcttccggt gttggtatgc agtgggctct caaggctgca tccatccttg 3000
aggctgacta cggagttcgt gcaaacattt actccgctac ttcttgggtt aacttggctc 3060
gcgatggcgc tgctcgtaac aaggcacagc tgcgcaaccc aggtgcagat gctggcgagg 3120
cattcgtaac cacccagctg aagcagacct ctggcccata cgtcgcagtg tctgacttct 3180
ccactgatct gccaaaccag atccgtgaat gggtcccagg cgactacacc gttctcggtg 3240
cagatggctt cggtttctct gatacccgcc cagctgctcg tcgcttcttc aacatcgacg 3300
ctgagtccat tgttgttgca gtgctgaact ccctggcacg cgaaggcaag atcgacgtct 3360
ccgttgctgc tcaggctgct gagaagttca agttggatga tcctacgagt gtttccgtag 3420
atccaaacgc tcctgaggaa taaatcacct caagggacag ataaatcccg ccgccagacg 3480
ttagtctggc ggcgggattc gtcgtaaagc aagctctttt tagccgagga acgccttgtc 3540
agacaatgtt gcgcccttga tgttggcgaa ctcctgcagc aaatcgcgca cagtcaactt 3600
cgacttggta gcctgatctg cctggtagac aatctggcct tcatgcatca tgatcaggcg 3660
attgcccagg cgaattgcct gttccatgtt gtgcgtgacc ataagcgtag tcagagttcc 3720
atctgccacg atcttttcgg tcaaggtggt cacaagctct gcacgctgtg gatcaagtgc 3780
tgcggtgtgc tcatccaaca gcatgatttt aggttgagta aaaccagcca tcagcaggga 3840
caatgcctga cgctgaccgc cagagagcaa accaactttg gcagtgagcc tgttttccag 3900
acccagctcg aggcgctcaa gttcctgctt gaattgctca cggcgcttcg aggtcagtgc 3960
aaagcccaat ccacggcgct tgccgcgcag caacgcgatg gccagattct cttcaatggt 4020
gagattcggc gcggtgccgg ccagaggatc ctggaaaacg cggccgatgt agcgggcacg 4080
cttgtgctct gacatcttgt ttacattgtt 4110
Claims (10)
1. The application of a substance for inhibiting YH66-RS10865 gene expression or reducing YH66-RS10865 protein abundance or reducing YH66-RS10865 protein activity;
the application is as follows (I) or (II):
the application of (I) in improving the yield of the bacterial valine;
(II) use in the production of valine;
the YH66-RS10865 gene is a gene encoding YH66-RS10865 protein;
the YH66-RS10865 protein is (a1) or (a2) or (a 3):
(a1) protein shown in a sequence 3 in a sequence table;
(a2) a protein derived from a bacterium, having 95% or more identity to (a1), and being associated with valine production by the bacterium;
(a3) and (b) a protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the protein shown in (a1) and is related to the production of valine by bacteria and derived from (a 1).
2. A recombinant bacterium is obtained by inhibiting YH66-RS10865 gene expression in bacteria; the YH66-RS10865 gene is the YH66-RS10865 gene described in claim 1.
3. Use of the recombinant bacterium of claim 2 for producing valine.
4. A process for producing valine, comprising the steps of: fermenting the recombinant bacterium of claim 2.
5. A method for increasing valine production in a bacterium, comprising the steps of: inhibiting YH66-RS10865 gene expression or reducing YH66-RS10865 protein abundance or reducing YH66-RS10865 protein activity in bacteria;
the YH66-RS10865 gene described in claim 1;
the YH66-RS10865 protein as set forth in claim 1, which is YH66-RS10865 protein.
Use of the YH66-RS10865 protein for regulating valine production in a bacterium; the YH66-RS10865 protein as set forth in claim 1, which is YH66-RS10865 protein.
7. The mutant protein is obtained by mutating the 282 th amino acid residue of YH66-RS10865 protein from N to other amino acid residues; the YH66-RS10865 protein as set forth in claim 1, which is YH66-RS10865 protein.
8. The mutant protein coding gene or claim 7 with the mutant protein coding gene expression cassettes or claim 7 with the mutant protein coding gene recombinant vector or claim 7 with the mutant protein coding gene recombinant bacteria.
9. Use of the mutein of claim 7, the coding gene of claim 8, the expression cassette of claim 8, the recombinant vector of claim 8 or the recombinant bacterium of claim 8 for producing valine.
10. A method for increasing valine production in a bacterium, comprising the steps of: mutating the codon of 282 th amino acid residue of YH66-RS10865 protein encoded in bacterial genomic DNA from the codon encoding N to the codon encoding other amino acid residue; the YH66-RS10865 protein as set forth in claim 1, which is YH66-RS10865 protein.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110980059.1A CN113683667B (en) | 2021-08-25 | 2021-08-25 | Engineering bacterium obtained by modification of YH66-RS10865 gene and application thereof in valine preparation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110980059.1A CN113683667B (en) | 2021-08-25 | 2021-08-25 | Engineering bacterium obtained by modification of YH66-RS10865 gene and application thereof in valine preparation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113683667A true CN113683667A (en) | 2021-11-23 |
CN113683667B CN113683667B (en) | 2024-06-25 |
Family
ID=78582324
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110980059.1A Active CN113683667B (en) | 2021-08-25 | 2021-08-25 | Engineering bacterium obtained by modification of YH66-RS10865 gene and application thereof in valine preparation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113683667B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114292315A (en) * | 2021-12-31 | 2022-04-08 | 宁夏伊品生物科技股份有限公司 | ppc mutant and application thereof in preparation of L-valine |
CN114540399A (en) * | 2022-02-15 | 2022-05-27 | 宁夏伊品生物科技股份有限公司 | Method for preparing L-valine and gene mutant and biological material used by same |
CN114540262A (en) * | 2022-02-15 | 2022-05-27 | 宁夏伊品生物科技股份有限公司 | Method for constructing recombinant microorganism producing L-valine and nucleic acid molecule and biomaterial used therefor |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1767616A2 (en) * | 2005-09-22 | 2007-03-28 | Degussa GmbH | Method for producing L-amino acids or ketoacids employing coryneform bacteria with attenuated aceE (pdhA) expression |
US20090053779A1 (en) * | 2007-01-17 | 2009-02-26 | Korea Advanced Institute Of Science And Technology | Mutant microorganism having improved production ability of branched amino acid and method for preparing branched amino acid using the same |
CN106715687A (en) * | 2015-03-18 | 2017-05-24 | Cj第制糖株式会社 | Pyruvate dehydrogenase mutant, microorganism comprising mutant, and method for producing L-amino acid by using microorganism |
-
2021
- 2021-08-25 CN CN202110980059.1A patent/CN113683667B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1767616A2 (en) * | 2005-09-22 | 2007-03-28 | Degussa GmbH | Method for producing L-amino acids or ketoacids employing coryneform bacteria with attenuated aceE (pdhA) expression |
US20090053779A1 (en) * | 2007-01-17 | 2009-02-26 | Korea Advanced Institute Of Science And Technology | Mutant microorganism having improved production ability of branched amino acid and method for preparing branched amino acid using the same |
CN106715687A (en) * | 2015-03-18 | 2017-05-24 | Cj第制糖株式会社 | Pyruvate dehydrogenase mutant, microorganism comprising mutant, and method for producing L-amino acid by using microorganism |
Non-Patent Citations (2)
Title |
---|
EMBL: "TR:A4QFV6 A4QFV6_CORGB (http://www.uniprot.org/uniprot/A4QFV6_CORGB) Pyruvate dehydrogenase E1 component OS=Corynebacterium glutamicum (strain R) OX=340322 GN=cgR_2120 PE=4 SV=1, Length=922, Identities = 921/922 (99%)", Retrieved from the Internet <URL:http://www.uniprot.org/uniprot/A4QFV6_CORGB> * |
KALINOWSKI等: "The complete Corynebacterium glutamicum ATCC 13032 genome sequence and its impact on the production of L-aspartate-derived amino acids and vitamins.", JOURNAL OF BIOTECHNOLOGY, vol. 104, no. 1, pages 3 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114292315A (en) * | 2021-12-31 | 2022-04-08 | 宁夏伊品生物科技股份有限公司 | ppc mutant and application thereof in preparation of L-valine |
CN114540399A (en) * | 2022-02-15 | 2022-05-27 | 宁夏伊品生物科技股份有限公司 | Method for preparing L-valine and gene mutant and biological material used by same |
CN114540262A (en) * | 2022-02-15 | 2022-05-27 | 宁夏伊品生物科技股份有限公司 | Method for constructing recombinant microorganism producing L-valine and nucleic acid molecule and biomaterial used therefor |
CN114540262B (en) * | 2022-02-15 | 2024-05-03 | 宁夏伊品生物科技股份有限公司 | Method for constructing recombinant microorganism producing L-valine and nucleic acid molecule and biological material used in same |
CN114540399B (en) * | 2022-02-15 | 2024-05-03 | 宁夏伊品生物科技股份有限公司 | Method for preparing L-valine, and gene mutant and biological material used by same |
Also Published As
Publication number | Publication date |
---|---|
CN113683667B (en) | 2024-06-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113683667B (en) | Engineering bacterium obtained by modification of YH66-RS10865 gene and application thereof in valine preparation | |
CN113666991B (en) | Engineering bacterium obtained by YH66-RS07015 gene modification and application thereof in valine preparation | |
CN113683666B (en) | Engineering bacterium obtained by YH66-RS07020 gene modification and application thereof in valine preparation | |
CN113667682A (en) | YH66-RS11190 gene mutant and application thereof in preparation of L-valine | |
EP4273228A1 (en) | Strain having enhanced l-glutamic acid productivity, construction method therefor and application thereof | |
CN113563436B (en) | Engineering bacterium obtained by modification of YH66-RS03880 gene and application thereof in valine preparation | |
CN114181288B (en) | Process for producing L-valine, gene used therefor and protein encoded by the gene | |
CN114409751B (en) | YH 66-04470 gene mutant recombinant bacterium and application thereof in preparation of arginine | |
CN114605509B (en) | YH 66-01475 protein and application of encoding gene thereof in regulating and controlling bacterial arginine yield | |
CN114540399B (en) | Method for preparing L-valine, and gene mutant and biological material used by same | |
CN114317583B (en) | Method for constructing recombinant microorganism producing L-valine and nucleic acid molecule used in method | |
CN112725253B (en) | Recombinant strain for modifying gene BBD29_14900 and construction method and application thereof | |
CN114410615B (en) | YH66_00525 protein and application of encoding gene thereof in regulating and controlling bacterial arginine yield | |
CN114507273B (en) | YH66_07020 protein and application of related biological material thereof in improving arginine yield | |
CN114292315B (en) | Ppc mutant and application thereof in preparation of L-valine | |
CN114751966B (en) | YH66_04585 protein and application of related biological material thereof in improving arginine yield | |
CN115073569B (en) | YH 66-09125 gene mutant recombinant bacterium and application thereof in preparation of arginine | |
CN114410614B (en) | Application of YH66_05415 protein or mutant thereof in preparation of L-arginine | |
CN114560918B (en) | Application of YH 66-14275 protein or mutant thereof in preparation of L-arginine | |
CN114315998B (en) | CEY17_RS00300 gene mutant and application thereof in preparation of L-valine | |
CN114249803B (en) | Engineering bacterium for high-yield arginine and construction method and application thereof | |
CN114277069B (en) | Method for preparing L-valine and biological material used by same | |
CN114907459B (en) | Engineering bacterium for high-yield arginine and construction method and application thereof | |
CN118147035A (en) | Recombinant microorganism and application thereof in preparation of L-lysine | |
CN114315999A (en) | LeuD protein mutant and related biological material and application thereof in preparation of L-valine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |