CN113832120A - Formaldehyde conversion mutant protein and application thereof - Google Patents
Formaldehyde conversion mutant protein and application thereof Download PDFInfo
- Publication number
- CN113832120A CN113832120A CN202111161482.5A CN202111161482A CN113832120A CN 113832120 A CN113832120 A CN 113832120A CN 202111161482 A CN202111161482 A CN 202111161482A CN 113832120 A CN113832120 A CN 113832120A
- Authority
- CN
- China
- Prior art keywords
- ala
- formaldehyde
- leu
- amino acid
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 title claims abstract description 197
- 238000006243 chemical reaction Methods 0.000 title claims description 54
- 102000008300 Mutant Proteins Human genes 0.000 title abstract description 20
- 108010021466 Mutant Proteins Proteins 0.000 title abstract description 20
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 claims abstract description 38
- RXKJFZQQPQGTFL-UHFFFAOYSA-N dihydroxyacetone Chemical compound OCC(=O)CO RXKJFZQQPQGTFL-UHFFFAOYSA-N 0.000 claims abstract description 32
- 239000004310 lactic acid Substances 0.000 claims abstract description 19
- 235000014655 lactic acid Nutrition 0.000 claims abstract description 19
- 230000035772 mutation Effects 0.000 claims description 62
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 49
- 241000894006 Bacteria Species 0.000 claims description 13
- 102220230156 rs1064795442 Human genes 0.000 claims description 11
- 102200149451 rs1800361 Human genes 0.000 claims description 10
- 230000009261 transgenic effect Effects 0.000 claims description 10
- 230000014509 gene expression Effects 0.000 claims description 9
- 102220559792 Notchless protein homolog 1_L43Q_mutation Human genes 0.000 claims description 8
- 102220334150 rs1334099693 Human genes 0.000 claims description 8
- 102220121667 rs368580280 Human genes 0.000 claims description 8
- 150000008044 alkali metal hydroxides Chemical class 0.000 claims description 7
- 229910001860 alkaline earth metal hydroxide Inorganic materials 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 7
- 108091033319 polynucleotide Proteins 0.000 claims description 7
- 102000040430 polynucleotide Human genes 0.000 claims description 7
- 239000002157 polynucleotide Substances 0.000 claims description 7
- 102220218735 rs1060501868 Human genes 0.000 claims description 7
- 238000003786 synthesis reaction Methods 0.000 claims description 7
- 230000015572 biosynthetic process Effects 0.000 claims description 5
- 102200108867 rs199473559 Human genes 0.000 claims description 3
- 102200004837 rs587777008 Human genes 0.000 claims description 3
- 102200031193 rs62514957 Human genes 0.000 claims description 3
- 238000009833 condensation Methods 0.000 claims description 2
- 230000005494 condensation Effects 0.000 claims description 2
- 102220091529 rs148370267 Human genes 0.000 claims 1
- 230000009466 transformation Effects 0.000 abstract description 5
- 230000003197 catalytic effect Effects 0.000 abstract description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 31
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 27
- 101100218648 Toxoplasma gondii (strain ATCC 50611 / Me49) BFD1 gene Proteins 0.000 description 27
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 26
- 239000002773 nucleotide Substances 0.000 description 25
- 125000003729 nucleotide group Chemical group 0.000 description 25
- 108090000623 proteins and genes Proteins 0.000 description 25
- 230000000694 effects Effects 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 22
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 22
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 22
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 22
- 125000000539 amino acid group Chemical group 0.000 description 21
- 102000004190 Enzymes Human genes 0.000 description 20
- 108090000790 Enzymes Proteins 0.000 description 20
- 235000018102 proteins Nutrition 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 19
- 235000001014 amino acid Nutrition 0.000 description 17
- 239000000243 solution Substances 0.000 description 17
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 15
- 229940024606 amino acid Drugs 0.000 description 15
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 14
- 235000004279 alanine Nutrition 0.000 description 14
- 150000001413 amino acids Chemical class 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 239000006228 supernatant Substances 0.000 description 12
- 238000004128 high performance liquid chromatography Methods 0.000 description 10
- 239000008057 potassium phosphate buffer Substances 0.000 description 10
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 9
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 239000004475 Arginine Substances 0.000 description 8
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 8
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 8
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 8
- 239000004473 Threonine Substances 0.000 description 8
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 8
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 8
- 239000003054 catalyst Substances 0.000 description 8
- 239000004471 Glycine Substances 0.000 description 7
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 239000004474 valine Substances 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 6
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 6
- 239000007853 buffer solution Substances 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 6
- 102220253152 rs1557150036 Human genes 0.000 description 6
- 108010071778 Benzoylformate decarboxylase Proteins 0.000 description 5
- 241000589776 Pseudomonas putida Species 0.000 description 5
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 5
- 239000008098 formaldehyde solution Substances 0.000 description 5
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 4
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 239000002994 raw material Substances 0.000 description 4
- PKAUICCNAWQPAU-UHFFFAOYSA-N 2-(4-chloro-2-methylphenoxy)acetic acid;n-methylmethanamine Chemical compound CNC.CC1=CC(Cl)=CC=C1OCC(O)=O PKAUICCNAWQPAU-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- AXCZMVOFGPJBDE-UHFFFAOYSA-L calcium dihydroxide Chemical compound [OH-].[OH-].[Ca+2] AXCZMVOFGPJBDE-UHFFFAOYSA-L 0.000 description 3
- 239000000920 calcium hydroxide Substances 0.000 description 3
- 229910001861 calcium hydroxide Inorganic materials 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 238000013048 microbiological method Methods 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- 229960002363 thiamine pyrophosphate Drugs 0.000 description 3
- 235000008170 thiamine pyrophosphate Nutrition 0.000 description 3
- 239000011678 thiamine pyrophosphate Substances 0.000 description 3
- YXVCLPJQTZXJLH-UHFFFAOYSA-N thiamine(1+) diphosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N YXVCLPJQTZXJLH-UHFFFAOYSA-N 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 2
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- NEBFIUZIGRTIFY-BJDJZHNGSA-N Ala-Met-Ser-Arg Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NEBFIUZIGRTIFY-BJDJZHNGSA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 2
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical class [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 2
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 2
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 2
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 2
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 2
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 2
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 2
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 239000002585 base Substances 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 108010024302 benzaldehyde lyase Proteins 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 238000011033 desalting Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000003912 environmental pollution Methods 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 239000012467 final product Substances 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 239000000446 fuel Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L magnesium sulphate Substances [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000002708 random mutagenesis Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000012916 structural analysis Methods 0.000 description 2
- KDYFGRWQOYBRFD-UHFFFAOYSA-N succinic acid Chemical compound OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 2
- 238000009210 therapy by ultrasound Methods 0.000 description 2
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 108010015133 Galactose oxidase Proteins 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 208000007976 Ketosis Diseases 0.000 description 1
- LKDRXBCSQODPBY-AMVSKUEXSA-N L-(-)-Sorbose Chemical compound OCC1(O)OC[C@H](O)[C@@H](O)[C@@H]1O LKDRXBCSQODPBY-AMVSKUEXSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OHDRQQURAXLVGJ-HLVWOLMTSA-N azane;(2e)-3-ethyl-2-[(e)-(3-ethyl-6-sulfo-1,3-benzothiazol-2-ylidene)hydrazinylidene]-1,3-benzothiazole-6-sulfonic acid Chemical compound [NH4+].[NH4+].S/1C2=CC(S([O-])(=O)=O)=CC=C2N(CC)C\1=N/N=C1/SC2=CC(S([O-])(=O)=O)=CC=C2N1CC OHDRQQURAXLVGJ-HLVWOLMTSA-N 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000010364 biochemical engineering Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 150000001722 carbon compounds Chemical class 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000014155 detection of activity Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- UQPHVQVXLPRNCX-UHFFFAOYSA-N erythrulose Chemical compound OCC(O)C(=O)CO UQPHVQVXLPRNCX-UHFFFAOYSA-N 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000002803 fossil fuel Substances 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000013067 intermediate product Substances 0.000 description 1
- -1 isopropyl- Chemical group 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 229960002064 kanamycin sulfate Drugs 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000001384 succinic acid Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 229920002994 synthetic fiber Polymers 0.000 description 1
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J23/00—Catalysts comprising metals or metal oxides or hydroxides, not provided for in group B01J21/00
- B01J23/02—Catalysts comprising metals or metal oxides or hydroxides, not provided for in group B01J21/00 of the alkali- or alkaline earth metals or beryllium
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07C—ACYCLIC OR CARBOCYCLIC COMPOUNDS
- C07C51/00—Preparation of carboxylic acids or their salts, halides or anhydrides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
- C12P7/28—Acetone-containing products
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/11—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors (1.14.11)
- C12Y114/11023—Flavonol synthase (1.14.11.23)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biotechnology (AREA)
- General Chemical & Material Sciences (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Materials Engineering (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)
Abstract
The invention discloses a formaldehyde transformation mutant protein and application thereof. The mutant protein can catalyze formaldehyde to prepare 1,3-dihydroxyacetone, and can further synthesize lactic acid through the 1, 3-dihydroxyacetone. The mutant obtained by the invention greatly improves the catalytic efficiency of formaldehyde to 1,3-dihydroxyacetone, and can be carried out under mild conditions. Has better application prospect.
Description
The application is a divisional application of Chinese patent application 202010983978.X entitled "formaldehyde conversion mutant protein and application thereof", which is filed on 9/17/2020.
Technical Field
The invention belongs to the technical field of biochemical engineering, and particularly relates to a formaldehyde conversion mutant protein and application thereof.
Background
1,3-dihydroxyacetone (1, 3-dihydroyacetone) is the simplest three-carbon ketose existing in nature and has wide application. Not only can be used as raw materials of cosmetics and has good protective effect on skin, but also can be used as a base for synthesizing polyester compounds. In addition, 1,3-dihydroxyacetone is an important metabolic intermediate product, and can be further converted into some high value-added chemicals and fuel molecules, such as ethanol, butanol, lactic acid, succinic acid and the like, and other sugars with higher values, such as erythrulose, sorbose and the like, can be synthesized through a microbial fermentation method. Therefore, the 1,3-dihydroxyacetone is an important chemical synthesis intermediate and is widely applied to industries of cosmetic manufacture, food development, medicine, chemical synthesis and the like.
At present, two methods, namely a microbiological method and a chemical method, are mainly used for producing the 1,3-dihydroxyacetone, the microbiological method mainly utilizes a microbiological fermentation method to convert the glycerol into the 1,3-dihydroxyacetone, but the microbiological method has low production capacity, needs to strictly control the culture conditions of microorganisms, has strict requirements on the production environment of the microorganisms, and has the disadvantages of complex purification process, high cost and environmental pollution of the 1,3-dihydroxyacetone at the later stage. The chemical method is mainly to use a metal catalyst to oxidize glycerol to generate 1,3-dihydroxyacetone, but the use of the metal catalyst in the chemical method causes environmental pollution and is not beneficial to environmental protection.
In recent years, monocarbon compounds have attracted attention as green energy substances with promising application prospects, and monocarbon compounds can be used for synthesizing basic organic chemical raw materials, fuels and other high value-added chemicals. Due to the characteristics of low price and easy obtaining, the carbon compound becomes a compound which has the most development prospect for preparing high-value compounds by replacing petroleum, and has important scientific significance and development value in the fields of medicines, foods and chemical industry. Meanwhile, the utilization and conversion of the one-carbon compound can also obviously reduce the negative influence on the ecological environment caused by people in the utilization process of fossil fuels and the use process of synthetic materials. The formaldehyde can be converted from other one-carbon compounds and further converted into bio-utilized intermediate substances, and meanwhile, the method has the characteristics of wide sources, low price and the like, and has important application prospect in synthesizing high-value compounds by taking the formaldehyde as a precursor.
Disclosure of Invention
In order to solve the above problems, an object of the present invention is to provide a formaldehyde conversion mutant protein, which is a protein selected from mutant BFD1 of benzoyl formate decarboxylase derived from pseudomonas putida by performing error-prone PCR, randomly mutating a target gene, and having an activity of catalyzing the synthesis of 1,3-dihydroxyacetone from formaldehyde, wherein the activity has a thermal stability.
The second purpose of the invention is to provide the application of the formaldehyde-converted mutant protein.
The invention adopts the following technical scheme to realize the purpose:
first, the present invention provides a formaldehyde-converting mutein capable of catalyzing the synthesis of 1,3-dihydroxyacetone from formaldehyde, the amino acid sequence of which is that an amino acid residue at least one position corresponding to at least one of positions S26, L43, F66, R86, T87, G109, a204, H281, a322, F397, M460, W463, V467, V473, S525 of SEQ ID No. 1 is mutated; or the amino acid sequence of the formaldehyde-converting mutein has a mutation site in the mutated amino acid sequence and an amino acid sequence having 80% or more homology with the mutated amino acid sequence, preferably 90% or more, 95% or more, or 98% or more homology with the mutated amino acid sequence.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises at least one of the following mutation sites: corresponding to SEQ ID NO:1, 26 th serine is mutated into phenylalanine F, 43 th leucine L is mutated into glutamine Q, 66 th phenylalanine F is mutated into leucine L, 86 th arginine R is mutated into cysteine C, 87 th threonine T is mutated into alanine A, 109 th glycine G is mutated into serine S, 204 th alanine A is mutated into valine V, 281 th histidine H is mutated into tyrosine Y, 322 th alanine A is mutated into threonine T, 397 nd phenylalanine F is mutated into any one of leucine L and serine S, 460 th methionine M is mutated into threonine T, 463 th tryptophan W is mutated into arginine R, 467 th valine V is mutated into alanine A, 473 th valine V is mutated into alanine A, 525 th serine S is mutated into alanine A. Specifically, the formaldehyde conversion mutant protein comprises a substitution corresponding to SEQ ID NO. 1 at least one of the following sites or a combination of two or more of the following different sites: S26F, L43Q, F66L, R86C, T87A, G109S, A204V, H281Y, A322T, F397L/F397S, M460T, W463R, V467A, V473A, S525A.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises at least: the 281 th bit of histidine H corresponding to SEQ ID NO. 1 is mutated, for example to tyrosine Y. Further, the formaldehyde conversion mutein may also comprise a mutation of at least one other site, for example, a combination of any one or more of S26F, L43Q, F66L, T87A, G109S, a204V, a322T, F397L/F397S, M460T, W463R, V467A, V473A, S525A.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises at least: serine S at position 26, corresponding to SEQ ID NO 1, is mutated, for example to phenylalanine F. Further, the formaldehyde conversion mutein may also comprise a mutation of at least one other site, for example, a combination of any one or more of L43Q, F66L, T87A, G109S, a204V, H281Y, a322T, F397L/F397S, M460T, W463R, V467A, V473A, S525A.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises at least: phenylalanine F, corresponding to position 397 of SEQ ID NO. 1, is mutated, for example to either leucine L or serine S. Further, the formaldehyde conversion mutein may also comprise a mutation at least one other site, for example, a combination of any one or more of S26F, L43Q, F66L, T87A, G109S, a204V, H281Y, a322T, M460T, W463R, V467A, V473A, S525A.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises at least: tryptophan W at position 463, corresponding to SEQ ID No. 1, is mutated, for example, to arginine R. Further, the formaldehyde conversion mutein may also comprise a mutation of at least one other site, for example, a combination of any one or more of S26F, L43Q, F66L, T87A, G109S, a204V, H281Y, a322T, F397L/F397S, M460T, V467A, V473A, S525A.
In one embodiment, the amino acid sequence of the above formaldehyde-converting mutein comprises: the 281 th histidine H corresponding to SEQ ID NO:1 is tyrosine Y and the 26 th serine S is mutated to phenylalanine F. Further, the formaldehyde conversion mutein may also comprise a mutation at least one other site, such as W463R, L43Q, G109S, F397S/F397L, M460T, S525A, V194I, V467A, F66L, a204V, a322T, or a combination of any one or more of these sites. In one embodiment, the amino acid sequence of the above formaldehyde conversion mutein comprises mutations corresponding to positions S26F, H281Y, and F397S of SEQ ID NO: 1; in yet another embodiment, the amino acid sequence of the formaldehyde conversion mutein includes mutations corresponding to the S26F, H281Y, and W463R positions of SEQ ID NO: 1.
As an illustrative embodiment of the present invention, the amino acid sequence of the above-mentioned formaldehyde-converting mutein is specifically any one of 1) to 18):
1) the 281 th histidine of SEQ ID NO. 1 is mutated into tyrosine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO. 3);
2) 1, mutating the 86 th arginine of SEQ ID NO. 1 into cysteine, and keeping other amino acid residues unchanged to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO. 4);
3) 1, the 281 th histidine is mutated into tyrosine, the 26 th serine is mutated into phenylalanine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 5);
4) 1, the 281 rd histidine is mutated into tyrosine, the 397 th phenylalanine is mutated into leucine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 6);
5) 1, mutation of 281 th histidine to tyrosine, mutation of 473 th valine to alanine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 7);
6) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 43 th leucine to glutamine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 8);
7) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 66 th phenylalanine to leucine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 9);
8) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 109 th glycine to serine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 10);
9) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 204 th alanine to valine and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 11);
10) 1, mutation of 281 th bit histidine into tyrosine, mutation of 26 th bit serine into phenylalanine, mutation of 397 th bit phenylalanine into serine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 12);
11) 1, the 281 th histidine is mutated into tyrosine, the 26 th serine is mutated into phenylalanine, the 463 th tryptophan is mutated into arginine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 13);
12) 1, the 281 th histidine is mutated into tyrosine, the 26 th serine is mutated into phenylalanine, the 467 th valine is mutated into alanine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 14);
13) 1, mutation of 281 th histidine to tyrosine, mutation of 87 th threonine to alanine, mutation of 322 th alanine to threonine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 15);
14) 1, the 281 th bit histidine is mutated into tyrosine, the 26 th bit serine is mutated into phenylalanine, the 463 th bit tryptophan is mutated into arginine, the 109 th bit glycine is mutated into serine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence of the amino acid sequence is SEQ ID NO: 16);
15) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 397 th phenylalanine to serine, mutation of 109 th glycine to serine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 17);
16) 1, the 281 th histidine is mutated into tyrosine, the 26 th serine is mutated into phenylalanine, the 460 th methionine is mutated into threonine, the 525 th serine is mutated into alanine, and other amino acid residues are kept unchanged to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 18);
17) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 397 th phenylalanine to serine, mutation of 109 th glycine to serine, mutation of 322 th alanine to threonine, and remaining of other amino acid residues to obtain an amino acid sequence (the nucleotide sequence of which is SEQ ID NO: 19);
18) 1, mutation of 281 th histidine to tyrosine, mutation of 26 th serine to phenylalanine, mutation of 463 th tryptophan to arginine, mutation of 109 th glycine to serine, mutation of 397 th phenylalanine to serine, and keeping other amino acid residues unchanged to obtain an amino acid sequence (the nucleotide sequence is SEQ ID NO: 20);
the present invention also provides a polynucleotide encoding the above formaldehyde transformation mutein.
The invention also provides a recombinant vector, an expression cassette, a transgenic cell line and/or a recombinant bacterium containing the formaldehyde transformation mutant protein and/or the polynucleotide.
The invention also provides the application of the formaldehyde conversion mutant protein as a catalyst, preferably a catalyst for catalyzing and preparing 1,3-dihydroxyacetone and/or lactic acid, for example a catalyst for catalyzing and preparing 1,3-dihydroxyacetone and/or lactic acid by formaldehyde.
The invention also provides the use of the recombinant vector, expression cassette, transgenic cell line and/or recombinant bacterium containing the polynucleotide as a catalyst, preferably a catalyst for catalyzing the preparation of 1,3-dihydroxyacetone and/or lactic acid, such as a catalyst for catalyzing the preparation of 1,3-dihydroxyacetone and/or lactic acid from formaldehyde.
When the formaldehyde-converting mutein of the present invention catalyzes formaldehyde to prepare a target compound, the method comprises: contacting the formaldehyde conversion mutant protein with formaldehyde to catalyze the formaldehyde conversion mutant protein to generate 1, 3-dihydroxyacetone; further, an alkali metal hydroxide and/or an alkaline earth metal hydroxide may optionally be added to catalyze the production of lactic acid from 1, 3-dihydroxyacetone.
The recombinant vector, transgenic cell line or recombinant strain comprising the formaldehyde transformation mutant protein and/or the polynucleotide in the invention comprises the following components in catalyzing formaldehyde to prepare a target compound: contacting a recombinant vector, transgenic cell line, or recombinant strain comprising the formaldehyde-converting mutein and/or the polynucleotide with formaldehyde, catalyzing the production of 1, 3-dihydroxyacetone; further, an alkali metal hydroxide and/or an alkaline earth metal hydroxide may optionally be added to catalyze the production of lactic acid from 1, 3-dihydroxyacetone.
Advantageous effects
The invention obtains the formaldehyde conversion mutant protein by carrying out molecular modification on the formaldehyde conversion protein, greatly improves the efficiency of generating 1,3-dihydroxyacetone by formaldehyde condensation, and provides raw materials for the synthesis of lactic acid.
Drawings
FIG. 1: the product, 1,3-dihydroxyacetone, was detected by HPLC in example 7.
FIG. 2: the final product lactic acid produced by alkali metal hydroxide catalysis was examined by HPLC in example 8.
FIG. 3: the final product lactic acid produced by alkaline earth metal hydroxide catalysis was tested by HPLC in example 8.
FIG. 4: formaldehyde conversion mutant protein structure diagram.
Terms and definitions
In the context of the present invention, "formaldehyde-converting protein" in the present invention means a protein capable of catalyzing the synthesis of 1,3-dihydroxyacetone from formaldehyde, as long as it has a function of catalyzing the conversion of formaldehyde into 1,3-dihydroxyacetone, and there is no particular limitation in the amino acid sequence and source thereof. By way of example only, and not by way of limitation, it may be, for example, Benzoyl Formate Decarboxylase (BFD) derived from Pseudomonas putida (Pseudomonas putida) and benzaldehyde lyase (BAL) derived from Pseudomonas fluorescens biovar I. The formaldehyde conversion mutant protein refers to the protein with the functions obtained by mutating the formaldehyde conversion protein through amino acids.
Amino acids in the present invention are represented by a single or three letter code and have the following meanings: a: ala (alanine); r: arg (arginine); n: asn (asparagine); d: asp (aspartic acid); c: cys (cysteine); q: gln (glutamine); e: glu (glutamic acid); g: gly (glycine); h: his (histidine); l: leu (leucine); k: lys (lysine); m: met (methionine); f: phe (phenylalanine); s: ser (serine); t: thr (threonine); w: trp (tryptophan); y: tyr (tyrosine); v: val (valine).
In the present invention, "homology" has the conventional meaning in the art and refers to "identity" between two nucleic acid or amino acid sequences, the percentage of which represents the statistically significant percentage of identical nucleotides or amino acid residues between the two sequences to be compared, obtained after optimal alignment (best alignment), the differences between the two sequences being randomly distributed over their entire length.
In the present invention, the terms "mutant" and "variant" and "mutein" are used interchangeably and "modification" or "mutation" are applied interchangeably, these expressions referring to the amino acids relative to the unmodified or engineered protein, e.g. the formaldehyde-converting mutein of SEQ ID NO:1 as starting sequence, or the base derived from such a protein, comprising alterations, i.e. substitutions, insertions and/or deletions, at one or more positions and still retaining its activity. Muteins can be obtained by various techniques known in the art. In particular, exemplary techniques for modifying a DNA sequence encoding a wild-type protein include, but are not limited to, site-directed mutagenesis, random mutagenesis, and construction of synthetic oligonucleotides.
The term "substitution" with respect to an amino acid position or residue means that the amino acid at the specified position has been replaced with another amino acid. Substitutions may be conservative or non-conservative.
The mutations are described in terms of their mutation at a specific residue, the position of which is determined by alignment of the amino acid sequence SEQ ID NO:1 as starting protein or by reference to the sequence SEQ ID NO: 1. In the context of the present invention, it also relates to any variant carrying these same mutations at functionally equivalent residues.
The term "corresponding to" as used herein has the meaning commonly understood by a person of ordinary skill in the art. Specifically, "corresponding to" means the position of one sequence corresponding to a specified position in the other sequence after alignment of the two sequences by homology or sequence identity. In the present invention, "a nucleotide sequence corresponding to SEQ ID NO:1 "indicates the determination of the position of the mutation site by comparison with SEQ ID NO: 1. Thus, for example, in the case of "amino acid residue corresponding to position 40 of the amino acid sequence shown in SEQ ID NO: 1", if a 6 XHis tag is added to one end of any of the amino acid sequences shown in SEQ ID NO:1, position 40 of the resulting mutant corresponding to the amino acid sequence shown in SEQ ID NO:1 may be position 46 of the mutant.
Those skilled in the art will appreciate that the sequence of the "corresponding to SEQ ID NO:1 "merely indicates that SEQ ID NO:1 is used as a basis for determining the position of the mutation site, but does not indicate that the mutein of the invention can only be obtained by modifying the amino acid sequence shown in SEQ ID NO: 1. In one embodiment, the formaldehyde converting mutein of the invention can be obtained by a person skilled in the art starting from the amino acid sequence of any formaldehyde converting enzyme known in the art, as long as the resulting formaldehyde converting mutein is mutated at the site referred to in the present invention in comparison to SEQ ID NO:1 and retains the catalytic activity of the formaldehyde converting enzyme, i.e.it is not necessary within the scope of the invention that the other sites than the mutated site are completely identical to SEQ ID NO: 1. In one embodiment, the protein shown in SEQ ID NO. 1 is used as a starting sequence to carry out mutation or substitution of the invention, so as to obtain the formaldehyde conversion mutant protein of the invention. In one embodiment, the formaldehyde-converting mutein of the invention is obtained by carrying out the mutation or substitution of the invention starting from a sequence having homology to SEQ ID NO:1, which homology to SEQ ID NO:1 is identical to the S26, L43, F66, R86, T87, G109, A204, H281, A322, F397, M460, W463, V467, V473, S525 sites of SEQ ID NO:1, so that the mutation or substitution of the invention can occur, but the other sites can be identical or different. In one embodiment, the starting sequence has more than 80% homology, such as more than 85%, more than 90% or more than 95% homology to SEQ ID NO. 1, in order to be able to identify the specific amino acid position corresponding to SEQ ID NO. 1. In yet another embodiment, the corresponding formaldehyde-converting mutein is obtained by using the wild-type formaldehyde converting enzyme (amino acid sequence SEQ ID NO:23, nucleotide sequence SEQ ID NO:24) as the starting sequence and performing the mutation or substitution of the invention still at the corresponding site corresponding to SEQ ID NO: 1.
The "starting protein" or "starting sequence" in the present invention refers to the formaldehyde protease or its amino acid sequence before the modification or mutation of the present invention is performed. The form "XaY" is used herein to denote a mutation or substitution of an amino acid, wherein a denotes the position of the amino acid in SEQ ID NO. 1, X denotes the wild-type amino acid species at position a in SEQ ID NO. 1, and Y denotes the amino acid species after mutation at position a in SEQ ID NO. 1. For example, "H281Y" indicates that histidine H corresponding to position 281 of SEQ ID NO:1 is substituted with tyrosine Y, in alignment with SEQ ID NO: 1.
Detailed Description
The technical solution of the present invention will be further described in detail with reference to specific embodiments. It is to be understood that the following examples are only illustrative and explanatory of the present invention and should not be construed as limiting the scope of the present invention. All the technologies realized based on the above-mentioned contents of the present invention are covered in the protection scope of the present invention.
Unless otherwise indicated, the raw materials and reagents used in the following examples are all commercially available products or can be prepared by known methods. The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Example 1 construction of Formaldehyde-converting mutein (Formaldehyde-converting mutant)
The formaldehyde-converting mutein is a mutant BFD1 (hereinafter referred to as "original BFD 1") derived from benzoic acid decarboxylase (BFD) of Pseudomonas putida (Pseudomonas putida), and has the nucleotide sequence of SEQ ID NO:2, in which restriction sites Nde I and Xho I are added to both ends of the nucleotide sequence, and the nucleotide sequence is ligated to the multiple cloning site region of plasmid pET28a using T4 DNA ligase to construct recombinant plasmid pET28a-BFD 1. Then, using pET28a-bfd1 as a template, a primer was designed (forward primer: 5'-CCGCGCGGCAGCCATATG-3')
(SEQ ID NO:21) downstream primer: 5'-GGTGGTGGTGGTGGTGCTCGAGTTATT-3' (SEQ ID NO:22)), error-prone PCR is performed, and random mutagenesis is performed on the gene of interest. After the PCR is finished, the target gene is recovered by glue, the mutated target gene is connected to a vector pET28a, the connected vector is transformed into escherichia coli BL21 Gold (DE3), and the obtained product is cultured in an incubator at 37 ℃ until a monoclonal antibody grows out.
Example 2 screening of Formaldehyde-converting muteins
For the mutant clones obtained in example 1, single clones were picked up into 96-well plates of LB medium containing kanamycin sulfate antibiotic, cultured overnight in a shaker, and the above mutants were replicated for a certain period of time into 96-well plates of LB medium containing isopropyl-. beta. -D-thiogalactoside (IPTG). After completion of the culture, the cells were collected by centrifugation, and 200. mu.L of potassium phosphate buffer (50mM K) was added2HPO4And KH2PO4,5mM MgSO4pH 7.4) and collected the cells.
The cells were resuspended in 50. mu.L of potassium phosphate buffer, and an equal volume of formaldehyde solution (containing 50mM or 30mM or 20mM or 200mM formaldehyde and 1mM thiamine pyrophosphate (TPP)) was added thereto, respectively, to react at 30 ℃ for 3 hours. After the reaction was completed, 90. mu.L of the supernatant was centrifuged in a 96-well plate, and 60. mu.L of tool enzyme buffer 1(0.3mg/mL galactose oxidase, 36U/mL horseradish peroxidase) and then 50. mu.L of tool enzyme buffer 2(3.2mM 2,2' -biazonitride-bis-3-ethylbenzothiazoline-6-sulfonic Acid (ABTS)) were added and detected for 20min under the condition of absorbing light at 410 nm.
Compared with the original BFD1, the obtained strain has higher catalytic efficiency, namely the strain is a beneficial mutant strain, and then the corresponding mutant site and amino acid are found out through gene sequencing. Eighteen mutants with obviously improved enzyme activity and stable activity are finally obtained through three rounds of screening of mutant libraries. The eighteen mutant names and the corresponding mutant amino acids obtained by gene sequencing are shown in table 1.
TABLE 1 mutant names and corresponding mutant amino acids
Example 3 expression of original BFD1 and its mutants in E.coli
The inoculating needle picks the positive bacteria of the original BFD1 and 18 mutants and inoculates the positive bacteria in 5mL LB culture medium respectively, and cultures overnight at 37 ℃, then inoculates the positive bacteria in 25mL LB culture medium by 1% (V/V) inoculum size, and cultures at 37 ℃ and 200 r/min. When OD is reached600When 0.6 was reached, 0.1mM IPTG was added and expression was induced at 30 ℃. After the induction, the cultured mutant cells were collected in a centrifuge tube, and then resuspended in potassium phosphate buffer and washed. After centrifugation, the cells were stored in a refrigerator at-80 ℃.
Example 4 detection of Activity of original BFD1 and its mutants
The cells obtained in example 3 were taken out, resuspended in potassium phosphate buffer, disrupted by ultrasonic treatment in ice bath, and centrifuged to collect the supernatant. And mixing 50 mu L of cell resuspension or supernatant with 50 mu L of formaldehyde solution, reacting at 30 ℃ for 3h, taking out 90 mu L of reaction solution after the reaction is finished, adding 60 mu L of tool enzyme buffer solution 1, then adding 50 mu L of tool enzyme buffer solution 2, and detecting for 20min under the condition of absorbing light at 410 nm. The original BFD1 and the mutant whole-cell enzyme activity are calculated, and the percentage of the mutant enzyme activity relative to the original BFD1 is shown in the following table (the original BFD1 and the original strain whole-cell enzyme activity are 100%).
TABLE 2 Whole cell activity of original BFD1 and mutant at 30mM formaldehyde concentration
TABLE 3 Whole cell activity of mutant V3 and further mutants thereof at 20mM formaldehyde concentration
TABLE 4 cell disruption supernatants of mutant V10 and further mutants thereof Activity at 20mM Formaldehyde concentration
TABLE 5 cell disruption supernatants of mutant V11 and further mutants thereof Activity at 20mM Formaldehyde concentration
TABLE 6 Activity of cell disruption supernatants of mutant V11 and further mutants thereof at 200mM Formaldehyde concentration
Example 5 detection of thermal stability of original BFD1 and its mutants
The cells obtained in example 3 were taken out, resuspended in potassium phosphate buffer, disrupted by ultrasonic treatment in ice bath, and centrifuged to collect the supernatant. Heating 50 μ L of cell supernatant at 55 deg.C for 30min, adding 50 μ L of 50mM formaldehyde solution, reacting at 30 deg.C for 3h, taking out 90 μ L of reaction solution, adding 60 μ L of tool enzyme buffer solution 1, adding 50 μ L of tool enzyme buffer solution 2, and detecting for 20min under the condition of absorbing light at 410 nm. The mutant activity was more thermostable than the original BFD1 after heating the cell supernatant. The relative residual activities of the original BFD1 and its mutants are shown in table 7 (based on the relative residual activity of the original BFD1 as 100%).
TABLE 7 relative residual Activity of original BFD1 and its mutants
Example 6 purification of original BFD1 transformation and mutants thereof
The bacterial cells of the original BFD1 and mutein V8 collected in example 3 were resuspended in 25mL potassium phosphate buffer, disrupted by sonication in ice bath, centrifuged to collect the supernatant, and then treated with Ni2+And (3) carrying out affinity chromatography purification on the expressed mutant by using a chromatographic column, eluting by using imidazole, desalting by using a desalting column, and storing the purified protein in a potassium phosphate buffer solution for later use.
EXAMPLE 7 Formaldehyde conversion to 1,3-dihydroxyacetone
The protein concentrations of the original BFD1 and mutant V8 purified in example 6 were diluted to 15mg/mL with potassium phosphate buffer. A certain volume of pure enzyme was taken in the reaction system, and then an equal amount of formaldehyde solution (600mM formaldehyde, 1mM TPP, 50mM K) was added2HPO4And KH2PO4,5mM MgSO4pH 7.4)), at 30 ℃ for 1 h. After the reaction is finished, adding a certain volume of reaction solution into acetonitrile with the same volume to terminate the reaction, centrifuging the reaction solution through a 0.22 mu m filter membrane, and carrying out HPLC detection under the conditions: a chromatographic column: aminex HPX-87H, 300mm × 7.8mm (Bio-Rad); mobile phase: 50% 0.005M H2SO4Solution, 50% H2O; ultraviolet absorption wavelength: 210 nm; flow rate: 0.5 mL/min; column temperature: 65 ℃; sample introduction amount: 5 μ L.
The HPLC detection result is shown in FIG. 1, and it is found by analysis that the original BFD1 and the mutant V8 can catalyze formaldehyde to generate 1, 3-dihydroxyacetone.
It was calculated that the conversion of the original BFD1 and mutant V8 catalyzed formaldehyde to 1,3-dihydroxyacetone is shown in table 8 below.
TABLE 8 conversion of original BFD1 and mutant V8 to catalyze formaldehyde to 1,3-dihydroxyacetone
EXAMPLE 8 alkali Metal hydroxide and alkaline Earth Metal hydroxide catalyzed conversion of mutein-catalyzed 1,3-dihydroxyacetone to lactic acid
The reaction solution in example 7 was centrifuged using an ultrafiltration tube to remove the enzyme from the reaction system, 17.5mL of the solution was slowly added to 17.5mL of 2M NaOH or KOH solution, the reaction was carried out at room temperature for 24 hours, and after the reaction was completed, dilute sulfuric acid was added to adjust the pH of the solution to 1.0-2.0. Passing through a 0.22 mu m organic filter membrane, detecting the target product lactic acid by HPLC, wherein the HPLC detection conditions are as follows: a chromatographic column: aminex HPX-87H, 300mm × 7.8mm (Bio-Rad); mobile phase: 0.010M H2SO4(ii) a A difference detector; flow rate: 0.5 mL/min; column temperature: 35 ℃; sample introduction amount: 20 μ L.
As shown in FIG. 2, it was found that alkali metal hydroxides (sodium hydroxide and potassium hydroxide) can further catalyze the mutant V8 to convert 1,3-dihydroxyacetone produced from formaldehyde into lactic acid.
The reaction solution of example 7 was centrifuged through an ultrafiltration tube to remove the enzyme from the reaction system, 17.5mL of the solution was slowly added to a suspension of 17.5mL of calcium hydroxide (250mM), the reaction was carried out at room temperature for 24 hours, and after the completion of the reaction, dilute sulfuric acid was added to adjust the pH of the solution to 1.0-2.0. Passing through a 0.22 mu m organic filter membrane, detecting the target product lactic acid by HPLC, wherein the HPLC detection conditions are as follows: a chromatographic column: aminex HPX-87H, 300mm × 7.8mm (Bio-Rad) mobile phase: 50% 0.005M H2SO4Solution, 50% H2O, ultraviolet absorption wavelength: 210nm, flow rate: 0.5mL/min, column temperature: 65 ℃, sample introduction: 5 μ L.
The HPLC detection result is shown in FIG. 3, and it is found by analysis that alkaline earth metal hydroxide (calcium hydroxide) can further catalyze the mutant V8 to catalyze the conversion of 1,3-dihydroxyacetone generated by formaldehyde into lactic acid.
The conversion of lactic acid from formaldehyde catalyzed by mutant V8, and alkali metal hydroxides (sodium hydroxide, potassium hydroxide) and alkaline earth metal hydroxides (calcium hydroxide) was calculated as shown in table 9 below.
TABLE 9 conversion of formaldehyde to lactic acid
Example 9 Activity detection of wild-type protein of original BFD1 and its single amino acid site mutants
The amino acid mutation sites S26F, L43Q, F66L, R86C, T87A, G109S, A204V, H281Y, A322T, F397L, F397S, M460T, W463R, V467A, V473A and S525A screened in the example 1 are subjected to primer design and single-point mutation in the wild-type protein (the amino acid sequence is SEQ ID NO:23 and the nucleotide sequence is SEQ ID NO:24) sequence of the original BFD1, and after the mutation is completed and the sequencing is successful, the mutants are subjected to plate-dividing.
The above mutant positive bacteria and 17 strains of wild-type protein strain of original BFD1 were selected and inoculated in 5mL LB medium, cultured overnight at 37 deg.C, then inoculated in 25mL LB medium at 1% (V/V) inoculum size, and cultured at 37 deg.C at 200 r/min. When OD is reached600When 0.6 was reached, 0.1mM IPTG was added and expression was induced at 30 ℃. After the induction, the cultured mutant cells were collected in a centrifuge tube, and then resuspended in potassium phosphate buffer and washed. After centrifugation, the cells were stored in a refrigerator at-80 ℃.
Taking out the thallus, resuspending the thallus in potassium phosphate buffer solution, carrying out ultrasonic bacteria breaking in ice bath, and centrifuging to collect supernatant. And mixing 50 mu L of the supernatant with 50 mu L of 50mM formaldehyde solution, reacting at 30 ℃ for 3h, taking out 90 mu L of the reaction solution after the reaction is finished, adding 60 mu L of tool enzyme buffer solution 1, then adding 50 mu L of tool enzyme buffer solution 2, and detecting for 20min under the condition of absorbing light at 410 nm. The wild-type protein and mutant whole-cell enzyme activities of the original BFD1 were calculated, and the percentage of the mutant enzyme activities relative to the wild-type protein of the original BFD1 is shown in the following table (taking the wild-type protease activity of the original BFD1 as 100%).
TABLE 10 mutant names and corresponding mutant amino acids and Activity at 50mM Formaldehyde concentration
Example 10 structural analysis of Formaldehyde-converting mutein
The invention analyzes the crystal structure of the mutant V8 (PDB ID: 6M2Y), and the combined mutation H281Y and S26F form a party interaction through structural analysis (FIG. 4), which is greatly helpful for improving the activity of the formaldehyde-converting mutant protein containing the H281Y and S26F mutations.
The embodiments of the present invention have been described above. However, the present invention is not limited to the above embodiment. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
SEQUENCE LISTING
<110> institute of biotechnology for Tianjin industry of Chinese academy of sciences
<120> formaldehyde conversion mutant protein and application thereof
<130> CPCN21111638
<150> 2020101134418
<151> 2020-02-24
<160> 24
<170> PatentIn version 3.3
<210> 1
<211> 528
<212> PRT
<213> Artificial sequence
<400> 1
Met Ala Ser Val His Gly Thr Thr Tyr Glu Leu Leu Arg Arg Gln Gly
1 5 10 15
Ile Asp Thr Val Phe Gly Asn Pro Gly Ser Asn Glu Leu Pro Phe Leu
20 25 30
Lys Asp Phe Pro Glu Asp Phe Arg Tyr Ile Leu Ala Leu Gln Glu Ala
35 40 45
Cys Val Val Gly Ile Ala Asp Gly Tyr Ala Gln Ala Ser Arg Lys Pro
50 55 60
Ala Phe Ile Asn Leu His Ser Ala Ala Gly Thr Gly Asn Ala Met Gly
65 70 75 80
Ala Leu Ser Asn Ala Arg Thr Ser His Ser Pro Leu Ile Val Thr Ala
85 90 95
Gly Gln Gln Thr Arg Ala Met Ile Gly Val Glu Ala Gly Glu Thr Asn
100 105 110
Val Asp Ala Ala Asn Leu Pro Arg Pro Leu Val Lys Trp Ser Tyr Glu
115 120 125
Pro Ala Ser Ala Ala Glu Val Pro His Ala Met Ser Arg Ala Ile His
130 135 140
Met Ala Ser Met Ala Pro Gln Gly Pro Val Tyr Leu Ser Val Pro Tyr
145 150 155 160
Asp Asp Trp Asp Lys Asp Ala Asp Pro Gln Ser His His Leu Phe Asp
165 170 175
Arg His Val Ser Ser Ser Val Arg Leu Asn Asp Gln Asp Leu Asp Ile
180 185 190
Leu Val Lys Ala Leu Asn Ser Ala Ser Asn Pro Ala Ile Val Leu Gly
195 200 205
Pro Asp Val Asp Ala Ala Asn Ala Asn Ala Asp Cys Val Met Leu Ala
210 215 220
Glu Arg Leu Lys Ala Pro Val Trp Val Ala Pro Ser Ala Pro Arg Cys
225 230 235 240
Pro Phe Pro Thr Arg His Pro Cys Phe Arg Gly Leu Met Pro Ala Gly
245 250 255
Ile Ala Ala Ile Ser Gln Leu Leu Glu Gly His Asp Val Val Leu Val
260 265 270
Ile Gly Ala Pro Val Phe Arg Tyr His Gln Tyr Asp Pro Gly Gln Tyr
275 280 285
Leu Lys Pro Gly Thr Arg Leu Ile Ser Val Thr Cys Asp Pro Leu Glu
290 295 300
Ala Ala Arg Ala Pro Met Gly Asp Ala Ile Val Ala Asp Ile Gly Ala
305 310 315 320
Met Ala Ser Ala Leu Ala Asn Leu Val Glu Glu Ser Ser Arg Gln Leu
325 330 335
Pro Thr Ala Ala Pro Glu Pro Ala Lys Val Asp Gln Asp Ala Gly Arg
340 345 350
Leu His Pro Glu Thr Val Phe Asp Thr Leu Asn Asp Met Ala Pro Glu
355 360 365
Asn Ala Ile Tyr Leu Asn Glu Ser Thr Ser Thr Thr Ala Gln Met Trp
370 375 380
Gln Arg Leu Asn Met Arg Asn Pro Gly Ser Tyr Tyr Phe Cys Ala Ala
385 390 395 400
Gly Gly Leu Gly Phe Ala Leu Pro Ala Ala Ile Gly Val Gln Leu Ala
405 410 415
Glu Pro Glu Arg Gln Val Ile Ala Val Ile Gly Asp Gly Ser Ala Asn
420 425 430
Tyr Ser Ile Ser Ala Leu Trp Thr Ala Ala Gln Tyr Asn Ile Pro Thr
435 440 445
Ile Phe Val Ile Met Asn Asn Gly Thr Tyr Gly Met Leu Arg Trp Phe
450 455 460
Ala Gly Val Leu Glu Ala Glu Asn Val Pro Gly Leu Asp Val Pro Gly
465 470 475 480
Ile Asp Phe Arg Ala Leu Ala Lys Gly Tyr Gly Val Gln Ala Leu Lys
485 490 495
Ala Asp Asn Leu Glu Gln Leu Lys Gly Ser Leu Gln Glu Ala Leu Ser
500 505 510
Ala Lys Gly Pro Val Leu Ile Glu Val Ser Thr Val Ser Pro Val Lys
515 520 525
<210> 2
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 2
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
caccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 3
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 3
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 4
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 4
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgcttgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
caccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 5
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 5
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 6
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 6
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactacct ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 7
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 7
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgctc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 8
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 8
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatccagg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 9
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 9
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctctcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 10
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 10
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctagtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 11
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 11
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ttatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 12
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 12
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactc ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 13
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 13
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgtcggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtccgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 14
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 14
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgc tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 15
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 15
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtgc ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atgacttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 16
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 16
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctagtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgtcggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtccgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 17
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 17
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctagtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactc ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 18
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 18
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctggtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtacg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg ttgctccggt taaataa 1587
<210> 19
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 19
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctagtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atgacttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactc ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 20
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 20
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttttaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgctcgtac ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctagtgaa accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
taccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactc ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtatg 1380
ctgcgtcggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtccgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
<210> 21
<211> 18
<212> DNA
<213> Artificial sequence
<400> 21
ccgcgcggca gccatatg 18
<210> 22
<211> 27
<212> DNA
<213> Artificial sequence
<400> 22
ggtggtggtg gtggtgctcg agttatt 27
<210> 23
<211> 528
<212> PRT
<213> Artificial sequence
<400> 23
Met Ala Ser Val His Gly Thr Thr Tyr Glu Leu Leu Arg Arg Gln Gly
1 5 10 15
Ile Asp Thr Val Phe Gly Asn Pro Gly Ser Asn Glu Leu Pro Phe Leu
20 25 30
Lys Asp Phe Pro Glu Asp Phe Arg Tyr Ile Leu Ala Leu Gln Glu Ala
35 40 45
Cys Val Val Gly Ile Ala Asp Gly Tyr Ala Gln Ala Ser Arg Lys Pro
50 55 60
Ala Phe Ile Asn Leu His Ser Ala Ala Gly Thr Gly Asn Ala Met Gly
65 70 75 80
Ala Leu Ser Asn Ala Trp Asn Ser His Ser Pro Leu Ile Val Thr Ala
85 90 95
Gly Gln Gln Thr Arg Ala Met Ile Gly Val Glu Ala Leu Leu Thr Asn
100 105 110
Val Asp Ala Ala Asn Leu Pro Arg Pro Leu Val Lys Trp Ser Tyr Glu
115 120 125
Pro Ala Ser Ala Ala Glu Val Pro His Ala Met Ser Arg Ala Ile His
130 135 140
Met Ala Ser Met Ala Pro Gln Gly Pro Val Tyr Leu Ser Val Pro Tyr
145 150 155 160
Asp Asp Trp Asp Lys Asp Ala Asp Pro Gln Ser His His Leu Phe Asp
165 170 175
Arg His Val Ser Ser Ser Val Arg Leu Asn Asp Gln Asp Leu Asp Ile
180 185 190
Leu Val Lys Ala Leu Asn Ser Ala Ser Asn Pro Ala Ile Val Leu Gly
195 200 205
Pro Asp Val Asp Ala Ala Asn Ala Asn Ala Asp Cys Val Met Leu Ala
210 215 220
Glu Arg Leu Lys Ala Pro Val Trp Val Ala Pro Ser Ala Pro Arg Cys
225 230 235 240
Pro Phe Pro Thr Arg His Pro Cys Phe Arg Gly Leu Met Pro Ala Gly
245 250 255
Ile Ala Ala Ile Ser Gln Leu Leu Glu Gly His Asp Val Val Leu Val
260 265 270
Ile Gly Ala Pro Val Phe Arg Tyr His Gln Tyr Asp Pro Gly Gln Tyr
275 280 285
Leu Lys Pro Gly Thr Arg Leu Ile Ser Val Thr Cys Asp Pro Leu Glu
290 295 300
Ala Ala Arg Ala Pro Met Gly Asp Ala Ile Val Ala Asp Ile Gly Ala
305 310 315 320
Met Ala Ser Ala Leu Ala Asn Leu Val Glu Glu Ser Ser Arg Gln Leu
325 330 335
Pro Thr Ala Ala Pro Glu Pro Ala Lys Val Asp Gln Asp Ala Gly Arg
340 345 350
Leu His Pro Glu Thr Val Phe Asp Thr Leu Asn Asp Met Ala Pro Glu
355 360 365
Asn Ala Ile Tyr Leu Asn Glu Ser Thr Ser Thr Thr Ala Gln Met Trp
370 375 380
Gln Arg Leu Asn Met Arg Asn Pro Gly Ser Tyr Tyr Phe Cys Ala Ala
385 390 395 400
Gly Gly Leu Gly Phe Ala Leu Pro Ala Ala Ile Gly Val Gln Leu Ala
405 410 415
Glu Pro Glu Arg Gln Val Ile Ala Val Ile Gly Asp Gly Ser Ala Asn
420 425 430
Tyr Ser Ile Ser Ala Leu Trp Thr Ala Ala Gln Tyr Asn Ile Pro Thr
435 440 445
Ile Phe Val Ile Met Asn Asn Gly Thr Tyr Gly Ala Leu Arg Trp Phe
450 455 460
Ala Gly Val Leu Glu Ala Glu Asn Val Pro Gly Leu Asp Val Pro Gly
465 470 475 480
Ile Asp Phe Arg Ala Leu Ala Lys Gly Tyr Gly Val Gln Ala Leu Lys
485 490 495
Ala Asp Asn Leu Glu Gln Leu Lys Gly Ser Leu Gln Glu Ala Leu Ser
500 505 510
Ala Lys Gly Pro Val Leu Ile Glu Val Ser Thr Val Ser Pro Val Lys
515 520 525
<210> 24
<211> 1587
<212> DNA
<213> Artificial sequence
<400> 24
atggcttctg ttcacggtac cacctacgaa ctgctgcgtc gtcagggtat cgacaccgtt 60
ttcggtaacc cgggttctaa cgaactgccg ttcctgaaag acttcccgga agacttccgt 120
tacatcctgg ctctgcagga agcttgcgtt gttggtatcg ctgacggtta cgctcaggct 180
tctcgtaaac cggctttcat caacctgcac tctgctgctg gtaccggtaa cgctatgggt 240
gctctgtcta acgcttggaa ctctcactct ccgctgatcg ttaccgctgg tcagcagacc 300
cgtgctatga tcggtgttga agctctgctg accaacgttg acgctgctaa cctgccgcgt 360
ccgctggtta aatggtctta cgaaccggct tctgctgctg aagttccgca cgctatgtct 420
cgtgctatcc acatggcttc tatggctccg cagggtccgg tttacctgtc tgttccgtac 480
gacgactggg acaaagacgc tgacccgcag tctcaccacc tgttcgaccg tcacgtttct 540
tcttctgttc gtctgaacga ccaggacctg gacatcctgg ttaaagctct gaactctgct 600
tctaacccgg ctatcgttct gggtccggac gttgacgctg ctaacgctaa cgctgactgc 660
gttatgctgg ctgaacgtct gaaagctccg gtttgggttg ctccgtctgc tccgcgttgc 720
ccgttcccga cccgtcaccc gtgcttccgt ggtctgatgc cggctggtat cgctgctatc 780
tctcagctgc tggaaggtca cgacgttgtt ctggttatcg gtgctccggt tttccgttac 840
caccagtacg acccgggtca gtacctgaaa ccgggtaccc gtctgatctc tgttacctgc 900
gacccgctgg aagctgctcg tgctccgatg ggtgacgcta tcgttgctga catcggtgct 960
atggcttctg ctctggctaa cctggttgaa gaatcttctc gtcagctgcc gaccgctgct 1020
ccggaaccgg ctaaagttga ccaggacgct ggtcgtctgc acccggaaac cgttttcgac 1080
accctgaacg acatggctcc ggaaaacgct atctacctga acgaatctac ctctaccacc 1140
gctcagatgt ggcagcgtct gaacatgcgt aacccgggtt cttactactt ctgcgctgct 1200
ggtggtctgg gtttcgctct gccggctgct atcggtgttc agctggctga accggaacgt 1260
caggttatcg ctgttatcgg tgacggttct gctaactact ctatctctgc tctgtggacc 1320
gctgctcagt acaacatccc gaccatcttc gttatcatga acaacggtac ctacggtgct 1380
ctgcgttggt tcgctggtgt tctggaagct gaaaacgttc cgggtctgga cgttccgggt 1440
atcgacttcc gtgctctggc taaaggttac ggtgttcagg ctctgaaagc tgacaacctg 1500
gaacagctga aaggttctct gcaggaagct ctgtctgcta aaggtccggt tctgatcgaa 1560
gtttctaccg tttctccggt taaataa 1587
Claims (6)
1. A formaldehyde conversion mutein capable of catalyzing formaldehyde conversion, the formaldehyde conversion mutein having an amino acid sequence that has undergone any one of the single-site mutations S26F, L43Q, F66L, R86C, T87A, G109S, A204V, H281Y, A322T, F397L, F397S, M460T, W463R, V467A, V473A, S525A corresponding to SEQ ID NO. 23.
2. A polynucleotide encoding the formaldehyde-converting mutein of claim 1.
3. A transgenic cell line or recombinant bacterium comprising the formaldehyde-converting mutein of claim 1.
4. A recombinant vector, expression cassette, transgenic cell line or recombinant bacterium comprising the polynucleotide of claim 2.
5. Use of the formaldehyde-converting mutein of claim 1, the transgenic cell line or recombinant bacterium of claim 3 or the recombinant vector, expression cassette, transgenic cell line or recombinant bacterium of claim 4 for catalyzing the synthesis of 1,3-dihydroxyacetone by formaldehyde condensation.
6. Use of the formaldehyde-converting mutein of claim 1, the transgenic cell line or recombinant bacterium of claim 3 or the recombinant vector, expression cassette, transgenic cell line or recombinant bacterium of claim 4, respectively, in combination with an alkali metal hydroxide or an alkaline earth metal hydroxide for catalyzing the production of lactic acid from formaldehyde.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111161482.5A CN113832120B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2020101134418 | 2020-02-24 | ||
CN202010113441 | 2020-02-24 | ||
CN202010983978.XA CN112852765B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
CN202111161482.5A CN113832120B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010983978.XA Division CN112852765B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113832120A true CN113832120A (en) | 2021-12-24 |
CN113832120B CN113832120B (en) | 2024-04-26 |
Family
ID=75995370
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010983978.XA Active CN112852765B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
CN202111161482.5A Active CN113832120B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
CN202010984037.8A Active CN112852766B (en) | 2020-02-24 | 2020-09-17 | Method for synthesizing lactic acid |
CN202410069167.7A Pending CN118127089A (en) | 2020-02-24 | 2021-02-09 | Preparation method of glycollic acid |
CN202180005890.3A Active CN114616327B (en) | 2020-02-24 | 2021-02-09 | Formaldehyde conversion mutant protein and application thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010983978.XA Active CN112852765B (en) | 2020-02-24 | 2020-09-17 | Formaldehyde conversion mutant protein and application thereof |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010984037.8A Active CN112852766B (en) | 2020-02-24 | 2020-09-17 | Method for synthesizing lactic acid |
CN202410069167.7A Pending CN118127089A (en) | 2020-02-24 | 2021-02-09 | Preparation method of glycollic acid |
CN202180005890.3A Active CN114616327B (en) | 2020-02-24 | 2021-02-09 | Formaldehyde conversion mutant protein and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (5) | CN112852765B (en) |
WO (1) | WO2021169814A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114591938A (en) * | 2022-04-07 | 2022-06-07 | 山东金城医药研究院有限公司 | Carboxylase mutant and preparation method and application thereof |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115109770B (en) * | 2022-06-30 | 2023-09-05 | 中国科学院天津工业生物技术研究所 | Benzaldehyde lyase mutant and application thereof in preparation of 1, 4-dihydroxyl-2-butanone |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106916794A (en) * | 2017-02-22 | 2017-07-04 | 中国科学院天津工业生物技术研究所 | It is catalyzed enzyme and its application of formaldehyde synthesis of hydroxy acetaldehyde |
CN110551701A (en) * | 2018-05-31 | 2019-12-10 | 中国科学院天津工业生物技术研究所 | carbonyl reductase mutant and application thereof in reduction of cyclopentadione compounds |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB9123354D0 (en) * | 1991-11-04 | 1991-12-18 | Bp Chem Int Ltd | Production of hydroxy carboxylic compounds |
JP2007228927A (en) * | 2006-03-02 | 2007-09-13 | Kaneka Corp | Method for producing glycolic acid |
AT503802B1 (en) * | 2006-07-26 | 2008-01-15 | Vtu Engineering Planungs Und B | PROCESS FOR THE PREPARATION OF MILKY ACID OR BZW. A SALT OF IT |
WO2011137192A1 (en) * | 2010-04-27 | 2011-11-03 | The Regents Of The University Of California | Production of 1,4-butanediol by recombinant microorganisms |
EP3325608B1 (en) * | 2015-07-21 | 2021-11-24 | The Governing Council of the University of Toronto | Methods and microorganisms for the production of 1,3-butanediol |
CN105132400B (en) * | 2015-07-24 | 2018-10-12 | 中国科学院天津工业生物技术研究所 | The enzyme and preparation method thereof of C3H6O3 function is synthesized with catalysis formaldehyde |
CN105777523B (en) * | 2016-04-07 | 2018-05-25 | 农业部环境保护科研监测所 | A kind of method for preparing lactic acid under temperate condition by carbohydrate |
WO2018038667A1 (en) * | 2016-08-25 | 2018-03-01 | Medivir Ab | Respiratory syncytial virus inhibitors |
WO2018095973A1 (en) * | 2016-11-24 | 2018-05-31 | Haldor Topsøe A/S | A method and a system for producing glycolic acid and/or glycolate |
CN108118037B (en) * | 2016-11-28 | 2021-08-31 | 青岛蔚蓝生物集团有限公司 | Glucose oxidase mutant with improved heat resistance |
CN107699536B (en) * | 2017-11-27 | 2021-02-05 | 南京工业大学 | Genetically engineered bacterium and application thereof in production of D-1,2, 4-butanetriol |
-
2020
- 2020-09-17 CN CN202010983978.XA patent/CN112852765B/en active Active
- 2020-09-17 CN CN202111161482.5A patent/CN113832120B/en active Active
- 2020-09-17 CN CN202010984037.8A patent/CN112852766B/en active Active
-
2021
- 2021-02-09 CN CN202410069167.7A patent/CN118127089A/en active Pending
- 2021-02-09 WO PCT/CN2021/076379 patent/WO2021169814A1/en active Application Filing
- 2021-02-09 CN CN202180005890.3A patent/CN114616327B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106916794A (en) * | 2017-02-22 | 2017-07-04 | 中国科学院天津工业生物技术研究所 | It is catalyzed enzyme and its application of formaldehyde synthesis of hydroxy acetaldehyde |
US20200024590A1 (en) * | 2017-02-22 | 2020-01-23 | Tianjin Institute Of Industrial Biotechnology, Chinese Academy Of Sciences | Enzyme for synthesizing hydroxyl acetaldehyde and/or 1,3-dihydroxyacetone by catalyzing formaldehyde, and applications thereof |
CN110551701A (en) * | 2018-05-31 | 2019-12-10 | 中国科学院天津工业生物技术研究所 | carbonyl reductase mutant and application thereof in reduction of cyclopentadione compounds |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114591938A (en) * | 2022-04-07 | 2022-06-07 | 山东金城医药研究院有限公司 | Carboxylase mutant and preparation method and application thereof |
CN114591938B (en) * | 2022-04-07 | 2023-07-25 | 山东金城医药研究院有限公司 | Carboxylase mutant and preparation method and application thereof |
Also Published As
Publication number | Publication date |
---|---|
CN114616327A (en) | 2022-06-10 |
CN112852765B (en) | 2021-11-12 |
CN112852765A (en) | 2021-05-28 |
CN112852766B (en) | 2022-03-25 |
CN114616327B (en) | 2024-03-22 |
CN118127089A (en) | 2024-06-04 |
CN112852766A (en) | 2021-05-28 |
CN113832120B (en) | 2024-04-26 |
WO2021169814A1 (en) | 2021-09-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108467857B (en) | PET hydrolase mutant and application thereof | |
CN109593750B (en) | Nitrile hydratase mutant, genetic engineering bacterium containing same and application thereof | |
CN112852765B (en) | Formaldehyde conversion mutant protein and application thereof | |
CN109504645B (en) | Isoleucine dioxygenase, mutant and application in synthesis of 4-hydroxyisoleucine | |
CN113462665B (en) | 7 alpha-HSDH enzyme mutant and coding gene and application thereof | |
CN109825538A (en) | A kind of synthetic method of Chiral 2-amino-1-butanol | |
CN109468291B (en) | Carbonyl reductase EbSDR8 mutant and construction method and application thereof | |
CN109055324B (en) | Improved ketoreductase and application thereof | |
CN113652407B (en) | Carbonyl reductase mutant and application thereof in asymmetric synthesis of chiral compound | |
CN108048494B (en) | Method for synthesizing 1, 3-propylene glycol by using biological enzyme | |
CN110904088B (en) | High-temperature-resistant D-psicose3-epimerase, mutant and application thereof | |
CN110592035B (en) | Carbonyl reductase mutant, recombinant expression vector and application of carbonyl reductase mutant in production of chiral alcohol | |
CN110331173B (en) | Application of phenylpyruvic acid decarboxylase mutant M538A in production of phenethyl alcohol through biological fermentation | |
CN110358751B (en) | Recombinant lipase mutant, encoding gene, recombinant engineering bacterium and application | |
CN108004225B (en) | Mutant of phenylalanine aminomutase from Pantoea agglomerans | |
CN113122525B (en) | Formaldehyde conversion protein and application thereof | |
CN112322597B (en) | Carbonyl reductase mutant and application thereof | |
CN115896081A (en) | Aspartase mutant and application thereof | |
CN110846288B (en) | Glutathione bifunctional enzyme mutant and application thereof | |
CN111057697B (en) | High-temperature-resistant TIM barrel protein mutant and application thereof | |
CN114540318A (en) | Enzyme with function of catalyzing glycolaldehyde to synthesize glycollic acid and application thereof | |
CN109182286B (en) | Improved cyano reductase and application thereof in synthesis of 3-chloropyrazine-2 methylamine | |
CN110804602A (en) | L-aspartic acid β -decarboxylase mutant and application thereof | |
CN115786296B (en) | Meso-diaminopimelate dehydrogenase mutant and production method thereof | |
CN110747190B (en) | Maleic acid hydratase mutant and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |