CN115418358B - 一种糖基转移酶及其应用 - Google Patents
一种糖基转移酶及其应用 Download PDFInfo
- Publication number
- CN115418358B CN115418358B CN202110610098.2A CN202110610098A CN115418358B CN 115418358 B CN115418358 B CN 115418358B CN 202110610098 A CN202110610098 A CN 202110610098A CN 115418358 B CN115418358 B CN 115418358B
- Authority
- CN
- China
- Prior art keywords
- leu
- glycosyltransferase
- rebaudioside
- glu
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108700023372 Glycosyltransferases Proteins 0.000 title claims abstract description 57
- 102000051366 Glycosyltransferases Human genes 0.000 title claims abstract description 57
- 235000019202 steviosides Nutrition 0.000 claims abstract description 68
- RPYRMTHVSUWHSV-CUZJHZIBSA-N rebaudioside D Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RPYRMTHVSUWHSV-CUZJHZIBSA-N 0.000 claims abstract description 53
- 238000006243 chemical reaction Methods 0.000 claims abstract description 47
- UEDUENGHJMELGK-HYDKPPNVSA-N Stevioside Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UEDUENGHJMELGK-HYDKPPNVSA-N 0.000 claims abstract description 45
- OHHNJQXIOPOJSC-UHFFFAOYSA-N stevioside Natural products CC1(CCCC2(C)C3(C)CCC4(CC3(CCC12C)CC4=C)OC5OC(CO)C(O)C(O)C5OC6OC(CO)C(O)C(O)C6O)C(=O)OC7OC(CO)C(O)C(O)C7O OHHNJQXIOPOJSC-UHFFFAOYSA-N 0.000 claims abstract description 45
- 229940013618 stevioside Drugs 0.000 claims abstract description 45
- HELXLJCILKEWJH-NCGAPWICSA-N rebaudioside A Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HELXLJCILKEWJH-NCGAPWICSA-N 0.000 claims abstract description 36
- HELXLJCILKEWJH-UHFFFAOYSA-N entered according to Sigma 01432 Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC(C1OC2C(C(O)C(O)C(CO)O2)O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O HELXLJCILKEWJH-UHFFFAOYSA-N 0.000 claims abstract description 34
- 235000019203 rebaudioside A Nutrition 0.000 claims abstract description 33
- 239000001512 FEMA 4601 Substances 0.000 claims abstract description 32
- HELXLJCILKEWJH-SEAGSNCFSA-N Rebaudioside A Natural products O=C(O[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1)[C@@]1(C)[C@@H]2[C@](C)([C@H]3[C@@]4(CC(=C)[C@@](O[C@H]5[C@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@H](O)[C@@H](CO)O5)(C4)CC3)CC2)CCC1 HELXLJCILKEWJH-SEAGSNCFSA-N 0.000 claims abstract description 32
- GSGVXNMGMKBGQU-PHESRWQRSA-N rebaudioside M Chemical compound C[C@@]12CCC[C@](C)([C@H]1CC[C@@]13CC(=C)[C@@](C1)(CC[C@@H]23)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O)C(=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O GSGVXNMGMKBGQU-PHESRWQRSA-N 0.000 claims abstract description 19
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 16
- 238000000034 method Methods 0.000 claims description 23
- 239000004383 Steviol glycoside Substances 0.000 claims description 22
- 235000019411 steviol glycoside Nutrition 0.000 claims description 22
- 229930182488 steviol glycoside Natural products 0.000 claims description 22
- 150000008144 steviol glycosides Chemical class 0.000 claims description 20
- 239000000758 substrate Substances 0.000 claims description 20
- 229930006000 Sucrose Natural products 0.000 claims description 17
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 17
- 108010043934 Sucrose synthase Proteins 0.000 claims description 17
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical group O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 claims description 17
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims description 17
- 239000005720 sucrose Substances 0.000 claims description 16
- 239000000348 glycosyl donor Substances 0.000 claims description 13
- 238000002360 preparation method Methods 0.000 claims description 13
- 238000004519 manufacturing process Methods 0.000 claims description 9
- 108020004707 nucleic acids Proteins 0.000 claims description 9
- 102000039446 nucleic acids Human genes 0.000 claims description 9
- 150000007523 nucleic acids Chemical class 0.000 claims description 9
- 125000000539 amino acid group Chemical group 0.000 claims description 8
- 238000006206 glycosylation reaction Methods 0.000 claims description 5
- 239000013604 expression vector Substances 0.000 claims description 4
- 230000013595 glycosylation Effects 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- 238000003259 recombinant expression Methods 0.000 claims description 4
- 239000007810 chemical reaction solvent Substances 0.000 claims description 3
- 102200006534 rs104894365 Human genes 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 239000002002 slurry Substances 0.000 claims description 2
- 241000588722 Escherichia Species 0.000 claims 1
- 108090000790 Enzymes Proteins 0.000 abstract description 53
- 102000004190 Enzymes Human genes 0.000 abstract description 52
- 230000000694 effects Effects 0.000 abstract description 16
- 230000003197 catalytic effect Effects 0.000 abstract description 5
- 238000009776 industrial production Methods 0.000 abstract description 3
- 239000000243 solution Substances 0.000 description 27
- 238000012216 screening Methods 0.000 description 20
- 101710204244 Processive diacylglycerol beta-glucosyltransferase Proteins 0.000 description 17
- 108090000623 proteins and genes Proteins 0.000 description 12
- 239000002609 medium Substances 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 239000006228 supernatant Substances 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 239000012634 fragment Substances 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- QSRAJVGDWKFOGU-WBXIDTKBSA-N rebaudioside c Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]1(CC[C@H]2[C@@]3(C)[C@@H]([C@](CCC3)(C)C(=O)O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)CC3)C(=C)C[C@]23C1 QSRAJVGDWKFOGU-WBXIDTKBSA-N 0.000 description 8
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 7
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- QFVOYBUQQBFCRH-UHFFFAOYSA-N Steviol Natural products C1CC2(C3)CC(=C)C3(O)CCC2C2(C)C1C(C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-UHFFFAOYSA-N 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 229930182470 glycoside Natural products 0.000 description 6
- 229930027917 kanamycin Natural products 0.000 description 6
- 229960000318 kanamycin Drugs 0.000 description 6
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 6
- 229930182823 kanamycin A Natural products 0.000 description 6
- 239000008055 phosphate buffer solution Substances 0.000 description 6
- 239000010802 sludge Substances 0.000 description 6
- 229940032084 steviol Drugs 0.000 description 6
- -1 steviol glycoside compounds Chemical class 0.000 description 6
- 235000019640 taste Nutrition 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 244000228451 Stevia rebaudiana Species 0.000 description 5
- 230000002210 biocatalytic effect Effects 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000006555 catalytic reaction Methods 0.000 description 5
- 150000002338 glycosides Chemical class 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 239000001776 FEMA 4720 Substances 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- 235000006092 Stevia rebaudiana Nutrition 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 235000019658 bitter taste Nutrition 0.000 description 4
- 238000007036 catalytic synthesis reaction Methods 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 230000014759 maintenance of location Effects 0.000 description 4
- RLLCWNUIHGPAJY-SFUUMPFESA-N rebaudioside E Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RLLCWNUIHGPAJY-SFUUMPFESA-N 0.000 description 4
- QFVOYBUQQBFCRH-VQSWZGCSSA-N steviol Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-VQSWZGCSSA-N 0.000 description 4
- DRSKVOAJKLUMCL-MMUIXFKXSA-N u2n4xkx7hp Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O DRSKVOAJKLUMCL-MMUIXFKXSA-N 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 3
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 3
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 3
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- RLLCWNUIHGPAJY-RYBZXKSASA-N Rebaudioside E Natural products O=C(O[C@H]1[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O2)[C@@H](O)[C@@H](O)[C@H](CO)O1)[C@]1(C)[C@@H]2[C@@](C)([C@@H]3[C@@]4(CC(=C)[C@@](O[C@@H]5[C@@H](O[C@@H]6[C@@H](O)[C@H](O)[C@@H](O)[C@H](CO)O6)[C@H](O)[C@@H](O)[C@H](CO)O5)(C4)CC3)CC2)CCC1 RLLCWNUIHGPAJY-RYBZXKSASA-N 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- 241000544066 Stevia Species 0.000 description 3
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 239000002054 inoculum Substances 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- HYLAUKAHEAUVFE-AVBZULRRSA-N rebaudioside f Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)CO1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HYLAUKAHEAUVFE-AVBZULRRSA-N 0.000 description 3
- SKPQXOSVPKPXML-ULQDDVLXSA-N 2-[[(2s)-1-[(2s)-3-phenyl-2-[[(2s)-pyrrolidine-2-carbonyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H]1NCCC1)CC1=CC=CC=C1 SKPQXOSVPKPXML-ULQDDVLXSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 2
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- FOWOZYAWODIRFZ-JYJNAYRXSA-N Arg-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCN=C(N)N)N FOWOZYAWODIRFZ-JYJNAYRXSA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 2
- 108010055629 Glucosyltransferases Proteins 0.000 description 2
- 102000000340 Glucosyltransferases Human genes 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 2
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- 244000303040 Glycyrrhiza glabra Species 0.000 description 2
- 235000006200 Glycyrrhiza glabra Nutrition 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- SRGRINJFBHKHAC-NAKRPEOUSA-N Ile-Cys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N SRGRINJFBHKHAC-NAKRPEOUSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 2
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 2
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 2
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 2
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- HQCSLJFGZYOXHW-KKUMJFAQSA-N Phe-His-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N HQCSLJFGZYOXHW-KKUMJFAQSA-N 0.000 description 2
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- OMHUCGDTACNQEX-OSHKXICASA-N Steviolbioside Natural products O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O OMHUCGDTACNQEX-OSHKXICASA-N 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 2
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 2
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 2
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 2
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 2
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 2
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 description 2
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 2
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 2
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- JLPRGBMUVNVSKP-AHUXISJXSA-M chembl2368336 Chemical compound [Na+].O([C@H]1[C@@H](O)[C@H](O)[C@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C([O-])=O)[C@@H]1O[C@@H](CO)[C@@H](O)[C@H](O)[C@@H]1O JLPRGBMUVNVSKP-AHUXISJXSA-M 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000001177 diphosphate Substances 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 125000003147 glycosyl group Chemical group 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 235000021096 natural sweeteners Nutrition 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- QRGRAFPOLJOGRV-UHFFFAOYSA-N rebaudioside F Natural products CC12CCCC(C)(C1CCC34CC(=C)C(CCC23)(C4)OC5OC(CO)C(O)C(OC6OCC(O)C(O)C6O)C5OC7OC(CO)C(O)C(O)C7O)C(=O)OC8OC(CO)C(O)C(O)C8O QRGRAFPOLJOGRV-UHFFFAOYSA-N 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 238000006491 synthase reaction Methods 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- YSCJAYPKBYRXEZ-HZPINHDXSA-N (2s,3s,4s,5r,6r)-6-[[(3s,4ar,6ar,6bs,8as,12as,14ar,14br)-4,4,6a,6b,11,11,14b-heptamethyl-8a-[(2s,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxycarbonyl-1,2,3,4a,5,6,7,8,9,10,12,12a,14,14a-tetradecahydropicen-3-yl]oxy]-3-hydroxy-4-[(2s,3r,4s, Chemical compound O([C@H]1[C@H](O)[C@H](O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@]([C@@]3(CC[C@H]2C1(C)C)C)(C)CC[C@]1(CCC(C[C@H]14)(C)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)C(O)=O)[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O YSCJAYPKBYRXEZ-HZPINHDXSA-N 0.000 description 1
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 1
- 241000954177 Bangana ariza Species 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- SDWZYDDNSMPBRM-AVGNSLFASA-N Cys-Gln-Phe Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SDWZYDDNSMPBRM-AVGNSLFASA-N 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- 229930186291 Dulcoside Natural products 0.000 description 1
- 206010013911 Dysgeusia Diseases 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 235000001453 Glycyrrhiza echinata Nutrition 0.000 description 1
- 235000017382 Glycyrrhiza lepidota Nutrition 0.000 description 1
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- CHLJXFMOQGYDNH-SZMVWBNQSA-N Met-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 CHLJXFMOQGYDNH-SZMVWBNQSA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- RPYRMTHVSUWHSV-UHFFFAOYSA-N Rebaudiosid D Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC(C1OC2C(C(O)C(O)C(CO)O2)O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O RPYRMTHVSUWHSV-UHFFFAOYSA-N 0.000 description 1
- GIPHUOWOTCAJSR-UHFFFAOYSA-N Rebaudioside A. Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1OC(C1O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O GIPHUOWOTCAJSR-UHFFFAOYSA-N 0.000 description 1
- YWPVROCHNBYFTP-UHFFFAOYSA-N Rubusoside Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1O YWPVROCHNBYFTP-UHFFFAOYSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 1
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- HXCHCVDVKSCDHU-LULTVBGHSA-N calicheamicin Chemical compound C1[C@H](OC)[C@@H](NCC)CO[C@H]1O[C@H]1[C@H](O[C@@H]2C\3=C(NC(=O)OC)C(=O)C[C@](C/3=C/CSSSC)(O)C#C\C=C/C#C2)O[C@H](C)[C@@H](NO[C@@H]2O[C@H](C)[C@@H](SC(=O)C=3C(=C(OC)C(O[C@H]4[C@@H]([C@H](OC)[C@@H](O)[C@H](C)O4)O)=C(I)C=3C)OC)[C@@H](O)C2)[C@@H]1O HXCHCVDVKSCDHU-LULTVBGHSA-N 0.000 description 1
- 229930195731 calicheamicin Natural products 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000010612 desalination reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 239000000386 donor Substances 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- 239000000937 glycosyl acceptor Substances 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- LPLVUJXQOOQHMX-QWBHMCJMSA-N glycyrrhizinic acid Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@H](O[C@@H]1O[C@@H]1C([C@H]2[C@]([C@@H]3[C@@]([C@@]4(CC[C@@]5(C)CC[C@@](C)(C[C@H]5C4=CC3=O)C(O)=O)C)(C)CC2)(C)CC1)(C)C)C(O)=O)[C@@H]1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O LPLVUJXQOOQHMX-QWBHMCJMSA-N 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000008123 high-intensity sweetener Substances 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 229940010454 licorice Drugs 0.000 description 1
- 235000011477 liquorice Nutrition 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 235000013615 non-nutritive sweetener Nutrition 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 108010029895 rubimetide Proteins 0.000 description 1
- YWPVROCHNBYFTP-OSHKXICASA-N rubusoside Chemical compound O([C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YWPVROCHNBYFTP-OSHKXICASA-N 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010045348 trehalose synthase Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000002351 wastewater Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
- C12P33/20—Preparation of steroids containing heterocyclic rings
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明提供了一种糖基转移酶及其应用,所述糖基转移酶的氨基酸序列如SEQ ID NO:112或与SEQ ID NO:112具有至少99%序列同一性的氨基酸序列所示。本发明的糖基转移酶的酶活高、稳定性好;将其用于制备甜菊糖苷(例如莱鲍迪苷A、莱鲍迪苷D或莱鲍迪苷M)时与糖基转移酶亲本相比,在催化活性方面有了明显的提高,转化率显著提升,从而降低了反应的成本,利于工业化生产。
Description
技术领域
本发明涉及一种糖基转移酶及其在甜菊糖苷的糖基化反应中的应用。
背景技术
甜菊糖苷(Steviol glycosides,又称甜菊醇糖苷)是从菊科草本植物甜叶菊叶中提取的天然甜味剂,是多种糖苷的混和物,不同甜菊糖苷在味质上存在较大的差异。甜菊糖苷具有纯天然(来自纯天然植物甜叶菊)、高甜度(蔗糖的250~450倍)、低热量(仅为白糖的1/300)、使用经济(成本仅为蔗糖的三分之一)、稳定性好(耐热、耐酸、耐碱,不易出现分解现象)、安全性高(无毒副作用)等优点,以及抗高血糖、抗高血压、抗炎症、抗肿瘤、抗腹泻等潜在疗效。
甜菊糖苷(甜菊糖苷类化合物)的结构式如下:
序号 | 化合物 | R1 | R2 |
1 | 甜菊醇 | H | H |
2 | 甜菊醇单糖苷 | H | β-Glc |
3 | 甜菊醇双糖苷 | H | β-Glc-β-Glc(2-1) |
4 | 甜茶苷 | β-Glc | β-Glc |
5 | 甜菊苷(STV) | β-Glc | β-Glc-β-Glc(2-1) |
6 | 莱鲍迪苷A(RA) | β-Glc | β-Glc-β-Glc(2-1)-β-Glc(3-1) |
7 | 莱鲍迪苷B(RB) | H | β-Glc-β-Glc(2-1)-β-Glc(3-1) |
8 | 莱鲍迪苷C(RC) | β-Glc | β-Glc-α-Rha(2-1)-β-Glc(3-1) |
9 | 莱鲍迪苷D(RD) | β-Glc-β-Glc(2-1) | β-Glc-β-Glc(2-1)-β-Glc(3-1) |
10 | 莱鲍迪苷E(RE) | β-Glc-β-Glc(2-1) | β-Glc-β-Glc(2-1) |
11 | 莱鲍迪苷F(RF) | β-Glc | β-Glc-α-Xly(2-1)-β-Glc(3-1) |
12 | 莱鲍迪苷M(RM) | β-Glc-β-Glc(2-1)-β-Glc(3-1) | β-Glc-β-Glc(2-1)-β-Glc(3-1) |
13 | 杜可尔苷A | β-Glc | β-Glc-α-Rha(2-1) |
上述甜菊糖苷类化合物,具有共同的糖苷配基:甜菊醇(Steviol),区别在于C-13和C-19位置连接的糖基的数量和类型,主要包括甜菊苷(Stevioside)、莱鲍迪苷A(Rebaudioside A,Reb A)、莱鲍迪苷B、莱鲍迪苷C、莱鲍迪苷D(Rebaudioside D,Reb D)、莱鲍迪苷E、杜克苷、甜菊双糖苷等八种糖苷。甜菊的叶子能够累积多达10-20%(基于干重)甜菊糖苷。甜菊叶子中发现的主要糖苷是莱鲍迪苷A(2-10%)、甜菊苷(2-10%)和莱鲍迪苷C(1-2%)。其他糖苷,如莱鲍迪苷B、D、E和F,甜菊双糖苷和甜茶苷,以低得多的水平被发现(大约0-0.2%)。
虽然甜菊糖苷是一种高倍甜味剂,但存在后苦涩味这一缺点,严重限制了其在食品、饮料等对口感要求较高的领域中的应用。而引起甜菊糖苷后苦涩味的本质原因是其内在分子结构引起的,甜菊糖苷中的R1和R2基团上连接糖基数量越多口感越好。通常,发现甜菊苷比蔗糖甜110-270倍,莱鲍迪苷A为150至320倍,然而,即使在高度纯化的状态下,甜菊糖苷仍然具有不合需要的味道属性,如苦味、甜的余味、甘草味等。
莱鲍迪苷D是其中最有应用潜力的甜菊糖苷,与其它甜菊糖苷相比,其甜度高,约为蔗糖的300-350倍,且甜味纯正,口感也更接近蔗糖,没有苦味和甘草异味,稳定性好,是一种理想的天然高倍甜味剂产品。甜叶菊叶子中莱鲍迪苷D的含量极少(少于5%),采用提取法生产莱鲍迪苷D需要大量的甜叶菊原料,另外富集莱鲍迪苷D的工艺繁琐,提取后需要多次过柱和脱盐、脱色、重结晶,并在生产过程中产生大量的废水,其生产成本较高,不适合工业化大生产。
目前生物酶法合成莱鲍迪苷D的方法需要外加昂贵的UDP-葡萄糖为底物之一,通过UDP-葡萄糖基转移酶(UDP-glucosyltransferase,简称UGT)的作用,并且以甜菊苷或莱鲍迪苷A为底物,催化生成莱鲍迪苷D。但由于UDP-葡萄糖极高的售价,几乎完全限制了工业化制备莱鲍迪苷D的可行性,经济性较差、缺乏市场竞争力。
莱鲍迪苷M(Rebaudioside M,RebM)具有更好的口感特性,但其占叶子干重的含量小于0.1%,导致分离成本高、价格昂贵。生物催化法获得高浓度的莱鲍迪苷M已引起了学者的关注。目前报道,来源甜叶菊的重组酶能催化莱鲍迪苷D生成莱鲍迪苷M,但产量较低。以莱鲍迪苷D为底物,通过微生物产酶催化法可获得莱鲍迪苷M,该方法较传统的提取法,不仅改善了生产流程,并且降低了对环境的污染,提高了目的产物莱鲍迪苷M的产率。但目前以生物酶催化法主要存在以下几个问题:(1)以生物酶催化莱鲍迪苷D生产莱鲍迪苷M的成本较高,并且酶催化产率有待进一步优化;(2)用于催化的糖基转移酶不易与产物分离并回收利用,且易失活;(3)天然植物中莱鲍迪苷A含量很高,而莱鲍迪苷D含量非常低,以低成本由莱鲍迪苷A直接转化为莱鲍迪苷D也是亟待解决的难题。
葡萄糖基转移酶是在酶反应中只转移葡萄糖基的酶,该酶的作用机理是催化糖基供体的葡萄糖残基转移到糖基受体分子上,从而调节受体分子的活性。UDP-葡萄糖基转移酶是葡萄糖基转移酶中的一种,以UDP-葡萄糖作为糖基供体,几乎存在于所有有机体中。
UDP-葡萄糖是二磷酸尿苷葡糖(uridine diphosphate glucose)的简称,又简称为UDP-葡糖或者UDPG,是由尿苷二磷酸和葡萄糖组成的维生素,可看作“活性葡萄糖”,广泛分布于植物、动物和微生物的细胞内,在蔗糖、淀粉、糖原及其他寡糖和多糖合成中作葡萄糖基的供体,是最常见的一种糖基供体。
如今,随着天然甜味剂甜菊糖的广泛应用,以及生物催化技术的日益发展,UDP-葡萄糖基转移酶被越来越多地应用在甜菊糖苷的生物催化制备的领域中来。UDP-葡萄糖基转移酶的种类很多,目前甜菊糖苷的生物酶法制备领域中使用的酶多为来源于植物细胞中的野生酶,这种野生酶往往存在酶活低、稳定性差等缺点,从而导致应用于工业化大生产制备甜菊糖苷的成本较高。因此,有必要对UDP-葡萄糖基转移酶进行改造,从而获得酶活更高、稳定性更好的改造酶,以便更好地服务于工业化大生产。
发明内容
本发明所要解决的技术问题是现有的UDP-葡萄糖基转移酶被应用于甜菊糖苷的生物催化制备时酶活低、稳定性差因而用于催化甜菊糖苷时转化率不高等缺陷,因此本发明提供一种糖基转移酶(即,UDP-葡萄糖基转移酶)以及其在制备甜菊糖苷中的应用。本发明的糖基转移酶(GT)的酶活高、稳定性好;将其用于制备甜菊糖苷(例如莱鲍迪苷A、莱鲍迪苷D或莱鲍迪苷M)时与糖基转移酶亲本相比,在催化活性方面有了明显的提高,转化率显著提升,从而降低了反应的成本,利于工业化生产。
为了解决上述技术问题,本发明提供了一种糖基转移酶,所述糖基转移酶的氨基酸序列如SEQ ID NO:112或与SEQ ID NO:112具有至少99%序列同一性的氨基酸序列所示。
较佳地,所述“与SEQ ID NO:112具有至少99%序列同一性的氨基酸序列”为与SEQID NO:112相比包含选自以下一个或多个的残基位置处的氨基酸残基差异:
V14I;
E99L;
T254G;
L257A;
Q451E;
Q265E;
P271A;
R333K。
更佳地,在上述基础上,所述糖基转移酶的氨基酸序列与SEQ ID NO:112相比还可进一步包含选自以下残基位置处的氨基酸残基差异:K347P。
在某一较佳实施例中,所述糖基转移酶的氨基酸序列与SEQ ID NO:112相比包含选自以下一个残基位置处的氨基酸残基差异:V14I、E99L、T254G、L257A、Q451E、Q265E、P271A、或、R333K。
在某一较佳实施例中,所述糖基转移酶的氨基酸序列与SEQ ID NO:112相比包含选自以下两个残基位置处的氨基酸残基差异:Q265E和P271A。
在某一较佳实施例中,所述糖基转移酶的氨基酸序列与SEQ ID NO:112相比包含选自以下两个残基位置处的氨基酸残基差异:R333K和K347P。
在某一较佳实施例中,编码所述糖基转移酶的核苷酸序列可选自以下的序列:SEQID NO:41、SEQ ID NO:93、SEQ ID NO:94、SEQ ID NO:97、SEQ ID NO:98、SEQ ID NO:100、SEQ ID NO:102、SEQ ID NO:103、SEQ ID NO:104、SEQ ID NO:107、SEQ ID NO:108。
在一些实施方案中,糖基转移酶的氨基酸序列可在上述的特定位置以外的位置另外具有1-2、1-3、1-4、1-5、1-6、1-7、1-8、1-9、1-10、1-11、1-12、1-13、1-14、1-15、1-16、1-17、1-18、1-19、1-20个氨基酸残基的差异;这些残基的差异包括以保守氨基酸残基取代。通常这些取得不影响所述糖基转移酶的酶活。
为了解决上述技术问题,本发明提供了一种分离的核酸,所述核酸编码如上所述的糖基转移酶。
较佳地,所述核酸的核苷酸序列可选自以下的序列:SEQ ID NO:41、SEQ ID NO:93、SEQ ID NO:94、SEQ ID NO:97、SEQ ID NO:98、SEQ ID NO:100、SEQ ID NO:102、SEQ IDNO:103、SEQ ID NO:104、SEQ ID NO:107、SEQ ID NO:108。
为了解决上述技术问题,本发明提供了一种重组表达载体,其包含如上所述的核酸。
为了解决上述技术问题,本发明提供了一种转化体,其为包含如上所述的核酸或如上所述的重组表达载体的宿主细胞。
所述宿主细胞可为本领域常规,较佳地为埃希氏大肠杆菌(Escherichia coli)。
为了解决上述技术问题,本发明提供了一种制备如上所述的糖基转移酶的方法,所述方法包括在适于表达所述糖基转移酶的条件下培养如上所述的转化体。
所述转化体表达糖基转移酶后,可采用本领域常规技术手段进行提取,例如可制备粗酶液,粗酶液制备后可进行常规的浓缩、置换,也可将粗酶液进一步经离子交换层析、亲和层析、疏水层析和分子筛层析等纯化步骤中的一种或多种以提纯所述糖基转移酶。在某一较佳实施例中,可采用以下步骤:(1)将含所述糖基转移酶的转化体接种至含抗生素的培养基例如LB培养基中振荡培养,得种子液;(2)将(1)中的种子液转接至含抗生素的培养基例如TB培养基中振荡培养;(3)向(2)中的培养基中加入IPTG诱导过夜,离心后收集菌体;(4)洗涤并重悬(3)中收集的菌体,破碎后离心,即得含所述糖基转移酶的粗酶液。
为了解决上述技术问题,本发明提供了一种组合物,其包含如上所述的糖基转移酶。所述组合物例如可以酶制剂的形式存在。
为了解决上述技术问题,本发明提供了一种用于底物的糖基化的方法,所述方法包括提供至少一种底物、如上所述的糖基转移酶,并在使得所述底物被糖基化以产生至少一种糖基化产物的条件下使所述底物与所述糖基转移酶接触;所述底物优选包括至少一种甜菊糖苷例如甜菊苷、莱鲍迪苷A或莱鲍迪苷D等。
为了解决上述技术问题,本发明提供了一种莱鲍迪苷A的制备方法,其特征在于,所述制备方法包括以下步骤:在如上所述的糖基转移酶的存在下,将甜菊苷(Stevioside)和糖基供体进行反应(例如在使得甜菊苷被糖基化以产生莱鲍迪苷A的条件下),即得莱鲍迪苷A。
较佳地,所述糖基转移酶以糖基转移酶菌泥的形式存在。
较佳地,所述甜菊苷的浓度1-150g/L,优选100g/L。
较佳地,所述糖基供体与甜菊苷的摩尔比为1:1~5:1例如2.5:1。
较佳地,所述糖基供体为UDP-葡萄糖。在某一较佳实施例中,使用的糖基供体优选通过蔗糖和UDP在蔗糖合成酶的存在下制得,所述蔗糖的浓度优选为100-300g/L例如200g/L,所述UDP的浓度优选为0.05-0.2g/L例如0.1g/L。合酶(例如蔗糖合酶或海藻糖合酶)通常以相反的方向作用,以从核苷二磷酸和葡萄糖供体(例如蔗糖、海藻糖或淀粉)形成核苷二磷酸葡萄糖化合物。本发明中,所述的蔗糖合成酶(又称为蔗糖合酶,Sucrose synthase,简称为SuSy或SUS)催化可逆的葡萄糖基转移反应,通过蔗糖合成酶催化蔗糖和UDP产生UDP-葡萄糖,有效地为UDP-糖基转移酶提供了UDP-糖供体。在蔗糖合成酶和UDP-糖基转移酶构建的反应体系中,产生UDP循环,实现UDP-葡萄糖的再生,避免直接使用昂贵的UDP-葡萄糖,降低了生产成本。
较佳地,所述反应的反应溶剂的pH为5-8,优选6。
较佳地,所述反应时的转速为500-1000rpm,优选600rpm。
较佳地,所述反应的反应体系的温度为20-90℃,优选60℃。
为了解决上述技术问题,本发明提供了一种莱鲍迪苷D或莱鲍迪苷M的制备方法,其包括根据如上所述的制备方法制备莱鲍迪苷A的步骤。
在某一较佳实施例中,所述方法包括提供甜菊苷底物、糖基供体和如前所述的糖基转移酶,在使得产生莱鲍迪苷D或莱鲍迪苷M的条件下将甜菊苷底物、糖基供体和如前所述的糖基转移酶反应。
为了解决上述技术问题,本发明提供了一种如上所述的糖基转移酶在制备甜菊糖苷中的用途;所述甜菊糖苷优选为莱鲍迪苷A、莱鲍迪苷D或莱鲍迪苷M。
本发明中,所述的甜菊糖苷(Steviol glycosides)又可称为甜菊醇糖苷,其结构式和所包含的化合物种类具体可参见本发明的背景技术部分。
本发明的积极进步效果在于:
本发明的糖基转移酶的酶活高、稳定性好;将其用于制备甜菊糖苷(例如莱鲍迪苷A、莱鲍迪苷D或莱鲍迪苷M)时与糖基转移酶亲本相比,在催化活性方面有了明显的提高,转化率显著提升,从而降低了反应的成本,利于工业化生产。
附图说明
图1显示了本发明的实施例中由甜菊苷制备莱鲍迪苷A、莱鲍迪苷D、莱鲍迪苷M的路线示意图。
图2显示了本发明的实施例中的酶催化反应体系合成莱鲍迪苷A,实现UDPG在生物催化反应中的循环。
图3为第二轮筛选时所用底物甜菊苷的图谱,甜菊苷的保留时间为12.76min。
图4为产物莱鲍迪苷A对照品的图谱,保留时间12.38min。
图5是表10中复筛Enz.11催化合成RA活性的HPLC图;
图6是表10中复筛Enz.24催化合成RA活性的HPLC图。
具体实施方式
下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。
本发明中的实验方法如无特别说明均为常规方法,基因克隆操作具体可参考J.萨姆布鲁克等编的《分子克隆实验指南》。
本发明中的氨基酸简写符号如无特殊说明均为本领域常规,具体简写符号对应的氨基酸如表1所示。
表1
所述氨基酸对应的密码子也为本领域常规,具体氨基酸与密码子的对应关系如表2所示。
表2
KOD Mix酶购自TOYOBO CO.,LTD.,DpnI酶购买自英潍捷基(上海)贸易有限公司;E.coli Trans10感受态细胞购买自北京鼎国昌盛生物技术有限责任公司,E.coli BL21(DE3)感受态细胞购买自北京鼎国昌盛生物技术有限责任公司。蔗糖购自生工生物。第一轮筛选所用反应底物RA60(甜菊苷,其中RA含量60%,晨光生物,产品规格TSG90/RA60)。第二轮筛选所用反应底物甜菊苷购自毕得医药(纯度95%)。Reb A对照品购自麦克林。
转化率HPLC检测方法:色谱柱:Agilent 5TC-C18(2)(250×4.6mm)。流动相:0.1%FA水溶液为流动相A,0.1%FA乙腈溶液为流动相B,按下表3进行梯度洗脱。检测波长:210nm;流速:1ml/min;进样体积:20μl;柱温:40℃。甜菊苷出峰时间:12.76min;Reb A出峰时间:12.38min。
表3
时间(分钟) | 流动相A% | 流动相B% |
0.00 | 70 | 30 |
15.00 | 60 | 40 |
20.00 | 30 | 70 |
25.00 | 30 | 70 |
25.10 | 70 | 30 |
32.00 | 70 | 30 |
本发明的实施例主要以UDP-葡萄糖基转移酶亲本Enz.1为模板,第一轮筛选出了308位点突变为N的糖基转移酶突变体Enz.11,以Enz.11为模板进行第二轮筛选,筛选出来了双位点或三位点突变的UDP-葡萄糖基转移酶。
实施例1第一轮GT010-308突变体文库的构建
全合成SEQ ID NO:1所示的编号为Enz.1的β-1,3-糖基转移酶(β-1,3-GT酶)酶基因,该基因已连接在pET28a质粒载体上,得到重组质粒pET28a-GT010,基因合成公司为生工生物工程(上海)股份有限公司(上海市松江区香闵路698号)。
以pET28a-GT010质粒为模板,采用表4所示的引物序列,分别以GT010-L308X-F/ET-R和ET-F/GT010-L308X-R为引物(其中X为:A、D、E、G、H、I、K、M、N、P、S、V、W),采用KOD酶进行PCR扩增目标DNA片段和载体片段。
表4
PCR扩增反应体系为:
试剂 | 用量(μL) |
KOD Mix酶 | 25 |
引物F | 2 |
引物R | 2 |
模板 | 1 |
去离子水 | 20 |
扩增程序如下:
对PCR产物进行DpnI消化并进行跑胶及胶回收得到目标DNA片段。通过诺唯赞的两片段同源重组酶(Exnase II)连接。连接后转化至E.coli Trans10感受态细胞,涂布在含有50μg/mL卡纳霉素的LB培养基,37℃培养过夜;挑点培养并进行测序,得到含有突变体基因的重组质粒。
实施例2UDP-葡萄糖基转移酶突变体的制备
1.进行突变载体的蛋白表达:
将测序正确的上述重组质粒转化至宿主E.coli BL21(DE3)感受态细胞,得到含有点突变的基因工程菌株。挑单菌落接种至含50μg/ml卡那霉素的5ml LB液体培养基中,37℃震荡培养4h。按2v/v%接种量转接至50ml同样含50μg/ml卡那霉素的新鲜TB液体培养基中,37℃震荡培养至OD600达到0.6-0.8时,加入IPTG(异丙基-β-D-硫代半乳糖苷,Isopropylβ-D-thiogalactoside)至其终浓度为0.1mM,25℃诱导培养20h。培养结束后,将培养液10000rpm离心10min,弃上清液,收集菌体(即,菌泥)。-20℃保存备用。
2.粗酶液的获取:
配制50mM pH6.0的磷酸缓冲液(PBS),将上述所得菌泥按照(M/V)1:5进行悬浮,之后,进行均质获得粗酶液,粗酶液经离心,取上清获得UDP-葡萄糖基转移酶突变体的粗酶液。
实施例3蔗糖合成酶SUS的制备
全合成SEQ ID NO:31所示的编号为Enz.2的蔗糖合成酶(SUS)基因,该基因已连接在pET28a质粒载体上得到重组质粒pET28a-SUS。基因合成公司为生工生物工程(上海)股份有限公司(上海市松江区香闵路698号)。
将质粒pET28a-SUS转化至宿主E.coli BL21(DE3)感受态细胞,得到Enz.2基因工程菌株。挑单菌落接种至含50μg/ml卡那霉素的5ml LB液体培养基中,37℃震荡培养4h。按2v/v%接种量转接至50ml同样含50μg/ml卡那霉素的新鲜TB液体培养基中,37℃震荡培养至OD600达到0.6-0.8时,加入IPTG至其终浓度为0.1mM,25℃诱导培养20h。培养结束后,将培养液10000rpm离心10min,弃上清液,收集菌体(即,GT012菌泥)。-20℃保存备用。
配制50mM pH6.0的磷酸缓冲液(PBS),将上述所得Enz.2菌泥按照(M/V)1:5进行悬浮,之后,进行均质获得粗酶液,粗酶液经离心,取上清获得蔗糖合成酶SUS(酶编号Enz.2,氨基酸序列如SEQ ID NO:32所示)的粗酶液。
实施例4第一轮突变体的筛选
将实施例2和实施例3中得到的粗酶液分别进行80℃恒温孵育20min,离心取上清即分别获得UDP-葡萄糖基转移酶突变体反应酶液和蔗糖合成酶反应酶液。
以RA60为底物,1mL反应体系中,加入UDP-葡萄糖基转移酶突变体的反应酶液150μL,RA60终浓度为100g/L,UDP终浓度为0.1g/L,蔗糖终浓度为200g/L,蔗糖合成酶反应酶液30μL,最后加入50mM pH6.0磷酸缓冲液至终体积1mL。将配制好的反应体系置于金属浴中,60℃,600rpm下反应60min,反应液稀释50倍,进行HPLC分析Reb A的浓度(详见表5的RebA%)。(蔗糖合成酶是用于将蔗糖上葡萄糖基转移至UDP上合成UDPG)。初筛结果如表5所示。
表5
酶编号 | 突变位点 | Reb A% | DNA SEQ ID NO: |
Enz.1 | / | 93.787 | 2 |
Enz.3 | L308A | 87.771% | 33 |
Enz.4 | L308D | 72.570% | 34 |
Enz.5 | L308E | 74.487% | 35 |
Enz.6 | L308G | 81.027% | 36 |
Enz.7 | L308H | 89.885% | 37 |
Enz.8 | L308I | 83.863% | 38 |
Enz.9 | L308K | 75.395% | 39 |
Enz.10 | L308M | 84.221% | 40 |
Enz.11 | L308N | 96.157% | 41 |
Enz.12 | L308P | 75.643% | 42 |
Enz.13 | L308S | 72.799% | 43 |
Enz.14 | L308V | 87.985% | 44 |
Enz.15 | L308W | 91.806% | 45 |
由表5中的初筛结果可知:相对于Enz.1,Enz.11催化效果较好,所得Reb A的产率较高。
2.复筛
采用Boil和UnBoil两种反应条件进行复筛,Boil复筛反应条件和初筛反应条件相同,UnBoil指不加热条件下反应,其余反应条件和Boil反应条件相同。复筛结果如表6所示(表中%数值对应反应液中Reb A的含量)。
表6
酶编号 | Enz.3 | Enz.7 | Enz.1 | Enz.11 | Enz.14 | Enz.15 |
Boil | 84.212% | 85.939% | 92.314% | 92.100% | 83.793% | 87.168% |
UnBoil | 79.577% | 82.266% | 84.522% | 83.475% | 80.686% | 81.180% |
由表6中的复筛结果可知:相对于Enz.1,Enz.11效果最好,其反应效果与Enz.1相当。
实施例5第二轮突变体文库的构建
将第一轮得到的编码糖基转移酶(β-1,3-GT酶)Enz.11的基因连接载体pET28a,得到pET28a-Enz.11重组质粒,以pET28a-Enz.11为模板,采用表7所示的引物序列,采用KOD酶进行PCR扩增,获得目标突变体Enz.16-Enz.34的基因片段和载体片段。
以pET28a-Enz.34质粒为模板,GT029-L378G-F/Km-R和Km-F/GT029-L378G-R为引物序列,PCR扩增目标突变体Enz.35的基因片段和载体片段。
表7
PCR扩增反应体系为表8-1所示:
表8-1
扩增程序如下表8-2所示:
表8-2
对PCR产物进行DpnI消化并进行跑胶及胶回收。采用诺唯赞两片段同源重组酶进行连接。连接完成转化至E.coli Trans10感受态细胞,涂布在含有50μg/mL卡纳霉素的LB培养基,37℃培养过夜;挑点培养并进行测序,得到含有突变体基因的重组质粒。
实施例6UDP-葡萄糖基转移酶突变体的制备
1.进行突变载体的蛋白表达:
将实施例5中测序正确的重组质粒转化至宿主E.coli BL21(DE3)感受态细胞,得到含有点突变的基因工程菌株。挑单菌落接种至含50μg/ml卡那霉素的5ml LB液体培养基中,37℃震荡培养4h。按2v/v%接种量转接至50ml同样含50μg/ml卡那霉素的新鲜TB液体培养基中,37℃震荡培养至OD600达到0.6-0.8时,加入IPTG至其终浓度为0.1mM,25℃诱导培养20h。培养结束后,将培养液4000rpm离心20min,弃上清液,收集菌体(即菌泥)。-20℃保存备用。
2.粗酶液的获取:
配制50mM pH6.0的磷酸缓冲液(PBS),将上述所得菌泥按照(M/V)1:10进行悬浮,之后,进行均质获得粗酶液,粗酶液经离心,取上清获得UDP-葡萄糖基转移酶突变体的粗酶液。-4℃保存备用。
实施例7第二轮突变体的筛选
1.初筛
将实施例6和实施例2中得到的粗酶液分别进行80℃恒温孵育15min,离心取上清即分别获得UDP-葡萄糖基转移酶反应酶液和蔗糖合成酶反应酶液。
以甜菊苷(甜菊苷含量95%,毕得医药)为底物,1mL反应体系中,加入UDP-葡萄糖基转移酶突变体的反应酶液150μL,甜菊苷终浓度为100g/L,UDP终浓度为0.1g/L,蔗糖终浓度为200g/L,蔗糖合成酶30μL,最后加入50mM pH6.0磷酸缓冲液至终体积1mL。将配制好的反应体系置于金属浴中,60℃,600rpm下反应60min,稀释50倍,进行HPLC分析Reb A的浓度。以Enz.1和Enz.11作为双对照对20个突变体进行筛选。初筛结果如表9所示。
表9
由表9中的初筛结果可知:Enz.17、Enz.18、Enz.21、Enz.22、Enz.24、Enz.26、Enz.27、Enz.28、Enz.31、Enz.32均优于对照10%以上,选择这些突变体进行复筛。此外,前述反应是将粗酶液进行80℃恒温孵育15min,离心取上清获得反应酶液再进行的反应,可以看出酶的稳定性很好。
2.复筛
复筛反应条件和初筛反应条件相同。复筛结果如表10所示。
表10
酶 | Reb A% |
Enz.1 | 47.066 |
Enz.11 | 61.639 |
Enz.17 | 73.639 |
Enz.18 | 72.940 |
Enz.21 | 66.063 |
Enz.22 | 74.680 |
Enz.24 | 76.903 |
Enz.26 | 72.557 |
Enz.27 | 63.514 |
Enz.28 | 61.659 |
Enz.31 | 66.672 |
Enz.32 | 59.815 |
由表10中的复筛结果确认Enz.17、Enz.18、Enz.21、Enz.22、Enz.24、Enz.26、Enz.27、Enz.28、Enz.31、Enz.32均优于亲本对照10%以上。与UDP-葡萄糖基转移酶亲本Enz.1相比,上述所得UDP-葡萄糖基转移酶突变体在催化活性方面有了明显的提高。
图1显示了本发明的实施例中由甜菊苷制备莱鲍迪苷A、莱鲍迪苷D、莱鲍迪苷M的路线示意图;图2显示了本发明的实施例中的酶催化反应体系合成莱鲍迪苷A,实现UDPG在生物催化反应中的循环。图3为第二轮筛选时所用底物甜菊苷的图谱,保留时间12.76min。图4为产物莱鲍迪苷A对照品的图谱,保留时间12.38min。图5是表10中复筛Enz.11催化合成RA活性的HPLC图;
图6是表10中复筛Enz.24催化合成RA活性的HPLC图。
SEQUENCE LISTING
<110> 弈柯莱生物科技(上海)股份有限公司
<120> 一种糖基转移酶及其应用
<130> P21014606C
<160> 112
<170> PatentIn version 3.5
<210> 1
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.1
<400> 1
atgccgaaca ctaacccaac taccgtgcgt cgtcgtcgtg ttattatgtt tccggttccg 60
ttcccgggcc acttaaaccc gatgctgcaa ctggcgaacg tgctgtaccg tagaggtttt 120
gaaatcacca ttctgcacac caacttcaac gccccgaaaa ccagccttta tccgcacttc 180
cagtttcgtt ttatcttgga caacgatccg caaccggagt ggttacgcaa cctgccgacg 240
actggtccgg gcgtgggtgc aagaatcccg gtaattaaca aacacggcgc ggatgaattc 300
cgtaaggagc tggaaatctg catgcgggat actccgagtg acgaggaagt tgcttgcgtg 360
attaccgatg cgctgtggta cttcgcgcaa ccggtggcgg acagcctgaa tctgaaacgt 420
ctggttctgc agaccgggag cctgtttaac ttccactgcc tggtgtgtct gccgaaattt 480
ctggagttgg gctacctgga tccggaaact aaacatcgtc cggatgaacc ggtggtaggt 540
ttcccgatgc tgaaggttaa agatatccgt cgcgcgtatt cgcacattca agaatcgaaa 600
ccaattctga tgaagatggt tgaagaaacc cgtgccagca gcggtgtgat ttggaacagc 660
gctaaagagc tggaggaaag cgagctggaa accattcagc gtgaaattcc ggcgccgagc 720
ttcctgcttc cgctgccgaa gcattatagg gcttcgagca ctagcctgct ggatactgat 780
ccgagcaccg cccaatggct ggaccagcag ccgccgagca gcgtgctgta cgttggcttt 840
ggcagccaga gctcgctgga ccccgcagat ttcctggaga ttgcgcgtgg tctggttgcg 900
agcaaacaaa gctttctgtg gctggttcgt ccgggcttcg tgaagggtta tgagtggatt 960
gagctgctgc cggatggttt tctgggtgaa aaaggtcgta tcgtgaagtc tgctccgcaa 1020
caagaagtgc tggcgcacaa ggcgattggt gcgttctgga cccacggcgg ttggaacggc 1080
accatggagg ccgtgtgcga aggcgtgccg atgatcttta gcgatttcgg tctggatcag 1140
ccgctgaacg cgcgttacat gagcgaggtt ctgcatgtgg gcgtttatct ggagaacggc 1200
ttcatccgtg gtgagatcat taatgcggtt aggcgtgtga tggttgaccc tgagggtgag 1260
gttatgcgcc aaaacgcgcg taaattgaag gataagttgg atcgaagcat tgctcccggt 1320
ggcagcagct acgagagcct ggaacgcctg cagagctata ttagcagcct g 1371
<210> 2
<211> 457
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.1
<400> 2
Met Pro Asn Thr Asn Pro Thr Thr Val Arg Arg Arg Arg Val Ile Met
1 5 10 15
Phe Pro Val Pro Phe Pro Gly His Leu Asn Pro Met Leu Gln Leu Ala
20 25 30
Asn Val Leu Tyr Arg Arg Gly Phe Glu Ile Thr Ile Leu His Thr Asn
35 40 45
Phe Asn Ala Pro Lys Thr Ser Leu Tyr Pro His Phe Gln Phe Arg Phe
50 55 60
Ile Leu Asp Asn Asp Pro Gln Pro Glu Trp Leu Arg Asn Leu Pro Thr
65 70 75 80
Thr Gly Pro Gly Val Gly Ala Arg Ile Pro Val Ile Asn Lys His Gly
85 90 95
Ala Asp Glu Phe Arg Lys Glu Leu Glu Ile Cys Met Arg Asp Thr Pro
100 105 110
Ser Asp Glu Glu Val Ala Cys Val Ile Thr Asp Ala Leu Trp Tyr Phe
115 120 125
Ala Gln Pro Val Ala Asp Ser Leu Asn Leu Lys Arg Leu Val Leu Gln
130 135 140
Thr Gly Ser Leu Phe Asn Phe His Cys Leu Val Cys Leu Pro Lys Phe
145 150 155 160
Leu Glu Leu Gly Tyr Leu Asp Pro Glu Thr Lys His Arg Pro Asp Glu
165 170 175
Pro Val Val Gly Phe Pro Met Leu Lys Val Lys Asp Ile Arg Arg Ala
180 185 190
Tyr Ser His Ile Gln Glu Ser Lys Pro Ile Leu Met Lys Met Val Glu
195 200 205
Glu Thr Arg Ala Ser Ser Gly Val Ile Trp Asn Ser Ala Lys Glu Leu
210 215 220
Glu Glu Ser Glu Leu Glu Thr Ile Gln Arg Glu Ile Pro Ala Pro Ser
225 230 235 240
Phe Leu Leu Pro Leu Pro Lys His Tyr Arg Ala Ser Ser Thr Ser Leu
245 250 255
Leu Asp Thr Asp Pro Ser Thr Ala Gln Trp Leu Asp Gln Gln Pro Pro
260 265 270
Ser Ser Val Leu Tyr Val Gly Phe Gly Ser Gln Ser Ser Leu Asp Pro
275 280 285
Ala Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Ala Ser Lys Gln Ser
290 295 300
Phe Leu Trp Leu Val Arg Pro Gly Phe Val Lys Gly Tyr Glu Trp Ile
305 310 315 320
Glu Leu Leu Pro Asp Gly Phe Leu Gly Glu Lys Gly Arg Ile Val Lys
325 330 335
Ser Ala Pro Gln Gln Glu Val Leu Ala His Lys Ala Ile Gly Ala Phe
340 345 350
Trp Thr His Gly Gly Trp Asn Gly Thr Met Glu Ala Val Cys Glu Gly
355 360 365
Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu Asn Ala
370 375 380
Arg Tyr Met Ser Glu Val Leu His Val Gly Val Tyr Leu Glu Asn Gly
385 390 395 400
Phe Ile Arg Gly Glu Ile Ile Asn Ala Val Arg Arg Val Met Val Asp
405 410 415
Pro Glu Gly Glu Val Met Arg Gln Asn Ala Arg Lys Leu Lys Asp Lys
420 425 430
Leu Asp Arg Ser Ile Ala Pro Gly Gly Ser Ser Tyr Glu Ser Leu Glu
435 440 445
Arg Leu Gln Ser Tyr Ile Ser Ser Leu
450 455
<210> 3
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308A-F
<400> 3
gctttctgtg ggcggttcgt ccgggcttcg tg 32
<210> 4
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308A-R
<400> 4
ggacgaaccg cccacagaaa gctttgtttg 30
<210> 5
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308D-F
<400> 5
gctttctgtg ggatgttcgt ccgggcttcg tg 32
<210> 6
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308D-R
<400> 6
ggacgaacat cccacagaaa gctttgtttg 30
<210> 7
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308E-F
<400> 7
gctttctgtg ggaggttcgt ccgggcttcg tg 32
<210> 8
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308E-R
<400> 8
ggacgaacct cccacagaaa gctttgtttg 30
<210> 9
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308G-F
<400> 9
gctttctgtg gggtgttcgt ccgggcttcg tg 32
<210> 10
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308G-R
<400> 10
ggacgaacac cccacagaaa gctttgtttg 30
<210> 11
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308H-F
<400> 11
gctttctgtg gcatgttcgt ccgggcttcg tg 32
<210> 12
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308H-R
<400> 12
ggacgaacat cccacagaaa gctttgtttg 30
<210> 13
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308I-F
<400> 13
gctttctgtg gattgttcgt ccgggcttcg tg 32
<210> 14
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308I-R
<400> 14
ggacgaacaa tccacagaaa gctttgtttg 30
<210> 15
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308K-F
<400> 15
gctttctgtg gaaggttcgt ccgggcttcg tg 32
<210> 16
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308K-R
<400> 16
ggacgaacct tccacagaaa gctttgtttg 30
<210> 17
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308M-F
<400> 17
gctttctgtg gatggttcgt ccgggcttcg tg 32
<210> 18
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308M-R
<400> 18
ggacgaacca tccacagaaa gctttgtttg 30
<210> 19
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308N-F
<400> 19
gctttctgtg gaacgttcgt ccgggcttcg tg 32
<210> 20
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308N-R
<400> 20
ggacgaacgt tccacagaaa gctttgtttg 30
<210> 21
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308P-F
<400> 21
gctttctgtg gccggttcgt ccgggcttcg tg 32
<210> 22
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308P-R
<400> 22
ggacgaaccg gccacagaaa gctttgtttg 30
<210> 23
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308S-F
<400> 23
gctttctgtg gagcgttcgt ccgggcttcg tg 32
<210> 24
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308S-R
<400> 24
ggacgaacgc tccacagaaa gctttgtttg 30
<210> 25
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308V-F
<400> 25
gctttctgtg ggtggttcgt ccgggcttcg tg 32
<210> 26
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308V-R
<400> 26
ggacgaacca cccacagaaa gctttgtttg 30
<210> 27
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308W-F
<400> 27
gctttctgtg gtgggttcgt ccgggcttcg tg 32
<210> 28
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT010-L308W-R
<400> 28
ggacgaaccc accacagaaa gctttgtttg 30
<210> 29
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> ET-F
<400> 29
cttgtctgct cccggcatc 19
<210> 30
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> ET-R
<400> 30
cttgtctgta agcggatgcc 20
<210> 31
<211> 2412
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.2
<400> 31
atgcaccatc atcatcatca tggcggtagc ggcatgattg aagtactgcg ccaacagctg 60
ctggatagcc cgcgttcatg gcgtgcattc ctgcgtcatt tagtcgcatc tcagcgtgac 120
tcatggctac ataccgattt acagcacgcg tgcaagacgt ttcgtgaaca gcctccggaa 180
ggctatcctg aagatattgg ttggctggca gattttattg cgcattgcca ggaagcgatc 240
ttccgggatc cgtggatggt ttttgcgtgg cgtctacgtc caggtgtttg ggagtatgtg 300
cgcatacatg tagaacagct ggcggtggag gagctgagca ctgatgaata tctgcaagcc 360
aaagaacaac ttgttggctt aggtgcagaa ggtgaagctg ttctgacggt ggatttcgaa 420
gattttcgtc cggtgagcca gcgtttaaaa gacgagagca ccattggtga tggtcttacc 480
catctgaatc gtcatttagc aggtcgcatc tggactgatt tagcagcagg tcgtagtgct 540
attctggaat ttctgggcct gcatcgtctg gataaccaga atctgatgct gagcaacggc 600
aataccgatt ttgactcttt acgtcaaacc gtacaatatc tgggcacctt accaagagaa 660
actccgtggg cagagtttcg tgaagacatg cgtcgtcgtg gttttgaacc cggttggggc 720
aacaccgcgg gccgtgttcg cgaaaccatg cgtctgctga tggatctgct tgactctccg 780
agcccagctg ccctggagag cttcctggat cgcatcccga tgattagcaa cgttctgatc 840
gtgagcattc acggatggtt tgcgcaggac aaggttctgg gtcgtccgga cactggtggt 900
caggtcgtgt atattctgga tcaggcccgt gcactggaac gcgaaatgcg taaccgcctg 960
cgccaacagg gtgttgatgt ggagccgcgc attttgattg cgacccgttt aatcccggaa 1020
agtgatggca cgacttgtga ccagcgtctg gagcctgtcc atggtgccga gaatgtgcag 1080
attctgcgcg ttccgtttcg ctatgaggat ggtcgtattc acccgcattg gatctcacgc 1140
ttcaaggttt ggccgtatct tgaacgctat gcaagggatc tggaacgcga agttaaggcc 1200
gaattaggta gtcgtccaga tctgatcatc ggcaactata gcgacggtgg gctggttgca 1260
accatcctgt cagaaaaatt aggtgttacg cagtgcaaca ttgcacatgc cctggagaaa 1320
agcaagtacc cggggtccga tctgcattgg ccgctgtatg aacaggacca tcactttgcg 1380
tgtcagttta ccgcggatct gatcgcgatg aatgcagcag acatcatcgt gacgagcaca 1440
taccaggaaa ttgcaggtaa tgaccgcgag gttggtcaat atgaatctca ccaggactat 1500
actttaccgg gcttgtatcg tgtcgagaat ggtattgacg tgttcgatag caagtttaac 1560
attgtgagtc cgggcgcaga tccgagtacg tattttagct atgcccgtca tgaagaacgc 1620
ttctcgtcgc tgtggccaga aatcgaaagt ctgctgtttg gccgcgaacc aggtccggat 1680
attcgtggtg ttctcgaaga tcctcagaaa ccgattattc tgtcggtggc ccgtatggat 1740
cgcatcaaga acctgagcgg tctggccgaa ctgtatggtc ggagtgcgcg cttacgtagc 1800
ctggccaatt tggtgatcat cggtggtcat gttgatgtac aggccagtat ggatgcagaa 1860
gaacgcgaag aaatccgtcg tatgcacgag atcatggacc gctaccagct ggatggtcag 1920
atgcgttggg tgggatcgca tctggataaa cgcgtcgtgg gcgaattgta tcgtgtagtg 1980
gcggatggac gtggcgtttt tgtgcaacca gccctgtttg aggcgttcgg cctgaccgtg 2040
attgaggcaa tgagcagtgg cctgccagtg tttgcgaccc gccacggtgg tccgctggaa 2100
atcatcgaag acggcgttag cggcttccat attgatccca acgaccctga agcggtagca 2160
gaaaaactgg ccgacttcct ggaagcagcg cgtgaacgtc cgaagtattg ggaggaaatt 2220
agccaggcgg ctcttgcgcg cgtcagcgaa cgttacacgt gggagcgcta tgcggaacgc 2280
ttgatgacca tcgcgcgttg cttcggcttt tggcgcttcg ttctgtcacg cgaatcacag 2340
gtcatggaac gctatctgca aatgttccgc cacctgcaat ggcgcccgct ggctcatgcc 2400
gtaccgatgg ag 2412
<210> 32
<211> 804
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.2
<400> 32
Met His His His His His His Gly Gly Ser Gly Met Ile Glu Val Leu
1 5 10 15
Arg Gln Gln Leu Leu Asp Ser Pro Arg Ser Trp Arg Ala Phe Leu Arg
20 25 30
His Leu Val Ala Ser Gln Arg Asp Ser Trp Leu His Thr Asp Leu Gln
35 40 45
His Ala Cys Lys Thr Phe Arg Glu Gln Pro Pro Glu Gly Tyr Pro Glu
50 55 60
Asp Ile Gly Trp Leu Ala Asp Phe Ile Ala His Cys Gln Glu Ala Ile
65 70 75 80
Phe Arg Asp Pro Trp Met Val Phe Ala Trp Arg Leu Arg Pro Gly Val
85 90 95
Trp Glu Tyr Val Arg Ile His Val Glu Gln Leu Ala Val Glu Glu Leu
100 105 110
Ser Thr Asp Glu Tyr Leu Gln Ala Lys Glu Gln Leu Val Gly Leu Gly
115 120 125
Ala Glu Gly Glu Ala Val Leu Thr Val Asp Phe Glu Asp Phe Arg Pro
130 135 140
Val Ser Gln Arg Leu Lys Asp Glu Ser Thr Ile Gly Asp Gly Leu Thr
145 150 155 160
His Leu Asn Arg His Leu Ala Gly Arg Ile Trp Thr Asp Leu Ala Ala
165 170 175
Gly Arg Ser Ala Ile Leu Glu Phe Leu Gly Leu His Arg Leu Asp Asn
180 185 190
Gln Asn Leu Met Leu Ser Asn Gly Asn Thr Asp Phe Asp Ser Leu Arg
195 200 205
Gln Thr Val Gln Tyr Leu Gly Thr Leu Pro Arg Glu Thr Pro Trp Ala
210 215 220
Glu Phe Arg Glu Asp Met Arg Arg Arg Gly Phe Glu Pro Gly Trp Gly
225 230 235 240
Asn Thr Ala Gly Arg Val Arg Glu Thr Met Arg Leu Leu Met Asp Leu
245 250 255
Leu Asp Ser Pro Ser Pro Ala Ala Leu Glu Ser Phe Leu Asp Arg Ile
260 265 270
Pro Met Ile Ser Asn Val Leu Ile Val Ser Ile His Gly Trp Phe Ala
275 280 285
Gln Asp Lys Val Leu Gly Arg Pro Asp Thr Gly Gly Gln Val Val Tyr
290 295 300
Ile Leu Asp Gln Ala Arg Ala Leu Glu Arg Glu Met Arg Asn Arg Leu
305 310 315 320
Arg Gln Gln Gly Val Asp Val Glu Pro Arg Ile Leu Ile Ala Thr Arg
325 330 335
Leu Ile Pro Glu Ser Asp Gly Thr Thr Cys Asp Gln Arg Leu Glu Pro
340 345 350
Val His Gly Ala Glu Asn Val Gln Ile Leu Arg Val Pro Phe Arg Tyr
355 360 365
Glu Asp Gly Arg Ile His Pro His Trp Ile Ser Arg Phe Lys Val Trp
370 375 380
Pro Tyr Leu Glu Arg Tyr Ala Arg Asp Leu Glu Arg Glu Val Lys Ala
385 390 395 400
Glu Leu Gly Ser Arg Pro Asp Leu Ile Ile Gly Asn Tyr Ser Asp Gly
405 410 415
Gly Leu Val Ala Thr Ile Leu Ser Glu Lys Leu Gly Val Thr Gln Cys
420 425 430
Asn Ile Ala His Ala Leu Glu Lys Ser Lys Tyr Pro Gly Ser Asp Leu
435 440 445
His Trp Pro Leu Tyr Glu Gln Asp His His Phe Ala Cys Gln Phe Thr
450 455 460
Ala Asp Leu Ile Ala Met Asn Ala Ala Asp Ile Ile Val Thr Ser Thr
465 470 475 480
Tyr Gln Glu Ile Ala Gly Asn Asp Arg Glu Val Gly Gln Tyr Glu Ser
485 490 495
His Gln Asp Tyr Thr Leu Pro Gly Leu Tyr Arg Val Glu Asn Gly Ile
500 505 510
Asp Val Phe Asp Ser Lys Phe Asn Ile Val Ser Pro Gly Ala Asp Pro
515 520 525
Ser Thr Tyr Phe Ser Tyr Ala Arg His Glu Glu Arg Phe Ser Ser Leu
530 535 540
Trp Pro Glu Ile Glu Ser Leu Leu Phe Gly Arg Glu Pro Gly Pro Asp
545 550 555 560
Ile Arg Gly Val Leu Glu Asp Pro Gln Lys Pro Ile Ile Leu Ser Val
565 570 575
Ala Arg Met Asp Arg Ile Lys Asn Leu Ser Gly Leu Ala Glu Leu Tyr
580 585 590
Gly Arg Ser Ala Arg Leu Arg Ser Leu Ala Asn Leu Val Ile Ile Gly
595 600 605
Gly His Val Asp Val Gln Ala Ser Met Asp Ala Glu Glu Arg Glu Glu
610 615 620
Ile Arg Arg Met His Glu Ile Met Asp Arg Tyr Gln Leu Asp Gly Gln
625 630 635 640
Met Arg Trp Val Gly Ser His Leu Asp Lys Arg Val Val Gly Glu Leu
645 650 655
Tyr Arg Val Val Ala Asp Gly Arg Gly Val Phe Val Gln Pro Ala Leu
660 665 670
Phe Glu Ala Phe Gly Leu Thr Val Ile Glu Ala Met Ser Ser Gly Leu
675 680 685
Pro Val Phe Ala Thr Arg His Gly Gly Pro Leu Glu Ile Ile Glu Asp
690 695 700
Gly Val Ser Gly Phe His Ile Asp Pro Asn Asp Pro Glu Ala Val Ala
705 710 715 720
Glu Lys Leu Ala Asp Phe Leu Glu Ala Ala Arg Glu Arg Pro Lys Tyr
725 730 735
Trp Glu Glu Ile Ser Gln Ala Ala Leu Ala Arg Val Ser Glu Arg Tyr
740 745 750
Thr Trp Glu Arg Tyr Ala Glu Arg Leu Met Thr Ile Ala Arg Cys Phe
755 760 765
Gly Phe Trp Arg Phe Val Leu Ser Arg Glu Ser Gln Val Met Glu Arg
770 775 780
Tyr Leu Gln Met Phe Arg His Leu Gln Trp Arg Pro Leu Ala His Ala
785 790 795 800
Val Pro Met Glu
<210> 33
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3
<400> 33
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg ggctgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 34
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4
<400> 34
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg ggacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 35
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5
<400> 35
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg ggaagttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 36
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6
<400> 36
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gggtgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 37
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7
<400> 37
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gcacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 38
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8
<400> 38
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gatcgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 39
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9
<400> 39
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaaagttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 40
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10
<400> 40
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gatggttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 41
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.11
<400> 41
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 42
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.12
<400> 42
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gccggttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 43
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.13
<400> 43
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gtctgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 44
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.14
<400> 44
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg ggttgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 45
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.15
<400> 45
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gtgggttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 46
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R12Q-F
<400> 46
gcgtcgtcag cgtgttatta tgtttccgg 29
<210> 47
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R12Q-R
<400> 47
cataataaca cgctgacgac gcacggtagt tgg 33
<210> 48
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-V14I-F
<400> 48
cgtcgtatca ttatgtttcc ggttccgttc 30
<210> 49
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-V14I-R
<400> 49
gaaacataat gatacgacga cgacgcacgg tag 33
<210> 50
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-E99L-F
<400> 50
cgcggatcta ttccgtaagg agctggaaat c 31
<210> 51
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-E99L-R
<400> 51
ccttacggaa tagatccgcg ccgtgtttgt ta 32
<210> 52
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-A118S-F
<400> 52
ggaagttagt tgcgtgatta ccgatgcgc 29
<210> 53
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-A118S-R
<400> 53
gtaatcacgc aactaacttc ctcgtcactc ggag 34
<210> 54
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-S194P-F
<400> 54
cgcgcgtatc cgcacattca agaatcgaaa c 31
<210> 55
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-S194P-R
<400> 55
cttgaatgtg cggatacgcg cgacggatat c 31
<210> 56
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-T254G-F
<400> 56
cttcgagcgg tagcctgctg gatactgatc c 31
<210> 57
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-T254G-R
<400> 57
cagcaggcta ccgctcgaag ccctataatg c 31
<210> 58
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L257A-F
<400> 58
gcactagcct ggcggatact gatccgagca ccg 33
<210> 59
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L257A-R
<400> 59
cggatcagta tccgccaggc tagtgctcga agcc 34
<210> 60
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-E418D-F
<400> 60
gttgaccctg acggtgaggt tatgcgccaa aac 33
<210> 61
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-E418D-R
<400> 61
cctcaccgtc agggtcaacc atcacac 27
<210> 62
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q451E-F
<400> 62
gaacgcctgg agagctatat tagcagcctg 30
<210> 63
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q451E-R
<400> 63
ctaatatagc tctccaggcg ttccaggctc tcg 33
<210> 64
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-S455R-F
<400> 64
gagctatatt cgcagcctgt aagcttgcgg 30
<210> 65
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-S455R-R
<400> 65
cttacaggct gcgaatatag ctctgcaggc gttc 34
<210> 66
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-F
<400> 66
ccgccgaatg gctggaccag cagccgcc 28
<210> 67
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-R
<400> 67
gtccagccat tcggcggtgc tcggatcagt atc 33
<210> 68
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-P271A-F
<400> 68
gaccagcagg cgccgagcag cgtgctgtac g 31
<210> 69
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-P271A-R
<400> 69
ctgctcggcg cctgctggtc cagccattgg g 31
<210> 70
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-F
<400> 70
gaaaaaggta agatcgtgaa gtctgctccg 30
<210> 71
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-R
<400> 71
cttcacgatc ttaccttttt cacccagaaa ac 32
<210> 72
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-K347P-F
<400> 72
ctggcgcacc ctgcgattgg tgcgttctgg ac 32
<210> 73
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-K347P-R
<400> 73
accaatcgca gggtgcgcca gcacttcttg 30
<210> 74
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-F
<400> 74
gatttcggtg gtgatcagcc gctgaacgcg cg 32
<210> 75
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-R
<400> 75
cggctgatca ccaccgaaat cgctaaagat c 31
<210> 76
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-F
<400> 76
cgaatggctg gaccagcagg cgccgagcag cgtgctgtac 40
<210> 77
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-R
<400> 77
gcgcctgctg gtccagccat tcggcggtgc tcggatcagt at 42
<210> 78
<211> 65
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-K347P-F
<400> 78
aagatcgtga agtctgctcc gcaacaagaa gtgctggcgc accctgcgat tggtgcgttc 60
tggac 65
<210> 79
<211> 60
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-K347P-R
<400> 79
agggtgcgcc agcacttctt gttgcggagc agacttcacg atcttacctt tttcacccag 60
<210> 80
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-F
<400> 80
cgaatggctg gaccagcagg cgccgagcag cgtgctgtac 40
<210> 81
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-R
<400> 81
cggctgatca ccaccgaaat cgctaaagat c 31
<210> 82
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-F
<400> 82
gatttcggtg gtgatcagcc gctgaacgcg cg 32
<210> 83
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-R
<400> 83
gcgcctgctg gtccagccat tcggcggtgc tcggatcagt at 42
<210> 84
<211> 65
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-K347P-F
<400> 84
aagatcgtga agtctgctcc gcaacaagaa gtgctggcgc accctgcgat tggtgcgttc 60
tggac 65
<210> 85
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-R
<400> 85
gcgcctgctg gtccagccat tcggcggtgc tcggatcagt at 42
<210> 86
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-Q265E-P271A-F
<400> 86
cgaatggctg gaccagcagg cgccgagcag cgtgctgtac 40
<210> 87
<211> 60
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-R333K-K347P-R
<400> 87
agggtgcgcc agcacttctt gttgcggagc agacttcacg atcttacctt tttcacccag 60
<210> 88
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-F
<400> 88
gatttcggtg gtgatcagcc gctgaacgcg cg 32
<210> 89
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> GT029-L378G-R
<400> 89
cggctgatca ccaccgaaat cgctaaagat c 31
<210> 90
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Km-F
<400> 90
gcccgacatt atcgcgagc 19
<210> 91
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Km-R
<400> 91
gggtataaat gggctcgcg 19
<210> 92
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.16
<400> 92
atgccgaaca ccaacccgac caccgttcgt cgtcagcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 93
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.17
<400> 93
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgta tcatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 94
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.18
<400> 94
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacctgttc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 95
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.19
<400> 95
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt ttcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 96
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.20
<400> 96
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttacc cgcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 97
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.21
<400> 97
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttctg gttctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 98
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.22
<400> 98
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctggc tgacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 99
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.23
<400> 99
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggacggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 100
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.24
<400> 100
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg gaatcttaca tctcttctct g 1371
<210> 101
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.25
<400> 101
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tccgttctct g 1371
<210> 102
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.26
<400> 102
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctgaatggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 103
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.27
<400> 103
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag gctccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 104
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.28
<400> 104
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtaaaa tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 105
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.29
<400> 105
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcaccc ggctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 106
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.30
<400> 106
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tggtgaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 107
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.31
<400> 107
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctgaatggct ggaccagcag gctccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 108
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.32
<400> 108
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctcagtggct ggaccagcag ccgccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtaaaa tcgttaaatc tgctccgcag 1020
caggaagttc tggctcaccc ggctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 109
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.33
<400> 109
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctgaatggct ggaccagcag gctccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtcgta tcgttaaatc tgctccgcag 1020
caggaagttc tggctcacaa agctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tggtgaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 110
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.34
<400> 110
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctgaatggct ggaccagcag gctccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtaaaa tcgttaaatc tgctccgcag 1020
caggaagttc tggctcaccc ggctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tctggaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 111
<211> 1371
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.35
<400> 111
atgccgaaca ccaacccgac caccgttcgt cgtcgtcgtg ttatcatgtt cccggttccg 60
ttcccgggtc acctgaaccc gatgctgcag ctggctaacg ttctgtaccg tcgtggtttc 120
gaaatcacca tcctgcacac caacttcaac gctccgaaaa cctctctgta cccgcacttc 180
cagttccgtt tcatcctgga caacgacccg cagccggaat ggctgcgtaa cctgccgacc 240
accggtccgg gtgttggtgc tcgtatcccg gttatcaaca aacacggtgc tgacgaattc 300
cgtaaagaac tggaaatctg catgcgtgac accccgtctg acgaagaagt tgcttgcgtt 360
atcaccgacg ctctgtggta cttcgctcag ccggttgctg actctctgaa cctgaaacgt 420
ctggttctgc agaccggttc tctgttcaac ttccactgcc tggtttgcct gccgaaattc 480
ctggaactgg gttacctgga cccggaaacc aaacaccgtc cggacgaacc ggttgttggt 540
ttcccgatgc tgaaagttaa agacatccgt cgtgcttact ctcacatcca ggaatctaaa 600
ccgatcctga tgaaaatggt tgaagaaacc cgtgcttctt ctggtgttat ctggaactct 660
gctaaagaac tggaagaatc tgaactggaa accatccagc gtgaaatccc ggctccgtct 720
ttcctgctgc cgctgccgaa acactaccgt gcttcttcta cctctctgct ggacaccgac 780
ccgtctaccg ctgaatggct ggaccagcag gctccgtctt ctgttctgta cgttggtttc 840
ggttctcagt cttctctgga cccggctgac ttcctggaaa tcgctcgtgg tctggttgct 900
tctaaacagt ctttcctgtg gaacgttcgt ccgggtttcg ttaaaggtta cgaatggatc 960
gaactgctgc cggacggttt cctgggtgaa aaaggtaaaa tcgttaaatc tgctccgcag 1020
caggaagttc tggctcaccc ggctatcggt gctttctgga cccacggtgg ttggaacggt 1080
accatggaag ctgtttgcga aggtgttccg atgatcttct ctgacttcgg tggtgaccag 1140
ccgctgaacg ctcgttacat gtctgaagtt ctgcacgttg gtgtttacct ggaaaacggt 1200
ttcatccgtg gtgaaatcat caacgctgtt cgtcgtgtta tggttgaccc ggaaggtgaa 1260
gttatgcgtc agaacgctcg taaactgaaa gacaaactgg accgttctat cgctccgggt 1320
ggttcttctt acgaatctct ggaacgtctg cagtcttaca tctcttctct g 1371
<210> 112
<211> 457
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.11的氨基酸序列
<400> 112
Met Pro Asn Thr Asn Pro Thr Thr Val Arg Arg Arg Arg Val Ile Met
1 5 10 15
Phe Pro Val Pro Phe Pro Gly His Leu Asn Pro Met Leu Gln Leu Ala
20 25 30
Asn Val Leu Tyr Arg Arg Gly Phe Glu Ile Thr Ile Leu His Thr Asn
35 40 45
Phe Asn Ala Pro Lys Thr Ser Leu Tyr Pro His Phe Gln Phe Arg Phe
50 55 60
Ile Leu Asp Asn Asp Pro Gln Pro Glu Trp Leu Arg Asn Leu Pro Thr
65 70 75 80
Thr Gly Pro Gly Val Gly Ala Arg Ile Pro Val Ile Asn Lys His Gly
85 90 95
Ala Asp Glu Phe Arg Lys Glu Leu Glu Ile Cys Met Arg Asp Thr Pro
100 105 110
Ser Asp Glu Glu Val Ala Cys Val Ile Thr Asp Ala Leu Trp Tyr Phe
115 120 125
Ala Gln Pro Val Ala Asp Ser Leu Asn Leu Lys Arg Leu Val Leu Gln
130 135 140
Thr Gly Ser Leu Phe Asn Phe His Cys Leu Val Cys Leu Pro Lys Phe
145 150 155 160
Leu Glu Leu Gly Tyr Leu Asp Pro Glu Thr Lys His Arg Pro Asp Glu
165 170 175
Pro Val Val Gly Phe Pro Met Leu Lys Val Lys Asp Ile Arg Arg Ala
180 185 190
Tyr Ser His Ile Gln Glu Ser Lys Pro Ile Leu Met Lys Met Val Glu
195 200 205
Glu Thr Arg Ala Ser Ser Gly Val Ile Trp Asn Ser Ala Lys Glu Leu
210 215 220
Glu Glu Ser Glu Leu Glu Thr Ile Gln Arg Glu Ile Pro Ala Pro Ser
225 230 235 240
Phe Leu Leu Pro Leu Pro Lys His Tyr Arg Ala Ser Ser Thr Ser Leu
245 250 255
Leu Asp Thr Asp Pro Ser Thr Ala Gln Trp Leu Asp Gln Gln Pro Pro
260 265 270
Ser Ser Val Leu Tyr Val Gly Phe Gly Ser Gln Ser Ser Leu Asp Pro
275 280 285
Ala Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Ala Ser Lys Gln Ser
290 295 300
Phe Leu Trp Asn Val Arg Pro Gly Phe Val Lys Gly Tyr Glu Trp Ile
305 310 315 320
Glu Leu Leu Pro Asp Gly Phe Leu Gly Glu Lys Gly Arg Ile Val Lys
325 330 335
Ser Ala Pro Gln Gln Glu Val Leu Ala His Lys Ala Ile Gly Ala Phe
340 345 350
Trp Thr His Gly Gly Trp Asn Gly Thr Met Glu Ala Val Cys Glu Gly
355 360 365
Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu Asn Ala
370 375 380
Arg Tyr Met Ser Glu Val Leu His Val Gly Val Tyr Leu Glu Asn Gly
385 390 395 400
Phe Ile Arg Gly Glu Ile Ile Asn Ala Val Arg Arg Val Met Val Asp
405 410 415
Pro Glu Gly Glu Val Met Arg Gln Asn Ala Arg Lys Leu Lys Asp Lys
420 425 430
Leu Asp Arg Ser Ile Ala Pro Gly Gly Ser Ser Tyr Glu Ser Leu Glu
435 440 445
Arg Leu Gln Ser Tyr Ile Ser Ser Leu
450 455
Claims (17)
1.一种糖基转移酶,其特征在于,所述糖基转移酶的氨基酸序列如SEQ ID NO: 112或为与SEQ ID NO: 112相比在选自以下一个残基位置处存在氨基酸残基差异的氨基酸序列:V14I、E99L、T254G、L257A、Q451E、Q265E、P271A或R333K;
或,所述糖基转移酶的氨基酸序列与SEQ ID NO: 112相比,差异为Q265E和P271A;
或,所述糖基转移酶的氨基酸序列与SEQ ID NO: 112相比,差异为R333K和K347P。
2.一种分离的核酸,其特征在于,所述核酸编码如权利要求1所述的糖基转移酶。
3. 如权利要求2所述的核酸,其特征在于,所述核酸选自以下的序列:SEQ ID NO: 41、SEQ ID NO: 93、SEQ ID NO: 94、SEQ ID NO: 97、SEQ ID NO: 98、SEQ ID NO: 100、SEQ IDNO: 102、SEQ ID NO: 103、SEQ ID NO: 104、SEQ ID NO: 107、SEQ ID NO: 108。
4.一种重组表达载体,其包含如权利要求2或3所述的核酸。
5.一种转化体,其为包含如权利要求2或3所述的核酸或如权利要求4所述的重组表达载体的宿主细胞。
6. 如权利要求5所述的转化体,其特征在于,所述宿主细胞为埃希氏大肠杆菌(Escherichia coli)。
7.一种制备如权利要求1所述的糖基转移酶的方法,其特征在于,所述方法包括在适于表达所述糖基转移酶的条件下培养如权利要求5或6所述的转化体。
8.一种组合物,其包含如权利要求1所述的糖基转移酶。
9.一种用于底物的糖基化的方法,所述方法包括提供至少一种底物、如权利要求1所述的糖基转移酶,并在使得所述底物被糖基化以产生至少一种糖基化产物的条件下使所述底物与所述糖基转移酶接触。
10.一种莱鲍迪苷A的制备方法,其特征在于,所述制备方法包括以下步骤:在如权利要求1所述的糖基转移酶的存在下,将甜菊苷和糖基供体进行反应,即得莱鲍迪苷A。
11.如权利要求10所述的制备方法,其特征在于,所述糖基转移酶以糖基转移酶菌泥的形式存在;
所述甜菊苷的浓度1-150 g/L;
所述糖基供体与甜菊苷的摩尔比为1:1~5:1;
所述糖基供体为UDP-葡萄糖;
所述反应的反应溶剂的pH为5-8;
所述反应时的转速为500-1000 rpm;
所述反应的反应体系的温度为20-90℃。
12. 如权利要求11所述的制备方法,其特征在于,所述甜菊苷的浓度为100 g/L;
所述UDP-葡萄糖通过蔗糖和UDP在蔗糖合成酶的存在下制得;
所述反应的反应溶剂的pH为6;
所述反应时的转速为600 rpm;
所述反应的反应体系的温度为60℃。
13. 如权利要求12所述的制备方法,其特征在于,所述蔗糖的浓度为100-300 g/L,所述UDP的浓度为0.05-0.2 g/L。
14. 如权利要求13所述的制备方法,其特征在于,所述蔗糖的浓度为200 g/L,所述UDP的浓度为0.1 g/L。
15.一种莱鲍迪苷D或莱鲍迪苷M的制备方法,其特征在于,其包括根据如权利要求10-14任一项所述的制备方法制备莱鲍迪苷A的步骤。
16.一种如权利要求1所述的糖基转移酶在制备甜菊糖苷中的用途。
17.如权利要求16所述的用途,其特征在于,所述甜菊糖苷为莱鲍迪苷A、莱鲍迪苷D或莱鲍迪苷M。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110610098.2A CN115418358B (zh) | 2021-06-01 | 2021-06-01 | 一种糖基转移酶及其应用 |
JP2023573659A JP2024520118A (ja) | 2021-06-01 | 2022-06-01 | 糖転移酵素及びその使用 |
PCT/CN2022/096683 WO2022253282A1 (zh) | 2021-06-01 | 2022-06-01 | 一种糖基转移酶及其应用 |
EP22815324.3A EP4349989A1 (en) | 2021-06-01 | 2022-06-01 | Glycosyltransferase and application thereof |
US18/565,361 US20240263152A1 (en) | 2021-06-01 | 2022-06-01 | Glycosyltransferase and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110610098.2A CN115418358B (zh) | 2021-06-01 | 2021-06-01 | 一种糖基转移酶及其应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115418358A CN115418358A (zh) | 2022-12-02 |
CN115418358B true CN115418358B (zh) | 2024-01-30 |
Family
ID=84230413
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110610098.2A Active CN115418358B (zh) | 2021-06-01 | 2021-06-01 | 一种糖基转移酶及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115418358B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115725528B (zh) * | 2021-08-30 | 2024-01-12 | 弈柯莱生物科技(集团)股份有限公司 | 一种糖基转移酶及其应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109750071A (zh) * | 2019-01-31 | 2019-05-14 | 南京工业大学 | 一种生物催化合成莱鲍迪苷m的方法 |
CN112375750A (zh) * | 2020-12-02 | 2021-02-19 | 南京工业大学 | 一种糖基转移酶突变体及其催化合成莱鲍迪苷a的方法 |
CN112760302A (zh) * | 2020-12-23 | 2021-05-07 | 中化健康产业发展有限公司 | 一种能够催化莱鲍迪苷A生成莱鲍迪苷D的糖基转移酶StUGT |
CN112805295A (zh) * | 2018-07-30 | 2021-05-14 | 科德克希思公司 | 工程化糖基转移酶和甜菊醇糖苷葡糖基化方法 |
-
2021
- 2021-06-01 CN CN202110610098.2A patent/CN115418358B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112805295A (zh) * | 2018-07-30 | 2021-05-14 | 科德克希思公司 | 工程化糖基转移酶和甜菊醇糖苷葡糖基化方法 |
CN109750071A (zh) * | 2019-01-31 | 2019-05-14 | 南京工业大学 | 一种生物催化合成莱鲍迪苷m的方法 |
CN112375750A (zh) * | 2020-12-02 | 2021-02-19 | 南京工业大学 | 一种糖基转移酶突变体及其催化合成莱鲍迪苷a的方法 |
CN112760302A (zh) * | 2020-12-23 | 2021-05-07 | 中化健康产业发展有限公司 | 一种能够催化莱鲍迪苷A生成莱鲍迪苷D的糖基转移酶StUGT |
Also Published As
Publication number | Publication date |
---|---|
CN115418358A (zh) | 2022-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109750072B (zh) | 一种酶法制备莱鲍迪苷e的方法 | |
CN112080480B (zh) | 糖基转移酶突变体及其应用 | |
CN104726523B (zh) | 一种酶法制备莱鲍迪苷m的方法 | |
CN111518782B (zh) | 一种糖基转移酶ugtzj1突变体及其应用 | |
US20240287479A1 (en) | Glycosyltransferase mutant, and method for catalytic synthesis of rebaudioside m using glycosyltransferase mutant | |
CN110699373B (zh) | 尿苷二磷酸葡萄糖高产菌株及其应用 | |
CN112063678A (zh) | 一种Siamenoside I的生物合成方法 | |
CN115418358B (zh) | 一种糖基转移酶及其应用 | |
WO2022253282A1 (zh) | 一种糖基转移酶及其应用 | |
CN115449514B (zh) | 一种β-1,2-糖基转移酶及其应用 | |
WO2023005779A1 (zh) | 一种蔗糖合成酶及其应用 | |
CN116790541A (zh) | 一种酶活提高的蔗糖合酶突变体 | |
CN110892068B (zh) | Udp-糖基转移酶 | |
CN116355874A (zh) | 糖基转移酶突变体及其在制备槲皮素-3-o鼠李糖苷中的应用 | |
CN115725528B (zh) | 一种糖基转移酶及其应用 | |
CN115478060B (zh) | 一种糖基转移酶及其应用 | |
CN115404226B (zh) | 一种蔗糖合成酶及其在催化糖基化反应中的应用 | |
KR101499138B1 (ko) | 스테비오사이드의 효소적 전환을 이용한 루부소사이드의 생산방법 | |
CN114875054B (zh) | 一种酶法制备糖基化甜菊糖苷类化合物的方法及其衍生物 | |
CN116004417B (zh) | 一株枯草芽孢杆菌及其应用 | |
CN116555210A (zh) | 糖基转移酶及其在制备莱鲍迪苷e中的应用 | |
CN116334162A (zh) | 一种莱鲍迪苷i的制备方法及应用 | |
CN117187321A (zh) | 利用生物酶法高效制备莱鲍迪苷m的方法 | |
CN117050965A (zh) | 糖基转移酶及其在制备莱鲍迪苷m中的应用 | |
CN118240787A (zh) | 糖基转移酶、蔗糖合酶及其在制备莱鲍迪苷m中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant after: Yikelai Biotechnology (Group) Co.,Ltd. Address before: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant before: Ecolab Biotechnology (Shanghai) Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |