CN111718916A - GGPPS directed single-point mutant protein and application thereof - Google Patents
GGPPS directed single-point mutant protein and application thereof Download PDFInfo
- Publication number
- CN111718916A CN111718916A CN202010814751.2A CN202010814751A CN111718916A CN 111718916 A CN111718916 A CN 111718916A CN 202010814751 A CN202010814751 A CN 202010814751A CN 111718916 A CN111718916 A CN 111718916A
- Authority
- CN
- China
- Prior art keywords
- amino acid
- ggpps
- site
- mutated
- muteins
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010021466 Mutant Proteins Proteins 0.000 title claims abstract description 7
- 102000008300 Mutant Proteins Human genes 0.000 title claims abstract description 7
- 101000895629 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Geranylgeranyl pyrophosphate synthase Proteins 0.000 title claims abstract 8
- 150000001413 amino acids Chemical class 0.000 claims abstract description 34
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 20
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 20
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims abstract description 15
- 235000001014 amino acid Nutrition 0.000 claims description 36
- 229940024606 amino acid Drugs 0.000 claims description 34
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 10
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 8
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 6
- -1 Val amino acid Chemical class 0.000 claims description 6
- 230000007935 neutral effect Effects 0.000 claims description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 4
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 3
- 235000007516 Chrysanthemum Nutrition 0.000 claims description 3
- 244000189548 Chrysanthemum x morifolium Species 0.000 claims description 3
- 244000241235 Citrullus lanatus Species 0.000 claims description 3
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 claims description 3
- 240000007154 Coffea arabica Species 0.000 claims description 3
- 240000008067 Cucumis sativus Species 0.000 claims description 3
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 claims description 3
- 244000000626 Daucus carota Species 0.000 claims description 3
- 235000002767 Daucus carota Nutrition 0.000 claims description 3
- 244000194101 Ginkgo biloba Species 0.000 claims description 3
- 235000008100 Ginkgo biloba Nutrition 0.000 claims description 3
- 244000043261 Hevea brasiliensis Species 0.000 claims description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 3
- 235000011430 Malus pumila Nutrition 0.000 claims description 3
- 244000070406 Malus silvestris Species 0.000 claims description 3
- 235000015103 Malus silvestris Nutrition 0.000 claims description 3
- 240000005561 Musa balbisiana Species 0.000 claims description 3
- 235000018290 Musa x paradisiaca Nutrition 0.000 claims description 3
- 240000007594 Oryza sativa Species 0.000 claims description 3
- 235000007164 Oryza sativa Nutrition 0.000 claims description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 3
- 235000004443 Ricinus communis Nutrition 0.000 claims description 3
- 240000003768 Solanum lycopersicum Species 0.000 claims description 3
- 244000061456 Solanum tuberosum Species 0.000 claims description 3
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 3
- 235000021307 Triticum Nutrition 0.000 claims description 3
- 244000098338 Triticum aestivum Species 0.000 claims description 3
- 235000009754 Vitis X bourquina Nutrition 0.000 claims description 3
- 235000012333 Vitis X labruscana Nutrition 0.000 claims description 3
- 240000006365 Vitis vinifera Species 0.000 claims description 3
- 235000014787 Vitis vinifera Nutrition 0.000 claims description 3
- 240000008042 Zea mays Species 0.000 claims description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 3
- 230000002378 acidificating effect Effects 0.000 claims description 3
- 235000004279 alanine Nutrition 0.000 claims description 3
- 235000009582 asparagine Nutrition 0.000 claims description 3
- 229960001230 asparagine Drugs 0.000 claims description 3
- 235000003704 aspartic acid Nutrition 0.000 claims description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 3
- 235000016213 coffee Nutrition 0.000 claims description 3
- 235000013353 coffee beverage Nutrition 0.000 claims description 3
- 235000005822 corn Nutrition 0.000 claims description 3
- 235000009566 rice Nutrition 0.000 claims description 3
- 235000002566 Capsicum Nutrition 0.000 claims description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 239000006002 Pepper Substances 0.000 claims description 2
- 235000016761 Piper aduncum Nutrition 0.000 claims description 2
- 240000003889 Piper guineense Species 0.000 claims description 2
- 235000017804 Piper guineense Nutrition 0.000 claims description 2
- 235000008184 Piper nigrum Nutrition 0.000 claims description 2
- 240000000528 Ricinus communis Species 0.000 claims description 2
- 125000001931 aliphatic group Chemical group 0.000 claims description 2
- 125000003118 aryl group Chemical group 0.000 claims description 2
- 235000018417 cysteine Nutrition 0.000 claims description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 2
- 239000000049 pigment Substances 0.000 claims description 2
- 235000011201 Ginkgo Nutrition 0.000 claims 1
- 235000017276 Salvia Nutrition 0.000 claims 1
- 240000007164 Salvia officinalis Species 0.000 claims 1
- 230000035772 mutation Effects 0.000 abstract description 23
- 238000002474 experimental method Methods 0.000 abstract description 21
- 102000004190 Enzymes Human genes 0.000 abstract description 19
- 108090000790 Enzymes Proteins 0.000 abstract description 19
- 241000208125 Nicotiana Species 0.000 abstract description 13
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 abstract description 11
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 abstract description 11
- 235000013734 beta-carotene Nutrition 0.000 abstract description 11
- 239000011648 beta-carotene Substances 0.000 abstract description 11
- 229960002747 betacarotene Drugs 0.000 abstract description 11
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 abstract description 11
- 238000005516 engineering process Methods 0.000 abstract description 10
- 238000012216 screening Methods 0.000 abstract description 8
- 238000004458 analytical method Methods 0.000 abstract description 7
- 230000003197 catalytic effect Effects 0.000 abstract description 5
- 238000010353 genetic engineering Methods 0.000 abstract description 5
- 108010007508 Farnesyltranstransferase Proteins 0.000 description 36
- 102000007317 Farnesyltranstransferase Human genes 0.000 description 31
- 108020004414 DNA Proteins 0.000 description 27
- 230000001580 bacterial effect Effects 0.000 description 18
- 239000013612 plasmid Substances 0.000 description 15
- 108090000623 proteins and genes Proteins 0.000 description 14
- 238000000034 method Methods 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- OINNEUNVOZHBOX-QIRCYJPOSA-N 2-trans,6-trans,10-trans-geranylgeranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP(O)(=O)OP(O)(O)=O OINNEUNVOZHBOX-QIRCYJPOSA-N 0.000 description 9
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 description 8
- 235000021466 carotenoid Nutrition 0.000 description 8
- 150000001747 carotenoids Chemical class 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 5
- 238000009395 breeding Methods 0.000 description 5
- 230000001488 breeding effect Effects 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 230000037361 pathway Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 101710173432 Phytoene synthase Proteins 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000007547 defect Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- NUHSROFQTUXZQQ-UHFFFAOYSA-N isopentenyl diphosphate Chemical compound CC(=C)CCO[P@](O)(=O)OP(O)(O)=O NUHSROFQTUXZQQ-UHFFFAOYSA-N 0.000 description 3
- CBIDRCWHNCKSTO-UHFFFAOYSA-N prenyl diphosphate Chemical compound CC(C)=CCO[P@](O)(=O)OP(O)(O)=O CBIDRCWHNCKSTO-UHFFFAOYSA-N 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 150000003505 terpenes Chemical class 0.000 description 3
- 235000007586 terpenes Nutrition 0.000 description 3
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- YVLPJIGOMTXXLP-UHFFFAOYSA-N 15-cis-phytoene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CC=CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C YVLPJIGOMTXXLP-UHFFFAOYSA-N 0.000 description 2
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 2
- 241000304195 Salvia miltiorrhiza Species 0.000 description 2
- 235000011135 Salvia miltiorrhiza Nutrition 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 230000031700 light absorption Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- YVLPJIGOMTXXLP-UUKUAVTLSA-N 15,15'-cis-Phytoene Natural products C(=C\C=C/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C YVLPJIGOMTXXLP-UUKUAVTLSA-N 0.000 description 1
- YVLPJIGOMTXXLP-BAHRDPFUSA-N 15Z-phytoene Natural products CC(=CCCC(=CCCC(=CCCC(=CC=C/C=C(C)/CCC=C(/C)CCC=C(/C)CCC=C(C)C)C)C)C)C YVLPJIGOMTXXLP-BAHRDPFUSA-N 0.000 description 1
- WIIZWVCIJKGZOK-IUCAKERBSA-N 2,2-dichloro-n-[(1s,2s)-1,3-dihydroxy-1-(4-nitrophenyl)propan-2-yl]acetamide Chemical compound ClC(Cl)C(=O)N[C@@H](CO)[C@@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-IUCAKERBSA-N 0.000 description 1
- 125000003903 2-propenyl group Chemical group [H]C([*])([H])C([H])=C([H])[H] 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- 235000002568 Capsicum frutescens Nutrition 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 229930191978 Gibberellin Natural products 0.000 description 1
- 101001094048 Homo sapiens Pendrin Proteins 0.000 description 1
- 101100235016 Oncidium hybrid cultivar LCY-B gene Proteins 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 229930004069 diterpene Natural products 0.000 description 1
- 150000004141 diterpene derivatives Chemical class 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 239000003448 gibberellin Substances 0.000 description 1
- IXORZMNAPKEEDV-OBDJNFEBSA-N gibberellin A3 Chemical class C([C@@]1(O)C(=C)C[C@@]2(C1)[C@H]1C(O)=O)C[C@H]2[C@]2(C=C[C@@H]3O)[C@H]1[C@]3(C)C(=O)O2 IXORZMNAPKEEDV-OBDJNFEBSA-N 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 230000029553 photosynthesis Effects 0.000 description 1
- 238000010672 photosynthesis Methods 0.000 description 1
- 235000017807 phytochemicals Nutrition 0.000 description 1
- 235000011765 phytoene Nutrition 0.000 description 1
- BOTWFXYSPFMFNR-PYDDKJGSSA-N phytol Chemical group CC(C)CCC[C@@H](C)CCC[C@@H](C)CCC\C(C)=C\CO BOTWFXYSPFMFNR-PYDDKJGSSA-N 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y205/00—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5)
- C12Y205/01—Transferases transferring alkyl or aryl groups, other than methyl groups (2.5) transferring alkyl or aryl groups, other than methyl groups (2.5.1)
- C12Y205/01029—Geranylgeranyl diphosphate synthase (2.5.1.29)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The invention belongs to the technical field of tobacco genetic engineering, and particularly relates to GGPPS directional mutant protein patent application matters. The three sites 154, 161 and 218 of the existing GGPPS protein are positioned in the catalytic pocket of the enzyme, and the two sites 209 and 233 are positioned on the surface of the enzyme molecule; based on these five sites, the present application provides GGPPS series of targeted muteins, including: a series of single-site muteins, two-site muteins, three-site muteins, four-site muteins, and five-site muteins. The inventor utilizes CAST technology to construct a small and fine mutant library, and the inventor conducts detailed analysis on the mutation type of the amino acid at a specific position through further screening and based on the requirement of directed evolution. Preliminary experiment results show that after the amino acid at a specific site is mutated, the synthesis amount of the beta-carotene is obviously improved, and a certain technical basis is laid for further cultivating new varieties of crops.
Description
Technical Field
The invention belongs to the technical field of tobacco genetic engineering, and particularly relates to GGPPS directional mutant protein.
Background
The carotenoid is an important plastid pigment, has important physiological action, is closely related to the growth and development and photosynthesis of plants, and influences the quality and the characters of crops. Geranylgeranyl diphosphate (GGPP) is a common precursor for carotenoids, the phytol side chains of chlorophyll and vitamin E, gibberellins, and diterpene phytochemicals. GGPP is catalytically produced by geranylgeranyl diphosphate synthase (GGPP synthsase, GGPPS), and 3 molecules of isopentenyl pyrophosphate (IPP) and 1 molecule of allyl isomer dimethylallyl pyrophosphate (DMAPP) are condensed under the action of GGPPS to produce C20 GGPP.
GGPP is a starting substrate for carotenoid synthesis, and is catalyzed by Phytoene Synthase (PSY), a terminal enzyme of the carotenoid synthesis pathway, to form phytoene, which is used for carotenoid synthesis. A great deal of research demonstration on plant GGPPS gene family shows that the family members not only encode important enzyme proteins at the upstream of a terpene synthesis pathway, but also directly participate in regulating various pathways of plant terpene synthesis, and play a central role in regulation.
In the existing research, although the synthesis routes of carotenoids have been studied more, from the aspect of plant improvement, if the oriented breeding and cultivation can be realized by using the genetic engineering technology, the method is an important technical premise for realizing the maximum utilization of plants.
As one of the key technologies in synthetic biology, directed biological evolution technology has acquired the 2018 nobel prize on chemistry. The technology is widely used for the activity design of enzyme, and can effectively overcome the defects of natural enzyme in the aspects of environmental tolerance, stereo/regioselectivity, substrate specificity, catalytic efficiency, product inhibition and the like through directed evolution, so that the evolution process of thousands of years in the nature is completed in a laboratory in a short time.
In the development process of the directed evolution technology of the enzyme, when the traditional directed evolution is carried out, such as error-prone PCR, DNA mixed group, sequence saturation mutation, random initiation of in vitro recombination and other technologies, the defects of low mutation efficiency, large screening workload and the like exist, and the application of the in vitro directed evolution of enzyme molecules is restricted. A combined active-site mutation strategy (CAST) is a relatively new directed evolution mutation strategy, and the technology is based on structural information of protein, firstly, on the basis of simulation of a computer, then, amino acid residues which have direct interaction with a substrate are selected around an enzyme catalytic active center, and a small and fine mutant library is constructed, so that the screening scale of the mutant library is reduced.
Generally, although the directed evolution technology can accelerate the evolution process of crop genes and provide high-quality genes for improving the quality of crops, the directed evolution technology is not used for modifying plant genes at present due to short development time and other technical difficulties.
Tobacco is one of model crops for genetic engineering research, and meanwhile, the carotenoid content in tobacco directly influences the quality and the character of the tobacco. Therefore, if the directional evolution can be combined to carry out the directional transformation on the terpene synthetic genes of the tobacco, the method has very important application significance for accelerating the tobacco breeding, and simultaneously, a certain technical foundation can be laid for the genetic breeding of other crops.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: provides a plurality of GGPPS genes with better enzyme activity and tobacco GGPPS genes, thereby providing a new technical idea for tobacco breeding and new tobacco variety cultivation and laying a certain technical foundation for breeding and variety improvement of other crops.
The technical scheme of the invention is as follows: .
The invention has the beneficial effects that: GGPP is used as a precursor substance of downstream carotenoid, and the inventors show that 154, 161, 209, 218 and 233 are key sites through structural analysis of key amino acid sites of an enzyme active pocket of GGPPS. Furthermore, the inventor utilizes CAST technology to construct a small and fine mutant library, thereby overcoming the defects of low mutation rate, large screening amount and the like in the traditional directed evolution screening process. Through further screening and based on the requirement of directed evolution, the inventors have conducted detailed analysis on the types of amino acid mutations at specific sites. Preliminary experiment results show that after the amino acid at a specific site is mutated, the synthesis amount of the beta-carotene is obviously improved, and a certain technical basis is laid for further cultivating new varieties of crops.
Drawings
FIG. 1 is a bacterial color experiment of GGPPS-154 mutant, in which: a double plasmid bacteria color experiment, PAC-94N plasmid contains PSY, PDS and LCY-B three genes, IPP and DMAPP can be synthesized in Escherichia coli, but GGPP can not be produced, after GGPPS plasmid with enzyme activity and PAC-94N are jointly transformed into Escherichia coli, beta-carotene can be catalytically produced in Escherichia coli, and Escherichia coli is transformed from white to yellow; B. in the GGPPS-154 single-site mutation bacterial color experiment, the 154 th amino acid site is mutated, after the 154 th amino acid site and PAC-94N double plasmids are jointly transformed into escherichia coli BL21 (DE) 3, the color of the bacterial liquid is obviously changed after IPTG induction, the lower graph is the light absorption value at OD440, and empty pET32b is used as negative control, so that the light absorption value of the mutant at OD440 is obviously improved compared with wild type GGPPS;
FIG. 2 shows bacterial color experiments of GGPPS-161 mutant, wherein A is the same as that in FIG. 1A and is a synthetic schematic diagram, and B is bacterial color experiments of single-site mutation of GGPPS-161;
FIG. 3 shows bacterial color experiments of GGPPS-218 mutant, wherein A is the same as that in FIG. 1A and is a synthetic schematic diagram, and B is bacterial color experiments of single-site mutation of GGPPS-218;
FIG. 4 shows bacterial color experiments of GGPPS-209 mutant, wherein A is the same as that in FIG. 1A and is a synthetic principle diagram, and B is bacterial color experiments of single-site mutation of GGPPS-209;
FIG. 5 shows bacterial color experiments of GGPPS-233 mutant, wherein A is the same as FIG. 1A and is a synthetic schematic diagram, and B is bacterial color experiments of single-site mutation of GGPPS-233;
FIG. 6 shows bacterial color experiments of GGPPS-154/161/218 mutant, wherein A is the same as that in FIG. 1A and is a synthetic schematic diagram, and B is bacterial color experiments of GGPPS-154/161/218 three-site combined mutation;
FIG. 7 shows bacterial color experiments of GGPPS-209/233 mutant, wherein A is the same as that in FIG. 1A and is a synthetic schematic diagram, and B is bacterial color experiments of two-site combined mutation of GGPPS-209/233;
FIG. 8 shows bacterial color experiments of GGPPS-154/161/209/218/233 mutant, wherein A is the same as that in FIG. 1A and is a synthetic principle diagram, and B is bacterial color experiments of combined mutation of five sites of GGPPS-154/161/209/218/233.
FIG. 9 is a comparison of GGPPS-154 and GGPPS locus of the mainstream crop, in which: hot pepper (cagggpps 1), potato (StGGPPS 1), wheat (tagggpps 1), corn (ZmGGPPS 1), salvia miltiorrhiza (SmGGPPS 1), coffee (CanGGPPS 1), carrot (DcGGPPS 1), grape (VvGGPPS), cucumber (csagpps), watermelon (ClGGPPS), apple (MdGGPPS 1), orange (csigpps), rubber tree (HbGGPPS), chrysanthemum (CmGGPPS), rice (OsGGPPS 1), tomato (SlGGPPS 1), castor (rcgggpps), banana (magggpps) and ginkgo biloba (GbGGPPS).
Detailed Description
The present application is further explained below with reference to the drawings and examples.
Example 1
Since the acquisition of the existing geranylgeranyl diphosphate synthase (GGPPS) gene is the basis of the analysis of the relevant sequences and the directed evolution mutation, this example is summarized below with respect to the cloning acquisition of the existing geranylgeranyl diphosphate synthase (GGPPS) gene.
First, based on the gene sequence shown in GenBank accession No. NM _001325177.1, primer sequences for PCR amplification were designed as follows:
a forward primer: 5'-atgagatctatgaatcttgt-3' the flow of the air in the air conditioner,
reverse primer: 5'-attttcacgataagcaatgt-3', respectively;
subsequently, PCR amplification is carried out;
carrying out electrophoresis detection on the PCR amplification product, recovering and purifying, and connecting the recovered and purified PCR product with pGEMT plasmid;
subsequently, the ligation product was transformed into E.coli DH 5. alpha. competent cells, and after overnight culture, positive clones were selected and identified to obtain the recombinant correct cloning plasmid pGEMT-GGPPS.
In order to facilitate the subsequent expression of GGPPS, the inventor further constructs a recombinant expression vector by utilizing pET-32b (+) plasmid, and the specific process is as follows:
first, a pair of primer sequences containing Nde I and Xho I cleavage sites was designed as follows:
a forward primer: 5'-gctaatccatatgGAGCAATTCAATTTCAAAACT-3' the flow of the air in the air conditioner,
reverse primer: 5'-cagctcgagATTTTCACGATAAGCAATGTAAT-3', respectively;
then, using the clone plasmid pGEMT-GGPPS as a template to carry out PCR amplification so as to obtain a GGPPS gene, and recycling and purifying PCR amplification products;
subsequently, carrying out Nde I and Xho I double enzyme digestion on the GGPPS gene product obtained by PCR amplification and the pET-32b (+) plasmid respectively, and recovering enzyme digestion products for connection;
finally, the ligation product was transformed into DH5 α competent cells, cultured overnight, and positive clones were selected for identification, and the correctly identified recombinant expression plasmid vector was renamed: pET-32b (+) -GGPPS.
Example 2
In order to facilitate the detection and analysis of relevant mutation sites, the present application has been experimentally verified using a geranylgeranyl diphosphate synthase (GGPPS) recombinant engineered strain, and therefore, the construction process of this engineered strain is briefly described below in this example.
First, the recombinant vector pET-32b (+) -GGPPS constructed in example 1 was co-transformed with the PAC-94N plasmid to express E.coli BL21(DE3), and the empty vector pET-32b (+) was co-transformed with the PAC-94N plasmid to serve as a negative control strain;
subsequently, after overnight culture, positive clones were selected for identification, and strains of positive clones that were sequenced correctly were either preserved or subjected to further beta-carotene content testing.
The principle of detecting the content of the beta-carotene comprises the following steps: coli cannot produce GGPP by itself, and PAC-94N plasmid contains all genes of the beta-carotene synthesis pathway but does not contain geranylgeranyl diphosphate synthase encoding gene (FIG. 1A), so that when the recombinant vector pET-32b (+) -GGPPS and PAC-94N plasmid are co-transformed, the strain which can catalytically produce GGPP is changed from white to yellow, and the stronger the enzyme activity is, the darker the yellow is, and further the absorbance of OD440 is measured by a spectrophotometer, so that the content of the beta-carotene in the final product can be measured.
Example 3
Since no crystal structure report is available for tobacco GGPPS, the inventors have performed homologous modeling on the enzyme based on its amino acid sequence in order to analyze the protein. During modeling, the optimal template was 3kro, the score was 0.993 (TM-score was used to measure the degree of matching between two protein structure models, the score from 0 to 1, 1 means perfect match), the Root Mean Square Deviation (RMSD) was 0.36 a, the sequence Identity (IDEN) was 74.9%, and the protein structure coverage (Cov) was 99.7%.
Substrates C5-DMAPP, C10-GPP, C15-FPP were docked to the GGPPS catalytic pocket using the Rosetta _ docking program, respectively. The binding mode of the receptor small molecule compound is predicted by searching the optimal binding position for the action of the receptor small molecule compound and the enzyme. The final analysis results show that: the 154 th site, the 161 th site and the 218 th site are positioned in a catalytic pocket of the enzyme, the binding capacity of a substrate and an enzyme activity pocket can be further improved after amino acid substitution, and the sites participate in enzyme activity regulation and control; and the 209 th and 233 th amino acid sites are positioned on the surface of the enzyme molecule, which is favorable for the protein to form a dimer.
It should be noted that, the existing GGPPS is composed of 296 amino acids, the specific sequence is shown in SEQ ID No.1, the gene coding GGPPS is composed of 888 nucleotides, the specific sequence is shown in SEQ ID No.2, based on the above analysis, the inventor further constructs the GGPPS saturation mutant library aiming at the 5 sites, and the specific construction process is briefly described as follows.
First, degenerate primer sequences were designed based on the above 5 sites as follows:
V154-F:5’-ggaactgaagggttannkgctggacaagtagcg-3’,
I161-F: 5’-ggacaagtagcggatnnkgcttgtactggtaac-3’,
I209-F: 5’-gtggagaaattgaggnnkttcgcgagatgtatt-3’,
F218-F: 5’-tgtattggattattgnnkcaagtagtagatgat-3’,
V233-F: 5’-acaaagtcgtcggagnnkctcggaaaaaccgcc-3’,
general-R: 5’-cacgataagcaatgtaatccg-3’;
subsequently, whole plasmid PCR was carried out using pET-32b (+) -GGPPS constructed in example 1 as a template, and then V154-F, I161-F, I209-F, F218-F and V233-F as forward primers and general-R as reverse primers, respectively;
then, the PCR amplification product was subjected to DpnI digestion, and the digested product was co-transformed with PAC-94N double plasmid into E.coli BL21(DE3) competent cells, and after the transformation, the bacterial solution was spread on a plate containing 100. mu.g/L ampicillin and 34. mu.g/L chloramphenicol, to finally obtain a saturated mutation library.
For further screening, positive monoclonals were selected, cultured overnight in 20ml of LB liquid medium containing ampicillin (100. mu.g/L) and chloramphenicol (34. mu.g/L), induced at 18 ℃ for expression with IPTG at a final concentration of 0.1mM, centrifuged after completion of the culture, the supernatant was discarded, photographed, and resuspended in 3ml of acetone, and the OD440 value was measured.
And finally screening and determining the mutant with obviously enhanced absorbance by adopting the high-throughput screening method and further sequencing. The results show that when: val (V) at position 154 mutated to the neutral amino acid Ala (A) or to cysteine Cys (C); ile (I) amino acid at position 161 is mutated into neutral aliphatic amino acid Leu (L) or is mutated into amino acid Met (M) containing sulfhydryl; ile (I) amino acid at position 209 is mutated to a basic amino acid Lys (K), an amino acid Ser (S) containing a hydroxyl group, aspartic acid Asp (D), asparagine Asn (N), alanine Ala (A) or proline Pro (P); phe (F) amino acid 218 mutated to the aromatic amino acid tyrosine Tyr (Y) or to leucine Leu (L); when the 233 th amino acid Val is mutated into acidic amino acid, the acid is Glu (E) or tyrosine Tyr (Y); the geranylgeranyl diphosphate synthase GGPPS of tobacco has obvious change.
For position 154:
when the 154 site is mutated into A, the amino acid sequence is SEQ ID No. 13;
the corresponding base coding sequence is SEQ ID No. 14;
when the 154 site is mutated to C, the base sequence encoded by the mutant is SEQ ID No. 15.
For position 161:
when the 161 site is mutated into L, the encoding base sequence is SEQ ID No. 16;
when the 161 site is mutated to M, the coding base sequence is SEQ ID No. 17.
For position 218:
when the 218 site is mutated into L, the coding base sequence is SEQ ID No. 18;
when the 218 site is mutated to Y, the encoding base sequence is SEQ ID No. 19.
For site 209:
when the 209 site is mutated into P, the coding base sequence is SEQ ID No. 20;
when the 209 site is mutated into K, the coding base sequence is SEQ ID No. 21;
when the 209 site is mutated into D, the encoding base sequence is SEQ ID No. 22;
when the 209 site is mutated into N, the coding base sequence is SEQ ID No. 23;
when the 209 site is mutated into S, the coding base sequence is SEQ ID No. 24;
when the 209 th site is mutated to A, the coding base sequence is SEQ ID No. 25.
For position 233:
when the 233 site is mutated to E, the coding base sequence is SEQ ID No. 26;
when the 233 site is mutated to Y, the coding base sequence is SEQ ID No. 27.
Other specific base sequences of the double sites, the three sites or the five sites are referred to above, for example, the amino acid sequence of the five-site mutant protein GGPPS-154A/161L/209S/218Y/233E is SEQ ID No. 28;
in this case, the corresponding coding base sequence is SEQ ID No. 29.
Furthermore, the inventors have performed different combination type mutations at amino acid positions 154, 161, 209, 218 and 233, synthesized expression vectors with different combination mutations by using the existing genetic engineering technology, further constructed different strains by using the construction method of the geranylgeranyl diphosphate synthase (GGPPS) recombinant engineering strain in example 2, and examined the synthetic amount of beta-carotene. The specific test results are summarized below.
The results of the single-site experiments shown in FIGS. 1 to 5 show that:
when the 154 th Val is mutated into Ala or Cys, the 161 th Ile is mutated into Leu or Met, the 209 th Ile is mutated into Lys, Ser, aspartic acid Asp, asparagine Asn, alanine Ala and proline Pro, the 218 th Phe is mutated into Leu or Tyr, and the 233 th Val is mutated into Glu or Tyr, the color of the thallus or the synthetic amount of the beta-carotene is obviously improved;
as shown in FIGS. 6 to 8, the results of the two-site, three-site and five-site experiments show that the color of the cells or the amount of synthesized beta-carotene is significantly increased under the condition of different site combinations in a specific mutation direction.
The results show that the GGPPS enzyme activity can be better provided by aiming at the directional mutation evolution mode of a specific site, so that a good foundation is laid for further providing the beta-carotene content in crops.
Example 4
Based on the analysis and verification of example 3, the inventors further analyzed the GGPPS modification sites of the existing mainstream crops. The results of comparative analysis of amino acid sequences (as shown in FIG. 9, FIG. 9 shows only the 154-position comparison results) indicate that: similar to GGPPS of tobacco, the mainstream crops of pepper (cagggpps 1), potato (StGGPPS 1), wheat (tagggpps 1), corn (ZmGGPPS 1), salvia miltiorrhiza (SmGGPPS 1), coffee (CanGGPPS 1), carrot (DcGGPPS 1), grape (VvGGPPS), cucumber (csagpps), watermelon (ClGGPPS), apple (MdGGPPS 1), orange (csigpps), rubber tree (HbGGPPS), chrysanthemum (CmGGPPS), rice (OsGGPPS 1), tomato (SlGGPPS 1), castor bean (rcgggpps), banana (magggpps) and ginkgo biloba (GbGGPPS) all have 5 sites of GGPPS enzyme conserved, so that mutation modes based on the same site are feasible, or directional muteins obtained by the present application are directly utilized, and a new idea is provided for breeding of new varieties of these crops.
Sequence listing
<110> Guizhou province tobacco science research institute
<120>
<130>1
<160>29
<210>1
<211>296
<212> amino acid
<213>GGPPS
<400>1
EQFNFKTYVA EKAISVNKAL DEAVIVKDPP VIHEAMRYSL LAGGKRVRPM LCLAACELVG 60
GDQSNAMPAA CAVEMIHTMS LIHDDLPCMD NDDLRRGKPT NHKVYGEDVA VLAGDSLLAF 120
AFEYIATATA GVSPSRILAA IGELAKSIGT EGLVAGQVAD IACTGNPNVG LDTLEFIHIH 180
KTAALLEASV VLGAILGGGT DEEVEKLRIF ARCIGLLFQV VDDILDVTKS SEVLGKTAGK 240
DLAVDKTTYP KLLGLEKAKE FAAELNRDAK QQLVEFDPHK AAPLIALADY IAYREN 296
<210>2
<211>888
<212>DNA
<213>GGPPS
<400>2
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>3
<211>20
<212>DNA
<213> Artificial Synthesis
<400>3
atgagatctatgaatcttgt
<210>4
<211>20
<212>DNA
<213> Artificial Synthesis
<400>4
attttcacgataagcaatgt
<210>5
<211>34
<212>DNA
<213> Artificial Synthesis
<400>5
gctaatccatatgGAGCAATTCAATTTCAAAACT
<210>6
<211>32
<212>DNA
<213> Artificial Synthesis
<400>6
cagctcgagATTTTCACGATAAGCAATGTAAT
<210>7
<211>33
<212>DNA
<213> Artificial Synthesis
<400>7
ggaactgaagggttannkgctggacaagtagcg
<210>8
<211>33
<212>DNA
<213> Artificial Synthesis
<400>8
ggacaagtagcggatnnkgcttgtactggtaac
<210>9
<211>33
<212>DNA
<213> Artificial Synthesis
<400>9
gtggagaaattgaggnnkttcgcgagatgtatt
<210>10
<211>30
<212>DNA
<213> Artificial Synthesis
<400>10
tgtattggattattgnnkcaagtagtagatgat
<210>11
<211>30
<212>DNA
<213> Artificial Synthesis
<400>11
acaaagtcgtcggagnnkctcggaaaaaccgcc
<210>12
<211>21
<212>DNA
<213> Artificial Synthesis
<400>12
cacgataagcaatgtaatccg
<210>13
<211>296
<212> amino acid
<213>GGPPS-154A
<400>13
EQFNFKTYVA EKAISVNKAL DEAVIVKDPP VIHEAMRYSL LAGGKRVRPM LCLAACELVG 60
GDQSNAMPAA CAVEMIHTMS LIHDDLPCMD NDDLRRGKPT NHKVYGEDVA VLAGDSLLAF 120
AFEYIATATA GVSPSRILAA IGELAKSIGT EGLAAGQVAD IACTGNPNVG LDTLEFIHIH 180
KTAALLEASV VLGAILGGGT DEEVEKLRIF ARCIGLLFQV VDDILDVTKS SEVLGKTAGK 240
DLAVDKTTYP KLLGLEKAKE FAAELNRDAK QQLVEFDPHK AAPLIALADY IAYREN 296
<210>14
<211>888
<212>DNA
<213>GGPPS-154A
<400>14
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG CGGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>15
<211>888
<212>DNA
<213>GGPPS-154C
<400>15
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAT GCGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>16
<211>888
<212>DNA
<213>GGPPS-161L
<400>16
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
CTGGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>17
<211>888
<212>DNA
<213>GGPPS-161M
<400>17
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATGGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>18
<211>888
<212>DNA
<213>GGPPS-218L
<400>18
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GCTGCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>19
<211>888
<212>DNA
<213>GGPPS-218Y
<400>19
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTATCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>20
<211>888
<212>DNA
<213>GGPPS-209P
<400>20
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGCCGTTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>21
<211>888
<212>DNA
<213>GGPPS-209K
<400>21
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGAAATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>22
<211>888
<212>DNA
<213>GGPPS-209D
<400>22
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGGATTTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>23
<211>888
<212>DNA
<213>GGPPS-209N
<400>23
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACGAAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGAATTTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>24
<211>888
<212>DNA
<213>GGPPS-209S
<400>24
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGAGCTTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>25
<211>888
<212>DNA
<213>GGPPS-209A
<400>25
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGGCGTTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGTGC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>26
<211>888
<212>DNA
<213>GGPPS-233E
<400>26
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGAAC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>27
<211>888
<212>DNA
<213>GGPPS-233Y
<400>27
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG TAGCTGGACA AGTAGCGGAT 480
ATAGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGATATTC GCGAGATGTA TTGGATTATT GTTTCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGTATC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 888
<210>28
<211>296
<212> amino acid
<213>GGPPS-154A/161L/209S/218Y/233E
<400>28
EQFNFKTYVA EKAISVNKAL DEAVIVKDPP VIHEAMRYSL LAGGKRVRPM LCLAACELVG 60
GDQSNAMPAA CAVEMIHTMS LIHDDLPCMD NDDLRRGKPT NHKVYGEDVA VLAGDSLLAF 120
AFEYIATATA GVSPSRILAA IGELAKSIGT EGLAAGQVAD LACTGNPNVG LDTLEFIHIH 180
KTAALLEASV VLGAILGGGT DEEVEKLRSF ARCIGLLYQV VDDILDVTKS SEELGKTAGK 240
DLAVDKTTYP KLLGLEKAKE FAAELNRDAK QQLVEFDPHK AAPLIALADY IAYREN 296
<210>29
<211>888
<212>DNA
<213>GGPPS-154A/161L/209S/218Y/233E
<400>29
GAGCAATTCA ATTTCAAAAC TTACGTAGCT GAAAAGGCTA TTTCTGTAAA TAAAGCTTTA 60
GATGAGGCTG TTATAGTAAA AGACCCACCT GTGATCCACG AAGCAATGCG CTATTCACTT 120
CTCGCCGGCG GCAAAAGAGT CCGACCGATG CTCTGCCTCG CCGCCTGCGA GCTCGTCGGC 180
GGCGACCAAT CCAACGCCAT GCCGGCTGCT TGCGCCGTCG AGATGATCCA CACTATGTCC 240
CTCATTCACG ACGATTTACC TTGTATGGAT AACGACGATC TCCGCCGTGG AAAGCCGACG 300
AACCACAAAG TCTACGGCGA GGACGTGGCG GTCCTCGCCG GAGACTCGCT CCTCGCTTTC 360
GCCTTCGAGT ACATCGCCAC CGCTACCGCC GGAGTTTCAC CGTCGAGGAT CCTCGCCGCC 420
ATCGGCGAAC TGGCGAAATC CATCGGAACT GAAGGGTTAG CGGCTGGACA AGTAGCGGAT 480
CTGGCTTGTA CTGGTAACCC TAATGTTGGA CTCGACACAC TCGAATTCAT TCACATACAC 540
AAAACGGCGG CGCTTCTAGA AGCTTCCGTA GTTCTCGGAG CAATCCTCGG CGGCGGAACA 600
GATGAAGAAG TGGAGAAATT GAGGAGCTTC GCGAGATGTA TTGGATTATT GTATCAAGTA 660
GTAGATGATA TACTCGATGT TACAAAGTCG TCGGAGGAAC TCGGAAAAAC CGCCGGAAAA 720
GATTTGGCAG TAGATAAAAC GACGTATCCA AAACTGCTGG GATTGGAAAA GGCTAAGGAA 780
TTTGCGGCGG AGCTCAACCG AGATGCTAAA CAACAGCTGG TGGAATTTGA TCCACACAAA 840
GCTGCTCCCT TGATTGCTTT GGCGGATTAC ATTGCTTATC GTGAAAAT 880
Claims (4)
- GGPPS directed mutant protein, characterized in that: in the GGPPS protein base sequence, the 154 th Val amino acid is mutated into a neutral amino acid Ala or a cysteine Cys (C);or the Ile amino acid at the 161 th position is mutated into neutral aliphatic amino acid Leu or amino acid Met containing sulfydryl;or the Ile amino acid at the 209 th position is mutated into any one of basic amino acid Lys, amino acid Ser containing hydroxyl, aspartic acid Asp, asparagine Asn, alanine Ala or proline Pro;or, the 218 th Phe amino acid is mutated into aromatic amino acid tyrosine Tyr or leucine Leu;or, the 233 th amino acid Val is mutated into the acidic amino acid, and the acidic amino acid is Glu or tyrosine Tyr.
- 2. The GGPPS directed mutein of claim 1, wherein: the mutant protein is formed by combining any two, three or five unit sites of the mutant site.
- 3. The use of the GGPPS directed mutein according to claim 1 or 2 for the synthesis of crop pigments.
- 4. Use according to claim 3, characterized in that: the crops are as follows: tobacco, pepper, tomato, rice, corn, wheat, potato, castor-oil plant, apple, cucumber, watermelon, carrot, banana, orange, grape, coffee, ginkgo, salvia, chrysanthemum or rubber tree.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2020105100961 | 2020-06-08 | ||
CN202010510096 | 2020-06-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111718916A true CN111718916A (en) | 2020-09-29 |
Family
ID=72574275
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010814751.2A Pending CN111718916A (en) | 2020-06-08 | 2020-08-13 | GGPPS directed single-point mutant protein and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111718916A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5882909A (en) * | 1995-09-01 | 1999-03-16 | Toyota Jidosha Kabushiki Kaisha | Nucleic acid encoding mutant geranylgeranyl diphosphate synthase |
CN1252838A (en) * | 1997-12-16 | 2000-05-10 | 丰田自动车株式会社 | Geranyl diphosphate synthase genes |
CN101475946A (en) * | 2009-01-16 | 2009-07-08 | 上海师范大学 | Geranylgeranyl diphosphate synthase gene in salvia root, and encoding protein and use thereof |
CN111593032A (en) * | 2020-05-26 | 2020-08-28 | 中国烟草总公司郑州烟草研究院 | Directional five-site mutant protein GGPPS on enzyme pocket and enzyme molecule surface |
-
2020
- 2020-08-13 CN CN202010814751.2A patent/CN111718916A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5882909A (en) * | 1995-09-01 | 1999-03-16 | Toyota Jidosha Kabushiki Kaisha | Nucleic acid encoding mutant geranylgeranyl diphosphate synthase |
CN1252838A (en) * | 1997-12-16 | 2000-05-10 | 丰田自动车株式会社 | Geranyl diphosphate synthase genes |
CN101475946A (en) * | 2009-01-16 | 2009-07-08 | 上海师范大学 | Geranylgeranyl diphosphate synthase gene in salvia root, and encoding protein and use thereof |
CN111593032A (en) * | 2020-05-26 | 2020-08-28 | 中国烟草总公司郑州烟草研究院 | Directional five-site mutant protein GGPPS on enzyme pocket and enzyme molecule surface |
Non-Patent Citations (3)
Title |
---|
IRINA ORLOVA ET AL.: "The Small Subunit of Snapdragon Geranyl Diphosphate Synthase Modifies the Chain Length Specificity of Tobacco Geranylgeranyl Diphosphate Synthase in Planta", 《THE PLANT CELL》 * |
唐美琼等: "植物GGPPS基因研究进展", 《湖北农业科学》 * |
魏攀等: "烟草牻牛儿基牻牛儿基焦磷酸合成酶基因NtGGPPS1的克隆和功能分析", 《烟草科技》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107686840B (en) | Pears transcription factor PyERF3 and its recombinant expression carrier and application | |
CN101993482B (en) | Protein associated with long grain foliaceous of paddy rice and coding gene and application thereof | |
CN113957086B (en) | Application of SlBIN2 gene in regulation and control of tomato fruit ripening and carotenoid synthesis | |
Gabruk et al. | The origin, evolution and diversification of multiple isoforms of light-dependent protochlorophyllide oxidoreductase (LPOR): focus on angiosperms | |
CN111593032B (en) | Directional five-site mutant protein GGPPS on enzyme pocket and enzyme molecule surface | |
CN109517812A (en) | Azalea chalcone synthase RsCHS albumen and its encoding gene | |
CN111500551B (en) | GGPPS directed single point mutant protein GGPPS-218 | |
CN113265381B (en) | Separated CYP450 protein, coding gene thereof and application thereof | |
CN107326035B (en) | Deubiquitinating enzyme gene UBP5 for regulating rice grain shape and leaf color and application thereof | |
EP2922961B1 (en) | Nucleotide sequence encoding wuschel-related homeobox4 (wox4) protein from corchorus olitorius and corchorus capsularis and methods of use for same | |
CN111534496B (en) | GGPPS Directional single-point mutant protein GGPPS-154 | |
CN111549009B (en) | Directional three-point mutant protein GGPPS-154-161-218 | |
CN111549010B (en) | GGPPS directional double-site mutant protein 209-233 | |
CN111534497B (en) | GGPPS directed single point mutant protein GGPPS-161 | |
CN111718916A (en) | GGPPS directed single-point mutant protein and application thereof | |
CN111534499A (en) | GGPPS directed single point mutant protein GGPPS-233 | |
Slugina et al. | Structural and functional diversification of SEPALLATA genes TM5 and RIN in tomato species (section Lycopersicon) | |
CN108395473A (en) | Carotenoid in Plants synthesis associated protein and its encoding gene and application | |
CN114196678A (en) | Hedgehog interacting protein OsHIPL1 gene for controlling rice seed vigor and engineering application thereof | |
CN108504665B (en) | Cauliflower internal reference gene and application thereof | |
CN104388446B (en) | Pinellia ternate squalene synthase gene as well as coded protein and application thereof | |
CN117737026A (en) | GGPPS muteins related to carotenoid content in wheat kernel | |
CN106811448B (en) | Cotton tyrosine phosphatase GhPTP1 and its encoding gene and application | |
CN104694557B (en) | A kind of Mang ox base Mang ox base diphosphate synthase genes and its encoding proteins and application | |
CN108004258A (en) | The protein of cape jasmine lycopene beta cyclase b2 genes and its coding, the gene of optimization and their application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200929 |
|
RJ01 | Rejection of invention patent application after publication |