CN113683712A - 甜菊醇糖苷 - Google Patents
甜菊醇糖苷 Download PDFInfo
- Publication number
- CN113683712A CN113683712A CN202110909254.5A CN202110909254A CN113683712A CN 113683712 A CN113683712 A CN 113683712A CN 202110909254 A CN202110909254 A CN 202110909254A CN 113683712 A CN113683712 A CN 113683712A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- ser
- val
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 235000019202 steviosides Nutrition 0.000 title claims abstract description 173
- 239000004383 Steviol glycoside Substances 0.000 title claims abstract description 160
- 229930182488 steviol glycoside Natural products 0.000 title claims abstract description 160
- 235000019411 steviol glycoside Nutrition 0.000 title claims abstract description 160
- 150000008144 steviol glycosides Chemical class 0.000 title claims abstract description 160
- 235000000346 sugar Nutrition 0.000 claims abstract description 65
- QFVOYBUQQBFCRH-UHFFFAOYSA-N Steviol Natural products C1CC2(C3)CC(=C)C3(O)CCC2C2(C)C1C(C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-UHFFFAOYSA-N 0.000 claims abstract description 32
- QFVOYBUQQBFCRH-VQSWZGCSSA-N steviol Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-VQSWZGCSSA-N 0.000 claims abstract description 32
- 229940032084 steviol Drugs 0.000 claims abstract description 32
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 claims abstract description 9
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 claims abstract description 9
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 claims abstract description 9
- 239000000203 mixture Substances 0.000 claims description 64
- 238000000034 method Methods 0.000 claims description 55
- 229920001184 polypeptide Polymers 0.000 claims description 54
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 54
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 54
- 238000000855 fermentation Methods 0.000 claims description 37
- 230000004151 fermentation Effects 0.000 claims description 37
- 235000003599 food sweetener Nutrition 0.000 claims description 26
- 239000003765 sweetening agent Substances 0.000 claims description 26
- 235000013305 food Nutrition 0.000 claims description 25
- 235000013361 beverage Nutrition 0.000 claims description 24
- 150000007523 nucleic acids Chemical group 0.000 claims description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 17
- 210000005253 yeast cell Anatomy 0.000 claims description 17
- 239000000796 flavoring agent Substances 0.000 claims description 13
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 13
- 235000019634 flavors Nutrition 0.000 claims description 11
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 125000000837 carbohydrate group Chemical group 0.000 claims 1
- 241000282326 Felis catus Species 0.000 description 51
- 230000000694 effects Effects 0.000 description 42
- 108010050848 glycylleucine Proteins 0.000 description 40
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 34
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 34
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 30
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 29
- 229930188195 rebaudioside Natural products 0.000 description 29
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 28
- 108020004414 DNA Proteins 0.000 description 27
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 26
- 229960001031 glucose Drugs 0.000 description 26
- 239000008103 glucose Substances 0.000 description 26
- 108010038633 aspartylglutamate Proteins 0.000 description 24
- 108010049041 glutamylalanine Proteins 0.000 description 24
- 210000004027 cell Anatomy 0.000 description 21
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 20
- 108010003700 lysyl aspartic acid Proteins 0.000 description 20
- 108010018006 histidylserine Proteins 0.000 description 19
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 18
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 18
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 18
- 235000001727 glucose Nutrition 0.000 description 18
- 108010034529 leucyl-lysine Proteins 0.000 description 18
- 239000002609 medium Substances 0.000 description 18
- 108010031719 prolyl-serine Proteins 0.000 description 18
- 108010026333 seryl-proline Proteins 0.000 description 18
- 102000004190 Enzymes Human genes 0.000 description 16
- 108090000790 Enzymes Proteins 0.000 description 16
- UEDUENGHJMELGK-HYDKPPNVSA-N Stevioside Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UEDUENGHJMELGK-HYDKPPNVSA-N 0.000 description 16
- 108010068265 aspartyltyrosine Proteins 0.000 description 16
- 229940088598 enzyme Drugs 0.000 description 16
- GSGVXNMGMKBGQU-PHESRWQRSA-N rebaudioside M Chemical compound C[C@@]12CCC[C@](C)([C@H]1CC[C@@]13CC(=C)[C@@](C1)(CC[C@@H]23)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O)C(=O)O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O[C@@H]2O[C@H](CO)[C@@H](O)[C@H](O)[C@H]2O)[C@H]1O[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O GSGVXNMGMKBGQU-PHESRWQRSA-N 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 101000640793 Homo sapiens UDP-galactose translocator Proteins 0.000 description 15
- 101000672037 Homo sapiens UDP-glucose:glycoprotein glucosyltransferase 2 Proteins 0.000 description 15
- 102100023897 NADPH-cytochrome P450 reductase Human genes 0.000 description 15
- 102100040361 UDP-glucose:glycoprotein glucosyltransferase 2 Human genes 0.000 description 15
- 238000006243 chemical reaction Methods 0.000 description 15
- 239000012634 fragment Substances 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 108090000623 proteins and genes Proteins 0.000 description 15
- HELXLJCILKEWJH-NCGAPWICSA-N rebaudioside A Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HELXLJCILKEWJH-NCGAPWICSA-N 0.000 description 15
- RPYRMTHVSUWHSV-CUZJHZIBSA-N rebaudioside D Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RPYRMTHVSUWHSV-CUZJHZIBSA-N 0.000 description 15
- 108010073969 valyllysine Proteins 0.000 description 15
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 14
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 108010047495 alanylglycine Proteins 0.000 description 14
- 150000001875 compounds Chemical class 0.000 description 14
- 108010078144 glutaminyl-glycine Proteins 0.000 description 14
- 108010081551 glycylphenylalanine Proteins 0.000 description 14
- OHHNJQXIOPOJSC-UHFFFAOYSA-N stevioside Natural products CC1(CCCC2(C)C3(C)CCC4(CC3(CCC12C)CC4=C)OC5OC(CO)C(O)C(O)C5OC6OC(CO)C(O)C(O)C6O)C(=O)OC7OC(CO)C(O)C(O)C7O OHHNJQXIOPOJSC-UHFFFAOYSA-N 0.000 description 13
- 229940013618 stevioside Drugs 0.000 description 13
- 238000012546 transfer Methods 0.000 description 13
- 108010020532 tyrosyl-proline Proteins 0.000 description 13
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 12
- 240000008415 Lactuca sativa Species 0.000 description 12
- 235000003228 Lactuca sativa Nutrition 0.000 description 12
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 12
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 12
- 241000235648 Pichia Species 0.000 description 12
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 12
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 12
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 12
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 12
- 108010085325 histidylproline Proteins 0.000 description 12
- 230000010354 integration Effects 0.000 description 12
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 12
- 238000002156 mixing Methods 0.000 description 12
- OMHUCGDTACNQEX-OSHKXICASA-N steviolbioside Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O OMHUCGDTACNQEX-OSHKXICASA-N 0.000 description 12
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 11
- 108010093581 aspartyl-proline Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 108010064235 lysylglycine Proteins 0.000 description 11
- 239000002773 nucleotide Substances 0.000 description 11
- 125000003729 nucleotide group Chemical group 0.000 description 11
- 230000002018 overexpression Effects 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 102100034689 2-hydroxyacylsphingosine 1-beta-galactosyltransferase Human genes 0.000 description 10
- 101100371757 Dactylopius coccus UGT4 gene Proteins 0.000 description 10
- 239000001512 FEMA 4601 Substances 0.000 description 10
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 10
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 10
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 10
- 108010047562 NGR peptide Proteins 0.000 description 10
- HELXLJCILKEWJH-SEAGSNCFSA-N Rebaudioside A Natural products O=C(O[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1)[C@@]1(C)[C@@H]2[C@](C)([C@H]3[C@@]4(CC(=C)[C@@](O[C@H]5[C@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@H](O)[C@@H](CO)O5)(C4)CC3)CC2)CCC1 HELXLJCILKEWJH-SEAGSNCFSA-N 0.000 description 10
- 241000235070 Saccharomyces Species 0.000 description 10
- 101710148271 UDP-glucose:glycoprotein glucosyltransferase 1 Proteins 0.000 description 10
- 102100029151 UDP-glucuronosyltransferase 1A10 Human genes 0.000 description 10
- 101150105569 Ugt8 gene Proteins 0.000 description 10
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 10
- 230000008878 coupling Effects 0.000 description 10
- 238000010168 coupling process Methods 0.000 description 10
- 238000005859 coupling reaction Methods 0.000 description 10
- HELXLJCILKEWJH-UHFFFAOYSA-N entered according to Sigma 01432 Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC(C1OC2C(C(O)C(O)C(CO)O2)O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O HELXLJCILKEWJH-UHFFFAOYSA-N 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 10
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 10
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 235000019203 rebaudioside A Nutrition 0.000 description 10
- 108010071207 serylmethionine Proteins 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 101100048055 Arabidopsis thaliana UGT85A5 gene Proteins 0.000 description 9
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- 101100371750 Pleuronectes platessa ugt3 gene Proteins 0.000 description 9
- 108010044940 alanylglutamine Proteins 0.000 description 9
- BDAGIHXWWSANSR-PFUFQJKNSA-N formic acid-d2 Chemical compound [2H]OC([2H])=O BDAGIHXWWSANSR-PFUFQJKNSA-N 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 9
- -1 hydroxyl hydrogen Chemical compound 0.000 description 9
- 108010012058 leucyltyrosine Proteins 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- QSIDJGUAAUSPMG-CULFPKEHSA-N steviolmonoside Chemical compound O([C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O QSIDJGUAAUSPMG-CULFPKEHSA-N 0.000 description 9
- 239000000758 substrate Substances 0.000 description 9
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 8
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 8
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 8
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 8
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 8
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 8
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 8
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 8
- 101150084072 ERG20 gene Proteins 0.000 description 8
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 8
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 8
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 8
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 8
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 8
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 8
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 8
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 8
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 8
- 238000005481 NMR spectroscopy Methods 0.000 description 8
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 8
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 8
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 8
- 244000228451 Stevia rebaudiana Species 0.000 description 8
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 8
- 108090000992 Transferases Proteins 0.000 description 8
- 102000004357 Transferases Human genes 0.000 description 8
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 8
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 8
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 8
- 108010070944 alanylhistidine Proteins 0.000 description 8
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 8
- 150000001413 amino acids Chemical group 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 235000015203 fruit juice Nutrition 0.000 description 8
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 8
- 238000003919 heteronuclear multiple bond coherence Methods 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 8
- 108010005942 methionylglycine Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 108010015796 prolylisoleucine Proteins 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- 108010038745 tryptophylglycine Proteins 0.000 description 8
- 101150080339 BTS1 gene Proteins 0.000 description 7
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 7
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 7
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 7
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 7
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 7
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 7
- 229930006000 Sucrose Natural products 0.000 description 7
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 7
- 108010064739 ent-kaurene synthetase B Proteins 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 108010056582 methionylglutamic acid Proteins 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 239000005720 sucrose Substances 0.000 description 7
- 108010080629 tryptophan-leucine Proteins 0.000 description 7
- 239000003643 water by type Substances 0.000 description 7
- IVZWRQBQDVHDNG-UHFFFAOYSA-N (-)-Kauran; alpha-Dihydrokauren Natural products C1CC2C3(C)CCCC(C)(C)C3CCC22CC(C)C1C2 IVZWRQBQDVHDNG-UHFFFAOYSA-N 0.000 description 6
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 6
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 6
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 6
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 6
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 6
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 6
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 6
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 6
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 6
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 6
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 6
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 6
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 6
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 6
- 241000954177 Bangana ariza Species 0.000 description 6
- 108090000201 Carboxypeptidase B2 Proteins 0.000 description 6
- 108010051219 Cre recombinase Proteins 0.000 description 6
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 6
- 108010090461 DFG peptide Proteins 0.000 description 6
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 6
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 6
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 6
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 6
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 6
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 6
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 6
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 6
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 6
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 6
- OFIHURVSQXAZIR-SZMVWBNQSA-N Glu-Lys-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OFIHURVSQXAZIR-SZMVWBNQSA-N 0.000 description 6
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 6
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 6
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 6
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 6
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 6
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 6
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 6
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 6
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 6
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 6
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 6
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 6
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 6
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 6
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 6
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 6
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 6
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 6
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 6
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 6
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 6
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 6
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 6
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 6
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 6
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 6
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 6
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 6
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 6
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 6
- 101100390535 Mus musculus Fdft1 gene Proteins 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 6
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 6
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 6
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 6
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 6
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 6
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 6
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 6
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 6
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 6
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 6
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 6
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 6
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 6
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 6
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 6
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 6
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 6
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 6
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 6
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 6
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 6
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 6
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 6
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 6
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 235000009508 confectionery Nutrition 0.000 description 6
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 108010028295 histidylhistidine Proteins 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 229930001567 kaurane Natural products 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010085203 methionylmethionine Proteins 0.000 description 6
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 6
- 108010005652 splenotritin Proteins 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 6
- 229940045145 uridine Drugs 0.000 description 6
- 239000011782 vitamin Substances 0.000 description 6
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 5
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 5
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 5
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 5
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 5
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 5
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 5
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 5
- 241000235649 Kluyveromyces Species 0.000 description 5
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 5
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 5
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 5
- 108010045510 NADPH-Ferrihemoprotein Reductase Proteins 0.000 description 5
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 5
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 5
- YWPVROCHNBYFTP-UHFFFAOYSA-N Rubusoside Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1O YWPVROCHNBYFTP-UHFFFAOYSA-N 0.000 description 5
- 101100427140 Stevia rebaudiana UGT74G1 gene Proteins 0.000 description 5
- 101100262416 Stevia rebaudiana UGT76G1 gene Proteins 0.000 description 5
- 101100048059 Stevia rebaudiana UGT85C2 gene Proteins 0.000 description 5
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 5
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 5
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 5
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 150000001720 carbohydrates Chemical group 0.000 description 5
- 229930004069 diterpene Natural products 0.000 description 5
- 235000013399 edible fruits Nutrition 0.000 description 5
- 229930182830 galactose Natural products 0.000 description 5
- 238000004896 high resolution mass spectrometry Methods 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 108010000761 leucylarginine Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 5
- 239000006072 paste Substances 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 238000001896 rotating frame Overhauser effect spectroscopy Methods 0.000 description 5
- 235000013311 vegetables Nutrition 0.000 description 5
- 229940088594 vitamin Drugs 0.000 description 5
- 235000013343 vitamin Nutrition 0.000 description 5
- 229930003231 vitamin Natural products 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 4
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 4
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 4
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 4
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 4
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 4
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 4
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 4
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 4
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 4
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 4
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 4
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 4
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 4
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 4
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 4
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 4
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 4
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 4
- JYHIVHINLJUIEG-BVSLBCMMSA-N Arg-Tyr-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYHIVHINLJUIEG-BVSLBCMMSA-N 0.000 description 4
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 4
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 4
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 4
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 4
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 4
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 4
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 4
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 4
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 4
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 4
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 4
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 4
- TZQWZQSMHDVLQL-QEJZJMRPSA-N Asn-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N TZQWZQSMHDVLQL-QEJZJMRPSA-N 0.000 description 4
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 4
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 4
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 4
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 4
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 4
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 4
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 4
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 4
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 4
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 4
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 4
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 4
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 4
- CMBDUPIBCOEWNE-BJDJZHNGSA-N Asp-Leu-Asp-Gln Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CMBDUPIBCOEWNE-BJDJZHNGSA-N 0.000 description 4
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 4
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 4
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 4
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 4
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 4
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 4
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 4
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 4
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241000589174 Bradyrhizobium japonicum Species 0.000 description 4
- 101100246550 Caenorhabditis elegans pyr-1 gene Proteins 0.000 description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 4
- 102000005575 Cellulases Human genes 0.000 description 4
- 108010084185 Cellulases Proteins 0.000 description 4
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 4
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 4
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 4
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 4
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 4
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 4
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 4
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 4
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 4
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 4
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 4
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 4
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 4
- ATTWDCRXQNKRII-GUBZILKMSA-N Gln-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ATTWDCRXQNKRII-GUBZILKMSA-N 0.000 description 4
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 4
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 4
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 4
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 4
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 4
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 4
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 4
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 4
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 4
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 4
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 4
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 4
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 4
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 4
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 4
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 4
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 4
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 4
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 4
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 4
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 4
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 4
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 4
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 4
- 102000000340 Glucosyltransferases Human genes 0.000 description 4
- 108010055629 Glucosyltransferases Proteins 0.000 description 4
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 4
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 4
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 4
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 4
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 4
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 4
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 4
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 4
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 4
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- 229920002488 Hemicellulose Polymers 0.000 description 4
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 4
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 4
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 4
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 4
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 4
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 4
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 4
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 4
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 4
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 4
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 4
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 4
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 4
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 4
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 4
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 4
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 4
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 4
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 4
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 4
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 4
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 4
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 4
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 4
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 4
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 4
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 4
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 4
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 4
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 4
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 4
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 4
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 4
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 4
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 4
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 4
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 4
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 4
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 4
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 4
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 4
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 4
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 4
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 4
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 4
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 4
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 4
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 4
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 4
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 4
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 4
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 4
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 4
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 4
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 4
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 4
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 4
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 4
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 4
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 4
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 4
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 4
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 4
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 4
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 4
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 4
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 4
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 4
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 4
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 4
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 4
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 4
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 4
- 108091005461 Nucleic proteins Proteins 0.000 description 4
- 102000004316 Oxidoreductases Human genes 0.000 description 4
- 108090000854 Oxidoreductases Proteins 0.000 description 4
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 4
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 4
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 4
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 4
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 4
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 4
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 4
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 4
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 4
- 240000000020 Picea glauca Species 0.000 description 4
- 235000008127 Picea glauca Nutrition 0.000 description 4
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 4
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 4
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 4
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 4
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 4
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 4
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 4
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 4
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 4
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 4
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 4
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 4
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 4
- RJTUIDFUUHPJMP-FHWLQOOXSA-N Pro-Trp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O RJTUIDFUUHPJMP-FHWLQOOXSA-N 0.000 description 4
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 4
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 4
- GIPHUOWOTCAJSR-UHFFFAOYSA-N Rebaudioside A. Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1OC(C1O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O GIPHUOWOTCAJSR-UHFFFAOYSA-N 0.000 description 4
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 4
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 4
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 4
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 4
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 4
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 4
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 4
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 4
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 4
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 4
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 4
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 4
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 4
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 4
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 4
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 4
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 4
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 4
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 4
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 4
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 4
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 4
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 4
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 4
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 4
- 229920002472 Starch Polymers 0.000 description 4
- 244000269722 Thea sinensis Species 0.000 description 4
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 4
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 4
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 4
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 4
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 4
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 4
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 4
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 4
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 4
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 4
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 4
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 4
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 4
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 4
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 4
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 4
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 4
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 4
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 4
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 4
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 4
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 4
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 4
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 4
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 4
- DXUVJJRTVACXSO-KKUMJFAQSA-N Tyr-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DXUVJJRTVACXSO-KKUMJFAQSA-N 0.000 description 4
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 4
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 4
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 4
- YIKDYZDNRCNFQB-KKUMJFAQSA-N Tyr-His-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O YIKDYZDNRCNFQB-KKUMJFAQSA-N 0.000 description 4
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 4
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 4
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 4
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 4
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 4
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 4
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 4
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 4
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 4
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 4
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 4
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 4
- XCCTYIAWTASOJW-UHFFFAOYSA-N UDP-Glc Natural products OC1C(O)C(COP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-UHFFFAOYSA-N 0.000 description 4
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 4
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 4
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 4
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 4
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 4
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 4
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 4
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 4
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 4
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 4
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 4
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 4
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 4
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 4
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 4
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 4
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 4
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 4
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 4
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 4
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 4
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 4
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 4
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 4
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 125000004429 atom Chemical group 0.000 description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 4
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 239000001913 cellulose Substances 0.000 description 4
- 229920002678 cellulose Polymers 0.000 description 4
- 235000015218 chewing gum Nutrition 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 238000000132 electrospray ionisation Methods 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 238000005570 heteronuclear single quantum coherence Methods 0.000 description 4
- NIKHGUQULKYIGE-UHFFFAOYSA-N kaurenoic acid Natural products C1CC2(CC3=C)CC3CCC2C2(C)C1C(C)(C(O)=O)CCC2 NIKHGUQULKYIGE-UHFFFAOYSA-N 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 4
- 230000014759 maintenance of location Effects 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 4
- 235000013336 milk Nutrition 0.000 description 4
- 239000008267 milk Substances 0.000 description 4
- 210000004080 milk Anatomy 0.000 description 4
- 235000013615 non-nutritive sweetener Nutrition 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 239000001301 oxygen Substances 0.000 description 4
- 229910052760 oxygen Inorganic materials 0.000 description 4
- 229920001277 pectin Polymers 0.000 description 4
- 239000001814 pectin Substances 0.000 description 4
- 235000010987 pectin Nutrition 0.000 description 4
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 4
- 108010084572 phenylalanyl-valine Proteins 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- QSRAJVGDWKFOGU-WBXIDTKBSA-N rebaudioside c Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]1(CC[C@H]2[C@@]3(C)[C@@H]([C@](CCC3)(C)C(=O)O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)CC3)C(=C)C[C@]23C1 QSRAJVGDWKFOGU-WBXIDTKBSA-N 0.000 description 4
- 238000005215 recombination Methods 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- YWPVROCHNBYFTP-OSHKXICASA-N rubusoside Chemical compound O([C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O YWPVROCHNBYFTP-OSHKXICASA-N 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 235000014214 soft drink Nutrition 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 239000008107 starch Substances 0.000 description 4
- 235000019698 starch Nutrition 0.000 description 4
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 4
- 238000001551 total correlation spectroscopy Methods 0.000 description 4
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 150000003722 vitamin derivatives Chemical class 0.000 description 4
- ONVABDHFQKWOSV-UHFFFAOYSA-N 16-Phyllocladene Natural products C1CC(C2)C(=C)CC32CCC2C(C)(C)CCCC2(C)C31 ONVABDHFQKWOSV-UHFFFAOYSA-N 0.000 description 3
- JUJWROOIHBZHMG-QYKNYGDISA-N 2-deuteriopyridine Chemical compound [2H]C1=CC=CC=N1 JUJWROOIHBZHMG-QYKNYGDISA-N 0.000 description 3
- OINNEUNVOZHBOX-QIRCYJPOSA-K 2-trans,6-trans,10-trans-geranylgeranyl diphosphate(3-) Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP([O-])(=O)OP([O-])([O-])=O OINNEUNVOZHBOX-QIRCYJPOSA-K 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 3
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 3
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 3
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- 239000007836 KH2PO4 Substances 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical group C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 3
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 3
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 3
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 3
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 3
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 3
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 3
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 3
- 101100390536 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) erg-6 gene Proteins 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 3
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 3
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 3
- RLLCWNUIHGPAJY-RYBZXKSASA-N Rebaudioside E Natural products O=C(O[C@H]1[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O2)[C@@H](O)[C@@H](O)[C@H](CO)O1)[C@]1(C)[C@@H]2[C@@](C)([C@@H]3[C@@]4(CC(=C)[C@@](O[C@@H]5[C@@H](O[C@@H]6[C@@H](O)[C@H](O)[C@@H](O)[C@H](CO)O6)[C@H](O)[C@@H](O)[C@H](CO)O5)(C4)CC3)CC2)CCC1 RLLCWNUIHGPAJY-RYBZXKSASA-N 0.000 description 3
- 241001123227 Saccharomyces pastorianus Species 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 3
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 3
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000010633 broth Nutrition 0.000 description 3
- 235000012970 cakes Nutrition 0.000 description 3
- 235000014171 carbonated beverage Nutrition 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 229940112822 chewing gum Drugs 0.000 description 3
- 238000005100 correlation spectroscopy Methods 0.000 description 3
- 239000002537 cosmetic Substances 0.000 description 3
- HEDRZPFGACZZDS-MICDWDOJSA-N deuterated chloroform Substances [2H]C(Cl)(Cl)Cl HEDRZPFGACZZDS-MICDWDOJSA-N 0.000 description 3
- 235000014113 dietary fatty acids Nutrition 0.000 description 3
- 235000011180 diphosphates Nutrition 0.000 description 3
- XPPKVPWEQAFLFU-UHFFFAOYSA-N diphosphoric acid Chemical compound OP(O)(=O)OP(O)(O)=O XPPKVPWEQAFLFU-UHFFFAOYSA-N 0.000 description 3
- 150000004141 diterpene derivatives Chemical class 0.000 description 3
- ONVABDHFQKWOSV-YQXATGRUSA-N ent-Kaur-16-ene Natural products C1C[C@@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-YQXATGRUSA-N 0.000 description 3
- NIKHGUQULKYIGE-OTCXFQBHSA-N ent-kaur-16-en-19-oic acid Chemical compound C([C@@H]1C[C@]2(CC1=C)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 NIKHGUQULKYIGE-OTCXFQBHSA-N 0.000 description 3
- ONVABDHFQKWOSV-HPUSYDDDSA-N ent-kaur-16-ene Chemical compound C1C[C@H](C2)C(=C)C[C@@]32CC[C@@H]2C(C)(C)CCC[C@@]2(C)[C@@H]31 ONVABDHFQKWOSV-HPUSYDDDSA-N 0.000 description 3
- UIXMIBNGPQGJJJ-UHFFFAOYSA-N ent-kaurene Natural products CC1CC23CCC4C(CCCC4(C)C)C2CCC1C3 UIXMIBNGPQGJJJ-UHFFFAOYSA-N 0.000 description 3
- 108010067758 ent-kaurene oxidase Proteins 0.000 description 3
- 101150116391 erg9 gene Proteins 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 229930195729 fatty acid Natural products 0.000 description 3
- 239000000194 fatty acid Substances 0.000 description 3
- 150000004665 fatty acids Chemical class 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 238000003197 gene knockdown Methods 0.000 description 3
- IXORZMNAPKEEDV-UHFFFAOYSA-N gibberellic acid GA3 Natural products OC(=O)C1C2(C3)CC(=C)C3(O)CCC2C2(C=CC3O)C1C3(C)C(=O)O2 IXORZMNAPKEEDV-UHFFFAOYSA-N 0.000 description 3
- IXORZMNAPKEEDV-OBDJNFEBSA-N gibberellin A3 Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)[C@H]1C(O)=O)C[C@H]2[C@]2(C=C[C@@H]3O)[C@H]1[C@]3(C)C(=O)O2 IXORZMNAPKEEDV-OBDJNFEBSA-N 0.000 description 3
- 229930182470 glycoside Natural products 0.000 description 3
- 150000002338 glycosides Chemical class 0.000 description 3
- WHWDWIHXSPCOKZ-UHFFFAOYSA-N hexahydrofarnesyl acetone Natural products CC(C)CCCC(C)CCCC(C)CCCC(C)=O WHWDWIHXSPCOKZ-UHFFFAOYSA-N 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 239000008101 lactose Substances 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 235000019796 monopotassium phosphate Nutrition 0.000 description 3
- 235000019533 nutritive sweetener Nutrition 0.000 description 3
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 3
- 239000002994 raw material Substances 0.000 description 3
- RLLCWNUIHGPAJY-SFUUMPFESA-N rebaudioside E Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RLLCWNUIHGPAJY-SFUUMPFESA-N 0.000 description 3
- 235000013580 sausages Nutrition 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 235000019640 taste Nutrition 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- 235000008924 yoghurt drink Nutrition 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- SPFMQWBKVUQXJV-BTVCFUMJSA-N (2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal;hydrate Chemical compound O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O SPFMQWBKVUQXJV-BTVCFUMJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 2
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 2
- ZCPBEAHAVUJKAE-UHTWSYAYSA-N (2s)-2-[[(2s)-2-[[(2r)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]propanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](NC(=O)CN)CC1=CC=CC=C1 ZCPBEAHAVUJKAE-UHTWSYAYSA-N 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 2
- 238000005084 2D-nuclear magnetic resonance Methods 0.000 description 2
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 2
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 2
- 108010076441 Ala-His-His Proteins 0.000 description 2
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- VGMNWQOPSFBBBG-XUXIUFHCSA-N Ala-Leu-Leu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VGMNWQOPSFBBBG-XUXIUFHCSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- CKIBTNMWVMKAHB-RWGOJESNSA-N Ala-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 CKIBTNMWVMKAHB-RWGOJESNSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 2
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 2
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 2
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 2
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 2
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 2
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 2
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 2
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 2
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 2
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 2
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 2
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 2
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 2
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 2
- DBLXNGHFOJZXMS-UHFFFAOYSA-N Arg-Trp-Trp-Trp Chemical compound C1=CC=C2C(CC(NC(=O)C(CC=3C4=CC=CC=C4NC=3)NC(=O)C(CC=3C4=CC=CC=C4NC=3)NC(=O)C(CCCN=C(N)N)N)C(O)=O)=CNC2=C1 DBLXNGHFOJZXMS-UHFFFAOYSA-N 0.000 description 2
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 2
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 2
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 2
- RFLVTVBAESPKKR-ZLUOBGJFSA-N Asn-Cys-Cys Chemical compound N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RFLVTVBAESPKKR-ZLUOBGJFSA-N 0.000 description 2
- FJIRXKVEDFLLOQ-SRVKXCTJSA-N Asn-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N FJIRXKVEDFLLOQ-SRVKXCTJSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 2
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 2
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 2
- HGGIYWURFPGLIU-FXQIFTODSA-N Asn-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(N)=O HGGIYWURFPGLIU-FXQIFTODSA-N 0.000 description 2
- UYRPHDGXHKBZHJ-CIUDSAMLSA-N Asn-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N UYRPHDGXHKBZHJ-CIUDSAMLSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 2
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 2
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 2
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 2
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 2
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 2
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 2
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 2
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 2
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 240000007154 Coffea arabica Species 0.000 description 2
- 241000235646 Cyberlindnera jadinii Species 0.000 description 2
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 2
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 2
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 2
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 2
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 2
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 2
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 2
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 2
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 2
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 2
- XRJFPHCGGQOORT-JBDRJPRFSA-N Cys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N XRJFPHCGGQOORT-JBDRJPRFSA-N 0.000 description 2
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 2
- GHUVBPIYQYXXEF-SRVKXCTJSA-N Cys-Cys-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GHUVBPIYQYXXEF-SRVKXCTJSA-N 0.000 description 2
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 2
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 2
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 2
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 2
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 2
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 2
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 2
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 2
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 2
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 2
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 2
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 2
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 2
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- DQBRIEGWTLXALA-GQGQLFGLSA-N Cys-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N DQBRIEGWTLXALA-GQGQLFGLSA-N 0.000 description 2
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 2
- ZXGDAZLSOSYSBA-IHRRRGAJSA-N Cys-Val-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZXGDAZLSOSYSBA-IHRRRGAJSA-N 0.000 description 2
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 2
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 2
- YTMBNLHIDIKJIU-HCXYKTFWSA-N D-Arginyl-L-arginyl-D-glutaminyl-L-phenylalanine Chemical compound NC(=N)NCCC[C@@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](CCC(O)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YTMBNLHIDIKJIU-HCXYKTFWSA-N 0.000 description 2
- LKDRXBCSQODPBY-VRPWFDPXSA-N D-fructopyranose Chemical compound OCC1(O)OC[C@@H](O)[C@@H](O)[C@@H]1O LKDRXBCSQODPBY-VRPWFDPXSA-N 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- ZAQJHHRNXZUBTE-NQXXGFSBSA-N D-ribulose Chemical compound OC[C@@H](O)[C@@H](O)C(=O)CO ZAQJHHRNXZUBTE-NQXXGFSBSA-N 0.000 description 2
- ZAQJHHRNXZUBTE-UHFFFAOYSA-N D-threo-2-Pentulose Natural products OCC(O)C(O)C(=O)CO ZAQJHHRNXZUBTE-UHFFFAOYSA-N 0.000 description 2
- 101710147220 Ent-copalyl diphosphate synthase, chloroplastic Proteins 0.000 description 2
- 108030000406 Ent-copalyl diphosphate synthases Proteins 0.000 description 2
- 239000001776 FEMA 4720 Substances 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 229930091371 Fructose Natural products 0.000 description 2
- 239000005715 Fructose Substances 0.000 description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 2
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 2
- 241000221778 Fusarium fujikuroi Species 0.000 description 2
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 2
- 239000005980 Gibberellic acid Substances 0.000 description 2
- 229930191978 Gibberellin Natural products 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 2
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 2
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 2
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 2
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 2
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 2
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 2
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 2
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 2
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 2
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 2
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 2
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 2
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 2
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 2
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 2
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 2
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 2
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 2
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 2
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 2
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 2
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 2
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 2
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 2
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 2
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 2
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 2
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 2
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 2
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 2
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 2
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 2
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 2
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 2
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 2
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 2
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 2
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 2
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 2
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 2
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 2
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 2
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 2
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 2
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 2
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 2
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 2
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 2
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 2
- AJTBOTWDSRSUDV-ULQDDVLXSA-N His-Phe-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O AJTBOTWDSRSUDV-ULQDDVLXSA-N 0.000 description 2
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 2
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 2
- FONIDUOGWNWEAX-XIRDDKMYSA-N His-Trp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O FONIDUOGWNWEAX-XIRDDKMYSA-N 0.000 description 2
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 2
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- 108700039609 IRW peptide Proteins 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 2
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 2
- ZIPOVLBRVPXWJQ-SPOWBLRKSA-N Ile-Cys-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N ZIPOVLBRVPXWJQ-SPOWBLRKSA-N 0.000 description 2
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 2
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 2
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 2
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 2
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 2
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 2
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 2
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 2
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 101710092857 Integrator complex subunit 1 Proteins 0.000 description 2
- 102100024061 Integrator complex subunit 1 Human genes 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 2
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 2
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 2
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- 108090000856 Lyases Proteins 0.000 description 2
- 102000004317 Lyases Human genes 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 2
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 2
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 2
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 2
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- ZJSXCIMWLPSTMG-HSCHXYMDSA-N Lys-Trp-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJSXCIMWLPSTMG-HSCHXYMDSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 2
- 239000005913 Maltodextrin Substances 0.000 description 2
- 229920002774 Maltodextrin Polymers 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 2
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 2
- IZLCDZDNZFEDHB-DCAQKATOSA-N Met-Cys-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N IZLCDZDNZFEDHB-DCAQKATOSA-N 0.000 description 2
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 2
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 2
- DYTWOWJWJCBFLE-IHRRRGAJSA-N Met-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CNC=N1 DYTWOWJWJCBFLE-IHRRRGAJSA-N 0.000 description 2
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 2
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 2
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 2
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 2
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 2
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 2
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 2
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 2
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 2
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 2
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 2
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 2
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 229920000881 Modified starch Polymers 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 description 2
- 101150053185 P450 gene Proteins 0.000 description 2
- 241001557897 Phaeosphaeria sp. Species 0.000 description 2
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 2
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 2
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 2
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 2
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 2
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- ZZVUXQCQPXSUFH-JBACZVJFSA-N Phe-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ZZVUXQCQPXSUFH-JBACZVJFSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 2
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 2
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 2
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 2
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 2
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 2
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 2
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 2
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 2
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 2
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 241000235645 Pichia kudriavzevii Species 0.000 description 2
- 108010059820 Polygalacturonase Proteins 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 2
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 2
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 2
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 2
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 2
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 2
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 2
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 2
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 2
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 2
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 2
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000227726 Sphaceloma manihoticola Species 0.000 description 2
- 235000006092 Stevia rebaudiana Nutrition 0.000 description 2
- 235000006468 Thea sinensis Nutrition 0.000 description 2
- 244000299461 Theobroma cacao Species 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 2
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 2
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 2
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 2
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 2
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- OWQKBXKXZFRRQL-XGEHTFHBSA-N Thr-Met-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N)O OWQKBXKXZFRRQL-XGEHTFHBSA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 2
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 2
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 2
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 2
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 2
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 2
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 2
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 2
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 2
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 2
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 2
- HRKOLWXWQSDMSK-XIRDDKMYSA-N Trp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HRKOLWXWQSDMSK-XIRDDKMYSA-N 0.000 description 2
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 2
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 2
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- YLGQHMHKAASRGJ-WDSOQIARSA-N Trp-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YLGQHMHKAASRGJ-WDSOQIARSA-N 0.000 description 2
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 2
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 2
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 2
- RQLNEFOBQAVGSY-WDSOQIARSA-N Trp-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQLNEFOBQAVGSY-WDSOQIARSA-N 0.000 description 2
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 2
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- HHPSUFUXXBOFQY-AQZXSJQPSA-N Trp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O HHPSUFUXXBOFQY-AQZXSJQPSA-N 0.000 description 2
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 2
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 2
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 2
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 2
- BODHJXJNRVRKFA-BZSNNMDCSA-N Tyr-Cys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BODHJXJNRVRKFA-BZSNNMDCSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 2
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 2
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 2
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 2
- AVFGBGGRZOKSFS-KJEVXHAQSA-N Tyr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O AVFGBGGRZOKSFS-KJEVXHAQSA-N 0.000 description 2
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 2
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 2
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 2
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 description 2
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 2
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 2
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 2
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 2
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 2
- DBMMKEHYWIZTPN-JYJNAYRXSA-N Val-Cys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N DBMMKEHYWIZTPN-JYJNAYRXSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- RHYOAUJXSRWVJT-GVXVVHGQSA-N Val-His-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RHYOAUJXSRWVJT-GVXVVHGQSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 2
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- NRAUADCLPJTGSF-ZPGVOIKOSA-N [(2r,3s,4r,5r,6r)-6-[[(3as,7r,7as)-7-hydroxy-4-oxo-1,3a,5,6,7,7a-hexahydroimidazo[4,5-c]pyridin-2-yl]amino]-5-[[(3s)-3,6-diaminohexanoyl]amino]-4-hydroxy-2-(hydroxymethyl)oxan-3-yl] carbamate Chemical compound NCCC[C@H](N)CC(=O)N[C@@H]1[C@@H](O)[C@H](OC(N)=O)[C@@H](CO)O[C@H]1\N=C/1N[C@H](C(=O)NC[C@H]2O)[C@@H]2N\1 NRAUADCLPJTGSF-ZPGVOIKOSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010007483 arginyl-leucyl-tyrosyl-glutamic acid Proteins 0.000 description 2
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 235000013339 cereals Nutrition 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000013375 chromatographic separation Methods 0.000 description 2
- 235000016213 coffee Nutrition 0.000 description 2
- 235000013353 coffee beverage Nutrition 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000006071 cream Substances 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 235000013365 dairy product Nutrition 0.000 description 2
- 229960000673 dextrose monohydrate Drugs 0.000 description 2
- 235000013325 dietary fiber Nutrition 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000000469 ethanolic extract Substances 0.000 description 2
- 230000008020 evaporation Effects 0.000 description 2
- 108010093305 exopolygalacturonase Proteins 0.000 description 2
- 229930003935 flavonoid Natural products 0.000 description 2
- 150000002215 flavonoids Chemical class 0.000 description 2
- 235000017173 flavonoids Nutrition 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 235000013355 food flavoring agent Nutrition 0.000 description 2
- 239000004459 forage Substances 0.000 description 2
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 230000005017 genetic modification Effects 0.000 description 2
- 235000013617 genetically modified food Nutrition 0.000 description 2
- 239000003448 gibberellin Substances 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010002430 hemicellulase Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 235000021539 instant coffee Nutrition 0.000 description 2
- 238000004898 kneading Methods 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010056787 lysyl-arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 2
- 235000019341 magnesium sulphate Nutrition 0.000 description 2
- 229940035034 maltodextrin Drugs 0.000 description 2
- 235000013372 meat Nutrition 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 235000020124 milk-based beverage Nutrition 0.000 description 2
- 235000019426 modified starch Nutrition 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000011330 nucleic acid test Methods 0.000 description 2
- 239000002417 nutraceutical Substances 0.000 description 2
- 235000021436 nutraceutical agent Nutrition 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 238000005554 pickling Methods 0.000 description 2
- 229920005862 polyol Polymers 0.000 description 2
- 150000003077 polyols Chemical class 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- QRGRAFPOLJOGRV-UHFFFAOYSA-N rebaudioside F Natural products CC12CCCC(C)(C1CCC34CC(=C)C(CCC23)(C4)OC5OC(CO)C(O)C(OC6OCC(O)C(O)C6O)C5OC7OC(CO)C(O)C(O)C7O)C(=O)OC8OC(CO)C(O)C(O)C8O QRGRAFPOLJOGRV-UHFFFAOYSA-N 0.000 description 2
- HYLAUKAHEAUVFE-AVBZULRRSA-N rebaudioside f Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)CO1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HYLAUKAHEAUVFE-AVBZULRRSA-N 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000007363 ring formation reaction Methods 0.000 description 2
- 108010029895 rubimetide Proteins 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 235000015067 sauces Nutrition 0.000 description 2
- 235000014102 seafood Nutrition 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 235000013555 soy sauce Nutrition 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 150000005846 sugar alcohols Chemical class 0.000 description 2
- 239000006188 syrup Substances 0.000 description 2
- 235000020357 syrup Nutrition 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 235000013616 tea Nutrition 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 239000011573 trace mineral Substances 0.000 description 2
- 235000013619 trace mineral Nutrition 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- DRSKVOAJKLUMCL-MMUIXFKXSA-N u2n4xkx7hp Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O DRSKVOAJKLUMCL-MMUIXFKXSA-N 0.000 description 2
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 2
- 239000000052 vinegar Substances 0.000 description 2
- 235000021419 vinegar Nutrition 0.000 description 2
- 108010000998 wheylin-2 peptide Proteins 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- 235000013618 yogurt Nutrition 0.000 description 2
- RMLYXMMBIZLGAQ-UHFFFAOYSA-N (-)-monatin Natural products C1=CC=C2C(CC(O)(CC(N)C(O)=O)C(O)=O)=CNC2=C1 RMLYXMMBIZLGAQ-UHFFFAOYSA-N 0.000 description 1
- RMLYXMMBIZLGAQ-HZMBPMFUSA-N (2s,4s)-4-amino-2-hydroxy-2-(1h-indol-3-ylmethyl)pentanedioic acid Chemical compound C1=CC=C2C(C[C@](O)(C[C@H](N)C(O)=O)C(O)=O)=CNC2=C1 RMLYXMMBIZLGAQ-HZMBPMFUSA-N 0.000 description 1
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- PAWQVTBBRAZDMG-UHFFFAOYSA-N 2-(3-bromo-2-fluorophenyl)acetic acid Chemical compound OC(=O)CC1=CC=CC(Br)=C1F PAWQVTBBRAZDMG-UHFFFAOYSA-N 0.000 description 1
- 238000012584 2D NMR experiment Methods 0.000 description 1
- MIDXCONKKJTLDX-UHFFFAOYSA-N 3,5-dimethylcyclopentane-1,2-dione Chemical compound CC1CC(C)C(=O)C1=O MIDXCONKKJTLDX-UHFFFAOYSA-N 0.000 description 1
- YBJHBAHKTGYVGT-UHFFFAOYSA-N 5-(2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl)pentanoic acid Chemical compound N1C(=O)NC2C(CCCCC(=O)O)SCC21 YBJHBAHKTGYVGT-UHFFFAOYSA-N 0.000 description 1
- 101150078509 ADH2 gene Proteins 0.000 description 1
- 101150026777 ADH5 gene Proteins 0.000 description 1
- 101150021974 Adh1 gene Proteins 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- 241000512259 Ascophyllum nodosum Species 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- 108010011485 Aspartame Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000722885 Brettanomyces Species 0.000 description 1
- 108090000023 Carbon-oxygen lyases Proteins 0.000 description 1
- 102000003732 Carbon-oxygen lyases Human genes 0.000 description 1
- 235000016795 Cola Nutrition 0.000 description 1
- 244000228088 Cola acuminata Species 0.000 description 1
- 235000011824 Cola pachycarpa Nutrition 0.000 description 1
- UDIPTWFVPPPURJ-UHFFFAOYSA-M Cyclamate Chemical compound [Na+].[O-]S(=O)(=O)NC1CCCCC1 UDIPTWFVPPPURJ-UHFFFAOYSA-M 0.000 description 1
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- 101100480530 Danio rerio tal1 gene Proteins 0.000 description 1
- 101100269269 Drosophila mayaguana Adh gene Proteins 0.000 description 1
- CANAPGLEBDTCAF-NTIPNFSCSA-N Dulcoside A Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@H]1[C@H](O[C@]23C(C[C@]4(C2)[C@H]([C@@]2(C)[C@@H]([C@](CCC2)(C)C(=O)O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)CC4)CC3)=C)O[C@H](CO)[C@@H](O)[C@@H]1O CANAPGLEBDTCAF-NTIPNFSCSA-N 0.000 description 1
- CANAPGLEBDTCAF-QHSHOEHESA-N Dulcoside A Natural products C[C@@H]1O[C@H](O[C@@H]2[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]2O[C@]34CC[C@H]5[C@]6(C)CCC[C@](C)([C@H]6CC[C@@]5(CC3=C)C4)C(=O)O[C@@H]7O[C@H](CO)[C@@H](O)[C@H](O)[C@H]7O)[C@H](O)[C@H](O)[C@H]1O CANAPGLEBDTCAF-QHSHOEHESA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101150015836 ENO1 gene Proteins 0.000 description 1
- 101710114727 Ent-kaur-16-ene synthase, chloroplastic Proteins 0.000 description 1
- 239000004386 Erythritol Substances 0.000 description 1
- UNXHWFMMPAWVPI-UHFFFAOYSA-N Erythritol Natural products OCC(O)C(O)CO UNXHWFMMPAWVPI-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- 101001112118 Homo sapiens NADPH-cytochrome P450 reductase Proteins 0.000 description 1
- 101000847024 Homo sapiens Tetratricopeptide repeat protein 1 Proteins 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- 108090000453 Intramolecular lyases Proteins 0.000 description 1
- 102000034335 Intramolecular lyases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 241000235644 Issatchenkia Species 0.000 description 1
- 101710197581 Ketoisovalerate oxidoreductase subunit VorC Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- 101710147185 Light-dependent protochlorophyllide reductase Proteins 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- 108010061951 Methemoglobin Proteins 0.000 description 1
- 101100054943 Mus musculus Adh4 gene Proteins 0.000 description 1
- 101100480538 Mus musculus Tal1 gene Proteins 0.000 description 1
- 101100313266 Mus musculus Tead1 gene Proteins 0.000 description 1
- 101100028920 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cfp gene Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 101100312945 Pasteurella multocida (strain Pm70) talA gene Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- 101710193909 Protochlorophyllide reductase, chloroplastic Proteins 0.000 description 1
- 101710109491 Pyruvate synthase subunit PorA Proteins 0.000 description 1
- 101710109487 Pyruvate synthase subunit PorB Proteins 0.000 description 1
- 101710109489 Pyruvate synthase subunit PorC Proteins 0.000 description 1
- 101710109484 Pyruvate synthase subunit PorD Proteins 0.000 description 1
- 241000235072 Saccharomyces bayanus Species 0.000 description 1
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010073771 Soybean Proteins Proteins 0.000 description 1
- 239000004376 Sucralose Substances 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 101150032817 TPI1 gene Proteins 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- 102100032841 Tetratricopeptide repeat protein 1 Human genes 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- 241000311098 Yamadazyma Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 244000273928 Zingiber officinale Species 0.000 description 1
- 235000006886 Zingiber officinale Nutrition 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- YGCFIWIQZPHFLU-UHFFFAOYSA-N acesulfame Chemical compound CC1=CC(=O)NS(=O)(=O)O1 YGCFIWIQZPHFLU-UHFFFAOYSA-N 0.000 description 1
- 229960005164 acesulfame Drugs 0.000 description 1
- DHKHKXVYLBGOIT-UHFFFAOYSA-N acetaldehyde Diethyl Acetal Natural products CCOC(C)OCC DHKHKXVYLBGOIT-UHFFFAOYSA-N 0.000 description 1
- 150000001241 acetals Chemical class 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 238000010564 aerobic fermentation Methods 0.000 description 1
- 238000005054 agglomeration Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 235000013334 alcoholic beverage Nutrition 0.000 description 1
- IAJILQKETJEXLJ-RSJOWCBRSA-N aldehydo-D-galacturonic acid Chemical compound O=C[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-RSJOWCBRSA-N 0.000 description 1
- IAJILQKETJEXLJ-QTBDOELSSA-N aldehydo-D-glucuronic acid Chemical compound O=C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-QTBDOELSSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 229940025131 amylases Drugs 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 235000015197 apple juice Nutrition 0.000 description 1
- 239000000605 aspartame Substances 0.000 description 1
- 235000010357 aspartame Nutrition 0.000 description 1
- IAOZJIPTCAWIRG-QWRGUYRKSA-N aspartame Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 IAOZJIPTCAWIRG-QWRGUYRKSA-N 0.000 description 1
- 229960003438 aspartame Drugs 0.000 description 1
- 235000013405 beer Nutrition 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 235000020279 black tea Nutrition 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 230000036772 blood pressure Effects 0.000 description 1
- 235000008429 bread Nutrition 0.000 description 1
- FAPWYRCQGJNNSJ-UBKPKTQASA-L calcium D-pantothenic acid Chemical compound [Ca+2].OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O.OCC(C)(C)[C@@H](O)C(=O)NCCC([O-])=O FAPWYRCQGJNNSJ-UBKPKTQASA-L 0.000 description 1
- 229960002079 calcium pantothenate Drugs 0.000 description 1
- 235000013736 caramel Nutrition 0.000 description 1
- 229940077731 carbohydrate nutrients Drugs 0.000 description 1
- 235000012174 carbonated soft drink Nutrition 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 235000019219 chocolate Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000002288 cocrystallisation Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000005056 compaction Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 235000014510 cooky Nutrition 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 235000012495 crackers Nutrition 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 235000021438 curry Nutrition 0.000 description 1
- 229940109275 cyclamate Drugs 0.000 description 1
- KAATUXNTWXVJKI-UHFFFAOYSA-N cypermethrin Chemical compound CC1(C)C(C=C(Cl)Cl)C1C(=O)OC(C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 KAATUXNTWXVJKI-UHFFFAOYSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 235000015071 dressings Nutrition 0.000 description 1
- CANAPGLEBDTCAF-UHFFFAOYSA-N dulcoside a Chemical compound OC1C(O)C(O)C(C)OC1OC1C(OC23C(CC4(C2)C(C2(C)C(C(CCC2)(C)C(=O)OC2C(C(O)C(O)C(CO)O2)O)CC4)CC3)=C)OC(CO)C(O)C1O CANAPGLEBDTCAF-UHFFFAOYSA-N 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 235000015897 energy drink Nutrition 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 101150104041 eno2 gene Proteins 0.000 description 1
- 108010026539 ent-kaurenoic acid 13-hydroxylase Proteins 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 235000019414 erythritol Nutrition 0.000 description 1
- UNXHWFMMPAWVPI-ZXZARUISSA-N erythritol Chemical compound OC[C@H](O)[C@H](O)CO UNXHWFMMPAWVPI-ZXZARUISSA-N 0.000 description 1
- 229940009714 erythritol Drugs 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 239000012526 feed medium Substances 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 235000013332 fish product Nutrition 0.000 description 1
- 235000013611 frozen food Nutrition 0.000 description 1
- 239000008369 fruit flavor Substances 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 235000008397 ginger Nutrition 0.000 description 1
- 150000002304 glucoses Chemical class 0.000 description 1
- 229930182478 glucoside Natural products 0.000 description 1
- 150000008131 glucosides Chemical class 0.000 description 1
- 229940097043 glucuronic acid Drugs 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 235000009569 green tea Nutrition 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000007407 health benefit Effects 0.000 description 1
- 235000019534 high fructose corn syrup Nutrition 0.000 description 1
- 239000008123 high-intensity sweetener Substances 0.000 description 1
- 239000008240 homogeneous mixture Substances 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 230000001631 hypertensive effect Effects 0.000 description 1
- 235000015243 ice cream Nutrition 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 150000002540 isothiocyanates Chemical class 0.000 description 1
- 235000015110 jellies Nutrition 0.000 description 1
- 239000008274 jelly Substances 0.000 description 1
- 235000008960 ketchup Nutrition 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 235000019223 lemon-lime Nutrition 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 235000010746 mayonnaise Nutrition 0.000 description 1
- 239000008268 mayonnaise Substances 0.000 description 1
- 229940126601 medicinal product Drugs 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 210000001589 microsome Anatomy 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 229930189775 mogroside Natural products 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- 235000021096 natural sweeteners Nutrition 0.000 description 1
- 229960003512 nicotinic acid Drugs 0.000 description 1
- 235000001968 nicotinic acid Nutrition 0.000 description 1
- 239000011664 nicotinic acid Substances 0.000 description 1
- 235000020333 oolong tea Nutrition 0.000 description 1
- 235000015205 orange juice Nutrition 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000036284 oxygen consumption Effects 0.000 description 1
- 239000005022 packaging material Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 101150079312 pgk1 gene Proteins 0.000 description 1
- 239000003075 phytoestrogen Substances 0.000 description 1
- 235000021110 pickles Nutrition 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 235000013406 prebiotics Nutrition 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000006041 probiotic Substances 0.000 description 1
- 230000000529 probiotic effect Effects 0.000 description 1
- 235000018291 probiotics Nutrition 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 239000003531 protein hydrolysate Substances 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- 235000011962 puddings Nutrition 0.000 description 1
- ZUFQODAHGAHPFQ-UHFFFAOYSA-N pyridoxine hydrochloride Chemical compound Cl.CC1=NC=C(CO)C(CO)=C1O ZUFQODAHGAHPFQ-UHFFFAOYSA-N 0.000 description 1
- 229960004172 pyridoxine hydrochloride Drugs 0.000 description 1
- 235000019171 pyridoxine hydrochloride Nutrition 0.000 description 1
- 239000011764 pyridoxine hydrochloride Substances 0.000 description 1
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical class C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 235000019992 sake Nutrition 0.000 description 1
- 229930182490 saponin Natural products 0.000 description 1
- 150000007949 saponins Chemical class 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000935 solvent evaporation Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 239000011877 solvent mixture Substances 0.000 description 1
- 235000014347 soups Nutrition 0.000 description 1
- 229940001941 soy protein Drugs 0.000 description 1
- 235000015096 spirit Nutrition 0.000 description 1
- 235000011496 sports drink Nutrition 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- BAQAVOSOZGMPRM-QBMZZYIRSA-N sucralose Chemical compound O[C@@H]1[C@@H](O)[C@@H](Cl)[C@@H](CO)O[C@@H]1O[C@@]1(CCl)[C@@H](O)[C@H](O)[C@@H](CCl)O1 BAQAVOSOZGMPRM-QBMZZYIRSA-N 0.000 description 1
- 235000019408 sucralose Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 229960000344 thiamine hydrochloride Drugs 0.000 description 1
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 1
- 239000011747 thiamine hydrochloride Substances 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 239000000606 toothpaste Substances 0.000 description 1
- 229940034610 toothpaste Drugs 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 238000002495 two-dimensional nuclear magnetic resonance spectrum Methods 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 238000000825 ultraviolet detection Methods 0.000 description 1
- 235000019583 umami taste Nutrition 0.000 description 1
- 235000019607 umami taste sensations Nutrition 0.000 description 1
- 235000015192 vegetable juice Nutrition 0.000 description 1
- 235000013522 vodka Nutrition 0.000 description 1
- 238000005550 wet granulation Methods 0.000 description 1
- 235000014101 wine Nutrition 0.000 description 1
- 229910009112 xH2O Inorganic materials 0.000 description 1
- 125000000969 xylosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)CO1)* 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L27/00—Spices; Flavouring agents or condiments; Artificial sweetening agents; Table salts; Dietetic salt substitutes; Preparation or treatment thereof
- A23L27/30—Artificial sweetening agents
- A23L27/33—Artificial sweetening agents containing sugars or derivatives
- A23L27/36—Terpene glycosides
-
- C—CHEMISTRY; METALLURGY
- C08—ORGANIC MACROMOLECULAR COMPOUNDS; THEIR PREPARATION OR CHEMICAL WORKING-UP; COMPOSITIONS BASED THEREON
- C08B—POLYSACCHARIDES; DERIVATIVES THEREOF
- C08B37/00—Preparation of polysaccharides not provided for in groups C08B1/00 - C08B35/00; Derivatives thereof
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23K—FODDER
- A23K20/00—Accessory food factors for animal feeding-stuffs
- A23K20/10—Organic substances
- A23K20/163—Sugars; Polysaccharides
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L2/00—Non-alcoholic beverages; Dry compositions or concentrates therefor; Their preparation
- A23L2/52—Adding ingredients
- A23L2/60—Sweeteners
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L27/00—Spices; Flavouring agents or condiments; Artificial sweetening agents; Table salts; Dietetic salt substitutes; Preparation or treatment thereof
- A23L27/30—Artificial sweetening agents
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L27/00—Spices; Flavouring agents or condiments; Artificial sweetening agents; Table salts; Dietetic salt substitutes; Preparation or treatment thereof
- A23L27/30—Artificial sweetening agents
- A23L27/33—Artificial sweetening agents containing sugars or derivatives
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L29/00—Foods or foodstuffs containing additives; Preparation or treatment thereof
- A23L29/30—Foods or foodstuffs containing additives; Preparation or treatment thereof containing carbohydrate syrups; containing sugars; containing sugar alcohols, e.g. xylitol; containing starch hydrolysates, e.g. dextrin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/44—Preparation of O-glycosides, e.g. glucosides
- C12P19/56—Preparation of O-glycosides, e.g. glucosides having an oxygen atom of the saccharide radical directly bound to a condensed ring system having three or more carbocyclic rings, e.g. daunomycin, adriamycin
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2002/00—Food compositions, function of food ingredients or processes for food or foodstuffs
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2200/00—Function of food ingredients
- A23V2200/15—Flavour affecting agent
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2250/00—Food ingredients
- A23V2250/24—Non-sugar sweeteners
- A23V2250/258—Rebaudioside
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2250/00—Food ingredients
- A23V2250/24—Non-sugar sweeteners
- A23V2250/262—Stevioside
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Polymers & Plastics (AREA)
- Zoology (AREA)
- Food Science & Technology (AREA)
- Wood Science & Technology (AREA)
- Nutrition Science (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- Microbiology (AREA)
- General Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Materials Engineering (AREA)
- Medicinal Chemistry (AREA)
- Animal Husbandry (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Seasonings (AREA)
- Saccharide Compounds (AREA)
Abstract
Description
本发明是申请人于2016年4月4日提交的申请号为201680020186.4、题为“甜菊醇糖苷”的中国专利申请的分案申请。
技术领域
本发明涉及甜菊醇糖苷,制备其的方法,包括甜菊醇糖苷的甜味剂组合物、风味组合物、食品、饲料和饮料以及甜菊醇糖苷在甜味剂组合物、风味组合物、食品、饲料和饮料中的用途。
背景技术
多年生草本植物甜叶菊(Stevia rebaudiana Bert.)的叶子积聚大量被称为甜菊醇糖苷的具有强烈甜味的化合物。虽然这些化合物的生物功能尚不清楚,但它们作为替代性高效甜味剂具有商业意义。
这些甜的甜菊醇糖苷的功能和感官特性表现为优于许多高效甜味剂的功能和感官特性。此外,研究表明甜菊苷能够降低II型糖尿病患者的血糖水平,并且能够降低轻度高血压患者的血压。
甜菊醇糖苷积聚在甜叶菊叶中,其中它们可占叶干重的10%至20%。甜菊苷和莱鲍迪甙A均是热和pH稳定的,并且适用于碳酸饮料和许多其他食物。甜菊苷比蔗糖甜110与270倍之间,莱鲍迪甙A比蔗糖甜150与320倍之间。此外,莱鲍迪甙D也是在甜叶菊叶中积聚的高效二萜糖苷甜味剂。它可比蔗糖甜约200倍。莱鲍迪甙M是另一种高效二萜糖苷甜味剂。它在某些甜叶菊品种叶中以痕量存在,但已表明其具有优异的味道特征。
传统上已从甜叶菊植物中提取了甜菊醇糖苷。在甜叶菊中,(-)-贝壳杉烯酸(赤霉酸(GA)生物合成中的中间体)被转化成四环二萜甜菊醇,其然后通过多步糖基化途径进行以形成各种甜菊醇糖苷。然而,产率可以是可变的,并且受到农业和环境条件的影响。此外,甜叶菊种植需要大量的土地面积、在收获前的很长时间、密集劳动以及用于提取和纯化糖苷的额外成本。
但是,仍需要具有替代和/或改善的味道谱的额外的甜菊醇糖苷,这是因为不同的甜菊醇糖苷可更适合于不同的应用。
发明内容
本发明基于从已进行修饰以制备包括rebA的甜菊醇糖苷的微生物获得的发酵液中鉴定新的甜菊醇糖苷。与已知的甜菊醇糖苷相比,新的甜菊醇糖苷将具有不同的感官特性。其可以单独使用或与其他甜菊醇糖苷组合使用,特别是作为甜味剂或用于甜味剂组合物中。
因此,本发明涉及:
-一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分,其全部均直接或间接地通过β键联接到甜菊醇糖苷配基。
-一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少四个糖部分且在位置R2上存在至少三个糖部分。
-一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,其中甜菊醇糖苷包括至少七个糖部分,且其中存在于位置R1的糖中的至少一个通过α键被联接到甜菊醇糖苷配基或糖分子。
-一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少四个糖部分,其中存在于位置R2上的糖部分中的至少四个为葡萄糖部分。
-一种具有式(II)的甜菊醇糖苷
-一种具有式(III)的甜菊醇糖苷
-一种具有式(IV)的甜菊醇糖苷
-一种发酵制备的具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分;
-一种用于制备根据前述权利要求中任一项所述的甜菊醇糖苷的方法,该方法包括:
提供重组酵母细胞,其包括编码多肽的重组核酸序列,所述多肽包括由下列编码的氨基酸序列:SEQ ID NO:61、SEQ ID NO:65、SEQ ID NO:23、SEQ ID NO:33、SEQ ID NO:77、SEQ ID NO:71、SEQ ID NO:87、SEQ ID NO:73和SEQ ID NO:75;
在合适的发酵培养基中使重组酵母细胞发酵;以及,可选地,
回收根据前述权利要求中任一项所述的甜菊醇糖苷。
-一种组合物,其包括本发明的甜菊醇糖苷以及一种或多种不同的甜菊醇糖苷(其中不同的甜菊醇糖苷可以是或可以不是本发明的甜菊醇糖苷);
-一种甜味剂组合物、风味组合物、食品、饲料或饮料,其包括本发明的甜菊醇糖苷或组合物;
-本发明的甜菊醇糖苷或组合物在甜味剂组合物或风味组合物中的用途;以及
-本发明的甜菊醇糖苷或组合物在食品、饲料或饮料中的用途。
附图说明
图1显示了质粒pUG7-EcoRV的示意图。
图2显示了将ERG20、tHMG1和BTS1过表达盒设计(A)和整合(B)至酵母基因组中的方法的示意图。(C)示出在通过Cre重组酶移除KANMX标记后的最终情况。
图3示出了ERG9敲低构建体的示意性图示。所述构建体由ERG9的500bp长的3'部分、TRP1启动子的98bp、TRP1开放阅读框和终止子、随后ERG9的400bp长的下游序列组成。由于在ERG9开放阅读框末端处引入Xbal位点,所以最后一个氨基酸变成Ser,并且终止密码子变成Arg。新的终止密码子位于TPR1启动子中,从而导致18个氨基酸的延伸。
图4示出了UGT2如何整合到基因组中的示意性图示。A.在转化中使用的不同片段;B.整合后的情况;C.在Cre重组酶表达后的情况)。
图5示出了从GGPP至RebA的途径如何整合到基因组中的示意性图示。A.在转化中使用的不同片段;B.在整合后的情况。
图6a示出使用高分辨率质谱法,在乙醇提取物(用于纯化的起始物料)中的含有7个葡萄糖(7.1、7.2和7.3)的甜菊醇糖苷混合物的m/z 1451.5820的提取的离子色谱图;且图6b为使用LC-MS的含有7个葡萄糖(7.1、7.2和7.3)的纯化甜菊醇糖苷的m/z 1451.5的提取的离子色谱图。
图7示出莱鲍迪甙7.1的结构。
图8示出莱鲍迪甙7.2的结构。
图9示出莱鲍迪甙7.3的结构。
图10示出莱鲍迪甙M的结构。
图11示出(a)甜菊醇的原子编号以及(b)葡萄糖的原子编号。
图12示出a)Reb M(cdcl3/pyr 1∶1,300K的2滴cdood),b)Reb 7.1(cdcl3/pyr 1∶3,320K的2滴cdood),c)Reb 7.2(cdcl3/pyr 1∶1,300K的2滴cdood)和d)Reb 7.3(cdcl3/pyr 1∶2,300K的3滴cdood)的1H NMR谱的选定区域。
序列表的说明
在表15中显示了对序列的描述。本文所述的序列可以参考序列表或参考也显示在表15中的数据库登录号来进行限定。
具体实施方式
在本说明书和所附权利要求书中,词语“包含”、“包括”和“具有”以及变化形式应被解释为包含性的。也就是说,这些词语意图表达在上下文允许的情况下可包含未具体叙述的其他要素或整数。
不使用数量词修饰时在本文中用于指代一个或一个以上(即一个或至少一个)的语法对象。举例来说,“要素”可意指一个要素或多于一个要素。
本发明涉及甜菊醇糖苷。为了本发明的目的,甜菊醇糖苷是甜菊醇的糖苷,特别是其羧基氢原子被葡萄糖分子取代以形成酯以及具有葡萄糖以形成乙缩醛的羟基氢的甜菊醇分子。
可以以分离的形式提供本发明的甜菊醇糖苷。“分离的甜菊醇糖苷”是从可与其天然相关联的其他物料,诸如其他甜菊醇糖苷移出的物质。因此,分离的甜菊醇糖苷可以含有按重量计的至多10%,至多8%,更优选为至多6%,更优选为至多5%,更优选为至多4%,更优选为至多3%,甚至更优选为至多2%,甚至更优选为至多1%,且最优选为至多0.5%的与其天然相关联的其他物料,例如其他甜菊醇糖苷。分离的甜菊醇糖苷可以不含任何其它杂质。本发明的分离的甜菊醇糖苷可以是按重量计的至少50%纯,例如至少60%纯,至少70%纯,至少75%纯,至少80%纯,至少85%纯,至少90%纯,或至少95%、96%、97%、98%、99%、99.5%、99.9%纯。
本发明提供了一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分,其全部均直接或间接地通过β键联接到甜菊醇糖苷配基,或
其中在位置R1上存在至少四个糖部分且在位置R2上存在至少三个糖部分,或
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,其中甜菊醇糖苷包括至少七个糖部分,且其中存在于位置R1的糖中的至少一个通过α键被联接到甜菊醇糖苷配基或糖分子,或
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少四个糖部分,其中存在于位置R2上的糖部分中的至少四个为葡萄糖部分。
本发明还提供了具有式(II)、(III)或(IV)的甜菊醇糖苷:
本发明的甜菊醇糖苷可以从植物物料获得,但更典型地,将通过发酵制备获得,例如,经对重组宿主细胞诸如酵母细胞的发酵获得。
因此,本发明提供了一种发酵制备的具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分。
可以基于糖中的端基异构位置和距离C1最远的立体中心的相对立体化学(R或S)来区分α-和β-糖苷键。通常,当两个碳具有相同的立体化学时,形成α-糖苷键,而当两个碳具有不同的立体化学时,出现β-糖苷键。
这种发酵制备的甜菊醇糖苷可以具有本文所述的甜菊醇糖苷中的任一个的结构。
本发明还涉及一种用于制备甜菊醇糖苷的方法。在这种方法中,在合适的发酵培养基中发酵合适的重组宿主细胞诸如酵母细胞以制备甜菊醇糖苷。可选地,可以回收甜菊醇糖苷。
例如,一种用于制备如本文所述的甜菊醇糖苷的方法可以包括:
提供重组酵母细胞,其包括编码多肽的重组核酸序列,所述多肽包括由下列编码的氨基酸序列:SEQ ID NO:61、SEQ ID NO:65、SEQ ID NO:23、SEQ ID NO:33、SEQ ID NO:77、SEQ ID NO:71、SEQ ID NO:87、SEQ ID NO:73和SEQ ID NO:75;
在合适的发酵培养基中使重组酵母细胞发酵;以及,可选地,
回收如本文所述的甜菊醇糖苷。
在涉及细胞、核酸、蛋白质或载体使用时,术语“重组”指示细胞、核酸、蛋白质或载体已通过引入异源核酸或蛋白质或改变天然核酸或蛋白质来进行修饰,或者指示细胞源自如此修饰的细胞。因此,例如,重组细胞表达在细胞的天然(非重组)形式中未发现的基因或者表达以其他形式异常表达、低表达或完全未表达的天然基因。术语“重组的”与“遗传修饰的”同义。
用于本发明的方法中的重组酵母细胞可以是任何合适的酵母细胞。优选的重组酵母细胞可以选自下列各属:酵母属(Saccharomyces)(例如,酿酒酵母(S.cerevisiae)、贝酵母(S.bayanus)、巴斯德酵母(S.pastorianus)、卡尔斯伯酵母(S.carlsbergensis))、酒香酵母属(Brettanomyces)、克鲁维酵母属(Kluyveromyces)、假丝酵母属(Candida)(例如,克鲁斯假丝酵母(C.krusei)、拉考夫假丝酵母(C.revkaufi)、铁红假丝酵母(C.pulcherrima)、热带假丝酵母(C.tropicalis)、产朊假丝酵母(C.utilis))、伊萨酵母属(Issatchenkia)(例如,东方伊萨酵母(I.orientalis))、毕赤酵母属(Pichia)(例如,巴斯德毕赤酵母(P.pastoris))、裂殖酵母属(Schizosaccharomyces)、汉逊酵母属(Hansenula)、克勒克酵母属(Kloeckera)、管囊酵母属(Pachysolen)、许旺酵母属(Schwanniomyces)、毛孢子菌属(Trichosporon)、耶氏酵母属(Yarrowia)(例如,解脂耶氏酵母(Y.lipolytica)(先前分类为解脂假丝酵母(Candida lipolytica)))、Yamadazyma。优选地,重组酵母细胞是酿酒酵母、解脂耶氏酵母或东方伊萨酵母细胞。
用于根据本发明所述方法中的重组酵母细胞可以包括一个或多个重组核苷酸序列,其对下列中的一个或多个进行编码:
具有对映-柯巴基焦磷酸合酶活性的多肽;
具有对映-贝壳杉烯合酶活性的多肽;
具有对映-贝壳杉烯氧化酶活性的多肽;以及
具有贝壳杉烯酸-13-羟化酶活性的多肽。
出于本发明的目的,具有对映-柯巴基焦磷酸合酶(EC 5.5.1.13)的多肽能够催化化学反应:
所述酶具有一种底物,香叶基香叶基焦磷酸;以及一种产物,对映-柯巴基焦磷酸。所述酶参与赤霉素生物的合成。所述酶属于异构酶家族,特别是分子内裂解酶的类别。所述酶类别的系统名称是对映-柯巴基-二磷酸裂解酶(脱环)。通常使用的其他名称包括具有对映-柯巴基焦磷酸合酶、对映-贝壳杉烯合酶A和对映-贝壳杉烯合成酶A。
编码对映-柯巴基焦磷酸合酶的合适核酸序列可例如包含在SEQ ID.NO:1、3、5、7、17、19、59、61、141、142、151、152、153、154、159、160、182或184中列出的序列。
出于本发明的目的,具有对映-贝壳杉烯合酶活性(EC 4.2.3.19)的多肽是能够催化以下化学反应的多肽:
因此,所述酶具有一种底物,对映-柯巴基二磷酸;以及两种产物,对映-贝壳杉烯和二磷酸。
所述酶属于裂解酶家族,特别是作用于磷酸盐/酯的碳-氧裂解酶。所述酶类别的系统名称是对映-柯巴基二磷酸二磷酸-裂解酶(环化,对映-贝壳杉烯形成)。常用的其它名称包括对映-贝壳杉烯合酶B、对映-贝壳杉烯合成酶B、对映-柯巴基-二磷酸二磷酸-裂解酶和(环化)。所述酶参与双萜类生物合成。
编码对映-贝壳杉烯合酶的合适核酸序列可例如包含在SEQ ID.NO:9、11、13、15、17、19、63、65、143、144、155、156、157、158、159、160、183或184中列出的序列。
对映-柯巴基二磷酸合酶还可具有与相同蛋白质分子相关联的不同对映-贝壳杉烯合酶活性。由对映-贝壳杉烯合酶催化的反应是赤霉素的生物合成途径中的下一步骤。两种类型的酶活性是不同的,并且定点诱变以抑制蛋白质的对映-贝壳杉烯合酶活性导致对映-柯巴基焦磷酸的积累。
因此,在适用于本发明方法的重组酵母中使用的单个核苷酸序列可编码具有对映-柯巴基焦磷酸合酶活性和对映-贝壳杉烯合酶活性的多肽。或者,两种活性可被两个不同的分离的核苷酸序列编码。
出于本发明的目的,具有对映-贝壳杉烯氧化酶活性(EC 1.14.13.78)的多肽是能够催化对映-贝壳杉烯的4-甲基的三次连续氧化以产生贝壳杉烯酸的多肽。这种活性通常需要细胞色素P450的存在。
编码对映-贝壳杉烯氧化酶的合适核酸序列可例如包含在SEQ ID.NO:21、23、25、67、85、145、161、162、163、180或186中列出的序列。
出于本发明的目的,具有贝壳杉烯酸13-羟化酶活性(EC 1.14.13)的多肽是能够催化使用NADPH和O2形成甜菊醇(对映-贝壳杉-16-烯-13-醇-19-酸)的多肽。这种活性也可称为对映-贝壳杉烯酸13-羟化酶活性。
编码贝壳杉烯酸13-羟化酶的合适核酸序列可例如包含在SEQ ID.NO:27、29、31、33、69、89、91、93、95、97、146、164、165、166、167或185中列出的序列。
适用于本发明方法的重组酵母细胞可包含编码具有NADPH-细胞色素p450还原酶活性的多肽的重组核酸序列。也就是说,适用于本发明方法的重组酵母可能够表达编码具有NADPH-细胞色素p450还原酶活性的多肽的核苷酸序列。出于本发明的目的,具有NADPH-细胞色素P450还原酶活性(EC 1.6.2.4;也称为NADPH:高铁血红蛋白氧化还原酶、NADPH:血红素蛋白氧化还原酶、NADPH:P450氧化还原酶、P450还原酶、POR、CPR、CYPOR)的多肽通常是一种这样的多肽,其为膜结合酶,从而允许电子从含有FAD和FMN的酶NADPH:细胞色素P450还原酶(POR;EC 1.6.2.4)转移至真核细胞的微粒体中的细胞色素P450。
编码NADPH-细胞色素p450还原酶的合适的核酸序列可以例如包括在SEQ ID.NO:53、55、57或77中显示的序列。
适合用于本发明的方法中的重组酵母细胞还可以包括一个或多个重组核酸序列,其对下列中的一个或多个进行编码:
(i)具有UGT74G1活性的多肽;
(ii)具有UGT2活性的多肽;
(iii)具有UGT85C2活性的多肽;以及
(iv)具有UGT76G1活性的多肽。
适合用于本发明中的重组酵母可以包括编码能够催化将C-13-葡萄糖添加至甜菊醇的多肽的核苷酸序列。也就是说,适合用于本发明的方法中的重组酵母可以包括UGT,其能够催化其中将甜菊醇转化成甜菊单糖苷的反应。
这种适合用于本发明方法中的重组酵母可包含编码具有由UDP-糖基转移酶(UGT)UGT85C2所示的活性的多肽的核苷酸序列,由此酵母转化后的核苷酸序列赋予所述酵母将甜菊醇转化为甜菊醇单糖苷的能力。
UGT85C2活性是将葡萄糖单元转移至甜菊醇的13-OH。因此,合适的UGT85C2可充当尿苷5'-二磷酸葡糖基∶甜菊醇13-OH转移酶和尿苷5'-二磷酸葡糖基∶甜菊醇-19-O-糖苷13-OH转移酶。功能性UGT85C2多肽还可催化葡糖基转移酶反应,所述反应利用除甜菊醇和甜菊醇-19-O-糖苷以外的甜菊醇糖苷底物。此类序列可在本文中称为UGT1序列。
适合用于本发明中的重组酵母可以包括编码具有UGT2活性的多肽的核苷酸序列。
具有UGT2活性的多肽是用作尿苷5’-二磷酸葡糖基:甜菊醇-13-O-葡萄糖苷转移酶(也称为甜菊醇-13-单葡萄糖苷1,2-转葡糖基酶)的多肽,其将葡萄糖部分转移至受体分子甜菊醇-13-O-葡萄糖苷的13-O-葡萄糖的C-2’。通常,合适的UGT2多肽也用作尿苷5’-二磷酸葡糖基:甜茶苷转移酶,其将葡萄糖部分转移到受体分子甜茶苷的13-O-葡萄糖的C-2’。
具有UGT2活性的多肽还可以催化利用除了甜菊醇-13-O-葡萄糖苷和甜茶苷以外的甜菊醇糖苷底物的反应,例如,功能性UGT2多肽可以利用甜菊苷作为底物,将葡萄糖部分转移到19-O-葡萄糖残基的C-2’以制备莱鲍迪甙E。功能性UGT2多肽还可以利用莱鲍迪甙A作为底物,将葡萄糖部分转移到19-O-葡萄糖残基的C-2’以制备莱鲍迪甙D。然而,功能性UGT2多肽通常不将葡萄糖部分转移到在C-13位置具有1,3-结合葡萄糖的甜菊醇化合物,即通常不发生葡萄糖部分至甜菊醇1,3-双糖苷和1,3-甜菊苷的转移。
具有UGT2活性的多肽也可以将糖部分从除了尿苷二磷酸葡萄糖以外的供体进行转移。例如,具有UGT2活性的多肽充当尿苷5’-二磷酸D-木糖基:甜菊醇-13-O-葡萄糖苷转移酶,其将木糖部分转移到受体分子甜菊醇-13-O-葡萄糖苷的13-O-葡萄糖的C-2’。作为另一个实例,具有UGT2活性的多肽可以充当尿苷5’-二磷酸L-鼠李糖基:甜菊醇-13-O-葡萄糖苷转移酶,其将鼠李糖部分转移到受体分子甜菊醇的13-O-葡萄糖的C-2’。
适合用于本发明的方法中的重组酵母可以包括编码具有UGT活性的核苷酸序列,可以包括编码能够催化将C-19-葡萄糖添加至甜菊双糖苷的多肽的核苷酸序列。也就是说,本发明的重组酵母可以包括UGT,其能够催化其中将甜菊双糖苷转化成甜菊苷的反应。因此,这样的重组酵母可能够将甜菊双糖苷转化成甜菊苷。这种核苷酸序列的表达可以赋予重组酵母制备至少甜菊苷的能力。
适合用于本发明的方法中的重组酵母因此还可以包括编码具有由UDP-糖基转移酶(UGT)UGT74G1所示活性的多肽的核苷酸序列,由此在进行酵母转化后核苷酸序列赋予该细胞将甜菊双糖苷转化成甜菊苷的能力。
合适的UGT74G1多肽可能够将葡萄糖单元分别转移至甜菊醇的13-OH或19-COOH。合适的UGT74G1多肽可充当尿苷5'-二磷酸葡糖基∶甜菊醇19-COOH转移酶和尿苷5'-二磷酸葡糖基∶甜菊醇-13-O-糖苷19-COOH转移酶。功能性UGT74G1多肽还可催化使用除甜菊醇和甜菊醇-13-O-糖苷以外的甜菊醇糖苷底物或者从除尿苷二磷酸葡萄糖以外的供体转移糖部分的糖基转移酶反应。此类序列可在本文中称为UGT3序列。
适合用于本发明的方法中的重组酵母可包含编码能够催化甜菊苷的C-13位置处的葡萄糖的C-3'的葡糖基化的多肽的核苷酸序列。也就是说,适合用于本发明的方法中的重组酵母可包含UGT,所述UGT能够催化甜菊苷至莱鲍迪甙A的反应。因此,这种重组酵母可能够将甜菊苷转化为莱鲍迪甙A。这种核苷酸序列的表达可赋予酵母产生至少莱鲍迪甙A的能力。
因此,适合用于本发明的方法中的重组酵母还可包含编码具有由UDP-糖基转移酶(UGT)UGT76G1所示的活性的多肽的核苷酸序列,由此酵母转化后的核苷酸序列赋予酵母将甜菊苷转化为莱鲍迪甙A的能力。
合适的UGT76G1向受体分子甜菊醇1,2糖苷的C-13-O-葡萄糖的C-3'添加葡萄糖部分。因此,UGT76G1例如充当尿苷5'-二磷酸葡糖基∶甜菊醇13-O-1,2葡糖苷C-3'葡糖基转移酶和尿苷5'-二磷酸葡糖基∶甜菊醇-19-O-葡萄糖、13-O-1,2双糖苷C-3'葡糖基转移酶。功能性UGT76G1多肽还可催化葡糖基转移酶反应,所述反应使用含有除葡萄糖以外的糖的甜菊醇糖苷底物,例如甜菊醇鼠李糖苷和甜菊醇木糖苷。此类序列可在本文中称为UGT4序列。UGT4可以替代地或额外地能够将RebD转化成RebM。
适合用于本发明的方法中的重组酵母通常包括编码至少一种具有UGT1活性的多肽,至少一种具有UGT2活性的多肽,至少一种具有UGT3活性的多肽和至少一种具有UGT4活性的多肽的核苷酸序列。这些核酸序列中的一种或多种可以是重组的。给定的核酸可以编码具有上述活性中的一组或多种的多肽。例如,核酸编码具有上述活性中的两种、三种或四种的多肽。优选地,用于本发明的方法中的重组酵母包括UGT1、UGT2、UGT3和UGT4活性。在本文的表15中描述了合适的UGT1、UGT2、UGT3和UGT4序列。编码UGT1、2、3和4活性的序列的优选组合为SEQ ID NO:71、87、73和75。
在本发明的方法中,重组宿主例如酵母可以能够在本领域中已知的任何合适的碳源上生长,并且将其转化为一种或更多种甜菊醇糖苷。重组宿主可能够直接转化植物生物质、纤维素、半纤维素、果胶、鼠李糖、半乳糖、岩藻糖、麦芽糖、麦芽糖糊精、核糖、核酮糖或淀粉、淀粉衍生物、蔗糖、乳糖和甘油。因此,优选的宿主表达酶如用于将纤维素转化成葡萄糖单体和将半纤维素转化成木糖和阿拉伯糖单体所需的纤维素酶(内切纤维素酶和外切纤维素酶)和半纤维素酶(例如内切和外切木聚糖酶、阿拉伯糖酶),能够将果胶转化成葡萄糖醛酸和半乳糖醛酸的果胶酶或将淀粉转化成葡萄糖单体的淀粉酶。优选地,宿主能够转化选自由以下各项组成的组的碳源:葡萄糖、木糖、阿拉伯糖、蔗糖、乳糖和甘油。宿主细胞可例如是WO03/062430、WO06/009434、EP1499708B1、WO2006096130或WO04/099381中所描述的真核宿主细胞。
在用于产生本发明的甜菊醇糖苷的方法中使用的发酵培养基可以是允许特定真核宿主细胞生长的任何合适的发酵培养基。发酵培养基的基本要素是本领域的技术人员已知的,并且可适用于所选择的宿主细胞。
优选地,发酵培养基包含选自由以下各项组成的组的碳源:植物生物质、纤维素、半纤维素、果胶、鼠李糖、半乳糖、岩藻糖、果糖、麦芽糖、麦芽糖糊精、核糖、核酮糖或淀粉、淀粉衍生物、蔗糖、乳糖、脂肪酸、甘油三酯和甘油。优选地,发酵培养基还包含氮源,如尿素;或铵盐,如硫酸铵、氯化铵、硝酸铵或磷酸铵。
根据本发明的发酵方法可以分批、分批补料或连续模式进行。也可应用单独的水解和发酵(SHF)方法或同时糖化和发酵(SSF)方法。这些发酵方法模式的组合对于最佳生产率来说也可以是可行的。如果在发酵方法中使用淀粉、纤维素、半纤维素或果胶作为碳源,则SSF方法可以是特别有吸引力的,其中可需要添加水解酶如纤维素酶、半纤维素酶或果胶酶以水解底物。
用于产生根据本发明的甜菊醇糖苷的发酵方法可以是需氧或厌氧发酵方法。
厌氧发酵方法可在本文中定义为在不存在氧的情况下运行或者基本上不消耗氧(优选小于5、2.5或1mmol/L/h),并且其中有机分子充当电子供体和电子受体两者的发酵方法。根据本发明的发酵方法也可首先在需氧条件下运行,且随后在厌氧条件下运行。
发酵方法也可在限氧或微需氧条件下进行。或者,发酵方法可首先在需氧条件下运行,且随后在限氧条件下运行。限氧发酵方法是其中氧消耗受到从气体到液体的氧传递的限制的过程。氧限制的程度由进入气流的量和组成以及所用发酵设备的实际混合/传质特性决定。
在根据本发明的方法中产生甜菊醇糖苷可在宿主细胞的生长阶段期间、固定(稳定状态)阶段期间或在两个阶段期间发生。在不同的温度下运行发酵方法可以是可行的。
用于产生甜菊醇糖苷的方法可在对于重组宿主来说最佳的温度下进行。对于每种转化的重组宿主而言,最佳生长温度可不同并且是本领域的技术人员已知的。最佳温度可高于野生型生物的最适温度以在非无菌条件下在最低感染敏感性和最低冷却成本的条件下有效生长生物体。或者,所述方法可在对于重组宿主的生长来说不是最佳的温度下进行。
用于产生根据本发明的甜菊醇糖苷的方法可在任何合适的pH值下进行。如果重组宿主是酵母,则发酵培养基中的pH优选具有低于6、优选低于5.5、优选低于5、优选低于4.5、优选低于4、优选低于pH 3.5或低于pH3.0或低于pH 2.5、优选高于pH 2的值。在这些低pH值下进行发酵的优点是可防止发酵培养基中污染细菌的生长。
这种方法可以以工业规模进行。这种方法的产物是根据本发明所述的一种或多种甜菊醇糖苷。
从发酵培养基回收本发明的甜菊醇糖苷可以通过本领域中已知的方法,例如通过蒸馏、真空提取、溶剂提取或蒸发来执行。
在根据本发明所述的用于制备甜菊醇糖苷的方法中,可以实现高于0.5mg/l,优选为高于约1mg/l的浓度。
在重组宿主中表达本发明的一种或多种甜菊醇糖苷的情况下,这种细胞可需要进行处理以将其释放。
本发明还提供了一种组合物,其包括与一种或多种不同的甜菊醇糖苷相组合的本发明的甜菊醇糖苷。一种或多种不同的甜菊醇糖苷中的一种或多种可以是本发明的甜菊醇糖苷。一种或多种不同的甜菊醇糖苷中的一种或多种可以是糖基化二萜(即,二萜糖苷),诸如甜菊单糖苷、甜菊双糖苷、甜菊苷、莱鲍迪甙A、莱鲍迪甙B、莱鲍迪甙C、莱鲍迪甙D、莱鲍迪甙E、莱鲍迪甙F、莱鲍迪甙M、甜茶苷、杜尔可苷A、甜菊醇-13-单糖苷、甜菊醇-19-单糖苷或13-[(β-D-吡喃葡萄糖基)氧基)贝壳杉-16-烯-18-酸2-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基酯。
本发明的组合物可以包括与更大量的不同的甜菊醇糖苷相组合的相对少量的本发明的甜菊醇糖苷。
例如,本发明的组合物可以包括与本发明的甜菊醇糖苷相组合的至少约80%、至少约90%、至少约95%的莱鲍迪甙A。本发明的组合物可以包括与本发明的甜菊醇糖苷相组合的至少约80%、至少约90%、至少约95%的莱鲍迪甙D。本发明的组合物可以包括与本发明的甜菊醇糖苷相组合的至少约80%、至少约90%、至少约95%的莱鲍迪甙M。本发明的组合物可以包括与本发明的甜菊醇糖苷和莱鲍迪甙D相组合的至少约80%、至少约90%、至少约95%的莱鲍迪甙A。本发明的组合物可以包括与本发明的甜菊醇糖苷和莱鲍迪甙M相组合的至少约80%、至少约90%、至少约95%的莱鲍迪甙A。所提及的百分比是以干重计的。
根据本发明所述的甜菊醇糖苷可以用于已知的用于这种化合物的任何应用中。特别地,其可以例如用作甜味剂或风味剂,例如在食品、饲料或饮料中。例如,甜菊醇糖苷可以被配制在软饮料诸如碳酸饮料、桌面甜味剂、口香糖、乳制品诸如酸奶(例如,原味酸奶)、蛋糕、谷物或基于谷物的食品、营养食品、药物、食用凝胶、糖食、化妆品、牙膏或其他口腔组合物等中。此外,甜菊醇糖苷可以用作甜味剂,其不仅可用于饮料、食品和其他专用于人类消费的制品中,还可用于具有改善的特性的动物饲料和草料中。
因此,本发明尤其提供了一种甜味剂组合物、风味剂组合物、食品、饲料或饮料,其包括根据本发明的方法所制备的甜菊醇糖苷。
本发明的组合物可以包括一种或多种非天然存在的组分。
而且,本发明提供了:
-本发明的甜菊醇糖苷或组合物在甜味剂组合物或风味组合物中的用途;以及
-本发明的甜菊醇糖苷或组合物在食品、饲料或饮料中的用途。
在制造食品、饮料、药物、化妆品、桌面制品、口香糖期间,可以使用传统的方法,诸如混合、捏合、溶解、浸酸、渗透、渗滤、喷洒、雾化、注入以及其他方法。
在本发明中获得的甜菊醇糖苷能够以干或液体形式使用。其能够在对食品进行热处理前或后进行添加。甜味剂的量取决于使用目的。其能够单独地或与其他化合物相组合地进行添加。
根据本发明的方法制备的化合物可以与一种或多种另外的非热量或热量甜味剂相混合。这种混合可以用于改善风味或时间特性或稳定性。本发明的甜菊醇糖苷可以用于改善第二种甜菊醇糖苷,诸如莱鲍迪甙A、D或M的风味或时间特性或稳定性。
大范围的非热量或热量甜味剂可以适合于与本发明的甜菊醇糖苷,包括根据本发明所述的一种或多种其他甜菊醇糖苷或一种或多种其他已知的甜菊醇糖苷,诸如甜菊单糖苷、甜菊双糖苷、甜菊苷、莱鲍迪甙A、莱鲍迪甙B、莱鲍迪甙C、莱鲍迪甙D、莱鲍迪甙E、莱鲍迪甙F、莱鲍迪甙M、甜茶苷、杜尔可苷A、甜菊醇-13-单糖苷、甜菊醇-19-单糖苷或13-[(β-D-吡喃葡萄糖基)氧基)贝壳杉-16-烯-18-酸2-O-β-D-吡喃葡萄糖基-β-D-吡喃葡萄糖基酯相混合。替代地或额外地,非热量甜味剂为诸如罗汉果苷、莫纳甜、阿斯巴甜、安赛蜜盐、甜蜜素、三氯蔗糖、糖精盐或赤藓糖醇。适合与甜菊醇糖苷相混合的热量甜味剂包括糖醇和碳水化合物,诸如蔗糖、葡萄糖、果糖和HFCS。还可以使用甜味氨基酸,诸如甘氨酸、丙氨酸或丝氨酸。
甜菊醇糖苷可与甜味剂抑制剂如天然甜味剂抑制剂组合使用。它可与鲜味增强剂如氨基酸或其盐组合。
甜菊醇糖苷可与多元醇或糖醇、碳水化合物、生理活性物质或功能成分(例如类胡萝卜素、膳食纤维、脂肪酸、皂苷、抗氧化剂、营养食品、类黄酮、异硫氰酸酯、苯酚、植物甾醇或甾烷醇(植物甾醇和植物甾烷醇)、多元醇、益生元、益生菌、植物雌激素、大豆蛋白、硫化物/硫醇、氨基酸、蛋白质、维生素、矿物质和/或基于健康益处如心血管、降胆固醇或抗炎分类的物质组合。
具有甜菊醇糖苷的组合物可包括调味剂、芳香组分、核苷酸、有机酸、有机酸盐、无机酸、苦味化合物、蛋白质或蛋白质水解产物、表面活性剂、类黄酮、收敛剂化合物、维生素、膳食纤维、抗氧化剂、脂肪酸和/或盐。
本发明的甜菊醇糖苷可作为高强度甜味剂应用,以产生具有改进的味道特征的零卡路里、低卡路里或糖尿病人用饮料和食品。它也可用于不能使用糖的饮料、食品、药物和其他产品中。
此外,本发明的甜菊醇糖苷可用作甜味剂,不仅用于饮料、食品和其它专门用于人消费的产品,而且用于具有改进的特性的动物饲料和草料中。
本发明组合物的甜菊醇糖苷可用作甜味化合物的产品的实例可以是酒精饮料,如伏特加酒、葡萄酒、啤酒、烈酒、清酒等;天然果汁、提神饮料、碳酸软饮料、减肥饮料、零卡路里饮料、低卡路里饮料和食物、酸奶饮料、速溶果汁、速溶咖啡、粉末型速溶饮料、罐装产品、糖浆、发酵大豆酱、酱油、醋、调味品、蛋黄酱、番茄酱、咖喱、汤、速食肉汤、酱油粉、醋粉、多种类型的饼干、香米饼、咸饼干、面包、巧克力、焦糖、糖果、口香糖、果冻、布丁、蜜饯和腌菜、鲜奶油、果酱、橘子酱、糖花膏、奶粉、冰淇淋、冰糕、包装在瓶中的蔬菜和水果、罐装和煮熟的豆类、在甜味酱中煮熟的肉和食物、农业蔬菜食品、海鲜、火腿、香肠、鱼火腿、鱼香肠、鱼酱、油炸鱼制品、干制海产品、冷冻食品、腌渍海带、腊肉、烟草、医药产品等。原则上它可具有无限应用。
甜味组合物包含饮料,其非限制性实例包括非碳酸化和碳酸饮料,如可乐、姜汁汽水、根汁汽水、苹果汁、水果味软饮料(例如柑橘味软饮料,如柠檬莱姆或橙汁)、软饮料粉等;来自水果或蔬菜的果汁、包括榨汁等的果汁、含有果粒的果汁、水果饮料、果汁饮料、含果汁的饮料、具有水果调味料的饮料、蔬菜汁、含蔬菜的汁以及含水果和蔬菜的混合果汁;运动饮料、能量饮料、接近水的饮料等(例如具有天然或合成调味剂的水);茶类或喜欢型饮料如咖啡、可可、红茶、绿茶、乌龙茶等;含乳成分饮料如乳饮料、含乳成分咖啡、牛奶咖啡、奶茶、果奶饮料、饮用酸奶、乳酸菌饮料等;以及乳制品。
通常,甜味组合物中存在的甜味剂的量取决于甜味组合物的具体类型及其所需的甜度而广泛变化。本领域的普通技术人员可容易确定加入到甜味组合物中的甜味剂的适当量。
本发明的甜菊醇糖苷可以干或液体形式使用。它可在食品热处理之前或之后加入。甜味剂的量取决于使用目的。它可单独添加或与其它化合物组合添加。
在食品、饮料、药物、化妆品、桌面产品、口香糖的制造过程中,可使用诸如混合、捏合、溶解、酸洗、渗透、渗滤、喷洒、雾化、灌注和其它方法的常规方法。
因此,本发明的组合物可通过本领域的技术人员已知的提供成分的均匀或均质混合物的任何方法来制备。这些方法包括干混、喷雾干燥、团聚、湿法制粒、压实、共结晶等。
呈固体形式时,本发明的甜菊醇糖苷可以适于递送到待甜化的食物中的任何形式提供给消费者,所述形式包括小袋、小包、散装袋或盒、方块、片剂、喷雾或可溶解的条。所述组合物可以单位剂量或散装形式递送。
对于液体甜味剂体系和组合物而言,应开发方便范围的流体、半流体、糊状和膏状形式、使用任何形状或形式的适当包装材料的适当包装,其便于携带或分配或储存或运输含有任何上述甜味剂产品或上述产品的组合的任何组合。
所述组合物可包含多种填充剂、功能成分、着色剂、调味剂。
标准遗传技术,诸如在宿主细胞中的酶的过表达、宿主细胞的遗传修饰或杂交技术是本领域中的已知方法,诸如在Sambrook和Russel(2001)的“分子克隆:实验室手册(第3版)”,冷泉港实验室,冷泉港实验室出版社,或F.Ausubel等人编辑,“最新分子生物学实验方法汇编”,Green Publishing and Wiley Interscience,纽约(1987)中所述的。根据例如EP-A-0635574、WO 98/46772、WO 99/60102和WO 00/37671、WO90/14423、EP-A-0481008、EP-A-0635574和US 6265186,已知用于真菌宿主细胞的转化、遗传修饰等的方法。
本发明的一些实施方案:
1.一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分,其全部均直接或间接地通过β键联接到甜菊醇糖苷配基。
2.一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少四个糖部分且在位置R2上存在至少三个糖部分。
3.一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,其中甜菊醇糖苷包括至少七个糖部分,且其中存在于位置R1的糖中的至少一个通过α键被联接到甜菊醇糖苷配基或糖分子。
4.一种具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少四个糖部分,其中存在于位置R2上的糖部分中的至少四个为葡萄糖部分。
5.一种具有式(II)的甜菊醇糖苷
6.一种具有式(III)的甜菊醇糖苷
7.一种具有式(IV)的甜菊醇糖苷
8.根据前述实施方案中任一项所述的甜菊醇糖苷,其为发酵制备的。
9.发酵制备的具有式(I)的甜菊醇糖苷,
其中在位置R1上存在至少三个糖部分且在位置R2上存在至少三个糖部分,且其中甜菊醇糖苷包括至少七个糖部分。
10.根据实施方案9所述的甜菊醇糖苷,其具有根据实施方案1至7中的任一项的结构。
11.一种用于制备根据前述实施方案中任一项所述的甜菊醇糖苷的方法,所述方法包括:
提供重组酵母细胞,其包括编码多肽的重组核酸序列,所述多肽包括由下列编码的氨基酸序列:SEQ ID NO:61、SEQ ID NO:65、SEQ ID NO:23、SEQ ID NO:33、SEQ ID NO:59、SEQ ID NO:71、SEQ ID NO:87、SEQ ID NO:73和SEQ ID NO:75;
在合适的发酵培养基中发酵重组酵母细胞;以及,可选地,
回收根据前述实施方案中任一项所述的甜菊醇糖苷。
12.一种组合物,其包括根据实施方案1至11中的任一项所述的甜菊醇糖苷以及一种或多种不同的甜菊醇糖苷。
13.一种食品、饲料或饮料,其包括根据实施方案1至10中的任一项所述的甜菊醇糖苷或根据实施方案12所述的组合物。
14.根据实施方案1至10中的任一项所述的甜菊醇糖苷或根据实施方案12所述的组合物在甜味剂组合物或风味组合物中的用途。
15.根据实施方案1至10中的任一项所述的甜菊醇糖苷或根据实施方案12所述的组合物在食品、饲料或饮料中的用途。
本文中对作为现有技术而给出的专利文件或其他事项的参考不应被视为承认该文件或事项是已知的或其含有的信息是在权利要求中的任一项的优先权日的公知常识的一部分。
本文阐明的每个参考文献的公开内容均通过引用整体并入本文。
本发明还通过下列实施例进一步进行了说明。
实施例
实施例1:STV016的构建
构建酿酒酵母菌株STV016以用于甜菊醇糖苷的发酵制备。
1.1 ERG20、BTS1和tHMG在酿酒酵母中的过表达
对于ERG20、BTS1 tHMG1的过度表达而言,使用W02013/076280中描述的技术将表达盒设计为整合在一个基因座中。为了扩增整合基因座的5'和3'整合侧翼,使用了来自CEN.PK酵母菌株(van Dijken等人.Enzyme and Microbial Technology 26(2000)706-714)的合适的引物和基因组DNA。不同的基因在DNA2.0作为盒(含有同源序列、启动子、基因、终止子、同源序列)订购。这些盒中的基因侧接组成型启动子和终止子。参见表1。将来自DNA2.0的含有ERG20、tHMG1和BTS1盒的质粒DNA溶解至100ng/μl的浓度。在50μl PCR混合物中,20ng模板与20pmol的引物一起使用。将材料溶解至0.5μg/μl的浓度。
表1过表达构建体的组成
为了扩增选择标记,使用了pUG7-EcoRV构建体(图1)和合适的引物。使用Zymoclean凝胶DNA回收试剂盒(ZymoResearch)从凝胶中纯化KanMX片段。将酵母菌株Cen.PK113-3C用表2中所列的片段转化。
表2用于ERG20、tHMG1和BTS1转化的DNA片段
片段 |
5’YPRcTau3 |
ERG20盒 |
tHMG1盒 |
KanMX盒 |
BTS1盒 |
3’YPRcTau3 |
在30℃下在YEPhD(酵母提取物植物蛋白胨葡萄糖;来自BD的BBL植物蛋白胨)中转化和恢复2.5小时后,将细胞与200μg/ml G418(Sigma)一起接种在YEPhD琼脂上。将板在30℃下孵育4天。通过诊断PCR和测序确定正确的整合。用蛋白质上LC/MS证实过度表达。图2中示出了ERG20、tHMG1和BTS1的组装示意图。此菌株被命名为STV002。
此菌株中CRE-重组酶的表达导致KanMX标记的外重组。用诊断PCR确定ERG20、tHMG和BTS1的存在和正确外重组。
1.2 Erg9的敲低
为了降低Erg9的表达,设计并使用了Erg9敲低构建体,所述构建体含有修饰的3'端,其继续进入驱动TRP1表达的TRP1启动子。
将含有Erg9-KD片段的构建体转化到到大肠杆菌TOP10细胞中。将转化体在2PY(2次植物蛋白胨酵母提取物)、sAMP培养基中生长。将质粒DNA用QIAprep旋转小量制备试剂盒(Qiagen)分离并用SalI-HF(New England Biolabs)消化。为了浓缩,将DNA用乙醇沉淀。将所述片段转化到酿酒酵母中,并将菌落接种在无色氨酸的无机培养基(Verduyn等人,1992.Yeast 8:501-517)琼脂板上。通过诊断PCR和测序证实Erg9-KD构建体的正确整合。进行的Erg9-KD构建体的转化的示意图在图3中示出。所述菌株被命名为STV003。
1.3 UGT2_1a的过表达
对于UGT2_1a的过度表达,使用如共同待决专利申请号W02013/076280和WO2013/144257中所描述的技术。将UGT2a在DNA2.0作为盒(含有同源序列、启动子、基因、终止子、同源序列)订购。关于细节,参见表3。为了获得含有标记和Cre-重组酶的片段,使用如共同待决专利申请号WO2013/135728中所描述的技术。使用赋予对诺尔丝菌素的抗性的NAT标记用于选择。
表3过表达构建体的组成
合适的引物用于扩增。为了扩增整合基因座的5'和3'整合侧翼,使用了来自CEN.PK酵母菌株的合适的引物和基因组DNA。
用表4中列出的片段转化酿酒酵母酵母菌株STV003,并将转化混合物接种在含有50μg/ml诺尔丝菌素(来自Jena Bioscience的Lexy NTC)的YEPhD琼脂板上。
表4用于UGT2a转化的DNA片段
CRE重组酶的表达通过半乳糖的存在活化。为了诱导CRE重组酶的表达,将转化体在YEPh半乳糖培养基上重新划线。这导致位于lox位点之间的标记的外重组。通过诊断PCR证实了UGT2a的正确整合和NAT标记的外重组。所得菌株被命名为STV004。进行的UGT2a构建体的转化的示意图在图4中示出。
1.4至RebA的制备路径:CPS、KS、KO、KAH、CPR、UGT1、UGT3和UGT4的过表达
引起RebA制备的所有路径基因被设计成整合在STV004菌株背景中的一个基因座中。为了扩增用于整合基因座(位点3)的5’和3’整合侧翼,使用了合适的引物和源于CEN.PK酵母菌株的基因组DNA。将不同的基因在DNA2.0定制为盒(包含同源序列、启动子、基因、终止子、同源序列)(参见表5以了解概况)。将源于DNA2.0的DNA溶解至100ng/μl。将该储备溶液进一步稀释至5ng/μl,其中的1μl用于50μl-PCR的混合物中。该反应含有25pmol的各引物。在扩增后,用NucleoSpin 96PCR清除试剂盒(Macherey-Nagel)纯化DNA,或替代地使用乙醇沉淀来浓缩DNA。
表5用于CPS、KS、KO、KAH、CPR、UGT1、UGT3和UGT4的过表达构建体的组成
启动子 | 开放阅读框 | 终止子 |
Kl prom 12.pro(SEQ ID NO:205) | CPS(SEQ ID NO:61) | Sc Adh2.ter(SEQ ID NO:213) |
Sc Pgk1.pro(SEQ ID NO:204) | KS(SEQ ID NO:65) | Sc Tal1.ter(SEQ ID NO:215) |
Sc Eno2.pro(SEQ ID NO:201) | KO(SEQ ID NO:23) | Sc Tpi1.ter(SEQ ID NO:216) |
Ag lox_Tef1.pro(SEQ ID NO:206) | KANMX(SEQ ID NO:211) | Ag Tef1_lox.ter(SEQ ID NO:217) |
Sc Tef1.pro(SEQ ID NO:203) | KAH(SEQ ID NO:33) | Sc Gpm1.ter(SEQ ID NO:214) |
Kl prom 6.pro(SEQ ID NO:207) | CPR(SEQ ID NO:77) | Sc Pdc1.ter(SEQ ID NO:218) |
Sc Pma1.pro(SEQ ID NO:208) | UGT1(SEQ ID NO:71) | Sc Tdh1.ter(SEQ ID NO:219) |
Sc Vps68.pro(SEQ ID NO:209) | UGT3(SEQ ID NO:73) | Sc Adh1.ter(SEQ ID NO:212) |
Sc Oye2.pro(SEQ ID NO:210) | UGT4(SEQ ID NO:75) | Sc Eno1.ter(SEQ ID NO:220) |
将到RebA的途径的所有片段(标记和侧翼)(参见表6总述)转化到酿酒酵母菌株STV004中。在20℃下在YEPhD中过夜恢复后,将转化混合物接种在含有200μg/ml G418的YEPhD琼脂上。将这些在30℃下孵育3天。
表6.用于CPS、KS、KO、KanMX、KAH、CPR、UGT1、UGT3和UGT4的转化的DNA片段
片段 |
5’INT1 |
CPS盒 |
KS盒 |
KO盒 |
KanMX盒 |
KAH盒 |
CPR盒 |
UGT1盒 |
UGT3盒 |
UGT4盒 |
3’INT1 |
通过诊断PCR和序列分析(3500基因分析仪,Applied Biosystems)证实了正确的整合。序列反应用BigDye终止子v3.1循环测序试剂盒(Life Technologies)进行。每个反应(10μl)均含有50ng模板和3.2pmol引物。将产物通过乙醇/EDTA沉淀纯化,溶解在10μl HiDi甲酰胺中并施加到装置上。所述菌株被命名为STV016。从GGPP到RebA的途径如何整合到基因组中的示意图在图5中示出。表7列出了本实施例1中使用的菌株。
表7菌株表
1.5STV016的发酵
如上所述构建的酿酒酵母菌株STV016在摇瓶(2升,具有200ml的培养基)中于30℃和220rpm下培养32小时。该培养基是基于Verduyn等人(Verduyn C、Postma E、ScheffersWA、Van Dijken JP.酵母,1992年7月;8(7):501-517),其改变了碳和氮源,如在表8中所示。
表8预培养基组成
原料 | 式 | 浓度(g/kg) |
半乳糖 | C<sub>6</sub>H<sub>12</sub>O<sub>6</sub> | 20.0 |
尿素 | (NH<sub>2</sub>)<sub>2</sub>CO | 2.3 |
磷酸二氢钾 | KH<sub>2</sub>PO<sub>4</sub> | 3.0 |
硫酸镁 | MgSO<sub>4</sub>.7H<sub>2</sub>O | 0.5 |
微量元素溶液 | 1 | |
维生素溶液 | 1 |
a微量元素溶液
b维生素溶液
组分 | 式 | 浓度(g/kg) |
生物素(D-) | C<sub>10</sub>H<sub>16</sub>N<sub>2</sub>O<sub>3</sub>S | 0.05 |
泛酸钙D(+) | C<sub>18</sub>H<sub>32</sub>CaN<sub>2</sub>O<sub>10</sub> | 1.00 |
烟酸 | C<sub>6</sub>H<sub>5</sub>NO<sub>2</sub> | 1.00 |
肌醇 | C<sub>6</sub>H<sub>12</sub>O<sub>6</sub> | 25.00 |
盐酸氯化硫胺素 | C<sub>12</sub>H<sub>18</sub>Cl<sub>2</sub>N<sub>4</sub>OS.xH<sub>2</sub>O | 1.00 |
盐酸吡哆醇 | C<sub>8</sub>H<sub>12</sub>ClNO<sub>3</sub> | 1.00 |
对氨基苯甲酸 | C<sub>7</sub>H<sub>7</sub>NO<sub>2</sub> | 0.20 |
随后,将摇瓶中200ml内容物转移至发酵罐(起始体积5L)中,其含有在表9中显示的培养基。
表9.组合发酵培养基
原料 | 最终浓度(g/kg) | |
一水葡萄糖 | C<sub>6</sub>H<sub>12</sub>O<sub>6</sub>.1H<sub>2</sub>O | 4.4 |
硫酸铵 | (NH<sub>4</sub>)<sub>2</sub>SO<sub>4</sub> | 1 |
磷酸二氢钾 | KH<sub>2</sub>PO<sub>4</sub> | 10 |
硫酸镁 | MgSO<sub>4</sub>.7H<sub>2</sub>O | 5 |
微量元素溶液 | - | 8 |
维生素溶液 | - | 8 |
通过添加氨(25wt%)将pH控制在5.0。将温度控制在27℃。通过调整搅拌器速度来将pO2控制在40%。葡萄糖浓度通过受限于至发酵罐的受控进料保持,如在表10中所显示的。
表10发酵进料培养基的组成
原料 | 式 | 最终浓度(g/kg) |
一水葡萄糖 | C<sub>6</sub>H<sub>12</sub>O<sub>6</sub>.1H<sub>2</sub>O | 550 |
磷酸二氢钾 | KH<sub>2</sub>PO<sub>4</sub> | 15.1 |
七水硫酸镁 | MgSO<sub>4</sub>.7H<sub>2</sub>O | 7.5 |
Verduyn微量元素溶液 | 12 | |
Verduyn维生素溶液 | 12 |
实施例2:使用LC-MS观察7.1、7.2和7.3
在水/乙醇混合物(菌株STV016)中结晶莱鲍迪甙A之后在母液中用下述的LC-MS系统来观察含有七个葡萄糖分子的甜菊醇糖苷(其还被称为7.1、7.2和7.3)。在纯化之前,通过蒸发来浓缩样品。
在被耦合至配备有以负离子模式操作的电喷雾电离源的XEVO-TQ质谱仪(Waters)的Acquity UPLC(Waters)上分析7.1、7.2和7.3,这是在MRM模式下在用于所研究的所有甜菊醇糖苷的去质子化分子上进行的,在这些m/z 1451.5中,表示含有七个葡萄糖分子的甜菊醇糖苷的去质子化分子。
使用具有作为移动相的(A)在LC-MS级水中的50mM乙酸铵和(B)LC-MS级乙腈的梯度洗脱,以2.1×100mm 1.8μm粒度的AcquityT3柱实现了色谱分离。4分钟梯度从30%B开始,在0.5分钟内线性增加到35%B且在35%B保持0.8分钟,随后在0.7分钟内线性增加到95%B并在该处保持0.5分钟,然后用30%B进行1.5分钟的再平衡。使用5μl的注射体积将流速保持在0.6ml/min,并将柱温度设置成50℃。为m/z 1451.5观察的各个化合物7.1、7.2和7.3在0.59、0.71和0.74分钟的保留时间上进行洗脱。
对于7.1、7.2和7.3的元素组成的分析而言,用配备有以负离子模式操作的电喷雾电离源的LTQ-轨道阱傅里叶变换质谱仪(Thermo Electron)执行HRMS(高分辨率质谱)分析,这从m/z 300-2000进行扫描。使用Acella LC系统(Thermo Fisher)实现了色谱分离,该系统具有与上述的相同的柱和梯度系统。
使用该色谱系统,各个化合物在0.84、1.20和1.30分钟的保持时间上进行洗脱,其分别如在图6a中所示,且7.1、7.2和7.3被分别在m/z 1451.5786、1451.5793和1451.5793处进行表征,这与1451.5820的理论m/z值具有良好的一致性(分别为-1.8和-2.3ppm)。这些组分的相应化学式为C62H100O38,其用于不带电物质。
实施例3:使用制备型LC-UV进行的7.1、7.2和7.3的纯化
从含有最少量的感兴趣的化合物的酵母菌属培养液(菌株STV016)的乙醇提取物执行7.1、7.2和7.3的纯化。使用反相色谱(Waters Atlantis T3,30*150mm,5μm)进行制备分离,用LC-MS级水和乙腈作为洗脱液来进行梯度洗脱。使用40ml/min的流速和300μl的注射体积。
执行大约100次的注射,并且通过在210nm处的UV检测来触发感兴趣的化合物。在LC-MS和NMR分析之前,将7.1、7.2和7.3的所有级分合并且冷冻干燥。
使用LC-MS的在制备纯化后进行的用于质量确认和纯度测定的7.1、7.2和7.3的
LC-MS
在被耦合至配备有以负离子模式操作的电喷雾电离源的XEVO-TQ质谱仪(Waters)的Acquity UPLC(Waters)上分析7.1、7.2和7.3的纯度,这是在MRM模式下在用于所研究的所有甜菊醇糖苷的去质子化分子上进行的,在这些m/z 1451.5中,表示含有七个葡萄糖分子的甜菊醇糖苷的去质子化分子。在0.59分钟的保留时间处进行洗脱的7.1可被估计为超过80%纯,而在0.71和0.74分钟的保留时间处进行洗脱的7.2和7.3可被估计为超过90%纯且7.3仍含有约5%的7.2,如在图6b中所示。
用配备有以负离子模式操作的电喷雾电离源的LTQ-轨道阱傅里叶变换质谱仪(Thermo Electron)执行HRMS(高分辨率质谱)分析,检查各个化合物的元素组成且发现其与用于不带电物质的化学式C62H100O38相对应的理论质量具有良好的一致性。
实施例4:莱鲍迪甙7.1的分析
将如在实施例3中所述获得的1.1mg的级分7.1溶解于1.3ml的CDCl3/吡啶-d5 1/3(w/w)和2滴DCOOD中。
一系列具有小混合时间增量的COZY和TOCSY 2D NMR谱为所有三种莱鲍迪甙以及对映贝壳杉烷二萜核心提供(七个糖单位的)每个旋转系统的几乎所有质子的分配。HSQC实验允许分配相应的C-H耦合。
基于其在HMBC中与对映贝壳杉烷二萜核心的质子的长程相关性来识别glcI和glcII的端基异构H。
在相应的ROESY谱中观察到的H2I-H1V和H3I-H1VI以及H2II-H1III和H3II-H1IIII的长程相关性允许进行glcI和glcII的取代位点的分配。还通过在HMBC实验中glcIII至glcVI的端基异构质子与glcI和glcII的13C原子,即H1III-C2II、H1IIII-C3II、H1V-C2I和H1VI-C3I的长程相关性来证实该分配。糖glcIII、glcIV、glcV和glcVI的位置与在莱鲍迪甙M的结构中的相同(图10)。
端基异构H1VII(5.86ppm对4.5至4.6ppm)和小耦合常数(3.8Hz对7.8Hz)的低场位移指示第七个糖残基具有α构型。
莱鲍迪甙7.1中第七个糖的位置可以根据在ROESY实验中的H1VII和C3III的长程HMBC耦合,H1VII-H3III的长程质子耦合以及C3III的低场位移(与约78-79ppm的未取代C3原子相比,其为83.8ppm)来进行识别。在图7中描绘了莱鲍迪甙7.1的结构。用于莱鲍迪甙7.1的所有1H和13C NMR化学位移均列于表11中。为了进行比较,莱鲍迪甙M的数据也包括在内。
表11在320K记录的在CDCl3/吡啶1/3和3滴DCOOD中的莱鲍迪甙7.1以及在300K,
δTMS=0记录的在CDCl3/吡啶1/1和3滴DCOOD中的莱鲍迪甙M的1H和13C NMR化学位移
实施例5:莱鲍迪甙7.2的分析
将2.5mg的样品溶解于1ml的CDCl3/吡啶-d5 1/1(w/w)和2滴DCOOD中。
一系列具有小混合时间增量的COSY和TOCSY 2D NMR为所有三种莱鲍迪甙以及对映贝壳杉烷二萜核心提供(七个糖单位的)每个旋转系统的几乎所有质子的分配。HSQC实验允许分配相应的C-H耦合。
基于其在HMBC中与对映贝壳杉烷二萜核心的质子的长程相关性来识别glcI和glcII的端基异构H。
糖glcIII、glcIV、glcV和glcVI的位置与在莱鲍迪甙M的结构中的相同,且在专门用于莱鲍迪甙7.1的结构的分配的部分中更详细地描述了该分配。
莱鲍迪甙7.2中第七个糖的位置可以根据在ROESY实验中的H6IV和C1VII的长程HMBC耦合,H1VII-H6IV的长程质子耦合以及C6IV的低场位移(与62-64ppm的剩余的C6原子相比,其为69.4ppm)来进行识别。第七个糖经β-糖苷键被附接至GlcIV。在图8中描绘了莱鲍迪甙7.2的结构。莱鲍迪甙7.2的所有1H和13C NMR化学位移均列于表12中。为了进行比较,莱鲍迪甙M的数据也包括在内。
表12在300K,δTMS=0记录的在CDCl3/吡啶1/1和2滴DCOOD中的莱鲍迪甙7.2以及在
CDCl3/吡啶1/1和3滴DCOOD中的莱鲍迪甙M的1H和13C NMR化学位移
实施例6:莱鲍迪甙7.3的分析
将2.3mg的样品溶解于1ml的CDCl3/吡啶-d5 1/2(w/w)和3滴DCOOD中。
一系列具有小混合时间增量的COSY和TOCSY 2D NMR为所有三种莱鲍迪甙以及对映贝壳杉烷二萜核心提供(七个糖单位的)每个旋转系统的几乎所有质子的分配。HSQC实验允许分配相应的C-H耦合。
基于其在HMBC中与对映贝壳杉烷二萜核心的质子的长程相关性来识别glcI和glcII的端基异构H。
糖glcIII、glcIV、glcV和glcVI的位置与在莱鲍迪甙M的结构中的相同,且在专门用于莱鲍迪甙7.1的结构的分配的部分中更详细地描述了该分配。
莱鲍迪甙7.3中第七个糖的位置可以根据在ROESY实验中的H1VII和C6VI的长程HMBC耦合,H1VII-H6VI的长程质子耦合以及C6VI的低场位移(与61-63ppm的剩余的C6原子相比,其为69.5ppm)来进行识别。第七个糖经β-糖苷键被附接至GlcVI。在图9中描绘了莱鲍迪甙7.3的结构。莱鲍迪甙7.3的所有1H和13C NMR化学位移均列于表13中。为了进行比较,莱鲍迪甙M的数据也包括在内。
表13在300K,δTMS=0记录的在CDCl3/吡啶1/2和3滴DCOOD中的莱鲍迪甙7.3以及在
CDCl3/吡啶1/1和3滴DCOOD中的莱鲍迪甙M的1H和13C NMR化学位移
总之,如在表14中所显示的,确定了三种新的莱鲍迪甙。
表14新的莱鲍迪甙的概况
一般物料和方法(NMR分析)
为莱鲍迪甙样品中的每一个优化溶剂混合物以获得端基异构质子的信号的最佳可能分辨率。样品的量和溶剂量对于峰的分辨率来说是至关重要的,这是因为峰,特别是端基异构的峰的位移是依赖于浓度和pH的(图12)。
在300K记录莱鲍迪甙7.2和7.3的谱,而在莱鲍迪甙7.1的情况下,则必须使用更高的温度。在300K时,在莱鲍迪甙7.1谱中的共振相当得宽,这指示溶解性差或构象过程缓慢。因此,在320K的样品温度下实现了所有信号的最终分配。
对于每个样品而言,进行了各种2D NMR实验:在Bruker Avance III600和700MHz谱仪上于320K下记录COSY、TOCSY(具有40、50、60、70、80、90和100ms的混合时间)、HSQC、HMBC和ROESY(225、400ms的混合时间)的谱。在样品部分中指定了用于每个样品的详细分配。
在实施例4、5和6中,分别在图11a和图11b中显示出甜菊醇和葡萄糖的原子编号。
表15:序列表的描述
截去变灰的标识符且因此为所提及的UniProt标识符的片段。
序列表
<110> 帝斯曼知识产权资产管理有限公司
<120> 甜菊醇糖苷
<130> 30983-WO-PCT
<150> US62/142631
<151> 2015-04-03
<160> 223
<170> PatentIn version 3.5
<210> 1
<211> 2397
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(2397)
<400> 1
atg aaa acc atg att tct tct cca atc cca gct ttc cac cca aga ttt 48
Met Lys Thr Met Ile Ser Ser Pro Ile Pro Ala Phe His Pro Arg Phe
1 5 10 15
tct cca gct gct ggt tcc aga aga tta tct cca atc ttg cca tct tcc 96
Ser Pro Ala Ala Gly Ser Arg Arg Leu Ser Pro Ile Leu Pro Ser Ser
20 25 30
ggt tct gtt gtc ttg act ggt tcc aag act caa tgt aag gcc gtt tct 144
Gly Ser Val Val Leu Thr Gly Ser Lys Thr Gln Cys Lys Ala Val Ser
35 40 45
aaa tct cca act caa gaa tac ttt gat gtt ttg caa aag aac ggt ttg 192
Lys Ser Pro Thr Gln Glu Tyr Phe Asp Val Leu Gln Lys Asn Gly Leu
50 55 60
cca ttc atc aac tgg caa aac gat gtc gtt gaa gat gaa ttg gac aag 240
Pro Phe Ile Asn Trp Gln Asn Asp Val Val Glu Asp Glu Leu Asp Lys
65 70 75 80
gaa aag aag atc ttg tac cca aac gac gaa atc aag ggt ttc gtt gaa 288
Glu Lys Lys Ile Leu Tyr Pro Asn Asp Glu Ile Lys Gly Phe Val Glu
85 90 95
aga atc aaa gtt atg tta ggt tcc atg gac gaa ggt gaa atc act gtc 336
Arg Ile Lys Val Met Leu Gly Ser Met Asp Glu Gly Glu Ile Thr Val
100 105 110
tct gct tat gac acc gct tgg gtt gct ttg gtc caa gat atc gat ggt 384
Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu Val Gln Asp Ile Asp Gly
115 120 125
aac ggt aga cca gaa ttt cct tct tct cta gaa tgg atc gtc aag aac 432
Asn Gly Arg Pro Glu Phe Pro Ser Ser Leu Glu Trp Ile Val Lys Asn
130 135 140
caa tta tcc gat ggt tcc tgg ggt gac cat ttg atc ttc tct gct cac 480
Gln Leu Ser Asp Gly Ser Trp Gly Asp His Leu Ile Phe Ser Ala His
145 150 155 160
gac aga atc att aac act ttg gct tgt gtc att gct ttg act tcc tgg 528
Asp Arg Ile Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser Trp
165 170 175
aac gtt cat cca ggt aag tgt caa aag ggt tta aag ttc ttg aac gac 576
Asn Val His Pro Gly Lys Cys Gln Lys Gly Leu Lys Phe Leu Asn Asp
180 185 190
aac atc tcc aag ttg gaa gaa gaa aac cca gaa cac atg cca atc ggt 624
Asn Ile Ser Lys Leu Glu Glu Glu Asn Pro Glu His Met Pro Ile Gly
195 200 205
ttt gaa gtt gct ttc cca tct ttg att gat att gcc aga aag ttg gat 672
Phe Glu Val Ala Phe Pro Ser Leu Ile Asp Ile Ala Arg Lys Leu Asp
210 215 220
atc caa gtt cca gaa gat tcc cca gct ttg aag gaa att tac gcc aga 720
Ile Gln Val Pro Glu Asp Ser Pro Ala Leu Lys Glu Ile Tyr Ala Arg
225 230 235 240
aga aac ttg aaa ttg acc aag att cca aag tct ttg atg cac aag gtt 768
Arg Asn Leu Lys Leu Thr Lys Ile Pro Lys Ser Leu Met His Lys Val
245 250 255
cca act act ttg ttg cat tct ttg gaa ggt atg cca gac ttg gaa tgg 816
Pro Thr Thr Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu Trp
260 265 270
gaa aag ttg ttg aag cta caa tgt aag gat ggt tct ttc ttg ttc tct 864
Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe Ser
275 280 285
cca tct tcc act gct ttc gct ttg atg caa acc aag gac caa aag tgt 912
Pro Ser Ser Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Gln Lys Cys
290 295 300
ttg caa tac ttg act gac gct gtc acc aag ttc aac ggt ggt gtt cca 960
Leu Gln Tyr Leu Thr Asp Ala Val Thr Lys Phe Asn Gly Gly Val Pro
305 310 315 320
aac gtc tac cca gtc gac ttg ttc gaa cac atc tgg gtt gtt gac aga 1008
Asn Val Tyr Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp Arg
325 330 335
ttg caa aga tta ggt atc tcc aga tac ttc gac tct gaa atc aaa gac 1056
Leu Gln Arg Leu Gly Ile Ser Arg Tyr Phe Asp Ser Glu Ile Lys Asp
340 345 350
tgt gtc gac tac atc tac aga tac tgg acc aag gat ggt atc tgt tgg 1104
Cys Val Asp Tyr Ile Tyr Arg Tyr Trp Thr Lys Asp Gly Ile Cys Trp
355 360 365
gct aag aac tcc aac gtt caa gat att gat gac acc gct atg ggt ttc 1152
Ala Lys Asn Ser Asn Val Gln Asp Ile Asp Asp Thr Ala Met Gly Phe
370 375 380
aga gtt ttg aga atg cac ggt tac aag gtc act act gat gtt ttc cgt 1200
Arg Val Leu Arg Met His Gly Tyr Lys Val Thr Thr Asp Val Phe Arg
385 390 395 400
caa ttc gaa aag gac ggt aaa ttc gtt tgt ttc cca ggt caa acc acc 1248
Gln Phe Glu Lys Asp Gly Lys Phe Val Cys Phe Pro Gly Gln Thr Thr
405 410 415
caa gct gtc acc ggt atg ttc aac ttg ttt aga gct tct caa gtc ttg 1296
Gln Ala Val Thr Gly Met Phe Asn Leu Phe Arg Ala Ser Gln Val Leu
420 425 430
ttc cca gac gaa aag att ttg gaa gat gct aag aaa ttc tct tac aac 1344
Phe Pro Asp Glu Lys Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr Asn
435 440 445
tac ttg aag gaa aag caa tcc acc aac gaa tta ttg gac aaa tgg atc 1392
Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp Ile
450 455 460
att gcc aag gac ttg cca ggt gaa gtt gaa tac gct tta gat gtt cca 1440
Ile Ala Lys Asp Leu Pro Gly Glu Val Glu Tyr Ala Leu Asp Val Pro
465 470 475 480
tgg tac gct tcc ttg cca cgt ttg gaa acc aga ttc tac ttg gaa caa 1488
Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr Arg Phe Tyr Leu Glu Gln
485 490 495
tac ggt ggt gaa gat gac gtc tgg atc ggt aag act ttg tac aga atg 1536
Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met
500 505 510
ggt aat gtc tcc aac aac acc tac ttg gaa atg gcc aaa ttg gac tac 1584
Gly Asn Val Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp Tyr
515 520 525
aac aac tgt ttg gcc att cat cac ttg gaa tgg aac acc atg caa caa 1632
Asn Asn Cys Leu Ala Ile His His Leu Glu Trp Asn Thr Met Gln Gln
530 535 540
tgg tac gtt gac ttc ggt atg gaa aga ttc ggt act tct gat atc act 1680
Trp Tyr Val Asp Phe Gly Met Glu Arg Phe Gly Thr Ser Asp Ile Thr
545 550 555 560
tct cta tta gtc tct tac tac ttg gct gct gcc tcc gtc ttt gaa cct 1728
Ser Leu Leu Val Ser Tyr Tyr Leu Ala Ala Ala Ser Val Phe Glu Pro
565 570 575
gaa cgt tct aag gaa aga att gct tgg gct aag acc acc act ttg gtt 1776
Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Thr Leu Val
580 585 590
gac acc atc tct tct ttc ttc cac tcc ttg aag atc tct aac gaa cac 1824
Asp Thr Ile Ser Ser Phe Phe His Ser Leu Lys Ile Ser Asn Glu His
595 600 605
cgt cgt gaa ttc gtt gaa gaa ttc aga aac atc tcc aac tcc atc cac 1872
Arg Arg Glu Phe Val Glu Glu Phe Arg Asn Ile Ser Asn Ser Ile His
610 615 620
cac gct aag tac ggt aag cca tgg cac ggt ttg atg gtt gct ttg aaa 1920
His Ala Lys Tyr Gly Lys Pro Trp His Gly Leu Met Val Ala Leu Lys
625 630 635 640
ggt act tta cac gaa att gcc ttg gat gtc ttg atg act cac aga aga 1968
Gly Thr Leu His Glu Ile Ala Leu Asp Val Leu Met Thr His Arg Arg
645 650 655
gac att cac cct caa ttg cac cat gct tgg gaa atg tgg ttg atg aga 2016
Asp Ile His Pro Gln Leu His His Ala Trp Glu Met Trp Leu Met Arg
660 665 670
tgg caa caa ggt gtc gat gcc act gaa ggt caa gct gaa tta att gtt 2064
Trp Gln Gln Gly Val Asp Ala Thr Glu Gly Gln Ala Glu Leu Ile Val
675 680 685
caa acc atc aac atg act gct ggt aga tgg gtt tcc aat gaa tta ttg 2112
Gln Thr Ile Asn Met Thr Ala Gly Arg Trp Val Ser Asn Glu Leu Leu
690 695 700
gct cac cca caa tac aga tta ttg tcc tct gtc atc aac aat atc tgt 2160
Ala His Pro Gln Tyr Arg Leu Leu Ser Ser Val Ile Asn Asn Ile Cys
705 710 715 720
cac gaa atc tac cac aac aga acc tgt atg gaa gtc aac tct acc acc 2208
His Glu Ile Tyr His Asn Arg Thr Cys Met Glu Val Asn Ser Thr Thr
725 730 735
att tcc act tct att gac tcc aag atg caa gaa ttg gtt caa tta gtc 2256
Ile Ser Thr Ser Ile Asp Ser Lys Met Gln Glu Leu Val Gln Leu Val
740 745 750
ttg tct gac tct ttg gac gac ttg gac caa gat ttg aag caa acc ttc 2304
Leu Ser Asp Ser Leu Asp Asp Leu Asp Gln Asp Leu Lys Gln Thr Phe
755 760 765
ttg act gtt gcc aag act ttc tac tac aag gct tac tgt gac cca gaa 2352
Leu Thr Val Ala Lys Thr Phe Tyr Tyr Lys Ala Tyr Cys Asp Pro Glu
770 775 780
acc atc aac gtt cac att tct aag gtc atg ttc gaa acc att att 2397
Thr Ile Asn Val His Ile Ser Lys Val Met Phe Glu Thr Ile Ile
785 790 795
<210> 2
<211> 799
<212> PRT
<213> Lactuca sativa
<400> 2
Met Lys Thr Met Ile Ser Ser Pro Ile Pro Ala Phe His Pro Arg Phe
1 5 10 15
Ser Pro Ala Ala Gly Ser Arg Arg Leu Ser Pro Ile Leu Pro Ser Ser
20 25 30
Gly Ser Val Val Leu Thr Gly Ser Lys Thr Gln Cys Lys Ala Val Ser
35 40 45
Lys Ser Pro Thr Gln Glu Tyr Phe Asp Val Leu Gln Lys Asn Gly Leu
50 55 60
Pro Phe Ile Asn Trp Gln Asn Asp Val Val Glu Asp Glu Leu Asp Lys
65 70 75 80
Glu Lys Lys Ile Leu Tyr Pro Asn Asp Glu Ile Lys Gly Phe Val Glu
85 90 95
Arg Ile Lys Val Met Leu Gly Ser Met Asp Glu Gly Glu Ile Thr Val
100 105 110
Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu Val Gln Asp Ile Asp Gly
115 120 125
Asn Gly Arg Pro Glu Phe Pro Ser Ser Leu Glu Trp Ile Val Lys Asn
130 135 140
Gln Leu Ser Asp Gly Ser Trp Gly Asp His Leu Ile Phe Ser Ala His
145 150 155 160
Asp Arg Ile Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser Trp
165 170 175
Asn Val His Pro Gly Lys Cys Gln Lys Gly Leu Lys Phe Leu Asn Asp
180 185 190
Asn Ile Ser Lys Leu Glu Glu Glu Asn Pro Glu His Met Pro Ile Gly
195 200 205
Phe Glu Val Ala Phe Pro Ser Leu Ile Asp Ile Ala Arg Lys Leu Asp
210 215 220
Ile Gln Val Pro Glu Asp Ser Pro Ala Leu Lys Glu Ile Tyr Ala Arg
225 230 235 240
Arg Asn Leu Lys Leu Thr Lys Ile Pro Lys Ser Leu Met His Lys Val
245 250 255
Pro Thr Thr Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu Trp
260 265 270
Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe Ser
275 280 285
Pro Ser Ser Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Gln Lys Cys
290 295 300
Leu Gln Tyr Leu Thr Asp Ala Val Thr Lys Phe Asn Gly Gly Val Pro
305 310 315 320
Asn Val Tyr Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp Arg
325 330 335
Leu Gln Arg Leu Gly Ile Ser Arg Tyr Phe Asp Ser Glu Ile Lys Asp
340 345 350
Cys Val Asp Tyr Ile Tyr Arg Tyr Trp Thr Lys Asp Gly Ile Cys Trp
355 360 365
Ala Lys Asn Ser Asn Val Gln Asp Ile Asp Asp Thr Ala Met Gly Phe
370 375 380
Arg Val Leu Arg Met His Gly Tyr Lys Val Thr Thr Asp Val Phe Arg
385 390 395 400
Gln Phe Glu Lys Asp Gly Lys Phe Val Cys Phe Pro Gly Gln Thr Thr
405 410 415
Gln Ala Val Thr Gly Met Phe Asn Leu Phe Arg Ala Ser Gln Val Leu
420 425 430
Phe Pro Asp Glu Lys Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr Asn
435 440 445
Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp Ile
450 455 460
Ile Ala Lys Asp Leu Pro Gly Glu Val Glu Tyr Ala Leu Asp Val Pro
465 470 475 480
Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr Arg Phe Tyr Leu Glu Gln
485 490 495
Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met
500 505 510
Gly Asn Val Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp Tyr
515 520 525
Asn Asn Cys Leu Ala Ile His His Leu Glu Trp Asn Thr Met Gln Gln
530 535 540
Trp Tyr Val Asp Phe Gly Met Glu Arg Phe Gly Thr Ser Asp Ile Thr
545 550 555 560
Ser Leu Leu Val Ser Tyr Tyr Leu Ala Ala Ala Ser Val Phe Glu Pro
565 570 575
Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Thr Leu Val
580 585 590
Asp Thr Ile Ser Ser Phe Phe His Ser Leu Lys Ile Ser Asn Glu His
595 600 605
Arg Arg Glu Phe Val Glu Glu Phe Arg Asn Ile Ser Asn Ser Ile His
610 615 620
His Ala Lys Tyr Gly Lys Pro Trp His Gly Leu Met Val Ala Leu Lys
625 630 635 640
Gly Thr Leu His Glu Ile Ala Leu Asp Val Leu Met Thr His Arg Arg
645 650 655
Asp Ile His Pro Gln Leu His His Ala Trp Glu Met Trp Leu Met Arg
660 665 670
Trp Gln Gln Gly Val Asp Ala Thr Glu Gly Gln Ala Glu Leu Ile Val
675 680 685
Gln Thr Ile Asn Met Thr Ala Gly Arg Trp Val Ser Asn Glu Leu Leu
690 695 700
Ala His Pro Gln Tyr Arg Leu Leu Ser Ser Val Ile Asn Asn Ile Cys
705 710 715 720
His Glu Ile Tyr His Asn Arg Thr Cys Met Glu Val Asn Ser Thr Thr
725 730 735
Ile Ser Thr Ser Ile Asp Ser Lys Met Gln Glu Leu Val Gln Leu Val
740 745 750
Leu Ser Asp Ser Leu Asp Asp Leu Asp Gln Asp Leu Lys Gln Thr Phe
755 760 765
Leu Thr Val Ala Lys Thr Phe Tyr Tyr Lys Ala Tyr Cys Asp Pro Glu
770 775 780
Thr Ile Asn Val His Ile Ser Lys Val Met Phe Glu Thr Ile Ile
785 790 795
<210> 3
<211> 2271
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(2271)
<400> 3
atg tgt aag gcc gtt tct aaa tct cca act caa gaa tac ttt gat gtt 48
Met Cys Lys Ala Val Ser Lys Ser Pro Thr Gln Glu Tyr Phe Asp Val
1 5 10 15
ttg caa aag aac ggt ttg cca ttc atc aac tgg caa aac gat gtc gtt 96
Leu Gln Lys Asn Gly Leu Pro Phe Ile Asn Trp Gln Asn Asp Val Val
20 25 30
gaa gat gaa ttg gac aag gaa aag aag atc ttg tac cca aac gac gaa 144
Glu Asp Glu Leu Asp Lys Glu Lys Lys Ile Leu Tyr Pro Asn Asp Glu
35 40 45
atc aag ggt ttc gtt gaa aga atc aaa gtt atg tta ggt tcc atg gac 192
Ile Lys Gly Phe Val Glu Arg Ile Lys Val Met Leu Gly Ser Met Asp
50 55 60
gaa ggt gaa atc act gtc tct gct tat gac acc gct tgg gtt gct ttg 240
Glu Gly Glu Ile Thr Val Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu
65 70 75 80
gtc caa gat atc gat ggt aac ggt aga cca gaa ttt cct tct tct cta 288
Val Gln Asp Ile Asp Gly Asn Gly Arg Pro Glu Phe Pro Ser Ser Leu
85 90 95
gaa tgg atc gtc aag aac caa tta tcc gat ggt tcc tgg ggt gac cat 336
Glu Trp Ile Val Lys Asn Gln Leu Ser Asp Gly Ser Trp Gly Asp His
100 105 110
ttg atc ttc tct gct cac gac aga atc att aac act ttg gct tgt gtc 384
Leu Ile Phe Ser Ala His Asp Arg Ile Ile Asn Thr Leu Ala Cys Val
115 120 125
att gct ttg act tcc tgg aac gtt cat cca ggt aag tgt caa aag ggt 432
Ile Ala Leu Thr Ser Trp Asn Val His Pro Gly Lys Cys Gln Lys Gly
130 135 140
tta aag ttc ttg aac gac aac atc tcc aag ttg gaa gaa gaa aac cca 480
Leu Lys Phe Leu Asn Asp Asn Ile Ser Lys Leu Glu Glu Glu Asn Pro
145 150 155 160
gaa cac atg cca atc ggt ttt gaa gtt gct ttc cca tct ttg att gat 528
Glu His Met Pro Ile Gly Phe Glu Val Ala Phe Pro Ser Leu Ile Asp
165 170 175
att gcc aga aag ttg gat atc caa gtt cca gaa gat tcc cca gct ttg 576
Ile Ala Arg Lys Leu Asp Ile Gln Val Pro Glu Asp Ser Pro Ala Leu
180 185 190
aag gaa att tac gcc aga aga aac ttg aaa ttg acc aag att cca aag 624
Lys Glu Ile Tyr Ala Arg Arg Asn Leu Lys Leu Thr Lys Ile Pro Lys
195 200 205
tct ttg atg cac aag gtt cca act act ttg ttg cat tct ttg gaa ggt 672
Ser Leu Met His Lys Val Pro Thr Thr Leu Leu His Ser Leu Glu Gly
210 215 220
atg cca gac ttg gaa tgg gaa aag ttg ttg aag cta caa tgt aag gat 720
Met Pro Asp Leu Glu Trp Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp
225 230 235 240
ggt tct ttc ttg ttc tct cca tct tcc act gct ttc gct ttg atg caa 768
Gly Ser Phe Leu Phe Ser Pro Ser Ser Thr Ala Phe Ala Leu Met Gln
245 250 255
acc aag gac caa aag tgt ttg caa tac ttg act gac gct gtc acc aag 816
Thr Lys Asp Gln Lys Cys Leu Gln Tyr Leu Thr Asp Ala Val Thr Lys
260 265 270
ttc aac ggt ggt gtt cca aac gtc tac cca gtc gac ttg ttc gaa cac 864
Phe Asn Gly Gly Val Pro Asn Val Tyr Pro Val Asp Leu Phe Glu His
275 280 285
atc tgg gtt gtt gac aga ttg caa aga tta ggt atc tcc aga tac ttc 912
Ile Trp Val Val Asp Arg Leu Gln Arg Leu Gly Ile Ser Arg Tyr Phe
290 295 300
gac tct gaa atc aaa gac tgt gtc gac tac atc tac aga tac tgg acc 960
Asp Ser Glu Ile Lys Asp Cys Val Asp Tyr Ile Tyr Arg Tyr Trp Thr
305 310 315 320
aag gat ggt atc tgt tgg gct aag aac tcc aac gtt caa gat att gat 1008
Lys Asp Gly Ile Cys Trp Ala Lys Asn Ser Asn Val Gln Asp Ile Asp
325 330 335
gac acc gct atg ggt ttc aga gtt ttg aga atg cac ggt tac aag gtc 1056
Asp Thr Ala Met Gly Phe Arg Val Leu Arg Met His Gly Tyr Lys Val
340 345 350
act act gat gtt ttc cgt caa ttc gaa aag gac ggt aaa ttc gtt tgt 1104
Thr Thr Asp Val Phe Arg Gln Phe Glu Lys Asp Gly Lys Phe Val Cys
355 360 365
ttc cca ggt caa acc acc caa gct gtc acc ggt atg ttc aac ttg ttt 1152
Phe Pro Gly Gln Thr Thr Gln Ala Val Thr Gly Met Phe Asn Leu Phe
370 375 380
aga gct tct caa gtc ttg ttc cca gac gaa aag att ttg gaa gat gct 1200
Arg Ala Ser Gln Val Leu Phe Pro Asp Glu Lys Ile Leu Glu Asp Ala
385 390 395 400
aag aaa ttc tct tac aac tac ttg aag gaa aag caa tcc acc aac gaa 1248
Lys Lys Phe Ser Tyr Asn Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu
405 410 415
tta ttg gac aaa tgg atc att gcc aag gac ttg cca ggt gaa gtt gaa 1296
Leu Leu Asp Lys Trp Ile Ile Ala Lys Asp Leu Pro Gly Glu Val Glu
420 425 430
tac gct tta gat gtt cca tgg tac gct tcc ttg cca cgt ttg gaa acc 1344
Tyr Ala Leu Asp Val Pro Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr
435 440 445
aga ttc tac ttg gaa caa tac ggt ggt gaa gat gac gtc tgg atc ggt 1392
Arg Phe Tyr Leu Glu Gln Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly
450 455 460
aag act ttg tac aga atg ggt aat gtc tcc aac aac acc tac ttg gaa 1440
Lys Thr Leu Tyr Arg Met Gly Asn Val Ser Asn Asn Thr Tyr Leu Glu
465 470 475 480
atg gcc aaa ttg gac tac aac aac tgt ttg gcc att cat cac ttg gaa 1488
Met Ala Lys Leu Asp Tyr Asn Asn Cys Leu Ala Ile His His Leu Glu
485 490 495
tgg aac acc atg caa caa tgg tac gtt gac ttc ggt atg gaa aga ttc 1536
Trp Asn Thr Met Gln Gln Trp Tyr Val Asp Phe Gly Met Glu Arg Phe
500 505 510
ggt act tct gat atc act tct cta tta gtc tct tac tac ttg gct gct 1584
Gly Thr Ser Asp Ile Thr Ser Leu Leu Val Ser Tyr Tyr Leu Ala Ala
515 520 525
gcc tcc gtc ttt gaa cct gaa cgt tct aag gaa aga att gct tgg gct 1632
Ala Ser Val Phe Glu Pro Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala
530 535 540
aag acc acc act ttg gtt gac acc atc tct tct ttc ttc cac tcc ttg 1680
Lys Thr Thr Thr Leu Val Asp Thr Ile Ser Ser Phe Phe His Ser Leu
545 550 555 560
aag atc tct aac gaa cac cgt cgt gaa ttc gtt gaa gaa ttc aga aac 1728
Lys Ile Ser Asn Glu His Arg Arg Glu Phe Val Glu Glu Phe Arg Asn
565 570 575
atc tcc aac tcc atc cac cac gct aag tac ggt aag cca tgg cac ggt 1776
Ile Ser Asn Ser Ile His His Ala Lys Tyr Gly Lys Pro Trp His Gly
580 585 590
ttg atg gtt gct ttg aaa ggt act tta cac gaa att gcc ttg gat gtc 1824
Leu Met Val Ala Leu Lys Gly Thr Leu His Glu Ile Ala Leu Asp Val
595 600 605
ttg atg act cac aga aga gac att cac cct caa ttg cac cat gct tgg 1872
Leu Met Thr His Arg Arg Asp Ile His Pro Gln Leu His His Ala Trp
610 615 620
gaa atg tgg ttg atg aga tgg caa caa ggt gtc gat gcc act gaa ggt 1920
Glu Met Trp Leu Met Arg Trp Gln Gln Gly Val Asp Ala Thr Glu Gly
625 630 635 640
caa gct gaa tta att gtt caa acc atc aac atg act gct ggt aga tgg 1968
Gln Ala Glu Leu Ile Val Gln Thr Ile Asn Met Thr Ala Gly Arg Trp
645 650 655
gtt tcc aat gaa tta ttg gct cac cca caa tac aga tta ttg tcc tct 2016
Val Ser Asn Glu Leu Leu Ala His Pro Gln Tyr Arg Leu Leu Ser Ser
660 665 670
gtc atc aac aat atc tgt cac gaa atc tac cac aac aga acc tgt atg 2064
Val Ile Asn Asn Ile Cys His Glu Ile Tyr His Asn Arg Thr Cys Met
675 680 685
gaa gtc aac tct acc acc att tcc act tct att gac tcc aag atg caa 2112
Glu Val Asn Ser Thr Thr Ile Ser Thr Ser Ile Asp Ser Lys Met Gln
690 695 700
gaa ttg gtt caa tta gtc ttg tct gac tct ttg gac gac ttg gac caa 2160
Glu Leu Val Gln Leu Val Leu Ser Asp Ser Leu Asp Asp Leu Asp Gln
705 710 715 720
gat ttg aag caa acc ttc ttg act gtt gcc aag act ttc tac tac aag 2208
Asp Leu Lys Gln Thr Phe Leu Thr Val Ala Lys Thr Phe Tyr Tyr Lys
725 730 735
gct tac tgt gac cca gaa acc atc aac gtt cac att tct aag gtc atg 2256
Ala Tyr Cys Asp Pro Glu Thr Ile Asn Val His Ile Ser Lys Val Met
740 745 750
ttc gaa acc att att 2271
Phe Glu Thr Ile Ile
755
<210> 4
<211> 757
<212> PRT
<213> Lactuca sativa
<400> 4
Met Cys Lys Ala Val Ser Lys Ser Pro Thr Gln Glu Tyr Phe Asp Val
1 5 10 15
Leu Gln Lys Asn Gly Leu Pro Phe Ile Asn Trp Gln Asn Asp Val Val
20 25 30
Glu Asp Glu Leu Asp Lys Glu Lys Lys Ile Leu Tyr Pro Asn Asp Glu
35 40 45
Ile Lys Gly Phe Val Glu Arg Ile Lys Val Met Leu Gly Ser Met Asp
50 55 60
Glu Gly Glu Ile Thr Val Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu
65 70 75 80
Val Gln Asp Ile Asp Gly Asn Gly Arg Pro Glu Phe Pro Ser Ser Leu
85 90 95
Glu Trp Ile Val Lys Asn Gln Leu Ser Asp Gly Ser Trp Gly Asp His
100 105 110
Leu Ile Phe Ser Ala His Asp Arg Ile Ile Asn Thr Leu Ala Cys Val
115 120 125
Ile Ala Leu Thr Ser Trp Asn Val His Pro Gly Lys Cys Gln Lys Gly
130 135 140
Leu Lys Phe Leu Asn Asp Asn Ile Ser Lys Leu Glu Glu Glu Asn Pro
145 150 155 160
Glu His Met Pro Ile Gly Phe Glu Val Ala Phe Pro Ser Leu Ile Asp
165 170 175
Ile Ala Arg Lys Leu Asp Ile Gln Val Pro Glu Asp Ser Pro Ala Leu
180 185 190
Lys Glu Ile Tyr Ala Arg Arg Asn Leu Lys Leu Thr Lys Ile Pro Lys
195 200 205
Ser Leu Met His Lys Val Pro Thr Thr Leu Leu His Ser Leu Glu Gly
210 215 220
Met Pro Asp Leu Glu Trp Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp
225 230 235 240
Gly Ser Phe Leu Phe Ser Pro Ser Ser Thr Ala Phe Ala Leu Met Gln
245 250 255
Thr Lys Asp Gln Lys Cys Leu Gln Tyr Leu Thr Asp Ala Val Thr Lys
260 265 270
Phe Asn Gly Gly Val Pro Asn Val Tyr Pro Val Asp Leu Phe Glu His
275 280 285
Ile Trp Val Val Asp Arg Leu Gln Arg Leu Gly Ile Ser Arg Tyr Phe
290 295 300
Asp Ser Glu Ile Lys Asp Cys Val Asp Tyr Ile Tyr Arg Tyr Trp Thr
305 310 315 320
Lys Asp Gly Ile Cys Trp Ala Lys Asn Ser Asn Val Gln Asp Ile Asp
325 330 335
Asp Thr Ala Met Gly Phe Arg Val Leu Arg Met His Gly Tyr Lys Val
340 345 350
Thr Thr Asp Val Phe Arg Gln Phe Glu Lys Asp Gly Lys Phe Val Cys
355 360 365
Phe Pro Gly Gln Thr Thr Gln Ala Val Thr Gly Met Phe Asn Leu Phe
370 375 380
Arg Ala Ser Gln Val Leu Phe Pro Asp Glu Lys Ile Leu Glu Asp Ala
385 390 395 400
Lys Lys Phe Ser Tyr Asn Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu
405 410 415
Leu Leu Asp Lys Trp Ile Ile Ala Lys Asp Leu Pro Gly Glu Val Glu
420 425 430
Tyr Ala Leu Asp Val Pro Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr
435 440 445
Arg Phe Tyr Leu Glu Gln Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly
450 455 460
Lys Thr Leu Tyr Arg Met Gly Asn Val Ser Asn Asn Thr Tyr Leu Glu
465 470 475 480
Met Ala Lys Leu Asp Tyr Asn Asn Cys Leu Ala Ile His His Leu Glu
485 490 495
Trp Asn Thr Met Gln Gln Trp Tyr Val Asp Phe Gly Met Glu Arg Phe
500 505 510
Gly Thr Ser Asp Ile Thr Ser Leu Leu Val Ser Tyr Tyr Leu Ala Ala
515 520 525
Ala Ser Val Phe Glu Pro Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala
530 535 540
Lys Thr Thr Thr Leu Val Asp Thr Ile Ser Ser Phe Phe His Ser Leu
545 550 555 560
Lys Ile Ser Asn Glu His Arg Arg Glu Phe Val Glu Glu Phe Arg Asn
565 570 575
Ile Ser Asn Ser Ile His His Ala Lys Tyr Gly Lys Pro Trp His Gly
580 585 590
Leu Met Val Ala Leu Lys Gly Thr Leu His Glu Ile Ala Leu Asp Val
595 600 605
Leu Met Thr His Arg Arg Asp Ile His Pro Gln Leu His His Ala Trp
610 615 620
Glu Met Trp Leu Met Arg Trp Gln Gln Gly Val Asp Ala Thr Glu Gly
625 630 635 640
Gln Ala Glu Leu Ile Val Gln Thr Ile Asn Met Thr Ala Gly Arg Trp
645 650 655
Val Ser Asn Glu Leu Leu Ala His Pro Gln Tyr Arg Leu Leu Ser Ser
660 665 670
Val Ile Asn Asn Ile Cys His Glu Ile Tyr His Asn Arg Thr Cys Met
675 680 685
Glu Val Asn Ser Thr Thr Ile Ser Thr Ser Ile Asp Ser Lys Met Gln
690 695 700
Glu Leu Val Gln Leu Val Leu Ser Asp Ser Leu Asp Asp Leu Asp Gln
705 710 715 720
Asp Leu Lys Gln Thr Phe Leu Thr Val Ala Lys Thr Phe Tyr Tyr Lys
725 730 735
Ala Tyr Cys Asp Pro Glu Thr Ile Asn Val His Ile Ser Lys Val Met
740 745 750
Phe Glu Thr Ile Ile
755
<210> 5
<211> 2283
<212> DNA
<213> Picea glauca
<220>
<221> CDS
<222> (1)..(2283)
<400> 5
atg aag atg tcc aag tct gtt gaa gtc caa cac tgt gct gtc caa ttc 48
Met Lys Met Ser Lys Ser Val Glu Val Gln His Cys Ala Val Gln Phe
1 5 10 15
ttg tct tcc acc acc gat caa atc gaa atc aga gaa aga aac ttg caa 96
Leu Ser Ser Thr Thr Asp Gln Ile Glu Ile Arg Glu Arg Asn Leu Gln
20 25 30
atc tcc act gaa gcc atg aag atg aaa tcc tgg att gaa acc gtc aaa 144
Ile Ser Thr Glu Ala Met Lys Met Lys Ser Trp Ile Glu Thr Val Lys
35 40 45
tac att ttg caa tcc atg gaa gat ggt gaa atc acc atc tcc gct tac 192
Tyr Ile Leu Gln Ser Met Glu Asp Gly Glu Ile Thr Ile Ser Ala Tyr
50 55 60
gac act gct tgg atc gcc ttg gtt cca gct ttg aac ggt tcc tct gaa 240
Asp Thr Ala Trp Ile Ala Leu Val Pro Ala Leu Asn Gly Ser Ser Glu
65 70 75 80
cct caa ttc cca tcc tct tta caa tgg ttg atc aac aac caa ttg caa 288
Pro Gln Phe Pro Ser Ser Leu Gln Trp Leu Ile Asn Asn Gln Leu Gln
85 90 95
gac ggt tcc tgg ggt gac cca ttg atg ttc ttg att aga gat cgt atc 336
Asp Gly Ser Trp Gly Asp Pro Leu Met Phe Leu Ile Arg Asp Arg Ile
100 105 110
atc aac act ttg gct tgt gtt ttg gct ttg aag acc tgg aac att cac 384
Ile Asn Thr Leu Ala Cys Val Leu Ala Leu Lys Thr Trp Asn Ile His
115 120 125
tct ttg ggt gtt aac aag ggt cta tct ttc ttg caa act tac atc cct 432
Ser Leu Gly Val Asn Lys Gly Leu Ser Phe Leu Gln Thr Tyr Ile Pro
130 135 140
aaa atg aac gat gaa cat gat gct cac act cca gtt ggt ttc gaa att 480
Lys Met Asn Asp Glu His Asp Ala His Thr Pro Val Gly Phe Glu Ile
145 150 155 160
gtc ttc cca gct ttg atg gaa gat gcc aag atc atg gaa ttg gac ttg 528
Val Phe Pro Ala Leu Met Glu Asp Ala Lys Ile Met Glu Leu Asp Leu
165 170 175
cca tac gac gct gaa ttc ttg caa aag att tac gat gaa aga gat ttg 576
Pro Tyr Asp Ala Glu Phe Leu Gln Lys Ile Tyr Asp Glu Arg Asp Leu
180 185 190
aag atg aag aga atc cca atg aag gtt ttg cac gaa ttc cca tct act 624
Lys Met Lys Arg Ile Pro Met Lys Val Leu His Glu Phe Pro Ser Thr
195 200 205
tta ttg cac tcc ttg gaa ggt ttg aga gac aag gtc aac tgg gaa gaa 672
Leu Leu His Ser Leu Glu Gly Leu Arg Asp Lys Val Asn Trp Glu Glu
210 215 220
tta ttg aag ttg caa tcc aag aac ggt tct ttc tta ttc tct cca gct 720
Leu Leu Lys Leu Gln Ser Lys Asn Gly Ser Phe Leu Phe Ser Pro Ala
225 230 235 240
tct act gct tgt gct ttg gct caa act tct gac acc aac tgt ttg aga 768
Ser Thr Ala Cys Ala Leu Ala Gln Thr Ser Asp Thr Asn Cys Leu Arg
245 250 255
tac ttg aat gaa atc acc aag aaa tac gac ggt ggt gct cca aat gtt 816
Tyr Leu Asn Glu Ile Thr Lys Lys Tyr Asp Gly Gly Ala Pro Asn Val
260 265 270
tac cca gtc gac ttg ttc gaa aga tta tgg acc gtc gat cgt att gaa 864
Tyr Pro Val Asp Leu Phe Glu Arg Leu Trp Thr Val Asp Arg Ile Glu
275 280 285
aga tta ggt att gct aga tac ttt gaa tct gaa atc act gac tct ttg 912
Arg Leu Gly Ile Ala Arg Tyr Phe Glu Ser Glu Ile Thr Asp Ser Leu
290 295 300
gaa tac gtt tac aga tac tgg act aac caa ggt atc ggt tgg gct aga 960
Glu Tyr Val Tyr Arg Tyr Trp Thr Asn Gln Gly Ile Gly Trp Ala Arg
305 310 315 320
gat tct cca gtt aag gac gtc gat gac act tcc atg gct ttc aga cta 1008
Asp Ser Pro Val Lys Asp Val Asp Asp Thr Ser Met Ala Phe Arg Leu
325 330 335
tta cgt tcc cac ggt ttc gac gtt act gct gaa gct ttc aac cac ttc 1056
Leu Arg Ser His Gly Phe Asp Val Thr Ala Glu Ala Phe Asn His Phe
340 345 350
aag caa gat gac caa ttc ttc tgt ttc ttt ggt caa acc aag caa acc 1104
Lys Gln Asp Asp Gln Phe Phe Cys Phe Phe Gly Gln Thr Lys Gln Thr
355 360 365
gtc act ggt atg tac aac ttg tac aga gct tct caa ttc tcc ttc cca 1152
Val Thr Gly Met Tyr Asn Leu Tyr Arg Ala Ser Gln Phe Ser Phe Pro
370 375 380
ggt gaa tct atc tta gaa gaa gct cgt gtc ttc acc aag aac ttc ttg 1200
Gly Glu Ser Ile Leu Glu Glu Ala Arg Val Phe Thr Lys Asn Phe Leu
385 390 395 400
gaa gaa aag aga gct gaa aag caa ttg aga gac aaa tgg atc att gct 1248
Glu Glu Lys Arg Ala Glu Lys Gln Leu Arg Asp Lys Trp Ile Ile Ala
405 410 415
aaa ggt ttg aag gaa gaa gtc gaa tac gct ttg aag ttc cca tgg tat 1296
Lys Gly Leu Lys Glu Glu Val Glu Tyr Ala Leu Lys Phe Pro Trp Tyr
420 425 430
gcc tcc caa cca aga att gac acc aga atg tac atc aac caa tac aga 1344
Ala Ser Gln Pro Arg Ile Asp Thr Arg Met Tyr Ile Asn Gln Tyr Arg
435 440 445
gtt gat gac gtc tgg atc ggt aag gct cta tac aga atg cca att gtc 1392
Val Asp Asp Val Trp Ile Gly Lys Ala Leu Tyr Arg Met Pro Ile Val
450 455 460
aac aac aag acc tac atc gaa ttg gcc aag gct gac ttc aac att tgt 1440
Asn Asn Lys Thr Tyr Ile Glu Leu Ala Lys Ala Asp Phe Asn Ile Cys
465 470 475 480
caa tct atc cac aga act gaa ttg cac ggt atc atc aga tgg tac aga 1488
Gln Ser Ile His Arg Thr Glu Leu His Gly Ile Ile Arg Trp Tyr Arg
485 490 495
gaa tcc ggt ttg gac gaa ttg ggt ttg aga caa gac caa atc gtc aaa 1536
Glu Ser Gly Leu Asp Glu Leu Gly Leu Arg Gln Asp Gln Ile Val Lys
500 505 510
tct tac ttc ttg gct gcc att gct atc tac gaa cca gac atg gcc tct 1584
Ser Tyr Phe Leu Ala Ala Ile Ala Ile Tyr Glu Pro Asp Met Ala Ser
515 520 525
gcc aga tta gct tgg gct aag tct gct gtc ttg atg gct gct atc aga 1632
Ala Arg Leu Ala Trp Ala Lys Ser Ala Val Leu Met Ala Ala Ile Arg
530 535 540
atc ttc ttt tcc ggt gaa aac tgt ttt gcc cac cat aga aga caa ttc 1680
Ile Phe Phe Ser Gly Glu Asn Cys Phe Ala His His Arg Arg Gln Phe
545 550 555 560
ttg gat gcc ttt acc aga tgg gac ggt aga gcc atg aga gat tct cca 1728
Leu Asp Ala Phe Thr Arg Trp Asp Gly Arg Ala Met Arg Asp Ser Pro
565 570 575
aac tcc gct aag aga tta ttc tct tgt tta ttc aga atg gtt aac ttg 1776
Asn Ser Ala Lys Arg Leu Phe Ser Cys Leu Phe Arg Met Val Asn Leu
580 585 590
ttc tct gtc gac ggt gtt gtt gct caa ggt aga gac atc tcc ggt gac 1824
Phe Ser Val Asp Gly Val Val Ala Gln Gly Arg Asp Ile Ser Gly Asp
595 600 605
ttg aga cac aga tgg gaa cac tgg ttg gcc tct gaa gct gaa gac ttg 1872
Leu Arg His Arg Trp Glu His Trp Leu Ala Ser Glu Ala Glu Asp Leu
610 615 620
acc gat gct caa gac cac gaa aag ttg ggt act gaa gct gaa att gtt 1920
Thr Asp Ala Gln Asp His Glu Lys Leu Gly Thr Glu Ala Glu Ile Val
625 630 635 640
gtt ttg act gct gct ttc ttg ggt aga gaa acc att tct cca gat ttg 1968
Val Leu Thr Ala Ala Phe Leu Gly Arg Glu Thr Ile Ser Pro Asp Leu
645 650 655
att tct cac cca gat ttc tct tct att atg aag gtt acc aac act gtt 2016
Ile Ser His Pro Asp Phe Ser Ser Ile Met Lys Val Thr Asn Thr Val
660 665 670
tgt tct tta ttg aga aga att gcc acc tac aag gaa gaa ggt tgt gac 2064
Cys Ser Leu Leu Arg Arg Ile Ala Thr Tyr Lys Glu Glu Gly Cys Asp
675 680 685
tcc cca tct ggt act gaa gaa gat gac aga ttg aaa cgt cgt gct gaa 2112
Ser Pro Ser Gly Thr Glu Glu Asp Asp Arg Leu Lys Arg Arg Ala Glu
690 695 700
gaa ggt atg ggt cat ttg gtt cgt gcc gtt tac aga cac caa tac tct 2160
Glu Gly Met Gly His Leu Val Arg Ala Val Tyr Arg His Gln Tyr Ser
705 710 715 720
cca gtt cca tct ggt gtc aag aga ttg tgt ttg gtt gtt ggt aag tcc 2208
Pro Val Pro Ser Gly Val Lys Arg Leu Cys Leu Val Val Gly Lys Ser
725 730 735
ttt tac tac gct gct cac tgt aac aat gaa gaa gtt ggt aac cat gtc 2256
Phe Tyr Tyr Ala Ala His Cys Asn Asn Glu Glu Val Gly Asn His Val
740 745 750
gaa acc gtc ttg ttc caa cca gta tac 2283
Glu Thr Val Leu Phe Gln Pro Val Tyr
755 760
<210> 6
<211> 761
<212> PRT
<213> Picea glauca
<400> 6
Met Lys Met Ser Lys Ser Val Glu Val Gln His Cys Ala Val Gln Phe
1 5 10 15
Leu Ser Ser Thr Thr Asp Gln Ile Glu Ile Arg Glu Arg Asn Leu Gln
20 25 30
Ile Ser Thr Glu Ala Met Lys Met Lys Ser Trp Ile Glu Thr Val Lys
35 40 45
Tyr Ile Leu Gln Ser Met Glu Asp Gly Glu Ile Thr Ile Ser Ala Tyr
50 55 60
Asp Thr Ala Trp Ile Ala Leu Val Pro Ala Leu Asn Gly Ser Ser Glu
65 70 75 80
Pro Gln Phe Pro Ser Ser Leu Gln Trp Leu Ile Asn Asn Gln Leu Gln
85 90 95
Asp Gly Ser Trp Gly Asp Pro Leu Met Phe Leu Ile Arg Asp Arg Ile
100 105 110
Ile Asn Thr Leu Ala Cys Val Leu Ala Leu Lys Thr Trp Asn Ile His
115 120 125
Ser Leu Gly Val Asn Lys Gly Leu Ser Phe Leu Gln Thr Tyr Ile Pro
130 135 140
Lys Met Asn Asp Glu His Asp Ala His Thr Pro Val Gly Phe Glu Ile
145 150 155 160
Val Phe Pro Ala Leu Met Glu Asp Ala Lys Ile Met Glu Leu Asp Leu
165 170 175
Pro Tyr Asp Ala Glu Phe Leu Gln Lys Ile Tyr Asp Glu Arg Asp Leu
180 185 190
Lys Met Lys Arg Ile Pro Met Lys Val Leu His Glu Phe Pro Ser Thr
195 200 205
Leu Leu His Ser Leu Glu Gly Leu Arg Asp Lys Val Asn Trp Glu Glu
210 215 220
Leu Leu Lys Leu Gln Ser Lys Asn Gly Ser Phe Leu Phe Ser Pro Ala
225 230 235 240
Ser Thr Ala Cys Ala Leu Ala Gln Thr Ser Asp Thr Asn Cys Leu Arg
245 250 255
Tyr Leu Asn Glu Ile Thr Lys Lys Tyr Asp Gly Gly Ala Pro Asn Val
260 265 270
Tyr Pro Val Asp Leu Phe Glu Arg Leu Trp Thr Val Asp Arg Ile Glu
275 280 285
Arg Leu Gly Ile Ala Arg Tyr Phe Glu Ser Glu Ile Thr Asp Ser Leu
290 295 300
Glu Tyr Val Tyr Arg Tyr Trp Thr Asn Gln Gly Ile Gly Trp Ala Arg
305 310 315 320
Asp Ser Pro Val Lys Asp Val Asp Asp Thr Ser Met Ala Phe Arg Leu
325 330 335
Leu Arg Ser His Gly Phe Asp Val Thr Ala Glu Ala Phe Asn His Phe
340 345 350
Lys Gln Asp Asp Gln Phe Phe Cys Phe Phe Gly Gln Thr Lys Gln Thr
355 360 365
Val Thr Gly Met Tyr Asn Leu Tyr Arg Ala Ser Gln Phe Ser Phe Pro
370 375 380
Gly Glu Ser Ile Leu Glu Glu Ala Arg Val Phe Thr Lys Asn Phe Leu
385 390 395 400
Glu Glu Lys Arg Ala Glu Lys Gln Leu Arg Asp Lys Trp Ile Ile Ala
405 410 415
Lys Gly Leu Lys Glu Glu Val Glu Tyr Ala Leu Lys Phe Pro Trp Tyr
420 425 430
Ala Ser Gln Pro Arg Ile Asp Thr Arg Met Tyr Ile Asn Gln Tyr Arg
435 440 445
Val Asp Asp Val Trp Ile Gly Lys Ala Leu Tyr Arg Met Pro Ile Val
450 455 460
Asn Asn Lys Thr Tyr Ile Glu Leu Ala Lys Ala Asp Phe Asn Ile Cys
465 470 475 480
Gln Ser Ile His Arg Thr Glu Leu His Gly Ile Ile Arg Trp Tyr Arg
485 490 495
Glu Ser Gly Leu Asp Glu Leu Gly Leu Arg Gln Asp Gln Ile Val Lys
500 505 510
Ser Tyr Phe Leu Ala Ala Ile Ala Ile Tyr Glu Pro Asp Met Ala Ser
515 520 525
Ala Arg Leu Ala Trp Ala Lys Ser Ala Val Leu Met Ala Ala Ile Arg
530 535 540
Ile Phe Phe Ser Gly Glu Asn Cys Phe Ala His His Arg Arg Gln Phe
545 550 555 560
Leu Asp Ala Phe Thr Arg Trp Asp Gly Arg Ala Met Arg Asp Ser Pro
565 570 575
Asn Ser Ala Lys Arg Leu Phe Ser Cys Leu Phe Arg Met Val Asn Leu
580 585 590
Phe Ser Val Asp Gly Val Val Ala Gln Gly Arg Asp Ile Ser Gly Asp
595 600 605
Leu Arg His Arg Trp Glu His Trp Leu Ala Ser Glu Ala Glu Asp Leu
610 615 620
Thr Asp Ala Gln Asp His Glu Lys Leu Gly Thr Glu Ala Glu Ile Val
625 630 635 640
Val Leu Thr Ala Ala Phe Leu Gly Arg Glu Thr Ile Ser Pro Asp Leu
645 650 655
Ile Ser His Pro Asp Phe Ser Ser Ile Met Lys Val Thr Asn Thr Val
660 665 670
Cys Ser Leu Leu Arg Arg Ile Ala Thr Tyr Lys Glu Glu Gly Cys Asp
675 680 685
Ser Pro Ser Gly Thr Glu Glu Asp Asp Arg Leu Lys Arg Arg Ala Glu
690 695 700
Glu Gly Met Gly His Leu Val Arg Ala Val Tyr Arg His Gln Tyr Ser
705 710 715 720
Pro Val Pro Ser Gly Val Lys Arg Leu Cys Leu Val Val Gly Lys Ser
725 730 735
Phe Tyr Tyr Ala Ala His Cys Asn Asn Glu Glu Val Gly Asn His Val
740 745 750
Glu Thr Val Leu Phe Gln Pro Val Tyr
755 760
<210> 7
<211> 1548
<212> DNA
<213> Bradyrhizobium japonicum
<220>
<221> CDS
<222> (1)..(1548)
<400> 7
atg aac gct ttg tct gaa cac atc tta tct gaa ttg aga aga tta ttg 48
Met Asn Ala Leu Ser Glu His Ile Leu Ser Glu Leu Arg Arg Leu Leu
1 5 10 15
tcc gaa atg tct gac ggt ggt tcc gtc ggt cct tcc gtt tac gac acc 96
Ser Glu Met Ser Asp Gly Gly Ser Val Gly Pro Ser Val Tyr Asp Thr
20 25 30
gcc caa gcc ttg aga ttc cac ggt aac gtc act ggt aga caa gat gct 144
Ala Gln Ala Leu Arg Phe His Gly Asn Val Thr Gly Arg Gln Asp Ala
35 40 45
tac gct tgg ttg att gct caa caa caa gct gac ggt ggt tgg ggt tct 192
Tyr Ala Trp Leu Ile Ala Gln Gln Gln Ala Asp Gly Gly Trp Gly Ser
50 55 60
gct gat ttc cca tta ttc aga cat gcc cca acc tgg gct gct cta ttg 240
Ala Asp Phe Pro Leu Phe Arg His Ala Pro Thr Trp Ala Ala Leu Leu
65 70 75 80
gct ttg caa aga gct gac cca ttg cca ggt gct gct gat gcc gtc caa 288
Ala Leu Gln Arg Ala Asp Pro Leu Pro Gly Ala Ala Asp Ala Val Gln
85 90 95
acc gcc acc aga ttc ttg caa aga caa cca gac cca tac gct cat gcc 336
Thr Ala Thr Arg Phe Leu Gln Arg Gln Pro Asp Pro Tyr Ala His Ala
100 105 110
gtt cca gaa gat gct cca atc ggt gct gaa ttg atc ttg cct caa ttc 384
Val Pro Glu Asp Ala Pro Ile Gly Ala Glu Leu Ile Leu Pro Gln Phe
115 120 125
tgt ggt gaa gct gct tct cta tta ggt ggt gtt gcc ttc cca aga cac 432
Cys Gly Glu Ala Ala Ser Leu Leu Gly Gly Val Ala Phe Pro Arg His
130 135 140
cca gct ttg ttg cca ttg aga caa gct tgt ttg gtc aaa ttg ggt gcc 480
Pro Ala Leu Leu Pro Leu Arg Gln Ala Cys Leu Val Lys Leu Gly Ala
145 150 155 160
gtt gcc atg ttg cca tct ggt cac cca ttg ttg cac tcc tgg gaa gcc 528
Val Ala Met Leu Pro Ser Gly His Pro Leu Leu His Ser Trp Glu Ala
165 170 175
tgg ggt act tct cca act act gct tgt cca gac gat gac ggt tcc atc 576
Trp Gly Thr Ser Pro Thr Thr Ala Cys Pro Asp Asp Asp Gly Ser Ile
180 185 190
ggt atc tct cca gct gct act gct gct tgg aga gcc caa gcc gtt acc 624
Gly Ile Ser Pro Ala Ala Thr Ala Ala Trp Arg Ala Gln Ala Val Thr
195 200 205
aga ggt tcc act cca caa gtc ggt aga gct gat gct tac ttg caa atg 672
Arg Gly Ser Thr Pro Gln Val Gly Arg Ala Asp Ala Tyr Leu Gln Met
210 215 220
gct tcc aga gcc acc aga tct ggt att gaa ggt gtt ttc cca aat gtc 720
Ala Ser Arg Ala Thr Arg Ser Gly Ile Glu Gly Val Phe Pro Asn Val
225 230 235 240
tgg cca atc aac gtc ttt gaa cca tgt tgg tct ttg tac act ttg cac 768
Trp Pro Ile Asn Val Phe Glu Pro Cys Trp Ser Leu Tyr Thr Leu His
245 250 255
ttg gct ggt ttg ttt gct cat cca gct ttg gct gaa gcc gtc aga gtc 816
Leu Ala Gly Leu Phe Ala His Pro Ala Leu Ala Glu Ala Val Arg Val
260 265 270
att gtt gct caa ttg gat gct aga tta ggt gtc cac ggt tta ggt cca 864
Ile Val Ala Gln Leu Asp Ala Arg Leu Gly Val His Gly Leu Gly Pro
275 280 285
gct ttg cac ttc gct gct gac gct gat gac acc gcc gtt gct cta tgt 912
Ala Leu His Phe Ala Ala Asp Ala Asp Asp Thr Ala Val Ala Leu Cys
290 295 300
gtt ttg cac ttg gct ggt cgt gac cca gct gtt gac gct ttg aga cac 960
Val Leu His Leu Ala Gly Arg Asp Pro Ala Val Asp Ala Leu Arg His
305 310 315 320
ttc gaa att ggt gaa ttg ttc gtc act ttc cca ggt gaa aga aac gct 1008
Phe Glu Ile Gly Glu Leu Phe Val Thr Phe Pro Gly Glu Arg Asn Ala
325 330 335
tct gtt tcc acc aac atc cac gct ttg cac gct ttg aga ttg ttg ggt 1056
Ser Val Ser Thr Asn Ile His Ala Leu His Ala Leu Arg Leu Leu Gly
340 345 350
aag cca gct gct ggt gct tct gct tac gtt gaa gcc aac aga aac cca 1104
Lys Pro Ala Ala Gly Ala Ser Ala Tyr Val Glu Ala Asn Arg Asn Pro
355 360 365
cac ggt tta tgg gac aac gaa aag tgg cac gtt tcc tgg tta tac cca 1152
His Gly Leu Trp Asp Asn Glu Lys Trp His Val Ser Trp Leu Tyr Pro
370 375 380
act gct cac gcc gtt gct gct ttg gcc caa ggt aag cct caa tgg aga 1200
Thr Ala His Ala Val Ala Ala Leu Ala Gln Gly Lys Pro Gln Trp Arg
385 390 395 400
gat gaa aga gct ttg gcc gct ttg ttg caa gct caa aga gat gac ggt 1248
Asp Glu Arg Ala Leu Ala Ala Leu Leu Gln Ala Gln Arg Asp Asp Gly
405 410 415
ggt tgg ggt gct ggt cgt ggt tct act ttc gaa gaa acc gcc tac gct 1296
Gly Trp Gly Ala Gly Arg Gly Ser Thr Phe Glu Glu Thr Ala Tyr Ala
420 425 430
ttg ttt gct ttg cat gtc atg gac ggt tct gaa gaa gct acc ggt aga 1344
Leu Phe Ala Leu His Val Met Asp Gly Ser Glu Glu Ala Thr Gly Arg
435 440 445
aga aga att gct caa gtt gtt gcc aga gct ttg gaa tgg atg ttg gct 1392
Arg Arg Ile Ala Gln Val Val Ala Arg Ala Leu Glu Trp Met Leu Ala
450 455 460
cgt cac gct gcc cac ggt ttg cca caa act cca tta tgg atc ggt aag 1440
Arg His Ala Ala His Gly Leu Pro Gln Thr Pro Leu Trp Ile Gly Lys
465 470 475 480
gaa ttg tac tgt cca acc aga gtt gtc aga gtt gct gaa ttg gct ggt 1488
Glu Leu Tyr Cys Pro Thr Arg Val Val Arg Val Ala Glu Leu Ala Gly
485 490 495
tta tgg ttg gct ttg aga tgg ggt cgt cgt gtc ttg gct gaa ggt gct 1536
Leu Trp Leu Ala Leu Arg Trp Gly Arg Arg Val Leu Ala Glu Gly Ala
500 505 510
ggt gct gca ccc 1548
Gly Ala Ala Pro
515
<210> 8
<211> 516
<212> PRT
<213> Bradyrhizobium japonicum
<400> 8
Met Asn Ala Leu Ser Glu His Ile Leu Ser Glu Leu Arg Arg Leu Leu
1 5 10 15
Ser Glu Met Ser Asp Gly Gly Ser Val Gly Pro Ser Val Tyr Asp Thr
20 25 30
Ala Gln Ala Leu Arg Phe His Gly Asn Val Thr Gly Arg Gln Asp Ala
35 40 45
Tyr Ala Trp Leu Ile Ala Gln Gln Gln Ala Asp Gly Gly Trp Gly Ser
50 55 60
Ala Asp Phe Pro Leu Phe Arg His Ala Pro Thr Trp Ala Ala Leu Leu
65 70 75 80
Ala Leu Gln Arg Ala Asp Pro Leu Pro Gly Ala Ala Asp Ala Val Gln
85 90 95
Thr Ala Thr Arg Phe Leu Gln Arg Gln Pro Asp Pro Tyr Ala His Ala
100 105 110
Val Pro Glu Asp Ala Pro Ile Gly Ala Glu Leu Ile Leu Pro Gln Phe
115 120 125
Cys Gly Glu Ala Ala Ser Leu Leu Gly Gly Val Ala Phe Pro Arg His
130 135 140
Pro Ala Leu Leu Pro Leu Arg Gln Ala Cys Leu Val Lys Leu Gly Ala
145 150 155 160
Val Ala Met Leu Pro Ser Gly His Pro Leu Leu His Ser Trp Glu Ala
165 170 175
Trp Gly Thr Ser Pro Thr Thr Ala Cys Pro Asp Asp Asp Gly Ser Ile
180 185 190
Gly Ile Ser Pro Ala Ala Thr Ala Ala Trp Arg Ala Gln Ala Val Thr
195 200 205
Arg Gly Ser Thr Pro Gln Val Gly Arg Ala Asp Ala Tyr Leu Gln Met
210 215 220
Ala Ser Arg Ala Thr Arg Ser Gly Ile Glu Gly Val Phe Pro Asn Val
225 230 235 240
Trp Pro Ile Asn Val Phe Glu Pro Cys Trp Ser Leu Tyr Thr Leu His
245 250 255
Leu Ala Gly Leu Phe Ala His Pro Ala Leu Ala Glu Ala Val Arg Val
260 265 270
Ile Val Ala Gln Leu Asp Ala Arg Leu Gly Val His Gly Leu Gly Pro
275 280 285
Ala Leu His Phe Ala Ala Asp Ala Asp Asp Thr Ala Val Ala Leu Cys
290 295 300
Val Leu His Leu Ala Gly Arg Asp Pro Ala Val Asp Ala Leu Arg His
305 310 315 320
Phe Glu Ile Gly Glu Leu Phe Val Thr Phe Pro Gly Glu Arg Asn Ala
325 330 335
Ser Val Ser Thr Asn Ile His Ala Leu His Ala Leu Arg Leu Leu Gly
340 345 350
Lys Pro Ala Ala Gly Ala Ser Ala Tyr Val Glu Ala Asn Arg Asn Pro
355 360 365
His Gly Leu Trp Asp Asn Glu Lys Trp His Val Ser Trp Leu Tyr Pro
370 375 380
Thr Ala His Ala Val Ala Ala Leu Ala Gln Gly Lys Pro Gln Trp Arg
385 390 395 400
Asp Glu Arg Ala Leu Ala Ala Leu Leu Gln Ala Gln Arg Asp Asp Gly
405 410 415
Gly Trp Gly Ala Gly Arg Gly Ser Thr Phe Glu Glu Thr Ala Tyr Ala
420 425 430
Leu Phe Ala Leu His Val Met Asp Gly Ser Glu Glu Ala Thr Gly Arg
435 440 445
Arg Arg Ile Ala Gln Val Val Ala Arg Ala Leu Glu Trp Met Leu Ala
450 455 460
Arg His Ala Ala His Gly Leu Pro Gln Thr Pro Leu Trp Ile Gly Lys
465 470 475 480
Glu Leu Tyr Cys Pro Thr Arg Val Val Arg Val Ala Glu Leu Ala Gly
485 490 495
Leu Trp Leu Ala Leu Arg Trp Gly Arg Arg Val Leu Ala Glu Gly Ala
500 505 510
Gly Ala Ala Pro
515
<210> 9
<211> 2364
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(2364)
<400> 9
atg aac att gct caa atc acc tct tct gct atg ttg gtt cca tct tct 48
Met Asn Ile Ala Gln Ile Thr Ser Ser Ala Met Leu Val Pro Ser Ser
1 5 10 15
cac atc cca cac cgt tcc tgg gtt gtc aac tgt tgt atg gtt caa tac 96
His Ile Pro His Arg Ser Trp Val Val Asn Cys Cys Met Val Gln Tyr
20 25 30
aac cca tcc ggt cta aga act gct tcc tct caa gcc ggt caa gtc aac 144
Asn Pro Ser Gly Leu Arg Thr Ala Ser Ser Gln Ala Gly Gln Val Asn
35 40 45
cca act gtt atg act ttg gat gtc act aag gaa aga atc aga aag ttg 192
Pro Thr Val Met Thr Leu Asp Val Thr Lys Glu Arg Ile Arg Lys Leu
50 55 60
ttc aac aac gtc gaa gtt tct gtt tcc tct tac gac acc gct tgg gtt 240
Phe Asn Asn Val Glu Val Ser Val Ser Ser Tyr Asp Thr Ala Trp Val
65 70 75 80
gcc atg gtt cca tct cca aac tct cca aag tcc cca tgt ttc cca gac 288
Ala Met Val Pro Ser Pro Asn Ser Pro Lys Ser Pro Cys Phe Pro Asp
85 90 95
tgt ttg aac tgg ttg ttg gac aac caa ttg gat gat ggt tcc tgg ggt 336
Cys Leu Asn Trp Leu Leu Asp Asn Gln Leu Asp Asp Gly Ser Trp Gly
100 105 110
ttg ttg cca cac caa tct cca ttg atc aaa gac act tta tct tct act 384
Leu Leu Pro His Gln Ser Pro Leu Ile Lys Asp Thr Leu Ser Ser Thr
115 120 125
ttg gct tgt gtt ttg gct tta aag aga tgg aac gtc ggt aag gat caa 432
Leu Ala Cys Val Leu Ala Leu Lys Arg Trp Asn Val Gly Lys Asp Gln
130 135 140
atc aac aag ggt tta cat tac atc gaa tcc aac ttt gct tcc gtc act 480
Ile Asn Lys Gly Leu His Tyr Ile Glu Ser Asn Phe Ala Ser Val Thr
145 150 155 160
gat aag aac caa gcc tct cca ttc ggt ttt gac atc att ttc cca ggt 528
Asp Lys Asn Gln Ala Ser Pro Phe Gly Phe Asp Ile Ile Phe Pro Gly
165 170 175
atg ttg gaa tac gcc aag gat ttg gac att aaa ttg cct ttg aac caa 576
Met Leu Glu Tyr Ala Lys Asp Leu Asp Ile Lys Leu Pro Leu Asn Gln
180 185 190
act cac ttg tcc gtc atg ttg cac gaa aga gaa ttg gaa ttg aga aga 624
Thr His Leu Ser Val Met Leu His Glu Arg Glu Leu Glu Leu Arg Arg
195 200 205
tgt cac tct aac ggt cgt gaa gcc tac tta gct tac att tcc gaa ggt 672
Cys His Ser Asn Gly Arg Glu Ala Tyr Leu Ala Tyr Ile Ser Glu Gly
210 215 220
ttg ggt aac ttg aac gac tgg aac atg gtt atg aag tac caa atg aag 720
Leu Gly Asn Leu Asn Asp Trp Asn Met Val Met Lys Tyr Gln Met Lys
225 230 235 240
aac ggt tct ttg ttc aac tct cct tct gct act gct tct gtt ttg att 768
Asn Gly Ser Leu Phe Asn Ser Pro Ser Ala Thr Ala Ser Val Leu Ile
245 250 255
cac cac caa aat gct ggt tgt ttg cac tac tta acc tcc cta ttg gac 816
His His Gln Asn Ala Gly Cys Leu His Tyr Leu Thr Ser Leu Leu Asp
260 265 270
aag ttt ggt aat gct gtc cca act gtt tac cca atc gac ttg tac gtt 864
Lys Phe Gly Asn Ala Val Pro Thr Val Tyr Pro Ile Asp Leu Tyr Val
275 280 285
aga ttg tcc atg gtt gac act ttg gaa aga ttg ggt atc aag aga cat 912
Arg Leu Ser Met Val Asp Thr Leu Glu Arg Leu Gly Ile Lys Arg His
290 295 300
ttc atg gtt gaa att caa aat gtc ttg gac gaa acc tac aga tgt tgg 960
Phe Met Val Glu Ile Gln Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp
305 310 315 320
gtt caa ggt gat gtt caa atc ttc atg gat gtc gtt acc tgt gct ttg 1008
Val Gln Gly Asp Val Gln Ile Phe Met Asp Val Val Thr Cys Ala Leu
325 330 335
gct ttc aga gtt cta cgt tct aac ggt tac gaa gtc tct tct gac cca 1056
Ala Phe Arg Val Leu Arg Ser Asn Gly Tyr Glu Val Ser Ser Asp Pro
340 345 350
ttg gct aag att acc aaa gaa ggt gac tac atg aac tct cca gaa aag 1104
Leu Ala Lys Ile Thr Lys Glu Gly Asp Tyr Met Asn Ser Pro Glu Lys
355 360 365
cct ttc aag gac gtc tac acc tct ttg gaa gtc tac aag gct tct caa 1152
Pro Phe Lys Asp Val Tyr Thr Ser Leu Glu Val Tyr Lys Ala Ser Gln
370 375 380
atc atc tac caa gaa gaa ttg gcc ttc aga gaa caa aac ttg acc tct 1200
Ile Ile Tyr Gln Glu Glu Leu Ala Phe Arg Glu Gln Asn Leu Thr Ser
385 390 395 400
tat ttg cca tct tct aac aaa ttg tcc aac tac att ttg aag gaa gtt 1248
Tyr Leu Pro Ser Ser Asn Lys Leu Ser Asn Tyr Ile Leu Lys Glu Val
405 410 415
gat gac gct ttg aag ttc cca ttc aat ggt tct cta gaa aga atg tcc 1296
Asp Asp Ala Leu Lys Phe Pro Phe Asn Gly Ser Leu Glu Arg Met Ser
420 425 430
act aga aga aac att gaa cac tac aac ttg aac cac acc aga att ttg 1344
Thr Arg Arg Asn Ile Glu His Tyr Asn Leu Asn His Thr Arg Ile Leu
435 440 445
aag acc acc tac tcc tct tcc aac att tcc aac aag gac tac ttg aaa 1392
Lys Thr Thr Tyr Ser Ser Ser Asn Ile Ser Asn Lys Asp Tyr Leu Lys
450 455 460
tta gcc gtc caa gac ttc aac gaa tgt caa tcc atc tac tgt gaa gaa 1440
Leu Ala Val Gln Asp Phe Asn Glu Cys Gln Ser Ile Tyr Cys Glu Glu
465 470 475 480
tta aag gac ttg gaa aga tgg gtt gtc gaa aac aga tta gac aag ttg 1488
Leu Lys Asp Leu Glu Arg Trp Val Val Glu Asn Arg Leu Asp Lys Leu
485 490 495
aaa ttt gct cgt caa aag acc gct tac tgt tac ttc tct gct gct tct 1536
Lys Phe Ala Arg Gln Lys Thr Ala Tyr Cys Tyr Phe Ser Ala Ala Ser
500 505 510
ttc tta tct tct cca gat ttg tcc gat gct aga atc tcc tgg gct aaa 1584
Phe Leu Ser Ser Pro Asp Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys
515 520 525
tct tcc att ttg act acc gtc att gat gat ttc ttt gac gtc ggt ggt 1632
Ser Ser Ile Leu Thr Thr Val Ile Asp Asp Phe Phe Asp Val Gly Gly
530 535 540
tcc atg gat gaa ttg gtt aac ttc gtc cac atc atc gaa aag tgg aac 1680
Ser Met Asp Glu Leu Val Asn Phe Val His Ile Ile Glu Lys Trp Asn
545 550 555 560
gtc aac gtt gaa aac gat tgt tgt tct gaa gaa gtc ggt gtt ttg ttc 1728
Val Asn Val Glu Asn Asp Cys Cys Ser Glu Glu Val Gly Val Leu Phe
565 570 575
ttg gct tta aag gat gct gtc tgt tgg att ggt gac aag gct ttc aag 1776
Leu Ala Leu Lys Asp Ala Val Cys Trp Ile Gly Asp Lys Ala Phe Lys
580 585 590
atc caa gaa aga aac atc act tcc cac gtc att gaa atc tgg ttg gac 1824
Ile Gln Glu Arg Asn Ile Thr Ser His Val Ile Glu Ile Trp Leu Asp
595 600 605
ttg gtt aag tcc atg ttg aga gaa gct att tgg gct aag gac ggt tcc 1872
Leu Val Lys Ser Met Leu Arg Glu Ala Ile Trp Ala Lys Asp Gly Ser
610 615 620
atc cca acc atc aac gaa tac atg gaa aac ggt tac gtt tct ttc gcc 1920
Ile Pro Thr Ile Asn Glu Tyr Met Glu Asn Gly Tyr Val Ser Phe Ala
625 630 635 640
ttg ggt cca atc gtt ttg cca act ttg tac ttc tta ggt gtc aag ttg 1968
Leu Gly Pro Ile Val Leu Pro Thr Leu Tyr Phe Leu Gly Val Lys Leu
645 650 655
tct gaa gaa gtc gtc caa tct tcc gaa tac cac aag ttg tac gaa gtt 2016
Ser Glu Glu Val Val Gln Ser Ser Glu Tyr His Lys Leu Tyr Glu Val
660 665 670
atg tcc acc caa ggt aga ttg atg aac gac att cac tct ttc aaa cgt 2064
Met Ser Thr Gln Gly Arg Leu Met Asn Asp Ile His Ser Phe Lys Arg
675 680 685
gaa aag aag gcc ggt aag ttg aat gct gtt gct tta tac atg tcc gac 2112
Glu Lys Lys Ala Gly Lys Leu Asn Ala Val Ala Leu Tyr Met Ser Asp
690 695 700
ggt aaa tct ggt tct gtt gaa gaa gaa gtt gtc gaa gaa atg aag atc 2160
Gly Lys Ser Gly Ser Val Glu Glu Glu Val Val Glu Glu Met Lys Ile
705 710 715 720
ttg act aaa tct caa aga aag gaa atg atg aag ttg gtt ttg gaa acc 2208
Leu Thr Lys Ser Gln Arg Lys Glu Met Met Lys Leu Val Leu Glu Thr
725 730 735
aag ggt tcc gtt gtc cca aga gtt tgt aag gac gtt ttc tgg aac atg 2256
Lys Gly Ser Val Val Pro Arg Val Cys Lys Asp Val Phe Trp Asn Met
740 745 750
tgt aac gtc ttg aac ttg ttc tac gct acc gat gac ggt ttc act ggt 2304
Cys Asn Val Leu Asn Leu Phe Tyr Ala Thr Asp Asp Gly Phe Thr Gly
755 760 765
aac gcc atc tta gat gtt gtc aag gaa atc atc tac gaa cca gtc tct 2352
Asn Ala Ile Leu Asp Val Val Lys Glu Ile Ile Tyr Glu Pro Val Ser
770 775 780
cat gaa ttg ata 2364
His Glu Leu Ile
785
<210> 10
<211> 788
<212> PRT
<213> Lactuca sativa
<400> 10
Met Asn Ile Ala Gln Ile Thr Ser Ser Ala Met Leu Val Pro Ser Ser
1 5 10 15
His Ile Pro His Arg Ser Trp Val Val Asn Cys Cys Met Val Gln Tyr
20 25 30
Asn Pro Ser Gly Leu Arg Thr Ala Ser Ser Gln Ala Gly Gln Val Asn
35 40 45
Pro Thr Val Met Thr Leu Asp Val Thr Lys Glu Arg Ile Arg Lys Leu
50 55 60
Phe Asn Asn Val Glu Val Ser Val Ser Ser Tyr Asp Thr Ala Trp Val
65 70 75 80
Ala Met Val Pro Ser Pro Asn Ser Pro Lys Ser Pro Cys Phe Pro Asp
85 90 95
Cys Leu Asn Trp Leu Leu Asp Asn Gln Leu Asp Asp Gly Ser Trp Gly
100 105 110
Leu Leu Pro His Gln Ser Pro Leu Ile Lys Asp Thr Leu Ser Ser Thr
115 120 125
Leu Ala Cys Val Leu Ala Leu Lys Arg Trp Asn Val Gly Lys Asp Gln
130 135 140
Ile Asn Lys Gly Leu His Tyr Ile Glu Ser Asn Phe Ala Ser Val Thr
145 150 155 160
Asp Lys Asn Gln Ala Ser Pro Phe Gly Phe Asp Ile Ile Phe Pro Gly
165 170 175
Met Leu Glu Tyr Ala Lys Asp Leu Asp Ile Lys Leu Pro Leu Asn Gln
180 185 190
Thr His Leu Ser Val Met Leu His Glu Arg Glu Leu Glu Leu Arg Arg
195 200 205
Cys His Ser Asn Gly Arg Glu Ala Tyr Leu Ala Tyr Ile Ser Glu Gly
210 215 220
Leu Gly Asn Leu Asn Asp Trp Asn Met Val Met Lys Tyr Gln Met Lys
225 230 235 240
Asn Gly Ser Leu Phe Asn Ser Pro Ser Ala Thr Ala Ser Val Leu Ile
245 250 255
His His Gln Asn Ala Gly Cys Leu His Tyr Leu Thr Ser Leu Leu Asp
260 265 270
Lys Phe Gly Asn Ala Val Pro Thr Val Tyr Pro Ile Asp Leu Tyr Val
275 280 285
Arg Leu Ser Met Val Asp Thr Leu Glu Arg Leu Gly Ile Lys Arg His
290 295 300
Phe Met Val Glu Ile Gln Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp
305 310 315 320
Val Gln Gly Asp Val Gln Ile Phe Met Asp Val Val Thr Cys Ala Leu
325 330 335
Ala Phe Arg Val Leu Arg Ser Asn Gly Tyr Glu Val Ser Ser Asp Pro
340 345 350
Leu Ala Lys Ile Thr Lys Glu Gly Asp Tyr Met Asn Ser Pro Glu Lys
355 360 365
Pro Phe Lys Asp Val Tyr Thr Ser Leu Glu Val Tyr Lys Ala Ser Gln
370 375 380
Ile Ile Tyr Gln Glu Glu Leu Ala Phe Arg Glu Gln Asn Leu Thr Ser
385 390 395 400
Tyr Leu Pro Ser Ser Asn Lys Leu Ser Asn Tyr Ile Leu Lys Glu Val
405 410 415
Asp Asp Ala Leu Lys Phe Pro Phe Asn Gly Ser Leu Glu Arg Met Ser
420 425 430
Thr Arg Arg Asn Ile Glu His Tyr Asn Leu Asn His Thr Arg Ile Leu
435 440 445
Lys Thr Thr Tyr Ser Ser Ser Asn Ile Ser Asn Lys Asp Tyr Leu Lys
450 455 460
Leu Ala Val Gln Asp Phe Asn Glu Cys Gln Ser Ile Tyr Cys Glu Glu
465 470 475 480
Leu Lys Asp Leu Glu Arg Trp Val Val Glu Asn Arg Leu Asp Lys Leu
485 490 495
Lys Phe Ala Arg Gln Lys Thr Ala Tyr Cys Tyr Phe Ser Ala Ala Ser
500 505 510
Phe Leu Ser Ser Pro Asp Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys
515 520 525
Ser Ser Ile Leu Thr Thr Val Ile Asp Asp Phe Phe Asp Val Gly Gly
530 535 540
Ser Met Asp Glu Leu Val Asn Phe Val His Ile Ile Glu Lys Trp Asn
545 550 555 560
Val Asn Val Glu Asn Asp Cys Cys Ser Glu Glu Val Gly Val Leu Phe
565 570 575
Leu Ala Leu Lys Asp Ala Val Cys Trp Ile Gly Asp Lys Ala Phe Lys
580 585 590
Ile Gln Glu Arg Asn Ile Thr Ser His Val Ile Glu Ile Trp Leu Asp
595 600 605
Leu Val Lys Ser Met Leu Arg Glu Ala Ile Trp Ala Lys Asp Gly Ser
610 615 620
Ile Pro Thr Ile Asn Glu Tyr Met Glu Asn Gly Tyr Val Ser Phe Ala
625 630 635 640
Leu Gly Pro Ile Val Leu Pro Thr Leu Tyr Phe Leu Gly Val Lys Leu
645 650 655
Ser Glu Glu Val Val Gln Ser Ser Glu Tyr His Lys Leu Tyr Glu Val
660 665 670
Met Ser Thr Gln Gly Arg Leu Met Asn Asp Ile His Ser Phe Lys Arg
675 680 685
Glu Lys Lys Ala Gly Lys Leu Asn Ala Val Ala Leu Tyr Met Ser Asp
690 695 700
Gly Lys Ser Gly Ser Val Glu Glu Glu Val Val Glu Glu Met Lys Ile
705 710 715 720
Leu Thr Lys Ser Gln Arg Lys Glu Met Met Lys Leu Val Leu Glu Thr
725 730 735
Lys Gly Ser Val Val Pro Arg Val Cys Lys Asp Val Phe Trp Asn Met
740 745 750
Cys Asn Val Leu Asn Leu Phe Tyr Ala Thr Asp Asp Gly Phe Thr Gly
755 760 765
Asn Ala Ile Leu Asp Val Val Lys Glu Ile Ile Tyr Glu Pro Val Ser
770 775 780
His Glu Leu Ile
785
<210> 11
<211> 2250
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(2250)
<400> 11
atg gct tcc tct caa gcc ggt caa gtc aac cca act gtt atg act ttg 48
Met Ala Ser Ser Gln Ala Gly Gln Val Asn Pro Thr Val Met Thr Leu
1 5 10 15
gat gtc act aag gaa aga atc aga aag ttg ttc aac aac gtc gaa gtt 96
Asp Val Thr Lys Glu Arg Ile Arg Lys Leu Phe Asn Asn Val Glu Val
20 25 30
tct gtt tcc tct tac gac acc gct tgg gtt gcc atg gtt cca tct cca 144
Ser Val Ser Ser Tyr Asp Thr Ala Trp Val Ala Met Val Pro Ser Pro
35 40 45
aac tct cca aag tcc cca tgt ttc cca gac tgt ttg aac tgg ttg ttg 192
Asn Ser Pro Lys Ser Pro Cys Phe Pro Asp Cys Leu Asn Trp Leu Leu
50 55 60
gac aac caa ttg gat gat ggt tcc tgg ggt ttg ttg cca cac caa tct 240
Asp Asn Gln Leu Asp Asp Gly Ser Trp Gly Leu Leu Pro His Gln Ser
65 70 75 80
cca ttg atc aaa gac act tta tct tct act ttg gct tgt gtt ttg gct 288
Pro Leu Ile Lys Asp Thr Leu Ser Ser Thr Leu Ala Cys Val Leu Ala
85 90 95
tta aag aga tgg aac gtc ggt aag gat caa atc aac aag ggt tta cat 336
Leu Lys Arg Trp Asn Val Gly Lys Asp Gln Ile Asn Lys Gly Leu His
100 105 110
tac atc gaa tcc aac ttt gct tcc gtc act gat aag aac caa gcc tct 384
Tyr Ile Glu Ser Asn Phe Ala Ser Val Thr Asp Lys Asn Gln Ala Ser
115 120 125
cca ttc ggt ttt gac atc att ttc cca ggt atg ttg gaa tac gcc aag 432
Pro Phe Gly Phe Asp Ile Ile Phe Pro Gly Met Leu Glu Tyr Ala Lys
130 135 140
gat ttg gac att aaa ttg cct ttg aac caa act cac ttg tcc gtc atg 480
Asp Leu Asp Ile Lys Leu Pro Leu Asn Gln Thr His Leu Ser Val Met
145 150 155 160
ttg cac gaa aga gaa ttg gaa ttg aga aga tgt cac tct aac ggt cgt 528
Leu His Glu Arg Glu Leu Glu Leu Arg Arg Cys His Ser Asn Gly Arg
165 170 175
gaa gcc tac tta gct tac att tcc gaa ggt ttg ggt aac ttg aac gac 576
Glu Ala Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn Leu Asn Asp
180 185 190
tgg aac atg gtt atg aag tac caa atg aag aac ggt tct ttg ttc aac 624
Trp Asn Met Val Met Lys Tyr Gln Met Lys Asn Gly Ser Leu Phe Asn
195 200 205
tct cct tct gct act gct tct gtt ttg att cac cac caa aat gct ggt 672
Ser Pro Ser Ala Thr Ala Ser Val Leu Ile His His Gln Asn Ala Gly
210 215 220
tgt ttg cac tac tta acc tcc cta ttg gac aag ttt ggt aat gct gtc 720
Cys Leu His Tyr Leu Thr Ser Leu Leu Asp Lys Phe Gly Asn Ala Val
225 230 235 240
cca act gtt tac cca atc gac ttg tac gtt aga ttg tcc atg gtt gac 768
Pro Thr Val Tyr Pro Ile Asp Leu Tyr Val Arg Leu Ser Met Val Asp
245 250 255
act ttg gaa aga ttg ggt atc aag aga cat ttc atg gtt gaa att caa 816
Thr Leu Glu Arg Leu Gly Ile Lys Arg His Phe Met Val Glu Ile Gln
260 265 270
aat gtc ttg gac gaa acc tac aga tgt tgg gtt caa ggt gat gtt caa 864
Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp Val Gln Gly Asp Val Gln
275 280 285
atc ttc atg gat gtc gtt acc tgt gct ttg gct ttc aga gtt cta cgt 912
Ile Phe Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg Val Leu Arg
290 295 300
tct aac ggt tac gaa gtc tct tct gac cca ttg gct aag att acc aaa 960
Ser Asn Gly Tyr Glu Val Ser Ser Asp Pro Leu Ala Lys Ile Thr Lys
305 310 315 320
gaa ggt gac tac atg aac tct cca gaa aag cct ttc aag gac gtc tac 1008
Glu Gly Asp Tyr Met Asn Ser Pro Glu Lys Pro Phe Lys Asp Val Tyr
325 330 335
acc tct ttg gaa gtc tac aag gct tct caa atc atc tac caa gaa gaa 1056
Thr Ser Leu Glu Val Tyr Lys Ala Ser Gln Ile Ile Tyr Gln Glu Glu
340 345 350
ttg gcc ttc aga gaa caa aac ttg acc tct tat ttg cca tct tct aac 1104
Leu Ala Phe Arg Glu Gln Asn Leu Thr Ser Tyr Leu Pro Ser Ser Asn
355 360 365
aaa ttg tcc aac tac att ttg aag gaa gtt gat gac gct ttg aag ttc 1152
Lys Leu Ser Asn Tyr Ile Leu Lys Glu Val Asp Asp Ala Leu Lys Phe
370 375 380
cca ttc aat ggt tct cta gaa aga atg tcc act aga aga aac att gaa 1200
Pro Phe Asn Gly Ser Leu Glu Arg Met Ser Thr Arg Arg Asn Ile Glu
385 390 395 400
cac tac aac ttg aac cac acc aga att ttg aag acc acc tac tcc tct 1248
His Tyr Asn Leu Asn His Thr Arg Ile Leu Lys Thr Thr Tyr Ser Ser
405 410 415
tcc aac att tcc aac aag gac tac ttg aaa tta gcc gtc caa gac ttc 1296
Ser Asn Ile Ser Asn Lys Asp Tyr Leu Lys Leu Ala Val Gln Asp Phe
420 425 430
aac gaa tgt caa tcc atc tac tgt gaa gaa tta aag gac ttg gaa aga 1344
Asn Glu Cys Gln Ser Ile Tyr Cys Glu Glu Leu Lys Asp Leu Glu Arg
435 440 445
tgg gtt gtc gaa aac aga tta gac aag ttg aaa ttt gct cgt caa aag 1392
Trp Val Val Glu Asn Arg Leu Asp Lys Leu Lys Phe Ala Arg Gln Lys
450 455 460
acc gct tac tgt tac ttc tct gct gct tct ttc tta tct tct cca gat 1440
Thr Ala Tyr Cys Tyr Phe Ser Ala Ala Ser Phe Leu Ser Ser Pro Asp
465 470 475 480
ttg tcc gat gct aga atc tcc tgg gct aaa tct tcc att ttg act acc 1488
Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys Ser Ser Ile Leu Thr Thr
485 490 495
gtc att gat gat ttc ttt gac gtc ggt ggt tcc atg gat gaa ttg gtt 1536
Val Ile Asp Asp Phe Phe Asp Val Gly Gly Ser Met Asp Glu Leu Val
500 505 510
aac ttc gtc cac atc atc gaa aag tgg aac gtc aac gtt gaa aac gat 1584
Asn Phe Val His Ile Ile Glu Lys Trp Asn Val Asn Val Glu Asn Asp
515 520 525
tgt tgt tct gaa gaa gtc ggt gtt ttg ttc ttg gct tta aag gat gct 1632
Cys Cys Ser Glu Glu Val Gly Val Leu Phe Leu Ala Leu Lys Asp Ala
530 535 540
gtc tgt tgg att ggt gac aag gct ttc aag atc caa gaa aga aac atc 1680
Val Cys Trp Ile Gly Asp Lys Ala Phe Lys Ile Gln Glu Arg Asn Ile
545 550 555 560
act tcc cac gtc att gaa atc tgg ttg gac ttg gtt aag tcc atg ttg 1728
Thr Ser His Val Ile Glu Ile Trp Leu Asp Leu Val Lys Ser Met Leu
565 570 575
aga gaa gct att tgg gct aag gac ggt tcc atc cca acc atc aac gaa 1776
Arg Glu Ala Ile Trp Ala Lys Asp Gly Ser Ile Pro Thr Ile Asn Glu
580 585 590
tac atg gaa aac ggt tac gtt tct ttc gcc ttg ggt cca atc gtt ttg 1824
Tyr Met Glu Asn Gly Tyr Val Ser Phe Ala Leu Gly Pro Ile Val Leu
595 600 605
cca act ttg tac ttc tta ggt gtc aag ttg tct gaa gaa gtc gtc caa 1872
Pro Thr Leu Tyr Phe Leu Gly Val Lys Leu Ser Glu Glu Val Val Gln
610 615 620
tct tcc gaa tac cac aag ttg tac gaa gtt atg tcc acc caa ggt aga 1920
Ser Ser Glu Tyr His Lys Leu Tyr Glu Val Met Ser Thr Gln Gly Arg
625 630 635 640
ttg atg aac gac att cac tct ttc aaa cgt gaa aag aag gcc ggt aag 1968
Leu Met Asn Asp Ile His Ser Phe Lys Arg Glu Lys Lys Ala Gly Lys
645 650 655
ttg aat gct gtt gct tta tac atg tcc gac ggt aaa tct ggt tct gtt 2016
Leu Asn Ala Val Ala Leu Tyr Met Ser Asp Gly Lys Ser Gly Ser Val
660 665 670
gaa gaa gaa gtt gtc gaa gaa atg aag atc ttg act aaa tct caa aga 2064
Glu Glu Glu Val Val Glu Glu Met Lys Ile Leu Thr Lys Ser Gln Arg
675 680 685
aag gaa atg atg aag ttg gtt ttg gaa acc aag ggt tcc gtt gtc cca 2112
Lys Glu Met Met Lys Leu Val Leu Glu Thr Lys Gly Ser Val Val Pro
690 695 700
aga gtt tgt aag gac gtt ttc tgg aac atg tgt aac gtc ttg aac ttg 2160
Arg Val Cys Lys Asp Val Phe Trp Asn Met Cys Asn Val Leu Asn Leu
705 710 715 720
ttc tac gct acc gat gac ggt ttc act ggt aac gcc atc tta gat gtt 2208
Phe Tyr Ala Thr Asp Asp Gly Phe Thr Gly Asn Ala Ile Leu Asp Val
725 730 735
gtc aag gaa atc atc tac gaa cca gtc tct cat gaa ttg ata 2250
Val Lys Glu Ile Ile Tyr Glu Pro Val Ser His Glu Leu Ile
740 745 750
<210> 12
<211> 750
<212> PRT
<213> Lactuca sativa
<400> 12
Met Ala Ser Ser Gln Ala Gly Gln Val Asn Pro Thr Val Met Thr Leu
1 5 10 15
Asp Val Thr Lys Glu Arg Ile Arg Lys Leu Phe Asn Asn Val Glu Val
20 25 30
Ser Val Ser Ser Tyr Asp Thr Ala Trp Val Ala Met Val Pro Ser Pro
35 40 45
Asn Ser Pro Lys Ser Pro Cys Phe Pro Asp Cys Leu Asn Trp Leu Leu
50 55 60
Asp Asn Gln Leu Asp Asp Gly Ser Trp Gly Leu Leu Pro His Gln Ser
65 70 75 80
Pro Leu Ile Lys Asp Thr Leu Ser Ser Thr Leu Ala Cys Val Leu Ala
85 90 95
Leu Lys Arg Trp Asn Val Gly Lys Asp Gln Ile Asn Lys Gly Leu His
100 105 110
Tyr Ile Glu Ser Asn Phe Ala Ser Val Thr Asp Lys Asn Gln Ala Ser
115 120 125
Pro Phe Gly Phe Asp Ile Ile Phe Pro Gly Met Leu Glu Tyr Ala Lys
130 135 140
Asp Leu Asp Ile Lys Leu Pro Leu Asn Gln Thr His Leu Ser Val Met
145 150 155 160
Leu His Glu Arg Glu Leu Glu Leu Arg Arg Cys His Ser Asn Gly Arg
165 170 175
Glu Ala Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn Leu Asn Asp
180 185 190
Trp Asn Met Val Met Lys Tyr Gln Met Lys Asn Gly Ser Leu Phe Asn
195 200 205
Ser Pro Ser Ala Thr Ala Ser Val Leu Ile His His Gln Asn Ala Gly
210 215 220
Cys Leu His Tyr Leu Thr Ser Leu Leu Asp Lys Phe Gly Asn Ala Val
225 230 235 240
Pro Thr Val Tyr Pro Ile Asp Leu Tyr Val Arg Leu Ser Met Val Asp
245 250 255
Thr Leu Glu Arg Leu Gly Ile Lys Arg His Phe Met Val Glu Ile Gln
260 265 270
Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp Val Gln Gly Asp Val Gln
275 280 285
Ile Phe Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg Val Leu Arg
290 295 300
Ser Asn Gly Tyr Glu Val Ser Ser Asp Pro Leu Ala Lys Ile Thr Lys
305 310 315 320
Glu Gly Asp Tyr Met Asn Ser Pro Glu Lys Pro Phe Lys Asp Val Tyr
325 330 335
Thr Ser Leu Glu Val Tyr Lys Ala Ser Gln Ile Ile Tyr Gln Glu Glu
340 345 350
Leu Ala Phe Arg Glu Gln Asn Leu Thr Ser Tyr Leu Pro Ser Ser Asn
355 360 365
Lys Leu Ser Asn Tyr Ile Leu Lys Glu Val Asp Asp Ala Leu Lys Phe
370 375 380
Pro Phe Asn Gly Ser Leu Glu Arg Met Ser Thr Arg Arg Asn Ile Glu
385 390 395 400
His Tyr Asn Leu Asn His Thr Arg Ile Leu Lys Thr Thr Tyr Ser Ser
405 410 415
Ser Asn Ile Ser Asn Lys Asp Tyr Leu Lys Leu Ala Val Gln Asp Phe
420 425 430
Asn Glu Cys Gln Ser Ile Tyr Cys Glu Glu Leu Lys Asp Leu Glu Arg
435 440 445
Trp Val Val Glu Asn Arg Leu Asp Lys Leu Lys Phe Ala Arg Gln Lys
450 455 460
Thr Ala Tyr Cys Tyr Phe Ser Ala Ala Ser Phe Leu Ser Ser Pro Asp
465 470 475 480
Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys Ser Ser Ile Leu Thr Thr
485 490 495
Val Ile Asp Asp Phe Phe Asp Val Gly Gly Ser Met Asp Glu Leu Val
500 505 510
Asn Phe Val His Ile Ile Glu Lys Trp Asn Val Asn Val Glu Asn Asp
515 520 525
Cys Cys Ser Glu Glu Val Gly Val Leu Phe Leu Ala Leu Lys Asp Ala
530 535 540
Val Cys Trp Ile Gly Asp Lys Ala Phe Lys Ile Gln Glu Arg Asn Ile
545 550 555 560
Thr Ser His Val Ile Glu Ile Trp Leu Asp Leu Val Lys Ser Met Leu
565 570 575
Arg Glu Ala Ile Trp Ala Lys Asp Gly Ser Ile Pro Thr Ile Asn Glu
580 585 590
Tyr Met Glu Asn Gly Tyr Val Ser Phe Ala Leu Gly Pro Ile Val Leu
595 600 605
Pro Thr Leu Tyr Phe Leu Gly Val Lys Leu Ser Glu Glu Val Val Gln
610 615 620
Ser Ser Glu Tyr His Lys Leu Tyr Glu Val Met Ser Thr Gln Gly Arg
625 630 635 640
Leu Met Asn Asp Ile His Ser Phe Lys Arg Glu Lys Lys Ala Gly Lys
645 650 655
Leu Asn Ala Val Ala Leu Tyr Met Ser Asp Gly Lys Ser Gly Ser Val
660 665 670
Glu Glu Glu Val Val Glu Glu Met Lys Ile Leu Thr Lys Ser Gln Arg
675 680 685
Lys Glu Met Met Lys Leu Val Leu Glu Thr Lys Gly Ser Val Val Pro
690 695 700
Arg Val Cys Lys Asp Val Phe Trp Asn Met Cys Asn Val Leu Asn Leu
705 710 715 720
Phe Tyr Ala Thr Asp Asp Gly Phe Thr Gly Asn Ala Ile Leu Asp Val
725 730 735
Val Lys Glu Ile Ile Tyr Glu Pro Val Ser His Glu Leu Ile
740 745 750
<210> 13
<211> 2271
<212> DNA
<213> Picea glauca
<220>
<221> CDS
<222> (1)..(2271)
<400> 13
atg aag aga gaa caa tac act atc ttg aac gaa aag gaa tcc atg gct 48
Met Lys Arg Glu Gln Tyr Thr Ile Leu Asn Glu Lys Glu Ser Met Ala
1 5 10 15
gaa gaa tta atc ttg aga atc aag aga atg ttt tct gaa atc gaa aac 96
Glu Glu Leu Ile Leu Arg Ile Lys Arg Met Phe Ser Glu Ile Glu Asn
20 25 30
act caa acc tct gct tct gct tac gac acc gct tgg gtt gcc atg gtt 144
Thr Gln Thr Ser Ala Ser Ala Tyr Asp Thr Ala Trp Val Ala Met Val
35 40 45
cca tct ttg gac tcc tct caa caa cca caa ttc cct caa tgt ttg tcc 192
Pro Ser Leu Asp Ser Ser Gln Gln Pro Gln Phe Pro Gln Cys Leu Ser
50 55 60
tgg atc att gac aac caa ttg ttg gat ggt tcc tgg ggt atc cca tac 240
Trp Ile Ile Asp Asn Gln Leu Leu Asp Gly Ser Trp Gly Ile Pro Tyr
65 70 75 80
ttg atc atc aag gac aga cta tgt cac act tta gct tgt gtc att gct 288
Leu Ile Ile Lys Asp Arg Leu Cys His Thr Leu Ala Cys Val Ile Ala
85 90 95
ttg aga aaa tgg aac gct ggt aac caa aat gtc gaa acc ggt cta cgt 336
Leu Arg Lys Trp Asn Ala Gly Asn Gln Asn Val Glu Thr Gly Leu Arg
100 105 110
ttc ttg aga gaa aac att gaa ggt atc gtt cac gaa gat gaa tac act 384
Phe Leu Arg Glu Asn Ile Glu Gly Ile Val His Glu Asp Glu Tyr Thr
115 120 125
cca att ggt ttc caa atc att ttc cca gct atg ttg gaa gaa gct cgt 432
Pro Ile Gly Phe Gln Ile Ile Phe Pro Ala Met Leu Glu Glu Ala Arg
130 135 140
ggt tta ggt cta gaa ttg cca tac gac ttg act cca atc aag ttg atg 480
Gly Leu Gly Leu Glu Leu Pro Tyr Asp Leu Thr Pro Ile Lys Leu Met
145 150 155 160
ttg acc cac aga gaa aag atc atg aag ggt aag gcc att gac cac atg 528
Leu Thr His Arg Glu Lys Ile Met Lys Gly Lys Ala Ile Asp His Met
165 170 175
cac gaa tac gac tct tct ttg atc tac act gtt gaa ggt atc cac aag 576
His Glu Tyr Asp Ser Ser Leu Ile Tyr Thr Val Glu Gly Ile His Lys
180 185 190
atc gtt gac tgg aac aag gtt ttg aaa cat caa aac aag gat ggt tct 624
Ile Val Asp Trp Asn Lys Val Leu Lys His Gln Asn Lys Asp Gly Ser
195 200 205
ttg ttc aac tct cca tct gct act gct tgt gct ttg atg cac act aga 672
Leu Phe Asn Ser Pro Ser Ala Thr Ala Cys Ala Leu Met His Thr Arg
210 215 220
aag tcc aac tgt ttg gaa tac ttg tcc tcc atg ttg caa aag tta ggt 720
Lys Ser Asn Cys Leu Glu Tyr Leu Ser Ser Met Leu Gln Lys Leu Gly
225 230 235 240
aac ggt gtt cca tcc gtc tac cca atc aac tta tat gct cgt atc tcc 768
Asn Gly Val Pro Ser Val Tyr Pro Ile Asn Leu Tyr Ala Arg Ile Ser
245 250 255
atg att gac aga tta caa aga tta ggt ttg gct aga cat ttc aga aac 816
Met Ile Asp Arg Leu Gln Arg Leu Gly Leu Ala Arg His Phe Arg Asn
260 265 270
gaa atc att cac gct ttg gac gac atc tac aga tac tgg atg caa aag 864
Glu Ile Ile His Ala Leu Asp Asp Ile Tyr Arg Tyr Trp Met Gln Lys
275 280 285
gaa acc tcc aga gaa ggt aag tct ttg acc cca gat att gtc tct act 912
Glu Thr Ser Arg Glu Gly Lys Ser Leu Thr Pro Asp Ile Val Ser Thr
290 295 300
tcc atc gct ttc atg ttg ttg aga ttg cac ggt tac gat gtt cca gct 960
Ser Ile Ala Phe Met Leu Leu Arg Leu His Gly Tyr Asp Val Pro Ala
305 310 315 320
gat gtt ttc tgt tgt tac cat ttg cac tcc att gaa caa tct ggt gaa 1008
Asp Val Phe Cys Cys Tyr His Leu His Ser Ile Glu Gln Ser Gly Glu
325 330 335
gcc gtt act gct atg ttg tct ttg tac aga gcc tct caa att atg ttc 1056
Ala Val Thr Ala Met Leu Ser Leu Tyr Arg Ala Ser Gln Ile Met Phe
340 345 350
cca ggt gaa acc att ttg gaa gaa atc aag acc gtt tct aga aag tac 1104
Pro Gly Glu Thr Ile Leu Glu Glu Ile Lys Thr Val Ser Arg Lys Tyr
355 360 365
ttg gac aag aga aag gaa aac ggt cgt atc tac tac cac aac att gtt 1152
Leu Asp Lys Arg Lys Glu Asn Gly Arg Ile Tyr Tyr His Asn Ile Val
370 375 380
atg aag gac ttg aga ggt gaa gtc gaa tac gcc ttg tct gtt cca tgg 1200
Met Lys Asp Leu Arg Gly Glu Val Glu Tyr Ala Leu Ser Val Pro Trp
385 390 395 400
tac gct tcc ttg gaa aga att gaa aac aga aga tac att gac caa tac 1248
Tyr Ala Ser Leu Glu Arg Ile Glu Asn Arg Arg Tyr Ile Asp Gln Tyr
405 410 415
ggt gtc aac gat acc tgg atc gct aag acc tct tac aaa atc cca tgt 1296
Gly Val Asn Asp Thr Trp Ile Ala Lys Thr Ser Tyr Lys Ile Pro Cys
420 425 430
atc tcc aat gac tta ttc ttg gct ttg gcc aag caa gac tac aac atc 1344
Ile Ser Asn Asp Leu Phe Leu Ala Leu Ala Lys Gln Asp Tyr Asn Ile
435 440 445
tgt caa gcc att caa caa aag gaa ttg aga gaa ttg gaa aga tgg ttt 1392
Cys Gln Ala Ile Gln Gln Lys Glu Leu Arg Glu Leu Glu Arg Trp Phe
450 455 460
gct gat aac aaa ttc tct cac ttg aac ttc gct cgt caa aag ttg atc 1440
Ala Asp Asn Lys Phe Ser His Leu Asn Phe Ala Arg Gln Lys Leu Ile
465 470 475 480
tac tgt tac ttc tct gcc gct gct act ttg ttc tct cca gaa ttg tct 1488
Tyr Cys Tyr Phe Ser Ala Ala Ala Thr Leu Phe Ser Pro Glu Leu Ser
485 490 495
gct gcc aga gtt gtc tgg gct aag aac ggt gtt atc acc acc gtc gtt 1536
Ala Ala Arg Val Val Trp Ala Lys Asn Gly Val Ile Thr Thr Val Val
500 505 510
gat gac ttc ttc gat gtc ggt ggt tct tct gaa gaa att cac tct ttc 1584
Asp Asp Phe Phe Asp Val Gly Gly Ser Ser Glu Glu Ile His Ser Phe
515 520 525
gtc gaa gct gtt aga gtc tgg gac gaa gct gct acc gat ggt ttg tct 1632
Val Glu Ala Val Arg Val Trp Asp Glu Ala Ala Thr Asp Gly Leu Ser
530 535 540
gaa aac gtt caa atc tta ttc tcc gct tta tac aac acc gtc gac gaa 1680
Glu Asn Val Gln Ile Leu Phe Ser Ala Leu Tyr Asn Thr Val Asp Glu
545 550 555 560
att gtc caa caa gcc ttc gtt ttc caa ggt aga gac atc tcc att cac 1728
Ile Val Gln Gln Ala Phe Val Phe Gln Gly Arg Asp Ile Ser Ile His
565 570 575
ttg aga gaa atc tgg tac aga ttg gtc aac tcc atg atg act gaa gct 1776
Leu Arg Glu Ile Trp Tyr Arg Leu Val Asn Ser Met Met Thr Glu Ala
580 585 590
caa tgg gct aga act cac tgt att cca tcc atg cac gaa tac atg gaa 1824
Gln Trp Ala Arg Thr His Cys Ile Pro Ser Met His Glu Tyr Met Glu
595 600 605
aac gct gaa cct tct att gct ttg gaa cca atc gtc ttg tcc tct tta 1872
Asn Ala Glu Pro Ser Ile Ala Leu Glu Pro Ile Val Leu Ser Ser Leu
610 615 620
tac ttt gtt ggt cca aag ttg tct gaa gaa atc att tgt cac cca gaa 1920
Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile Ile Cys His Pro Glu
625 630 635 640
tac tac aac ttg atg cat ttg ttg aat atc tgt ggt aga ttg ttg aac 1968
Tyr Tyr Asn Leu Met His Leu Leu Asn Ile Cys Gly Arg Leu Leu Asn
645 650 655
gat atc caa ggt tgt aaa cgt gaa gcc cac caa ggt aag ttg aac tcc 2016
Asp Ile Gln Gly Cys Lys Arg Glu Ala His Gln Gly Lys Leu Asn Ser
660 665 670
gtc act ttg tac atg gaa gaa aac tct ggt act acc atg gaa gat gct 2064
Val Thr Leu Tyr Met Glu Glu Asn Ser Gly Thr Thr Met Glu Asp Ala
675 680 685
atc gtt tac ttg aga aag acc atc gac gaa tcc aga caa ttg cta ttg 2112
Ile Val Tyr Leu Arg Lys Thr Ile Asp Glu Ser Arg Gln Leu Leu Leu
690 695 700
aaa gaa gtt tta aga cca tct att gtt cca aga gaa tgt aaa caa ttg 2160
Lys Glu Val Leu Arg Pro Ser Ile Val Pro Arg Glu Cys Lys Gln Leu
705 710 715 720
cac tgg aac atg atg aga atc ttg caa ttg ttc tac ttg aag aac gat 2208
His Trp Asn Met Met Arg Ile Leu Gln Leu Phe Tyr Leu Lys Asn Asp
725 730 735
ggt ttt acc tct cca act gaa atg ttg ggt tac gtc aac gcc gtc att 2256
Gly Phe Thr Ser Pro Thr Glu Met Leu Gly Tyr Val Asn Ala Val Ile
740 745 750
gtc gac cca att ttg 2271
Val Asp Pro Ile Leu
755
<210> 14
<211> 757
<212> PRT
<213> Picea glauca
<400> 14
Met Lys Arg Glu Gln Tyr Thr Ile Leu Asn Glu Lys Glu Ser Met Ala
1 5 10 15
Glu Glu Leu Ile Leu Arg Ile Lys Arg Met Phe Ser Glu Ile Glu Asn
20 25 30
Thr Gln Thr Ser Ala Ser Ala Tyr Asp Thr Ala Trp Val Ala Met Val
35 40 45
Pro Ser Leu Asp Ser Ser Gln Gln Pro Gln Phe Pro Gln Cys Leu Ser
50 55 60
Trp Ile Ile Asp Asn Gln Leu Leu Asp Gly Ser Trp Gly Ile Pro Tyr
65 70 75 80
Leu Ile Ile Lys Asp Arg Leu Cys His Thr Leu Ala Cys Val Ile Ala
85 90 95
Leu Arg Lys Trp Asn Ala Gly Asn Gln Asn Val Glu Thr Gly Leu Arg
100 105 110
Phe Leu Arg Glu Asn Ile Glu Gly Ile Val His Glu Asp Glu Tyr Thr
115 120 125
Pro Ile Gly Phe Gln Ile Ile Phe Pro Ala Met Leu Glu Glu Ala Arg
130 135 140
Gly Leu Gly Leu Glu Leu Pro Tyr Asp Leu Thr Pro Ile Lys Leu Met
145 150 155 160
Leu Thr His Arg Glu Lys Ile Met Lys Gly Lys Ala Ile Asp His Met
165 170 175
His Glu Tyr Asp Ser Ser Leu Ile Tyr Thr Val Glu Gly Ile His Lys
180 185 190
Ile Val Asp Trp Asn Lys Val Leu Lys His Gln Asn Lys Asp Gly Ser
195 200 205
Leu Phe Asn Ser Pro Ser Ala Thr Ala Cys Ala Leu Met His Thr Arg
210 215 220
Lys Ser Asn Cys Leu Glu Tyr Leu Ser Ser Met Leu Gln Lys Leu Gly
225 230 235 240
Asn Gly Val Pro Ser Val Tyr Pro Ile Asn Leu Tyr Ala Arg Ile Ser
245 250 255
Met Ile Asp Arg Leu Gln Arg Leu Gly Leu Ala Arg His Phe Arg Asn
260 265 270
Glu Ile Ile His Ala Leu Asp Asp Ile Tyr Arg Tyr Trp Met Gln Lys
275 280 285
Glu Thr Ser Arg Glu Gly Lys Ser Leu Thr Pro Asp Ile Val Ser Thr
290 295 300
Ser Ile Ala Phe Met Leu Leu Arg Leu His Gly Tyr Asp Val Pro Ala
305 310 315 320
Asp Val Phe Cys Cys Tyr His Leu His Ser Ile Glu Gln Ser Gly Glu
325 330 335
Ala Val Thr Ala Met Leu Ser Leu Tyr Arg Ala Ser Gln Ile Met Phe
340 345 350
Pro Gly Glu Thr Ile Leu Glu Glu Ile Lys Thr Val Ser Arg Lys Tyr
355 360 365
Leu Asp Lys Arg Lys Glu Asn Gly Arg Ile Tyr Tyr His Asn Ile Val
370 375 380
Met Lys Asp Leu Arg Gly Glu Val Glu Tyr Ala Leu Ser Val Pro Trp
385 390 395 400
Tyr Ala Ser Leu Glu Arg Ile Glu Asn Arg Arg Tyr Ile Asp Gln Tyr
405 410 415
Gly Val Asn Asp Thr Trp Ile Ala Lys Thr Ser Tyr Lys Ile Pro Cys
420 425 430
Ile Ser Asn Asp Leu Phe Leu Ala Leu Ala Lys Gln Asp Tyr Asn Ile
435 440 445
Cys Gln Ala Ile Gln Gln Lys Glu Leu Arg Glu Leu Glu Arg Trp Phe
450 455 460
Ala Asp Asn Lys Phe Ser His Leu Asn Phe Ala Arg Gln Lys Leu Ile
465 470 475 480
Tyr Cys Tyr Phe Ser Ala Ala Ala Thr Leu Phe Ser Pro Glu Leu Ser
485 490 495
Ala Ala Arg Val Val Trp Ala Lys Asn Gly Val Ile Thr Thr Val Val
500 505 510
Asp Asp Phe Phe Asp Val Gly Gly Ser Ser Glu Glu Ile His Ser Phe
515 520 525
Val Glu Ala Val Arg Val Trp Asp Glu Ala Ala Thr Asp Gly Leu Ser
530 535 540
Glu Asn Val Gln Ile Leu Phe Ser Ala Leu Tyr Asn Thr Val Asp Glu
545 550 555 560
Ile Val Gln Gln Ala Phe Val Phe Gln Gly Arg Asp Ile Ser Ile His
565 570 575
Leu Arg Glu Ile Trp Tyr Arg Leu Val Asn Ser Met Met Thr Glu Ala
580 585 590
Gln Trp Ala Arg Thr His Cys Ile Pro Ser Met His Glu Tyr Met Glu
595 600 605
Asn Ala Glu Pro Ser Ile Ala Leu Glu Pro Ile Val Leu Ser Ser Leu
610 615 620
Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile Ile Cys His Pro Glu
625 630 635 640
Tyr Tyr Asn Leu Met His Leu Leu Asn Ile Cys Gly Arg Leu Leu Asn
645 650 655
Asp Ile Gln Gly Cys Lys Arg Glu Ala His Gln Gly Lys Leu Asn Ser
660 665 670
Val Thr Leu Tyr Met Glu Glu Asn Ser Gly Thr Thr Met Glu Asp Ala
675 680 685
Ile Val Tyr Leu Arg Lys Thr Ile Asp Glu Ser Arg Gln Leu Leu Leu
690 695 700
Lys Glu Val Leu Arg Pro Ser Ile Val Pro Arg Glu Cys Lys Gln Leu
705 710 715 720
His Trp Asn Met Met Arg Ile Leu Gln Leu Phe Tyr Leu Lys Asn Asp
725 730 735
Gly Phe Thr Ser Pro Thr Glu Met Leu Gly Tyr Val Asn Ala Val Ile
740 745 750
Val Asp Pro Ile Leu
755
<210> 15
<211> 900
<212> DNA
<213> Bradyrhizobium japonicum
<220>
<221> CDS
<222> (1)..(900)
<400> 15
atg atc caa act gaa aga gct gtc caa caa gtt ttg gaa tgg ggt cgt 48
Met Ile Gln Thr Glu Arg Ala Val Gln Gln Val Leu Glu Trp Gly Arg
1 5 10 15
tct ttg acc ggt ttc gct gac gaa cac gct gtc gaa gct gtt cgt ggt 96
Ser Leu Thr Gly Phe Ala Asp Glu His Ala Val Glu Ala Val Arg Gly
20 25 30
ggt caa tac atc tta caa aga atc cac cca tct ttg aga ggt act tct 144
Gly Gln Tyr Ile Leu Gln Arg Ile His Pro Ser Leu Arg Gly Thr Ser
35 40 45
gcc aga act ggt aga gat cca caa gat gaa act ttg att gtc acc ttt 192
Ala Arg Thr Gly Arg Asp Pro Gln Asp Glu Thr Leu Ile Val Thr Phe
50 55 60
tac aga gaa ttg gct ttg ttg ttc tgg tta gat gac tgt aac gat ttg 240
Tyr Arg Glu Leu Ala Leu Leu Phe Trp Leu Asp Asp Cys Asn Asp Leu
65 70 75 80
ggt ttg att tcc cca gaa caa ttg gct gct gtc gaa caa gct ttg ggt 288
Gly Leu Ile Ser Pro Glu Gln Leu Ala Ala Val Glu Gln Ala Leu Gly
85 90 95
caa ggt gtc cca tgt gct ttg cca ggt ttc gaa ggt tgt gcc gtt ttg 336
Gln Gly Val Pro Cys Ala Leu Pro Gly Phe Glu Gly Cys Ala Val Leu
100 105 110
aga gct tct ttg gcc act ttg gcc tac gac aga aga gac tac gct caa 384
Arg Ala Ser Leu Ala Thr Leu Ala Tyr Asp Arg Arg Asp Tyr Ala Gln
115 120 125
ttg ttg gat gac acc aga tgt tac tcc gct gct ttg aga gct ggt cat 432
Leu Leu Asp Asp Thr Arg Cys Tyr Ser Ala Ala Leu Arg Ala Gly His
130 135 140
gcc caa gct gtt gct gct gaa aga tgg tcc tac gct gaa tac ttg cac 480
Ala Gln Ala Val Ala Ala Glu Arg Trp Ser Tyr Ala Glu Tyr Leu His
145 150 155 160
aac ggt att gac tcc att gct tac gct aac gtt ttc tgt tgt ttg tcc 528
Asn Gly Ile Asp Ser Ile Ala Tyr Ala Asn Val Phe Cys Cys Leu Ser
165 170 175
cta tta tgg ggt ttg gac atg gct act ttg aga gcc aga cca gct ttc 576
Leu Leu Trp Gly Leu Asp Met Ala Thr Leu Arg Ala Arg Pro Ala Phe
180 185 190
aga caa gtc ttg aga tta atc tct gcc att ggt aga tta caa aac gat 624
Arg Gln Val Leu Arg Leu Ile Ser Ala Ile Gly Arg Leu Gln Asn Asp
195 200 205
ttg cac ggt tgt gac aag gac cgt tct gct ggt gaa gct gac aac gct 672
Leu His Gly Cys Asp Lys Asp Arg Ser Ala Gly Glu Ala Asp Asn Ala
210 215 220
gtc atc tta ttg ttg caa aga tac cca gcc atg cca gtt gtt gaa ttc 720
Val Ile Leu Leu Leu Gln Arg Tyr Pro Ala Met Pro Val Val Glu Phe
225 230 235 240
ttg aac gat gaa ttg gct ggt cac acc aga atg ttg cac cgt gtc atg 768
Leu Asn Asp Glu Leu Ala Gly His Thr Arg Met Leu His Arg Val Met
245 250 255
gct gaa gaa aga ttc cca gct cca tgg ggt cca tta atc gaa gct atg 816
Ala Glu Glu Arg Phe Pro Ala Pro Trp Gly Pro Leu Ile Glu Ala Met
260 265 270
gct gcc atc aga gtt caa tac tac aga acc tcc acc tct cgt tac aga 864
Ala Ala Ile Arg Val Gln Tyr Tyr Arg Thr Ser Thr Ser Arg Tyr Arg
275 280 285
tct gac gct gtt aga ggt ggt caa aga gcc cct gcc 900
Ser Asp Ala Val Arg Gly Gly Gln Arg Ala Pro Ala
290 295 300
<210> 16
<211> 300
<212> PRT
<213> Bradyrhizobium japonicum
<400> 16
Met Ile Gln Thr Glu Arg Ala Val Gln Gln Val Leu Glu Trp Gly Arg
1 5 10 15
Ser Leu Thr Gly Phe Ala Asp Glu His Ala Val Glu Ala Val Arg Gly
20 25 30
Gly Gln Tyr Ile Leu Gln Arg Ile His Pro Ser Leu Arg Gly Thr Ser
35 40 45
Ala Arg Thr Gly Arg Asp Pro Gln Asp Glu Thr Leu Ile Val Thr Phe
50 55 60
Tyr Arg Glu Leu Ala Leu Leu Phe Trp Leu Asp Asp Cys Asn Asp Leu
65 70 75 80
Gly Leu Ile Ser Pro Glu Gln Leu Ala Ala Val Glu Gln Ala Leu Gly
85 90 95
Gln Gly Val Pro Cys Ala Leu Pro Gly Phe Glu Gly Cys Ala Val Leu
100 105 110
Arg Ala Ser Leu Ala Thr Leu Ala Tyr Asp Arg Arg Asp Tyr Ala Gln
115 120 125
Leu Leu Asp Asp Thr Arg Cys Tyr Ser Ala Ala Leu Arg Ala Gly His
130 135 140
Ala Gln Ala Val Ala Ala Glu Arg Trp Ser Tyr Ala Glu Tyr Leu His
145 150 155 160
Asn Gly Ile Asp Ser Ile Ala Tyr Ala Asn Val Phe Cys Cys Leu Ser
165 170 175
Leu Leu Trp Gly Leu Asp Met Ala Thr Leu Arg Ala Arg Pro Ala Phe
180 185 190
Arg Gln Val Leu Arg Leu Ile Ser Ala Ile Gly Arg Leu Gln Asn Asp
195 200 205
Leu His Gly Cys Asp Lys Asp Arg Ser Ala Gly Glu Ala Asp Asn Ala
210 215 220
Val Ile Leu Leu Leu Gln Arg Tyr Pro Ala Met Pro Val Val Glu Phe
225 230 235 240
Leu Asn Asp Glu Leu Ala Gly His Thr Arg Met Leu His Arg Val Met
245 250 255
Ala Glu Glu Arg Phe Pro Ala Pro Trp Gly Pro Leu Ile Glu Ala Met
260 265 270
Ala Ala Ile Arg Val Gln Tyr Tyr Arg Thr Ser Thr Ser Arg Tyr Arg
275 280 285
Ser Asp Ala Val Arg Gly Gly Gln Arg Ala Pro Ala
290 295 300
<210> 17
<211> 2838
<212> DNA
<213> Phaeosphaeria sp.
<220>
<221> CDS
<222> (1)..(2838)
<400> 17
atg ttt gct aag ttt gat atg ttg gaa gaa gaa gct aga gct ttg gtc 48
Met Phe Ala Lys Phe Asp Met Leu Glu Glu Glu Ala Arg Ala Leu Val
1 5 10 15
aga aag gtc ggt aac gct gtt gac cca atc tac ggt ttc tct acc act 96
Arg Lys Val Gly Asn Ala Val Asp Pro Ile Tyr Gly Phe Ser Thr Thr
20 25 30
tcc tgt caa atc tac gac acc gct tgg gct gct atg atc tcc aag gaa 144
Ser Cys Gln Ile Tyr Asp Thr Ala Trp Ala Ala Met Ile Ser Lys Glu
35 40 45
gaa cac ggt gac aag gtc tgg ttg ttc cca gaa tct ttc aaa tac ttg 192
Glu His Gly Asp Lys Val Trp Leu Phe Pro Glu Ser Phe Lys Tyr Leu
50 55 60
ttg gaa aag caa ggt gaa gat ggt tcc tgg gaa aga cat cca aga tcc 240
Leu Glu Lys Gln Gly Glu Asp Gly Ser Trp Glu Arg His Pro Arg Ser
65 70 75 80
aag act gtc ggt gtt ttg aac act gct gcc gct tgt ttg gcc ttg ttg 288
Lys Thr Val Gly Val Leu Asn Thr Ala Ala Ala Cys Leu Ala Leu Leu
85 90 95
aga cac gtc aag aac cct ttg caa ttg caa gac att gct gcc caa gac 336
Arg His Val Lys Asn Pro Leu Gln Leu Gln Asp Ile Ala Ala Gln Asp
100 105 110
atc gaa ttg aga atc caa aga ggt ttg cgt tct ttg gaa gaa caa ttg 384
Ile Glu Leu Arg Ile Gln Arg Gly Leu Arg Ser Leu Glu Glu Gln Leu
115 120 125
att gct tgg gat gat gtc ttg gac acc aac cac atc ggt gtc gaa atg 432
Ile Ala Trp Asp Asp Val Leu Asp Thr Asn His Ile Gly Val Glu Met
130 135 140
att gtt cca gct ttg ttg gac tac ttg caa gcc gaa gat gaa aac gtt 480
Ile Val Pro Ala Leu Leu Asp Tyr Leu Gln Ala Glu Asp Glu Asn Val
145 150 155 160
gac ttt gaa ttc gaa tcc cac tct cta ttg atg caa atg tac aaa gaa 528
Asp Phe Glu Phe Glu Ser His Ser Leu Leu Met Gln Met Tyr Lys Glu
165 170 175
aag atg gcc aga ttc tcc cca gaa tcc tta tac aga gct aga cca tct 576
Lys Met Ala Arg Phe Ser Pro Glu Ser Leu Tyr Arg Ala Arg Pro Ser
180 185 190
tct gcc tta cat aat ttg gaa gct ttg atc ggt aaa ttg gac ttt gac 624
Ser Ala Leu His Asn Leu Glu Ala Leu Ile Gly Lys Leu Asp Phe Asp
195 200 205
aag gtt ggt cac cac ttg tac aac ggt tcc atg atg gct tcc cca tct 672
Lys Val Gly His His Leu Tyr Asn Gly Ser Met Met Ala Ser Pro Ser
210 215 220
tcc act gct gcc ttt cta atg cat gcc tct cca tgg tcc cac gaa gct 720
Ser Thr Ala Ala Phe Leu Met His Ala Ser Pro Trp Ser His Glu Ala
225 230 235 240
gaa gct tac cta cgt cac gtt ttc gaa gct ggt act ggt aag ggt tcc 768
Glu Ala Tyr Leu Arg His Val Phe Glu Ala Gly Thr Gly Lys Gly Ser
245 250 255
ggt ggt ttc cca ggt act tac cca acc act tac ttc gaa ttg aac tgg 816
Gly Gly Phe Pro Gly Thr Tyr Pro Thr Thr Tyr Phe Glu Leu Asn Trp
260 265 270
gtt ttg tcc act ttg atg aaa tct ggt ttc act tta tct gat ttg gaa 864
Val Leu Ser Thr Leu Met Lys Ser Gly Phe Thr Leu Ser Asp Leu Glu
275 280 285
tgt gat gaa tta tct tcc att gcc aac acc att gct gaa ggt ttc gaa 912
Cys Asp Glu Leu Ser Ser Ile Ala Asn Thr Ile Ala Glu Gly Phe Glu
290 295 300
tgt gac cac ggt gtc atc ggt ttt gct cca aga gct gtc gat gtc gac 960
Cys Asp His Gly Val Ile Gly Phe Ala Pro Arg Ala Val Asp Val Asp
305 310 315 320
gac acc gct aag ggt ttg ttg act tta act tta tta ggt atg gat gaa 1008
Asp Thr Ala Lys Gly Leu Leu Thr Leu Thr Leu Leu Gly Met Asp Glu
325 330 335
ggt gtt tct cca gct cca atg att gct atg ttc gaa gcc aag gac cac 1056
Gly Val Ser Pro Ala Pro Met Ile Ala Met Phe Glu Ala Lys Asp His
340 345 350
ttc ttg act ttc ttg ggt gaa cgt gac cca tct ttc act tct aac tgt 1104
Phe Leu Thr Phe Leu Gly Glu Arg Asp Pro Ser Phe Thr Ser Asn Cys
355 360 365
cac gtt ttg tta tct ttg ttg cac aga acc gat ttg ttg caa tat ttg 1152
His Val Leu Leu Ser Leu Leu His Arg Thr Asp Leu Leu Gln Tyr Leu
370 375 380
cct caa atc aga aag acc acc acc ttc ttg tgt gaa gcc tgg tgg gct 1200
Pro Gln Ile Arg Lys Thr Thr Thr Phe Leu Cys Glu Ala Trp Trp Ala
385 390 395 400
tgt gac ggt caa atc aag gac aaa tgg cac tta tct cac ttg tac cca 1248
Cys Asp Gly Gln Ile Lys Asp Lys Trp His Leu Ser His Leu Tyr Pro
405 410 415
acc atg ttg atg gtc caa gct ttc gct gaa atc tta ttg aag tct gct 1296
Thr Met Leu Met Val Gln Ala Phe Ala Glu Ile Leu Leu Lys Ser Ala
420 425 430
gaa ggt gaa cca ttg cac gat gct ttc gat gcc gct act ttg tcc aga 1344
Glu Gly Glu Pro Leu His Asp Ala Phe Asp Ala Ala Thr Leu Ser Arg
435 440 445
gtc tcc att tgt gtt ttc caa gct tgt ttg aga act cta ttg gcc caa 1392
Val Ser Ile Cys Val Phe Gln Ala Cys Leu Arg Thr Leu Leu Ala Gln
450 455 460
tct caa gat ggt tct tgg cac ggt caa cca gaa gct tct tgt tac gct 1440
Ser Gln Asp Gly Ser Trp His Gly Gln Pro Glu Ala Ser Cys Tyr Ala
465 470 475 480
gtc ttg act ttg gct gaa tct ggt cgt ttg gtt ttg ttg caa gct ttg 1488
Val Leu Thr Leu Ala Glu Ser Gly Arg Leu Val Leu Leu Gln Ala Leu
485 490 495
caa cca caa atc gct gct gcc atg gaa aag gcc gct gat gtt atg caa 1536
Gln Pro Gln Ile Ala Ala Ala Met Glu Lys Ala Ala Asp Val Met Gln
500 505 510
gct ggt aga tgg tcc tgt tct gat cat gac tgt gac tgg acc tcc aag 1584
Ala Gly Arg Trp Ser Cys Ser Asp His Asp Cys Asp Trp Thr Ser Lys
515 520 525
acc gct tac aga gtt gac tta gtt gct gct gct tac aga tta gct gct 1632
Thr Ala Tyr Arg Val Asp Leu Val Ala Ala Ala Tyr Arg Leu Ala Ala
530 535 540
atg aaa gct tct tcc aac ttg act ttc acc gtc gat gac aac gtt tcc 1680
Met Lys Ala Ser Ser Asn Leu Thr Phe Thr Val Asp Asp Asn Val Ser
545 550 555 560
aag aga tcc aac ggt ttc caa caa ttg gtc ggt aga act gac ttg ttc 1728
Lys Arg Ser Asn Gly Phe Gln Gln Leu Val Gly Arg Thr Asp Leu Phe
565 570 575
tct ggt gtc cca gcc tgg gaa tta caa gct tct ttc ttg gaa tct gct 1776
Ser Gly Val Pro Ala Trp Glu Leu Gln Ala Ser Phe Leu Glu Ser Ala
580 585 590
ttg ttc gtt cct ttg ttg aga aac cat aga ttg gat gtt ttc gac aga 1824
Leu Phe Val Pro Leu Leu Arg Asn His Arg Leu Asp Val Phe Asp Arg
595 600 605
gat gac atc aag gtt tct aag gac cac tac ttg gac atg atc cca ttc 1872
Asp Asp Ile Lys Val Ser Lys Asp His Tyr Leu Asp Met Ile Pro Phe
610 615 620
acc tgg gtt ggt tgt aac aac aga tct aga acc tac gtt tct acc tct 1920
Thr Trp Val Gly Cys Asn Asn Arg Ser Arg Thr Tyr Val Ser Thr Ser
625 630 635 640
ttc ttg ttc gac atg atg atc atc tcc atg ttg ggt tac caa atc gat 1968
Phe Leu Phe Asp Met Met Ile Ile Ser Met Leu Gly Tyr Gln Ile Asp
645 650 655
gaa ttc ttc gaa gct gaa gct gct cca gct ttc gct caa tgt att ggt 2016
Glu Phe Phe Glu Ala Glu Ala Ala Pro Ala Phe Ala Gln Cys Ile Gly
660 665 670
caa ttg cac caa gtc gtc gac aag gtt gtt gat gaa gtc att gac gaa 2064
Gln Leu His Gln Val Val Asp Lys Val Val Asp Glu Val Ile Asp Glu
675 680 685
gtc gtc gat aag gtt gtc ggt aag gtc gtt ggt aag gtt gtt ggt aag 2112
Val Val Asp Lys Val Val Gly Lys Val Val Gly Lys Val Val Gly Lys
690 695 700
gtt gtc gac gaa cgt gtt gac tct cca acc cac gaa gcc att gcc atc 2160
Val Val Asp Glu Arg Val Asp Ser Pro Thr His Glu Ala Ile Ala Ile
705 710 715 720
tgt aac atc gaa gct tct tta aga aga ttc gtc gat cac gtt ttg cac 2208
Cys Asn Ile Glu Ala Ser Leu Arg Arg Phe Val Asp His Val Leu His
725 730 735
cac caa cat gtc tta cac gct tct caa caa gaa caa gac att ttg tgg 2256
His Gln His Val Leu His Ala Ser Gln Gln Glu Gln Asp Ile Leu Trp
740 745 750
aga gaa ttg aga gcc ttc ttg cac gct cac gtt gtc caa atg gcc gac 2304
Arg Glu Leu Arg Ala Phe Leu His Ala His Val Val Gln Met Ala Asp
755 760 765
aac tcc act ttg gct cca cca ggt aga act ttc ttc gac tgg gtt aga 2352
Asn Ser Thr Leu Ala Pro Pro Gly Arg Thr Phe Phe Asp Trp Val Arg
770 775 780
acc act gct gct gac cac gtt gcc tgt gct tac tct ttc gct ttc gct 2400
Thr Thr Ala Ala Asp His Val Ala Cys Ala Tyr Ser Phe Ala Phe Ala
785 790 795 800
tgt tgt atc acc tct gct act atc ggt caa ggt caa tct atg ttc gct 2448
Cys Cys Ile Thr Ser Ala Thr Ile Gly Gln Gly Gln Ser Met Phe Ala
805 810 815
acc gtc aac gaa ttg tac ttg gtc caa gct gct gct cgt cac atg acc 2496
Thr Val Asn Glu Leu Tyr Leu Val Gln Ala Ala Ala Arg His Met Thr
820 825 830
acc atg tgt aga atg tgt aac gac att ggt tcc gtc gac aga gat ttc 2544
Thr Met Cys Arg Met Cys Asn Asp Ile Gly Ser Val Asp Arg Asp Phe
835 840 845
atc gaa gct aac att aac tcc gtt cac ttc cca gaa ttc tcc act ttg 2592
Ile Glu Ala Asn Ile Asn Ser Val His Phe Pro Glu Phe Ser Thr Leu
850 855 860
tct ttg gtt gct gac aag aag aaa gct ttg gcc aga ttg gct gcc tac 2640
Ser Leu Val Ala Asp Lys Lys Lys Ala Leu Ala Arg Leu Ala Ala Tyr
865 870 875 880
gaa aag tct tgt ttg act cac act ttg gac caa ttt gaa aat gaa gtt 2688
Glu Lys Ser Cys Leu Thr His Thr Leu Asp Gln Phe Glu Asn Glu Val
885 890 895
tta caa tct cca cgt gtt tct tct gct gcc tcc ggt gat ttc aga acc 2736
Leu Gln Ser Pro Arg Val Ser Ser Ala Ala Ser Gly Asp Phe Arg Thr
900 905 910
aga aag gtt gcc gtt gtt aga ttc ttt gct gat gtc act gac ttc tac 2784
Arg Lys Val Ala Val Val Arg Phe Phe Ala Asp Val Thr Asp Phe Tyr
915 920 925
gat caa ttg tac att ttg aga gat ttg tcc tct tct ttg aag cat gtc 2832
Asp Gln Leu Tyr Ile Leu Arg Asp Leu Ser Ser Ser Leu Lys His Val
930 935 940
gga acc 2838
Gly Thr
945
<210> 18
<211> 946
<212> PRT
<213> Phaeosphaeria sp.
<400> 18
Met Phe Ala Lys Phe Asp Met Leu Glu Glu Glu Ala Arg Ala Leu Val
1 5 10 15
Arg Lys Val Gly Asn Ala Val Asp Pro Ile Tyr Gly Phe Ser Thr Thr
20 25 30
Ser Cys Gln Ile Tyr Asp Thr Ala Trp Ala Ala Met Ile Ser Lys Glu
35 40 45
Glu His Gly Asp Lys Val Trp Leu Phe Pro Glu Ser Phe Lys Tyr Leu
50 55 60
Leu Glu Lys Gln Gly Glu Asp Gly Ser Trp Glu Arg His Pro Arg Ser
65 70 75 80
Lys Thr Val Gly Val Leu Asn Thr Ala Ala Ala Cys Leu Ala Leu Leu
85 90 95
Arg His Val Lys Asn Pro Leu Gln Leu Gln Asp Ile Ala Ala Gln Asp
100 105 110
Ile Glu Leu Arg Ile Gln Arg Gly Leu Arg Ser Leu Glu Glu Gln Leu
115 120 125
Ile Ala Trp Asp Asp Val Leu Asp Thr Asn His Ile Gly Val Glu Met
130 135 140
Ile Val Pro Ala Leu Leu Asp Tyr Leu Gln Ala Glu Asp Glu Asn Val
145 150 155 160
Asp Phe Glu Phe Glu Ser His Ser Leu Leu Met Gln Met Tyr Lys Glu
165 170 175
Lys Met Ala Arg Phe Ser Pro Glu Ser Leu Tyr Arg Ala Arg Pro Ser
180 185 190
Ser Ala Leu His Asn Leu Glu Ala Leu Ile Gly Lys Leu Asp Phe Asp
195 200 205
Lys Val Gly His His Leu Tyr Asn Gly Ser Met Met Ala Ser Pro Ser
210 215 220
Ser Thr Ala Ala Phe Leu Met His Ala Ser Pro Trp Ser His Glu Ala
225 230 235 240
Glu Ala Tyr Leu Arg His Val Phe Glu Ala Gly Thr Gly Lys Gly Ser
245 250 255
Gly Gly Phe Pro Gly Thr Tyr Pro Thr Thr Tyr Phe Glu Leu Asn Trp
260 265 270
Val Leu Ser Thr Leu Met Lys Ser Gly Phe Thr Leu Ser Asp Leu Glu
275 280 285
Cys Asp Glu Leu Ser Ser Ile Ala Asn Thr Ile Ala Glu Gly Phe Glu
290 295 300
Cys Asp His Gly Val Ile Gly Phe Ala Pro Arg Ala Val Asp Val Asp
305 310 315 320
Asp Thr Ala Lys Gly Leu Leu Thr Leu Thr Leu Leu Gly Met Asp Glu
325 330 335
Gly Val Ser Pro Ala Pro Met Ile Ala Met Phe Glu Ala Lys Asp His
340 345 350
Phe Leu Thr Phe Leu Gly Glu Arg Asp Pro Ser Phe Thr Ser Asn Cys
355 360 365
His Val Leu Leu Ser Leu Leu His Arg Thr Asp Leu Leu Gln Tyr Leu
370 375 380
Pro Gln Ile Arg Lys Thr Thr Thr Phe Leu Cys Glu Ala Trp Trp Ala
385 390 395 400
Cys Asp Gly Gln Ile Lys Asp Lys Trp His Leu Ser His Leu Tyr Pro
405 410 415
Thr Met Leu Met Val Gln Ala Phe Ala Glu Ile Leu Leu Lys Ser Ala
420 425 430
Glu Gly Glu Pro Leu His Asp Ala Phe Asp Ala Ala Thr Leu Ser Arg
435 440 445
Val Ser Ile Cys Val Phe Gln Ala Cys Leu Arg Thr Leu Leu Ala Gln
450 455 460
Ser Gln Asp Gly Ser Trp His Gly Gln Pro Glu Ala Ser Cys Tyr Ala
465 470 475 480
Val Leu Thr Leu Ala Glu Ser Gly Arg Leu Val Leu Leu Gln Ala Leu
485 490 495
Gln Pro Gln Ile Ala Ala Ala Met Glu Lys Ala Ala Asp Val Met Gln
500 505 510
Ala Gly Arg Trp Ser Cys Ser Asp His Asp Cys Asp Trp Thr Ser Lys
515 520 525
Thr Ala Tyr Arg Val Asp Leu Val Ala Ala Ala Tyr Arg Leu Ala Ala
530 535 540
Met Lys Ala Ser Ser Asn Leu Thr Phe Thr Val Asp Asp Asn Val Ser
545 550 555 560
Lys Arg Ser Asn Gly Phe Gln Gln Leu Val Gly Arg Thr Asp Leu Phe
565 570 575
Ser Gly Val Pro Ala Trp Glu Leu Gln Ala Ser Phe Leu Glu Ser Ala
580 585 590
Leu Phe Val Pro Leu Leu Arg Asn His Arg Leu Asp Val Phe Asp Arg
595 600 605
Asp Asp Ile Lys Val Ser Lys Asp His Tyr Leu Asp Met Ile Pro Phe
610 615 620
Thr Trp Val Gly Cys Asn Asn Arg Ser Arg Thr Tyr Val Ser Thr Ser
625 630 635 640
Phe Leu Phe Asp Met Met Ile Ile Ser Met Leu Gly Tyr Gln Ile Asp
645 650 655
Glu Phe Phe Glu Ala Glu Ala Ala Pro Ala Phe Ala Gln Cys Ile Gly
660 665 670
Gln Leu His Gln Val Val Asp Lys Val Val Asp Glu Val Ile Asp Glu
675 680 685
Val Val Asp Lys Val Val Gly Lys Val Val Gly Lys Val Val Gly Lys
690 695 700
Val Val Asp Glu Arg Val Asp Ser Pro Thr His Glu Ala Ile Ala Ile
705 710 715 720
Cys Asn Ile Glu Ala Ser Leu Arg Arg Phe Val Asp His Val Leu His
725 730 735
His Gln His Val Leu His Ala Ser Gln Gln Glu Gln Asp Ile Leu Trp
740 745 750
Arg Glu Leu Arg Ala Phe Leu His Ala His Val Val Gln Met Ala Asp
755 760 765
Asn Ser Thr Leu Ala Pro Pro Gly Arg Thr Phe Phe Asp Trp Val Arg
770 775 780
Thr Thr Ala Ala Asp His Val Ala Cys Ala Tyr Ser Phe Ala Phe Ala
785 790 795 800
Cys Cys Ile Thr Ser Ala Thr Ile Gly Gln Gly Gln Ser Met Phe Ala
805 810 815
Thr Val Asn Glu Leu Tyr Leu Val Gln Ala Ala Ala Arg His Met Thr
820 825 830
Thr Met Cys Arg Met Cys Asn Asp Ile Gly Ser Val Asp Arg Asp Phe
835 840 845
Ile Glu Ala Asn Ile Asn Ser Val His Phe Pro Glu Phe Ser Thr Leu
850 855 860
Ser Leu Val Ala Asp Lys Lys Lys Ala Leu Ala Arg Leu Ala Ala Tyr
865 870 875 880
Glu Lys Ser Cys Leu Thr His Thr Leu Asp Gln Phe Glu Asn Glu Val
885 890 895
Leu Gln Ser Pro Arg Val Ser Ser Ala Ala Ser Gly Asp Phe Arg Thr
900 905 910
Arg Lys Val Ala Val Val Arg Phe Phe Ala Asp Val Thr Asp Phe Tyr
915 920 925
Asp Gln Leu Tyr Ile Leu Arg Asp Leu Ser Ser Ser Leu Lys His Val
930 935 940
Gly Thr
945
<210> 19
<211> 2856
<212> DNA
<213> Gibberella fujikuroi
<220>
<221> CDS
<222> (1)..(2856)
<400> 19
atg cca ggt aag atc gaa aac ggt act cca aag gat tta aag act ggt 48
Met Pro Gly Lys Ile Glu Asn Gly Thr Pro Lys Asp Leu Lys Thr Gly
1 5 10 15
aac gat ttc gtt tct gct gcc aag tct ttg ttg gac aga gct ttc aaa 96
Asn Asp Phe Val Ser Ala Ala Lys Ser Leu Leu Asp Arg Ala Phe Lys
20 25 30
tct cac cac tct tac tac ggt tta tgt tcc act tct tgt caa gtt tac 144
Ser His His Ser Tyr Tyr Gly Leu Cys Ser Thr Ser Cys Gln Val Tyr
35 40 45
gac act gct tgg gtt gcc atg atc cca aag acc aga gat aat gtc aaa 192
Asp Thr Ala Trp Val Ala Met Ile Pro Lys Thr Arg Asp Asn Val Lys
50 55 60
caa tgg ttg ttc cct gaa tgt ttc cac tac ttg ttg aaa acc caa gct 240
Gln Trp Leu Phe Pro Glu Cys Phe His Tyr Leu Leu Lys Thr Gln Ala
65 70 75 80
gct gat ggt tcc tgg ggt tct ttg cca acc act caa acc gct ggt atc 288
Ala Asp Gly Ser Trp Gly Ser Leu Pro Thr Thr Gln Thr Ala Gly Ile
85 90 95
ttg gat acc gct tcc gct gtc ttg gct ttg cta tgt cac gct caa gaa 336
Leu Asp Thr Ala Ser Ala Val Leu Ala Leu Leu Cys His Ala Gln Glu
100 105 110
cca ttg caa atc ttg gat gtt tct cca gat gaa atg ggt ttg aga atc 384
Pro Leu Gln Ile Leu Asp Val Ser Pro Asp Glu Met Gly Leu Arg Ile
115 120 125
gaa cac ggt gtt act tct ttg aag aga caa ttg gct gtc tgg aac gac 432
Glu His Gly Val Thr Ser Leu Lys Arg Gln Leu Ala Val Trp Asn Asp
130 135 140
gtc gag gac act aac cac att ggt gtc gaa ttt att att cca gct tta 480
Val Glu Asp Thr Asn His Ile Gly Val Glu Phe Ile Ile Pro Ala Leu
145 150 155 160
ttg tcc atg ttg gaa aag gaa ttg gat gtt cca tct ttc gaa ttc cca 528
Leu Ser Met Leu Glu Lys Glu Leu Asp Val Pro Ser Phe Glu Phe Pro
165 170 175
tgt aga tcc atc ttg gaa aga atg cac ggt gaa aag ttg ggt cac ttc 576
Cys Arg Ser Ile Leu Glu Arg Met His Gly Glu Lys Leu Gly His Phe
180 185 190
gat ttg gaa caa gtc tac ggt aag cca tct tct tta cta cat tct ttg 624
Asp Leu Glu Gln Val Tyr Gly Lys Pro Ser Ser Leu Leu His Ser Leu
195 200 205
gaa gct ttc tta ggt aag tta gac ttc gac aga ttg tcc cac cac ttg 672
Glu Ala Phe Leu Gly Lys Leu Asp Phe Asp Arg Leu Ser His His Leu
210 215 220
tac cac ggt tcc atg atg gct tct cca tcc tcc act gct gcc tac ttg 720
Tyr His Gly Ser Met Met Ala Ser Pro Ser Ser Thr Ala Ala Tyr Leu
225 230 235 240
att ggt gct acc aaa tgg gat gac gaa gct gag gac tac ttg aga cac 768
Ile Gly Ala Thr Lys Trp Asp Asp Glu Ala Glu Asp Tyr Leu Arg His
245 250 255
gtt atg aga aac ggt gct ggt cat ggt aac ggt ggt atc tcc ggt act 816
Val Met Arg Asn Gly Ala Gly His Gly Asn Gly Gly Ile Ser Gly Thr
260 265 270
ttc cca acc acc cat ttc gaa tgt tcc tgg atc att gcc act ttg ttg 864
Phe Pro Thr Thr His Phe Glu Cys Ser Trp Ile Ile Ala Thr Leu Leu
275 280 285
aag gtt ggt ttc acc ttg aaa caa att gac ggt gac ggt ttg aga ggt 912
Lys Val Gly Phe Thr Leu Lys Gln Ile Asp Gly Asp Gly Leu Arg Gly
290 295 300
cta tcc act atc tta ttg gaa gct tta cgt gac gaa aac ggt gtc atc 960
Leu Ser Thr Ile Leu Leu Glu Ala Leu Arg Asp Glu Asn Gly Val Ile
305 310 315 320
ggt ttc gct cca aga acc gct gat gtc gac gac acc gcc aag gct ttg 1008
Gly Phe Ala Pro Arg Thr Ala Asp Val Asp Asp Thr Ala Lys Ala Leu
325 330 335
ttg gct ttg tct tta gtt aac caa cct gtc tct cca gat atc atg att 1056
Leu Ala Leu Ser Leu Val Asn Gln Pro Val Ser Pro Asp Ile Met Ile
340 345 350
aag gtt ttc gaa ggt aag gac cat ttt acc act ttc ggt tct gaa cgt 1104
Lys Val Phe Glu Gly Lys Asp His Phe Thr Thr Phe Gly Ser Glu Arg
355 360 365
gat cca tct ttg acc tct aac ttg cac gtc ttg ttg tct ttg ttg aag 1152
Asp Pro Ser Leu Thr Ser Asn Leu His Val Leu Leu Ser Leu Leu Lys
370 375 380
caa tct aac ttg tct caa tac cat cca caa atc tta aag act act ttg 1200
Gln Ser Asn Leu Ser Gln Tyr His Pro Gln Ile Leu Lys Thr Thr Leu
385 390 395 400
ttt acc tgt aga tgg tgg tgg ggt tct gac cat tgt gtc aag gac aag 1248
Phe Thr Cys Arg Trp Trp Trp Gly Ser Asp His Cys Val Lys Asp Lys
405 410 415
tgg aac ttg tct cat ttg tac cca acc atg ttg ttg gtc gaa gcc ttc 1296
Trp Asn Leu Ser His Leu Tyr Pro Thr Met Leu Leu Val Glu Ala Phe
420 425 430
act gaa gtc ttg cac ttg att gac ggt ggt gaa ttg tcc tct ttg ttc 1344
Thr Glu Val Leu His Leu Ile Asp Gly Gly Glu Leu Ser Ser Leu Phe
435 440 445
gat gaa tct ttc aaa tgt aag atc ggt ttg tcc att ttc caa gct gtt 1392
Asp Glu Ser Phe Lys Cys Lys Ile Gly Leu Ser Ile Phe Gln Ala Val
450 455 460
ttg aga atc atc ttg act caa gac aac gat ggt tcc tgg aga ggt tac 1440
Leu Arg Ile Ile Leu Thr Gln Asp Asn Asp Gly Ser Trp Arg Gly Tyr
465 470 475 480
aga gaa caa acc tgt tac gcc att ttg gct ttg gtc caa gct cgt cac 1488
Arg Glu Gln Thr Cys Tyr Ala Ile Leu Ala Leu Val Gln Ala Arg His
485 490 495
gtt tgt ttc ttc act cac atg gtc gat aga tta caa tct tgt gtc gac 1536
Val Cys Phe Phe Thr His Met Val Asp Arg Leu Gln Ser Cys Val Asp
500 505 510
aga ggt ttc tcc tgg tta aag tct tgt tcc ttc cac tct caa gac ttg 1584
Arg Gly Phe Ser Trp Leu Lys Ser Cys Ser Phe His Ser Gln Asp Leu
515 520 525
act tgg acc tcc aag act gct tac gaa gtt ggt ttc gtt gct gaa gcc 1632
Thr Trp Thr Ser Lys Thr Ala Tyr Glu Val Gly Phe Val Ala Glu Ala
530 535 540
tac aaa ttg gct gct ttg caa tct gct tct ttg gaa gtt cca gct gct 1680
Tyr Lys Leu Ala Ala Leu Gln Ser Ala Ser Leu Glu Val Pro Ala Ala
545 550 555 560
acc atc ggt cac tcc gtt acc tct gct gtc cca tct tct gac ttg gaa 1728
Thr Ile Gly His Ser Val Thr Ser Ala Val Pro Ser Ser Asp Leu Glu
565 570 575
aag tac atg aga ttg gtc cgt aag act gct ttg ttc tct cca ttg gac 1776
Lys Tyr Met Arg Leu Val Arg Lys Thr Ala Leu Phe Ser Pro Leu Asp
580 585 590
gaa tgg ggt ttg atg gct tcc att att gaa tct tct ttc ttt gtc cca 1824
Glu Trp Gly Leu Met Ala Ser Ile Ile Glu Ser Ser Phe Phe Val Pro
595 600 605
ttg ttg caa gct caa aga gtc gaa atc tac cca aga gat aat atc aag 1872
Leu Leu Gln Ala Gln Arg Val Glu Ile Tyr Pro Arg Asp Asn Ile Lys
610 615 620
gtt gac gag gac aag tac ttg tcc atc att cca ttt acc tgg gtt ggt 1920
Val Asp Glu Asp Lys Tyr Leu Ser Ile Ile Pro Phe Thr Trp Val Gly
625 630 635 640
tgt aat aac aga tct aga act ttc gct tct aac aga tgg tta tat gac 1968
Cys Asn Asn Arg Ser Arg Thr Phe Ala Ser Asn Arg Trp Leu Tyr Asp
645 650 655
atg atg tac tta tct ttg ttg ggt tac caa act gac gaa tac atg gaa 2016
Met Met Tyr Leu Ser Leu Leu Gly Tyr Gln Thr Asp Glu Tyr Met Glu
660 665 670
gct gtt gcc ggt cca gtt ttc ggt gat gtt tct cta ttg cac caa acc 2064
Ala Val Ala Gly Pro Val Phe Gly Asp Val Ser Leu Leu His Gln Thr
675 680 685
atc gac aag gtt atc gac aac acc atg ggt aac ttg gcc aga gcc aac 2112
Ile Asp Lys Val Ile Asp Asn Thr Met Gly Asn Leu Ala Arg Ala Asn
690 695 700
ggt act gtt cac tct ggt aac ggt cac caa cac gaa tct cca aac att 2160
Gly Thr Val His Ser Gly Asn Gly His Gln His Glu Ser Pro Asn Ile
705 710 715 720
ggt caa gtt gag gac act ttg acc aga ttc acc aac tcc gtt ttg aac 2208
Gly Gln Val Glu Asp Thr Leu Thr Arg Phe Thr Asn Ser Val Leu Asn
725 730 735
cac aag gac gtc ttg aac tct tct tct tct gac caa gat acc tta cgt 2256
His Lys Asp Val Leu Asn Ser Ser Ser Ser Asp Gln Asp Thr Leu Arg
740 745 750
cgt gaa ttc aga act ttc atg cac gct cac atc acc caa atc gaa gat 2304
Arg Glu Phe Arg Thr Phe Met His Ala His Ile Thr Gln Ile Glu Asp
755 760 765
aac tcc aga ttc tcc aag caa gcc tcc tct gat gcc ttt tct tct cca 2352
Asn Ser Arg Phe Ser Lys Gln Ala Ser Ser Asp Ala Phe Ser Ser Pro
770 775 780
gaa caa tct tac ttc caa tgg gtt aac tcc act ggt ggt tct cac gtt 2400
Glu Gln Ser Tyr Phe Gln Trp Val Asn Ser Thr Gly Gly Ser His Val
785 790 795 800
gct tgt gcc tac tcc ttc gcc ttt tcc aac tgt ttg atg tcc gct aac 2448
Ala Cys Ala Tyr Ser Phe Ala Phe Ser Asn Cys Leu Met Ser Ala Asn
805 810 815
tta ttg caa ggt aag gat gct ttc cca tct ggt act caa aag tac ttg 2496
Leu Leu Gln Gly Lys Asp Ala Phe Pro Ser Gly Thr Gln Lys Tyr Leu
820 825 830
atc tcc tcc gtc atg aga cac gct acc aac atg tgt cgt atg tac aac 2544
Ile Ser Ser Val Met Arg His Ala Thr Asn Met Cys Arg Met Tyr Asn
835 840 845
gat ttc ggt tct att gcc aga gac aac gct gaa aga aac gtt aac tcc 2592
Asp Phe Gly Ser Ile Ala Arg Asp Asn Ala Glu Arg Asn Val Asn Ser
850 855 860
att cac ttc cct gaa ttc act cta tgt aac ggt act tct caa aac ttg 2640
Ile His Phe Pro Glu Phe Thr Leu Cys Asn Gly Thr Ser Gln Asn Leu
865 870 875 880
gat gaa aga aag gaa aga tta ttg aag atc gct acc tac gaa caa ggt 2688
Asp Glu Arg Lys Glu Arg Leu Leu Lys Ile Ala Thr Tyr Glu Gln Gly
885 890 895
tac ttg gac aga gct ttg gaa gct ttg gaa aga caa tcc aga gat gac 2736
Tyr Leu Asp Arg Ala Leu Glu Ala Leu Glu Arg Gln Ser Arg Asp Asp
900 905 910
gct ggt gac aga gct ggt tcc aag gac atg aga aaa ttg aag att gtc 2784
Ala Gly Asp Arg Ala Gly Ser Lys Asp Met Arg Lys Leu Lys Ile Val
915 920 925
aaa ttg ttc tgt gat gtt acc gac ttg tac gac caa ttg tac gtc atc 2832
Lys Leu Phe Cys Asp Val Thr Asp Leu Tyr Asp Gln Leu Tyr Val Ile
930 935 940
aag gac ttg tcc tct tcc atg aag 2856
Lys Asp Leu Ser Ser Ser Met Lys
945 950
<210> 20
<211> 952
<212> PRT
<213> Gibberella fujikuroi
<400> 20
Met Pro Gly Lys Ile Glu Asn Gly Thr Pro Lys Asp Leu Lys Thr Gly
1 5 10 15
Asn Asp Phe Val Ser Ala Ala Lys Ser Leu Leu Asp Arg Ala Phe Lys
20 25 30
Ser His His Ser Tyr Tyr Gly Leu Cys Ser Thr Ser Cys Gln Val Tyr
35 40 45
Asp Thr Ala Trp Val Ala Met Ile Pro Lys Thr Arg Asp Asn Val Lys
50 55 60
Gln Trp Leu Phe Pro Glu Cys Phe His Tyr Leu Leu Lys Thr Gln Ala
65 70 75 80
Ala Asp Gly Ser Trp Gly Ser Leu Pro Thr Thr Gln Thr Ala Gly Ile
85 90 95
Leu Asp Thr Ala Ser Ala Val Leu Ala Leu Leu Cys His Ala Gln Glu
100 105 110
Pro Leu Gln Ile Leu Asp Val Ser Pro Asp Glu Met Gly Leu Arg Ile
115 120 125
Glu His Gly Val Thr Ser Leu Lys Arg Gln Leu Ala Val Trp Asn Asp
130 135 140
Val Glu Asp Thr Asn His Ile Gly Val Glu Phe Ile Ile Pro Ala Leu
145 150 155 160
Leu Ser Met Leu Glu Lys Glu Leu Asp Val Pro Ser Phe Glu Phe Pro
165 170 175
Cys Arg Ser Ile Leu Glu Arg Met His Gly Glu Lys Leu Gly His Phe
180 185 190
Asp Leu Glu Gln Val Tyr Gly Lys Pro Ser Ser Leu Leu His Ser Leu
195 200 205
Glu Ala Phe Leu Gly Lys Leu Asp Phe Asp Arg Leu Ser His His Leu
210 215 220
Tyr His Gly Ser Met Met Ala Ser Pro Ser Ser Thr Ala Ala Tyr Leu
225 230 235 240
Ile Gly Ala Thr Lys Trp Asp Asp Glu Ala Glu Asp Tyr Leu Arg His
245 250 255
Val Met Arg Asn Gly Ala Gly His Gly Asn Gly Gly Ile Ser Gly Thr
260 265 270
Phe Pro Thr Thr His Phe Glu Cys Ser Trp Ile Ile Ala Thr Leu Leu
275 280 285
Lys Val Gly Phe Thr Leu Lys Gln Ile Asp Gly Asp Gly Leu Arg Gly
290 295 300
Leu Ser Thr Ile Leu Leu Glu Ala Leu Arg Asp Glu Asn Gly Val Ile
305 310 315 320
Gly Phe Ala Pro Arg Thr Ala Asp Val Asp Asp Thr Ala Lys Ala Leu
325 330 335
Leu Ala Leu Ser Leu Val Asn Gln Pro Val Ser Pro Asp Ile Met Ile
340 345 350
Lys Val Phe Glu Gly Lys Asp His Phe Thr Thr Phe Gly Ser Glu Arg
355 360 365
Asp Pro Ser Leu Thr Ser Asn Leu His Val Leu Leu Ser Leu Leu Lys
370 375 380
Gln Ser Asn Leu Ser Gln Tyr His Pro Gln Ile Leu Lys Thr Thr Leu
385 390 395 400
Phe Thr Cys Arg Trp Trp Trp Gly Ser Asp His Cys Val Lys Asp Lys
405 410 415
Trp Asn Leu Ser His Leu Tyr Pro Thr Met Leu Leu Val Glu Ala Phe
420 425 430
Thr Glu Val Leu His Leu Ile Asp Gly Gly Glu Leu Ser Ser Leu Phe
435 440 445
Asp Glu Ser Phe Lys Cys Lys Ile Gly Leu Ser Ile Phe Gln Ala Val
450 455 460
Leu Arg Ile Ile Leu Thr Gln Asp Asn Asp Gly Ser Trp Arg Gly Tyr
465 470 475 480
Arg Glu Gln Thr Cys Tyr Ala Ile Leu Ala Leu Val Gln Ala Arg His
485 490 495
Val Cys Phe Phe Thr His Met Val Asp Arg Leu Gln Ser Cys Val Asp
500 505 510
Arg Gly Phe Ser Trp Leu Lys Ser Cys Ser Phe His Ser Gln Asp Leu
515 520 525
Thr Trp Thr Ser Lys Thr Ala Tyr Glu Val Gly Phe Val Ala Glu Ala
530 535 540
Tyr Lys Leu Ala Ala Leu Gln Ser Ala Ser Leu Glu Val Pro Ala Ala
545 550 555 560
Thr Ile Gly His Ser Val Thr Ser Ala Val Pro Ser Ser Asp Leu Glu
565 570 575
Lys Tyr Met Arg Leu Val Arg Lys Thr Ala Leu Phe Ser Pro Leu Asp
580 585 590
Glu Trp Gly Leu Met Ala Ser Ile Ile Glu Ser Ser Phe Phe Val Pro
595 600 605
Leu Leu Gln Ala Gln Arg Val Glu Ile Tyr Pro Arg Asp Asn Ile Lys
610 615 620
Val Asp Glu Asp Lys Tyr Leu Ser Ile Ile Pro Phe Thr Trp Val Gly
625 630 635 640
Cys Asn Asn Arg Ser Arg Thr Phe Ala Ser Asn Arg Trp Leu Tyr Asp
645 650 655
Met Met Tyr Leu Ser Leu Leu Gly Tyr Gln Thr Asp Glu Tyr Met Glu
660 665 670
Ala Val Ala Gly Pro Val Phe Gly Asp Val Ser Leu Leu His Gln Thr
675 680 685
Ile Asp Lys Val Ile Asp Asn Thr Met Gly Asn Leu Ala Arg Ala Asn
690 695 700
Gly Thr Val His Ser Gly Asn Gly His Gln His Glu Ser Pro Asn Ile
705 710 715 720
Gly Gln Val Glu Asp Thr Leu Thr Arg Phe Thr Asn Ser Val Leu Asn
725 730 735
His Lys Asp Val Leu Asn Ser Ser Ser Ser Asp Gln Asp Thr Leu Arg
740 745 750
Arg Glu Phe Arg Thr Phe Met His Ala His Ile Thr Gln Ile Glu Asp
755 760 765
Asn Ser Arg Phe Ser Lys Gln Ala Ser Ser Asp Ala Phe Ser Ser Pro
770 775 780
Glu Gln Ser Tyr Phe Gln Trp Val Asn Ser Thr Gly Gly Ser His Val
785 790 795 800
Ala Cys Ala Tyr Ser Phe Ala Phe Ser Asn Cys Leu Met Ser Ala Asn
805 810 815
Leu Leu Gln Gly Lys Asp Ala Phe Pro Ser Gly Thr Gln Lys Tyr Leu
820 825 830
Ile Ser Ser Val Met Arg His Ala Thr Asn Met Cys Arg Met Tyr Asn
835 840 845
Asp Phe Gly Ser Ile Ala Arg Asp Asn Ala Glu Arg Asn Val Asn Ser
850 855 860
Ile His Phe Pro Glu Phe Thr Leu Cys Asn Gly Thr Ser Gln Asn Leu
865 870 875 880
Asp Glu Arg Lys Glu Arg Leu Leu Lys Ile Ala Thr Tyr Glu Gln Gly
885 890 895
Tyr Leu Asp Arg Ala Leu Glu Ala Leu Glu Arg Gln Ser Arg Asp Asp
900 905 910
Ala Gly Asp Arg Ala Gly Ser Lys Asp Met Arg Lys Leu Lys Ile Val
915 920 925
Lys Leu Phe Cys Asp Val Thr Asp Leu Tyr Asp Gln Leu Tyr Val Ile
930 935 940
Lys Asp Leu Ser Ser Ser Met Lys
945 950
<210> 21
<211> 1533
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(1533)
<400> 21
atg gac ttg caa acc atg gcc cca atg ggt tct gct gct atc gcc atc 48
Met Asp Leu Gln Thr Met Ala Pro Met Gly Ser Ala Ala Ile Ala Ile
1 5 10 15
ggt ggt cca gct gtc gcc gtt gct ggt ggt atc tcc ttg ttg ttc ttg 96
Gly Gly Pro Ala Val Ala Val Ala Gly Gly Ile Ser Leu Leu Phe Leu
20 25 30
aaa tcc ttc ttg tct caa caa cca ggt aac cca aac cac ttg cca tct 144
Lys Ser Phe Leu Ser Gln Gln Pro Gly Asn Pro Asn His Leu Pro Ser
35 40 45
gtc cca gct gtc cca ggt gtt cct cta tta ggt aac ttg ttg gaa ttg 192
Val Pro Ala Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Glu Leu
50 55 60
aag gaa aag aag cca tac aag acc ttt acc aaa tgg gct gaa act tac 240
Lys Glu Lys Lys Pro Tyr Lys Thr Phe Thr Lys Trp Ala Glu Thr Tyr
65 70 75 80
ggt cca atc tac tcc atc aag acc ggt gct acc tcc atg gtt gtc gtt 288
Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val Val
85 90 95
aac tcc aac caa ttg gcc aag gaa gct atg gtc acc aga ttc gac tcc 336
Asn Ser Asn Gln Leu Ala Lys Glu Ala Met Val Thr Arg Phe Asp Ser
100 105 110
atc tct acc aga aag ttg tcc aag gct ttg caa att ttg act gct gac 384
Ile Ser Thr Arg Lys Leu Ser Lys Ala Leu Gln Ile Leu Thr Ala Asp
115 120 125
aag acc atg gtt gcc atg tct gac tac gat gac tac cat aaa act gtc 432
Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr Val
130 135 140
aag aga aac ttg ttg act tct att ttg ggt cca gct gct caa aag aga 480
Lys Arg Asn Leu Leu Thr Ser Ile Leu Gly Pro Ala Ala Gln Lys Arg
145 150 155 160
cac aga gcc cac cgt gac gct atg ggt gac aac ttg tcc aga caa ttg 528
His Arg Ala His Arg Asp Ala Met Gly Asp Asn Leu Ser Arg Gln Leu
165 170 175
cac gct tta gct ttg aac tct cct caa gaa gcc att aac ttc aga caa 576
His Ala Leu Ala Leu Asn Ser Pro Gln Glu Ala Ile Asn Phe Arg Gln
180 185 190
atc ttc caa tct gaa tta ttc act tta gct ttc aag caa act ttc ggt 624
Ile Phe Gln Ser Glu Leu Phe Thr Leu Ala Phe Lys Gln Thr Phe Gly
195 200 205
aga gat atc gaa tct atc ttc gtc ggt gat ttg ggt act acc atg acc 672
Arg Asp Ile Glu Ser Ile Phe Val Gly Asp Leu Gly Thr Thr Met Thr
210 215 220
aga gaa gaa atg ttc caa att ttg gtt gtt gac cca atg atg ggt gct 720
Arg Glu Glu Met Phe Gln Ile Leu Val Val Asp Pro Met Met Gly Ala
225 230 235 240
att gac gtt gac tgg aga gac ttc ttc cca tac ttg aaa tgg att cca 768
Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Ile Pro
245 250 255
aac gcc aaa ttg gaa gaa aag atc gaa caa atg tac atc aga aga aag 816
Asn Ala Lys Leu Glu Glu Lys Ile Glu Gln Met Tyr Ile Arg Arg Lys
260 265 270
gct gtt atg aag gct gtt atc caa gaa cac aga aag aga att gat tct 864
Ala Val Met Lys Ala Val Ile Gln Glu His Arg Lys Arg Ile Asp Ser
275 280 285
ggt gaa aac ttg gat tct tac att gat ttc ttg ttg gct gaa gct caa 912
Gly Glu Asn Leu Asp Ser Tyr Ile Asp Phe Leu Leu Ala Glu Ala Gln
290 295 300
cca ttg act gaa aag caa tta ttg atg tcc ttg tgg gaa cca att att 960
Pro Leu Thr Glu Lys Gln Leu Leu Met Ser Leu Trp Glu Pro Ile Ile
305 310 315 320
gaa act tct gac acc acc atg gtt acc act gaa tgg gct atg tac gaa 1008
Glu Thr Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr Glu
325 330 335
ttg tct aag cac cca aac aag caa caa aga tta tac aat gaa atc aga 1056
Leu Ser Lys His Pro Asn Lys Gln Gln Arg Leu Tyr Asn Glu Ile Arg
340 345 350
aac atc tgt ggt tct gaa aag atc act gaa gaa aag ttg tgt aag atg 1104
Asn Ile Cys Gly Ser Glu Lys Ile Thr Glu Glu Lys Leu Cys Lys Met
355 360 365
cca tac tta tct gct gtc ttt cac gaa act tta cgt gtt cac tct cca 1152
Pro Tyr Leu Ser Ala Val Phe His Glu Thr Leu Arg Val His Ser Pro
370 375 380
gtt tcc atc att cca tta cgt tac gtt cac gaa aac act gaa ttg ggt 1200
Val Ser Ile Ile Pro Leu Arg Tyr Val His Glu Asn Thr Glu Leu Gly
385 390 395 400
ggt tac cat gtc cca gct ggt act gaa ttg gct gtc aac atc tac ggt 1248
Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly
405 410 415
tgt aac atg gaa aga gaa atc tgg gaa aac cca gaa gaa tgg tcc cca 1296
Cys Asn Met Glu Arg Glu Ile Trp Glu Asn Pro Glu Glu Trp Ser Pro
420 425 430
gaa aga ttc ttg gct gaa aac gaa cca gtt aac ttg caa aag acc atg 1344
Glu Arg Phe Leu Ala Glu Asn Glu Pro Val Asn Leu Gln Lys Thr Met
435 440 445
gcc ttt ggt gct ggt aag aga gtc tgt gcc ggt gct atg caa gct atg 1392
Ala Phe Gly Ala Gly Lys Arg Val Cys Ala Gly Ala Met Gln Ala Met
450 455 460
ttg ttg gct tgt gtc ggt att ggt aga atg gtt caa gaa ttc gaa tgg 1440
Leu Leu Ala Cys Val Gly Ile Gly Arg Met Val Gln Glu Phe Glu Trp
465 470 475 480
aga ttg aaa gat gac gtc gaa gaa gat gtc aac act tta ggt ttg acc 1488
Arg Leu Lys Asp Asp Val Glu Glu Asp Val Asn Thr Leu Gly Leu Thr
485 490 495
act caa aga ttg aac cca atg ttg gct gtt atc aag cct agg aat 1533
Thr Gln Arg Leu Asn Pro Met Leu Ala Val Ile Lys Pro Arg Asn
500 505 510
<210> 22
<211> 511
<212> PRT
<213> Lactuca sativa
<400> 22
Met Asp Leu Gln Thr Met Ala Pro Met Gly Ser Ala Ala Ile Ala Ile
1 5 10 15
Gly Gly Pro Ala Val Ala Val Ala Gly Gly Ile Ser Leu Leu Phe Leu
20 25 30
Lys Ser Phe Leu Ser Gln Gln Pro Gly Asn Pro Asn His Leu Pro Ser
35 40 45
Val Pro Ala Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Glu Leu
50 55 60
Lys Glu Lys Lys Pro Tyr Lys Thr Phe Thr Lys Trp Ala Glu Thr Tyr
65 70 75 80
Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val Val
85 90 95
Asn Ser Asn Gln Leu Ala Lys Glu Ala Met Val Thr Arg Phe Asp Ser
100 105 110
Ile Ser Thr Arg Lys Leu Ser Lys Ala Leu Gln Ile Leu Thr Ala Asp
115 120 125
Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr Val
130 135 140
Lys Arg Asn Leu Leu Thr Ser Ile Leu Gly Pro Ala Ala Gln Lys Arg
145 150 155 160
His Arg Ala His Arg Asp Ala Met Gly Asp Asn Leu Ser Arg Gln Leu
165 170 175
His Ala Leu Ala Leu Asn Ser Pro Gln Glu Ala Ile Asn Phe Arg Gln
180 185 190
Ile Phe Gln Ser Glu Leu Phe Thr Leu Ala Phe Lys Gln Thr Phe Gly
195 200 205
Arg Asp Ile Glu Ser Ile Phe Val Gly Asp Leu Gly Thr Thr Met Thr
210 215 220
Arg Glu Glu Met Phe Gln Ile Leu Val Val Asp Pro Met Met Gly Ala
225 230 235 240
Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Ile Pro
245 250 255
Asn Ala Lys Leu Glu Glu Lys Ile Glu Gln Met Tyr Ile Arg Arg Lys
260 265 270
Ala Val Met Lys Ala Val Ile Gln Glu His Arg Lys Arg Ile Asp Ser
275 280 285
Gly Glu Asn Leu Asp Ser Tyr Ile Asp Phe Leu Leu Ala Glu Ala Gln
290 295 300
Pro Leu Thr Glu Lys Gln Leu Leu Met Ser Leu Trp Glu Pro Ile Ile
305 310 315 320
Glu Thr Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr Glu
325 330 335
Leu Ser Lys His Pro Asn Lys Gln Gln Arg Leu Tyr Asn Glu Ile Arg
340 345 350
Asn Ile Cys Gly Ser Glu Lys Ile Thr Glu Glu Lys Leu Cys Lys Met
355 360 365
Pro Tyr Leu Ser Ala Val Phe His Glu Thr Leu Arg Val His Ser Pro
370 375 380
Val Ser Ile Ile Pro Leu Arg Tyr Val His Glu Asn Thr Glu Leu Gly
385 390 395 400
Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile Tyr Gly
405 410 415
Cys Asn Met Glu Arg Glu Ile Trp Glu Asn Pro Glu Glu Trp Ser Pro
420 425 430
Glu Arg Phe Leu Ala Glu Asn Glu Pro Val Asn Leu Gln Lys Thr Met
435 440 445
Ala Phe Gly Ala Gly Lys Arg Val Cys Ala Gly Ala Met Gln Ala Met
450 455 460
Leu Leu Ala Cys Val Gly Ile Gly Arg Met Val Gln Glu Phe Glu Trp
465 470 475 480
Arg Leu Lys Asp Asp Val Glu Glu Asp Val Asn Thr Leu Gly Leu Thr
485 490 495
Thr Gln Arg Leu Asn Pro Met Leu Ala Val Ile Lys Pro Arg Asn
500 505 510
<210> 23
<211> 1536
<212> DNA
<213> Lactuca sativa
<220>
<221> CDS
<222> (1)..(1536)
<400> 23
atg gat ggt gtc att gac atg caa acc att cca ttg aga acc gcc att 48
Met Asp Gly Val Ile Asp Met Gln Thr Ile Pro Leu Arg Thr Ala Ile
1 5 10 15
gcc att ggt ggt act gct gtt gct ttg gtt gtt gct cta tac ttc tgg 96
Ala Ile Gly Gly Thr Ala Val Ala Leu Val Val Ala Leu Tyr Phe Trp
20 25 30
ttc ttg aga tct tac gct tct cca tct cac cac tct aac cat ttg cca 144
Phe Leu Arg Ser Tyr Ala Ser Pro Ser His His Ser Asn His Leu Pro
35 40 45
cct gtt cca gaa gtt cca ggt gtc cca gtc ttg ggt aac ttg ttg caa 192
Pro Val Pro Glu Val Pro Gly Val Pro Val Leu Gly Asn Leu Leu Gln
50 55 60
ttg aaa gaa aag aag cca tac atg act ttc acc aaa tgg gct gaa atg 240
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Lys Trp Ala Glu Met
65 70 75 80
tac ggt cca atc tac tct atc aga act ggt gct acc tcc atg gtt gtt 288
Tyr Gly Pro Ile Tyr Ser Ile Arg Thr Gly Ala Thr Ser Met Val Val
85 90 95
gtt tcc tct aac gaa att gcc aag gaa gtt gtt gtc act aga ttc cca 336
Val Ser Ser Asn Glu Ile Ala Lys Glu Val Val Val Thr Arg Phe Pro
100 105 110
tcc atc tcc acc aga aag ttg tct tac gct ttg aag gtc ttg act gaa 384
Ser Ile Ser Thr Arg Lys Leu Ser Tyr Ala Leu Lys Val Leu Thr Glu
115 120 125
gat aag tcc atg gtt gct atg tct gat tac cat gac tac cac aag acc 432
Asp Lys Ser Met Val Ala Met Ser Asp Tyr His Asp Tyr His Lys Thr
130 135 140
gtc aaa aga cac att ttg act gct gtc tta ggt cca aac gcc caa aag 480
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
aag ttc cgt gct cac aga gac acc atg atg gaa aac gtt tcc aat gaa 528
Lys Phe Arg Ala His Arg Asp Thr Met Met Glu Asn Val Ser Asn Glu
165 170 175
ttg cat gcc ttc ttt gaa aag aac cca aac caa gaa gtc aac ttg aga 576
Leu His Ala Phe Phe Glu Lys Asn Pro Asn Gln Glu Val Asn Leu Arg
180 185 190
aag atc ttc caa tct caa ttg ttc ggt ttg gcc atg aag caa gct ttg 624
Lys Ile Phe Gln Ser Gln Leu Phe Gly Leu Ala Met Lys Gln Ala Leu
195 200 205
ggt aag gat gtc gaa tct atc tac gtc aag gac ttg gaa act acc atg 672
Gly Lys Asp Val Glu Ser Ile Tyr Val Lys Asp Leu Glu Thr Thr Met
210 215 220
aag aga gaa gaa atc ttt gaa gtc ttg gtt gtt gac cca atg atg ggt 720
Lys Arg Glu Glu Ile Phe Glu Val Leu Val Val Asp Pro Met Met Gly
225 230 235 240
gcc att gaa gtc gat tgg aga gac ttc ttc cca tac ttg aaa tgg gtt 768
Ala Ile Glu Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Val
245 250 255
cca aac aaa tct ttc gaa aac atc att cac aga atg tac acc cgt cgt 816
Pro Asn Lys Ser Phe Glu Asn Ile Ile His Arg Met Tyr Thr Arg Arg
260 265 270
gaa gct gtc atg aag gct ttg atc caa gaa cac aag aag aga att gct 864
Glu Ala Val Met Lys Ala Leu Ile Gln Glu His Lys Lys Arg Ile Ala
275 280 285
tct ggt gaa aac tta aac tcc tac att gac tac ttg ttg tct gaa gct 912
Ser Gly Glu Asn Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala
290 295 300
caa act ttg act gac aag caa ttg ttg atg tcc cta tgg gaa cca atc 960
Gln Thr Leu Thr Asp Lys Gln Leu Leu Met Ser Leu Trp Glu Pro Ile
305 310 315 320
att gaa tct tcc gac acc acc atg gtc acc act gaa tgg gct atg tac 1008
Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr
325 330 335
gaa ttg gct aag aat cca aac atg caa gac aga ttg tac gaa gaa atc 1056
Glu Leu Ala Lys Asn Pro Asn Met Gln Asp Arg Leu Tyr Glu Glu Ile
340 345 350
caa tct gtt tgt ggt tcc gaa aag atc act gaa gaa aac ttg tct caa 1104
Gln Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu Asn Leu Ser Gln
355 360 365
tta cca tac ttg tac gct gtt ttc caa gaa act ttg aga aag cac tgt 1152
Leu Pro Tyr Leu Tyr Ala Val Phe Gln Glu Thr Leu Arg Lys His Cys
370 375 380
cca gtt cca atc atg cca ttg aga tac gtc cac gaa aac acc gtt ttg 1200
Pro Val Pro Ile Met Pro Leu Arg Tyr Val His Glu Asn Thr Val Leu
385 390 395 400
ggt ggt tac cac gtt cca gct ggt act gaa gtt gct atc aac atc tat 1248
Gly Gly Tyr His Val Pro Ala Gly Thr Glu Val Ala Ile Asn Ile Tyr
405 410 415
ggt tgt aac atg gac aag aag gtc tgg gaa aac cca gaa gaa tgg aac 1296
Gly Cys Asn Met Asp Lys Lys Val Trp Glu Asn Pro Glu Glu Trp Asn
420 425 430
cca gaa aga ttc tta tcc gaa aag gaa tcc atg gac ttg tac aag acc 1344
Pro Glu Arg Phe Leu Ser Glu Lys Glu Ser Met Asp Leu Tyr Lys Thr
435 440 445
atg gcc ttc ggt ggt ggt aag aga gtt tgt gct ggt tct ttg caa gct 1392
Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln Ala
450 455 460
atg gtc atc tct tgt atc ggt att ggt aga tta gtc caa gat ttt gaa 1440
Met Val Ile Ser Cys Ile Gly Ile Gly Arg Leu Val Gln Asp Phe Glu
465 470 475 480
tgg aaa ttg aaa gat gac gct gaa gaa gat gtc aac act tta ggt tta 1488
Trp Lys Leu Lys Asp Asp Ala Glu Glu Asp Val Asn Thr Leu Gly Leu
485 490 495
acc act caa aag ttg cac cca tta ttg gct ttg atc aac cct cga aag 1536
Thr Thr Gln Lys Leu His Pro Leu Leu Ala Leu Ile Asn Pro Arg Lys
500 505 510
<210> 24
<211> 512
<212> PRT
<213> Lactuca sativa
<400> 24
Met Asp Gly Val Ile Asp Met Gln Thr Ile Pro Leu Arg Thr Ala Ile
1 5 10 15
Ala Ile Gly Gly Thr Ala Val Ala Leu Val Val Ala Leu Tyr Phe Trp
20 25 30
Phe Leu Arg Ser Tyr Ala Ser Pro Ser His His Ser Asn His Leu Pro
35 40 45
Pro Val Pro Glu Val Pro Gly Val Pro Val Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Lys Trp Ala Glu Met
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Arg Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Val Val Val Thr Arg Phe Pro
100 105 110
Ser Ile Ser Thr Arg Lys Leu Ser Tyr Ala Leu Lys Val Leu Thr Glu
115 120 125
Asp Lys Ser Met Val Ala Met Ser Asp Tyr His Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys Phe Arg Ala His Arg Asp Thr Met Met Glu Asn Val Ser Asn Glu
165 170 175
Leu His Ala Phe Phe Glu Lys Asn Pro Asn Gln Glu Val Asn Leu Arg
180 185 190
Lys Ile Phe Gln Ser Gln Leu Phe Gly Leu Ala Met Lys Gln Ala Leu
195 200 205
Gly Lys Asp Val Glu Ser Ile Tyr Val Lys Asp Leu Glu Thr Thr Met
210 215 220
Lys Arg Glu Glu Ile Phe Glu Val Leu Val Val Asp Pro Met Met Gly
225 230 235 240
Ala Ile Glu Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp Val
245 250 255
Pro Asn Lys Ser Phe Glu Asn Ile Ile His Arg Met Tyr Thr Arg Arg
260 265 270
Glu Ala Val Met Lys Ala Leu Ile Gln Glu His Lys Lys Arg Ile Ala
275 280 285
Ser Gly Glu Asn Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu Ala
290 295 300
Gln Thr Leu Thr Asp Lys Gln Leu Leu Met Ser Leu Trp Glu Pro Ile
305 310 315 320
Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met Tyr
325 330 335
Glu Leu Ala Lys Asn Pro Asn Met Gln Asp Arg Leu Tyr Glu Glu Ile
340 345 350
Gln Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu Asn Leu Ser Gln
355 360 365
Leu Pro Tyr Leu Tyr Ala Val Phe Gln Glu Thr Leu Arg Lys His Cys
370 375 380
Pro Val Pro Ile Met Pro Leu Arg Tyr Val His Glu Asn Thr Val Leu
385 390 395 400
Gly Gly Tyr His Val Pro Ala Gly Thr Glu Val Ala Ile Asn Ile Tyr
405 410 415
Gly Cys Asn Met Asp Lys Lys Val Trp Glu Asn Pro Glu Glu Trp Asn
420 425 430
Pro Glu Arg Phe Leu Ser Glu Lys Glu Ser Met Asp Leu Tyr Lys Thr
435 440 445
Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln Ala
450 455 460
Met Val Ile Ser Cys Ile Gly Ile Gly Arg Leu Val Gln Asp Phe Glu
465 470 475 480
Trp Lys Leu Lys Asp Asp Ala Glu Glu Asp Val Asn Thr Leu Gly Leu
485 490 495
Thr Thr Gln Lys Leu His Pro Leu Leu Ala Leu Ile Asn Pro Arg Lys
500 505 510
<210> 25
<211> 1575
<212> DNA
<213> Sphaceloma manihoticola
<220>
<221> CDS
<222> (1)..(1575)
<400> 25
atg atg gac gac acc act tct cca tac tcc act tac cac tct gtc aga 48
Met Met Asp Asp Thr Thr Ser Pro Tyr Ser Thr Tyr His Ser Val Arg
1 5 10 15
tcc att aga aac caa tct gct tgg gct ttg gct cca att gct gtc ttt 96
Ser Ile Arg Asn Gln Ser Ala Trp Ala Leu Ala Pro Ile Ala Val Phe
20 25 30
att tgt tac gtt gtc ttg aga cac aac aga aag tct gtc cca gct gcc 144
Ile Cys Tyr Val Val Leu Arg His Asn Arg Lys Ser Val Pro Ala Ala
35 40 45
tcc gct ggt tct cac tcc atc ttg gaa cca tta tgg tta gct aga tta 192
Ser Ala Gly Ser His Ser Ile Leu Glu Pro Leu Trp Leu Ala Arg Leu
50 55 60
cgt ttc atc aga gac tcc aga ttc atc atc ggt caa ggt tac tcc aag 240
Arg Phe Ile Arg Asp Ser Arg Phe Ile Ile Gly Gln Gly Tyr Ser Lys
65 70 75 80
ttc aag gat act atc ttc aag gtt acc aag gtc ggt gct gac atc atc 288
Phe Lys Asp Thr Ile Phe Lys Val Thr Lys Val Gly Ala Asp Ile Ile
85 90 95
gtt gtt gct cca aag tac gtc gaa gaa atc aga aga ttg tct cgt gac 336
Val Val Ala Pro Lys Tyr Val Glu Glu Ile Arg Arg Leu Ser Arg Asp
100 105 110
acc ggt aga tct gtt gaa cca ttc atc cac gat ttc gct ggt gaa ttg 384
Thr Gly Arg Ser Val Glu Pro Phe Ile His Asp Phe Ala Gly Glu Leu
115 120 125
ttg ggt ggt ttg aat ttc ttg gaa tct gat ttg caa acc aga gtt gtc 432
Leu Gly Gly Leu Asn Phe Leu Glu Ser Asp Leu Gln Thr Arg Val Val
130 135 140
caa caa aag ttg acc cca aac ttg aaa act att gtc cca gtc atg gaa 480
Gln Gln Lys Leu Thr Pro Asn Leu Lys Thr Ile Val Pro Val Met Glu
145 150 155 160
gat gaa atg cat tac gct ttg gtt tct gaa ttg gat tct tgt ttg gac 528
Asp Glu Met His Tyr Ala Leu Val Ser Glu Leu Asp Ser Cys Leu Asp
165 170 175
ggt tcc gaa cac tgg acc aga gtt gac atg att cac atg ttg tcc aga 576
Gly Ser Glu His Trp Thr Arg Val Asp Met Ile His Met Leu Ser Arg
180 185 190
atc gtt tcc aga att tct gcc aga atc ttc ttg ggt cca aag tac tgt 624
Ile Val Ser Arg Ile Ser Ala Arg Ile Phe Leu Gly Pro Lys Tyr Cys
195 200 205
aga aac gac tta tgg tta aag acc acc gct gaa tac act gaa aac ttg 672
Arg Asn Asp Leu Trp Leu Lys Thr Thr Ala Glu Tyr Thr Glu Asn Leu
210 215 220
ttc ttg acc ggt act cta ttg aga ttt gtc cca aga atg ttg caa aag 720
Phe Leu Thr Gly Thr Leu Leu Arg Phe Val Pro Arg Met Leu Gln Lys
225 230 235 240
tgg atc gct cca tta cta cca tct ttc cgt caa ttg caa gaa aac aga 768
Trp Ile Ala Pro Leu Leu Pro Ser Phe Arg Gln Leu Gln Glu Asn Arg
245 250 255
caa gct gcc aga aag atc att tct gaa att ttg act gac cat caa cca 816
Gln Ala Ala Arg Lys Ile Ile Ser Glu Ile Leu Thr Asp His Gln Pro
260 265 270
gaa aag cac gat gaa acc tct gac aac ggt gac cca tac cca gat atc 864
Glu Lys His Asp Glu Thr Ser Asp Asn Gly Asp Pro Tyr Pro Asp Ile
275 280 285
ttg act ttg atg ttc caa gct gcc cgt ggt aag gaa aag gac atc gaa 912
Leu Thr Leu Met Phe Gln Ala Ala Arg Gly Lys Glu Lys Asp Ile Glu
290 295 300
gat atc gct caa cac act ttg ttg cta tct ttg tcc tcc att cac act 960
Asp Ile Ala Gln His Thr Leu Leu Leu Ser Leu Ser Ser Ile His Thr
305 310 315 320
acc gct ttg acc atg act caa gct ttg tac gac ttg tgt gcc tac cca 1008
Thr Ala Leu Thr Met Thr Gln Ala Leu Tyr Asp Leu Cys Ala Tyr Pro
325 330 335
caa tac ttg gac cct gtt aag cac gaa att gct gac acc ttg caa tct 1056
Gln Tyr Leu Asp Pro Val Lys His Glu Ile Ala Asp Thr Leu Gln Ser
340 345 350
gaa ggt tcc tgg tct aag gct atg ttg gac aaa ttg cac atg atg gac 1104
Glu Gly Ser Trp Ser Lys Ala Met Leu Asp Lys Leu His Met Met Asp
355 360 365
tct ttg ttg aga gaa tct caa aga tta tct cct gtt ttc tta ttg act 1152
Ser Leu Leu Arg Glu Ser Gln Arg Leu Ser Pro Val Phe Leu Leu Thr
370 375 380
ttc aac aga att ttg cac acc cca ttg act ttg tcc aac ggt atc cat 1200
Phe Asn Arg Ile Leu His Thr Pro Leu Thr Leu Ser Asn Gly Ile His
385 390 395 400
ttg cca aag ggt act aga att gct gct cct tct gat gcc att ttg aac 1248
Leu Pro Lys Gly Thr Arg Ile Ala Ala Pro Ser Asp Ala Ile Leu Asn
405 410 415
gat cca tct tta gtc cca ggt cca caa cca gct gac act ttc gac cca 1296
Asp Pro Ser Leu Val Pro Gly Pro Gln Pro Ala Asp Thr Phe Asp Pro
420 425 430
ttt aga tac atc aac cac tcc act ggt gat gct aag aaa acc aag acc 1344
Phe Arg Tyr Ile Asn His Ser Thr Gly Asp Ala Lys Lys Thr Lys Thr
435 440 445
aac ttc caa acc act tct ttg caa aac atg gct ttc ggt tac ggt aaa 1392
Asn Phe Gln Thr Thr Ser Leu Gln Asn Met Ala Phe Gly Tyr Gly Lys
450 455 460
tac gct tgt cca ggt cgt ttc tat gtt gcc aac gaa atc aaa ttg gtt 1440
Tyr Ala Cys Pro Gly Arg Phe Tyr Val Ala Asn Glu Ile Lys Leu Val
465 470 475 480
tta ggt cac ttg ttg atg cac tac gaa ttc aag ttc cca cca ggt atg 1488
Leu Gly His Leu Leu Met His Tyr Glu Phe Lys Phe Pro Pro Gly Met
485 490 495
ggt aga cca gtt aac tcc acc gtc gat act gac atg tac cca gat ttg 1536
Gly Arg Pro Val Asn Ser Thr Val Asp Thr Asp Met Tyr Pro Asp Leu
500 505 510
ggt gcc aga ttg ttg gtt aga aag aga aag atg gaa gaa 1575
Gly Ala Arg Leu Leu Val Arg Lys Arg Lys Met Glu Glu
515 520 525
<210> 26
<211> 525
<212> PRT
<213> Sphaceloma manihoticola
<400> 26
Met Met Asp Asp Thr Thr Ser Pro Tyr Ser Thr Tyr His Ser Val Arg
1 5 10 15
Ser Ile Arg Asn Gln Ser Ala Trp Ala Leu Ala Pro Ile Ala Val Phe
20 25 30
Ile Cys Tyr Val Val Leu Arg His Asn Arg Lys Ser Val Pro Ala Ala
35 40 45
Ser Ala Gly Ser His Ser Ile Leu Glu Pro Leu Trp Leu Ala Arg Leu
50 55 60
Arg Phe Ile Arg Asp Ser Arg Phe Ile Ile Gly Gln Gly Tyr Ser Lys
65 70 75 80
Phe Lys Asp Thr Ile Phe Lys Val Thr Lys Val Gly Ala Asp Ile Ile
85 90 95
Val Val Ala Pro Lys Tyr Val Glu Glu Ile Arg Arg Leu Ser Arg Asp
100 105 110
Thr Gly Arg Ser Val Glu Pro Phe Ile His Asp Phe Ala Gly Glu Leu
115 120 125
Leu Gly Gly Leu Asn Phe Leu Glu Ser Asp Leu Gln Thr Arg Val Val
130 135 140
Gln Gln Lys Leu Thr Pro Asn Leu Lys Thr Ile Val Pro Val Met Glu
145 150 155 160
Asp Glu Met His Tyr Ala Leu Val Ser Glu Leu Asp Ser Cys Leu Asp
165 170 175
Gly Ser Glu His Trp Thr Arg Val Asp Met Ile His Met Leu Ser Arg
180 185 190
Ile Val Ser Arg Ile Ser Ala Arg Ile Phe Leu Gly Pro Lys Tyr Cys
195 200 205
Arg Asn Asp Leu Trp Leu Lys Thr Thr Ala Glu Tyr Thr Glu Asn Leu
210 215 220
Phe Leu Thr Gly Thr Leu Leu Arg Phe Val Pro Arg Met Leu Gln Lys
225 230 235 240
Trp Ile Ala Pro Leu Leu Pro Ser Phe Arg Gln Leu Gln Glu Asn Arg
245 250 255
Gln Ala Ala Arg Lys Ile Ile Ser Glu Ile Leu Thr Asp His Gln Pro
260 265 270
Glu Lys His Asp Glu Thr Ser Asp Asn Gly Asp Pro Tyr Pro Asp Ile
275 280 285
Leu Thr Leu Met Phe Gln Ala Ala Arg Gly Lys Glu Lys Asp Ile Glu
290 295 300
Asp Ile Ala Gln His Thr Leu Leu Leu Ser Leu Ser Ser Ile His Thr
305 310 315 320
Thr Ala Leu Thr Met Thr Gln Ala Leu Tyr Asp Leu Cys Ala Tyr Pro
325 330 335
Gln Tyr Leu Asp Pro Val Lys His Glu Ile Ala Asp Thr Leu Gln Ser
340 345 350
Glu Gly Ser Trp Ser Lys Ala Met Leu Asp Lys Leu His Met Met Asp
355 360 365
Ser Leu Leu Arg Glu Ser Gln Arg Leu Ser Pro Val Phe Leu Leu Thr
370 375 380
Phe Asn Arg Ile Leu His Thr Pro Leu Thr Leu Ser Asn Gly Ile His
385 390 395 400
Leu Pro Lys Gly Thr Arg Ile Ala Ala Pro Ser Asp Ala Ile Leu Asn
405 410 415
Asp Pro Ser Leu Val Pro Gly Pro Gln Pro Ala Asp Thr Phe Asp Pro
420 425 430
Phe Arg Tyr Ile Asn His Ser Thr Gly Asp Ala Lys Lys Thr Lys Thr
435 440 445
Asn Phe Gln Thr Thr Ser Leu Gln Asn Met Ala Phe Gly Tyr Gly Lys
450 455 460
Tyr Ala Cys Pro Gly Arg Phe Tyr Val Ala Asn Glu Ile Lys Leu Val
465 470 475 480
Leu Gly His Leu Leu Met His Tyr Glu Phe Lys Phe Pro Pro Gly Met
485 490 495
Gly Arg Pro Val Asn Ser Thr Val Asp Thr Asp Met Tyr Pro Asp Leu
500 505 510
Gly Ala Arg Leu Leu Val Arg Lys Arg Lys Met Glu Glu
515 520 525
<210> 27
<211> 1440
<212> DNA
<213> Artemisia annua
<220>
<221> CDS
<222> (1)..(1440)
<400> 27
atg cca atg acc gtc atg ttg ttg ttc gtt ttc tta tta ttc att gcc 48
Met Pro Met Thr Val Met Leu Leu Phe Val Phe Leu Leu Phe Ile Ala
1 5 10 15
atc tgt ttc ttc ttg gtc cac aga cac aac tcc acc acc acc aag aac 96
Ile Cys Phe Phe Leu Val His Arg His Asn Ser Thr Thr Thr Lys Asn
20 25 30
ttg cca cca ggt tct ttc ggt tgg cca ttc atc ggt gaa act ttg gct 144
Leu Pro Pro Gly Ser Phe Gly Trp Pro Phe Ile Gly Glu Thr Leu Ala
35 40 45
tac atc aga tcc aag aga ggt ggt gac cca gaa aga ttc acc aag gaa 192
Tyr Ile Arg Ser Lys Arg Gly Gly Asp Pro Glu Arg Phe Thr Lys Glu
50 55 60
aga att gaa aag tac ggt tcc act tta gtc ttt aag acc tct gtt gct 240
Arg Ile Glu Lys Tyr Gly Ser Thr Leu Val Phe Lys Thr Ser Val Ala
65 70 75 80
ggt gaa aga atg gct gtc ttt tgt ggt cca gaa ggt aac aag ttc ttg 288
Gly Glu Arg Met Ala Val Phe Cys Gly Pro Glu Gly Asn Lys Phe Leu
85 90 95
ttt ggt aac gaa aac aaa ttg gtt gct tcc tgg tgg cca aac tct gtt 336
Phe Gly Asn Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Asn Ser Val
100 105 110
aga att ttg ttc gaa aag tgt ttg att acc atc aga ggt gac gaa gct 384
Arg Ile Leu Phe Glu Lys Cys Leu Ile Thr Ile Arg Gly Asp Glu Ala
115 120 125
aaa tgg tta cgt aaa atg atg ttc gct tac ttg ggt cca gac gct cta 432
Lys Trp Leu Arg Lys Met Met Phe Ala Tyr Leu Gly Pro Asp Ala Leu
130 135 140
tcc aac aga tac act ggt act atg gaa gtt gtt acc aga ttg cac atc 480
Ser Asn Arg Tyr Thr Gly Thr Met Glu Val Val Thr Arg Leu His Ile
145 150 155 160
caa aac cac tgg caa ggt aag tct gaa ttg aag gtc ttt gaa acc gtt 528
Gln Asn His Trp Gln Gly Lys Ser Glu Leu Lys Val Phe Glu Thr Val
165 170 175
aga cca tac ttg ttc gaa ttg gcc tgt aga tta ttc ttg tct ttg gat 576
Arg Pro Tyr Leu Phe Glu Leu Ala Cys Arg Leu Phe Leu Ser Leu Asp
180 185 190
gac cca aag cac gtt gct gaa ttg ggt act tta ttc aac act ttc ttg 624
Asp Pro Lys His Val Ala Glu Leu Gly Thr Leu Phe Asn Thr Phe Leu
195 200 205
aag ggt ttg act gaa ttg cca atc aac att cca ggt act aga ttc tac 672
Lys Gly Leu Thr Glu Leu Pro Ile Asn Ile Pro Gly Thr Arg Phe Tyr
210 215 220
aga gcc aag aga gct gcc aac gcc atc aag aaa caa ttg att gtc atc 720
Arg Ala Lys Arg Ala Ala Asn Ala Ile Lys Lys Gln Leu Ile Val Ile
225 230 235 240
atc aag caa cgt cgt caa gct ttg aag caa gaa gat caa tct tct tct 768
Ile Lys Gln Arg Arg Gln Ala Leu Lys Gln Glu Asp Gln Ser Ser Ser
245 250 255
ttc gaa gat ttg cta tct cat ttg tta gtc tcc tct gat gaa aac ggt 816
Phe Glu Asp Leu Leu Ser His Leu Leu Val Ser Ser Asp Glu Asn Gly
260 265 270
aga ttc ttg tcc gaa gct gaa att gcc aat aat gtc ttg ttg ttg ttg 864
Arg Phe Leu Ser Glu Ala Glu Ile Ala Asn Asn Val Leu Leu Leu Leu
275 280 285
ttt gct ggt cac gac act tct gct gtt tcc atc act ttg ttg atg aag 912
Phe Ala Gly His Asp Thr Ser Ala Val Ser Ile Thr Leu Leu Met Lys
290 295 300
tct ttg gct gaa cac cct caa gtc tac gac aac gtt ttg aag gaa caa 960
Ser Leu Ala Glu His Pro Gln Val Tyr Asp Asn Val Leu Lys Glu Gln
305 310 315 320
tta ggt atc ttg gaa gcc aag gct cca ggt gaa atg ttg aac tgg gaa 1008
Leu Gly Ile Leu Glu Ala Lys Ala Pro Gly Glu Met Leu Asn Trp Glu
325 330 335
gat atc caa aag atg aga tac tcc tgg tac gtt gtt tgt gaa gtc atg 1056
Asp Ile Gln Lys Met Arg Tyr Ser Trp Tyr Val Val Cys Glu Val Met
340 345 350
aga ttg att cca cct gtt gtt ggt tct ttc aga gaa gct ttg gtt gat 1104
Arg Leu Ile Pro Pro Val Val Gly Ser Phe Arg Glu Ala Leu Val Asp
355 360 365
ttc gaa tac gct ggt tac act att cca aag ggt tgg aag atc atc tgg 1152
Phe Glu Tyr Ala Gly Tyr Thr Ile Pro Lys Gly Trp Lys Ile Ile Trp
370 375 380
tct gct gtc atg act cac aag gaa gaa aac aac ttc cca aac gct act 1200
Ser Ala Val Met Thr His Lys Glu Glu Asn Asn Phe Pro Asn Ala Thr
385 390 395 400
aag ttc gac cca tcc aga ttc gaa ggt gct ggt cca acc cca ttc acc 1248
Lys Phe Asp Pro Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr
405 410 415
tac gtt cca ttc ggt ggt ggt cca aga atg tgt ttg ggt aag gaa ttg 1296
Tyr Val Pro Phe Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Leu
420 425 430
gct cgt gtc aga att ttg gtt ttc tta cat aac atc atg acc aaa ttc 1344
Ala Arg Val Arg Ile Leu Val Phe Leu His Asn Ile Met Thr Lys Phe
435 440 445
aaa tgg gac ttg ttg att cca gac gaa aag atc ggt tac gac cca ttg 1392
Lys Trp Asp Leu Leu Ile Pro Asp Glu Lys Ile Gly Tyr Asp Pro Leu
450 455 460
gct acc cca gtc aag ggt ttg cca gtc aga ttg cac cct cac caa gtg 1440
Ala Thr Pro Val Lys Gly Leu Pro Val Arg Leu His Pro His Gln Val
465 470 475 480
<210> 28
<211> 480
<212> PRT
<213> Artemisia annua
<400> 28
Met Pro Met Thr Val Met Leu Leu Phe Val Phe Leu Leu Phe Ile Ala
1 5 10 15
Ile Cys Phe Phe Leu Val His Arg His Asn Ser Thr Thr Thr Lys Asn
20 25 30
Leu Pro Pro Gly Ser Phe Gly Trp Pro Phe Ile Gly Glu Thr Leu Ala
35 40 45
Tyr Ile Arg Ser Lys Arg Gly Gly Asp Pro Glu Arg Phe Thr Lys Glu
50 55 60
Arg Ile Glu Lys Tyr Gly Ser Thr Leu Val Phe Lys Thr Ser Val Ala
65 70 75 80
Gly Glu Arg Met Ala Val Phe Cys Gly Pro Glu Gly Asn Lys Phe Leu
85 90 95
Phe Gly Asn Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Asn Ser Val
100 105 110
Arg Ile Leu Phe Glu Lys Cys Leu Ile Thr Ile Arg Gly Asp Glu Ala
115 120 125
Lys Trp Leu Arg Lys Met Met Phe Ala Tyr Leu Gly Pro Asp Ala Leu
130 135 140
Ser Asn Arg Tyr Thr Gly Thr Met Glu Val Val Thr Arg Leu His Ile
145 150 155 160
Gln Asn His Trp Gln Gly Lys Ser Glu Leu Lys Val Phe Glu Thr Val
165 170 175
Arg Pro Tyr Leu Phe Glu Leu Ala Cys Arg Leu Phe Leu Ser Leu Asp
180 185 190
Asp Pro Lys His Val Ala Glu Leu Gly Thr Leu Phe Asn Thr Phe Leu
195 200 205
Lys Gly Leu Thr Glu Leu Pro Ile Asn Ile Pro Gly Thr Arg Phe Tyr
210 215 220
Arg Ala Lys Arg Ala Ala Asn Ala Ile Lys Lys Gln Leu Ile Val Ile
225 230 235 240
Ile Lys Gln Arg Arg Gln Ala Leu Lys Gln Glu Asp Gln Ser Ser Ser
245 250 255
Phe Glu Asp Leu Leu Ser His Leu Leu Val Ser Ser Asp Glu Asn Gly
260 265 270
Arg Phe Leu Ser Glu Ala Glu Ile Ala Asn Asn Val Leu Leu Leu Leu
275 280 285
Phe Ala Gly His Asp Thr Ser Ala Val Ser Ile Thr Leu Leu Met Lys
290 295 300
Ser Leu Ala Glu His Pro Gln Val Tyr Asp Asn Val Leu Lys Glu Gln
305 310 315 320
Leu Gly Ile Leu Glu Ala Lys Ala Pro Gly Glu Met Leu Asn Trp Glu
325 330 335
Asp Ile Gln Lys Met Arg Tyr Ser Trp Tyr Val Val Cys Glu Val Met
340 345 350
Arg Leu Ile Pro Pro Val Val Gly Ser Phe Arg Glu Ala Leu Val Asp
355 360 365
Phe Glu Tyr Ala Gly Tyr Thr Ile Pro Lys Gly Trp Lys Ile Ile Trp
370 375 380
Ser Ala Val Met Thr His Lys Glu Glu Asn Asn Phe Pro Asn Ala Thr
385 390 395 400
Lys Phe Asp Pro Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr
405 410 415
Tyr Val Pro Phe Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Leu
420 425 430
Ala Arg Val Arg Ile Leu Val Phe Leu His Asn Ile Met Thr Lys Phe
435 440 445
Lys Trp Asp Leu Leu Ile Pro Asp Glu Lys Ile Gly Tyr Asp Pro Leu
450 455 460
Ala Thr Pro Val Lys Gly Leu Pro Val Arg Leu His Pro His Gln Val
465 470 475 480
<210> 29
<211> 1365
<212> DNA
<213> Ricinus communis
<220>
<221> CDS
<222> (1)..(1365)
<400> 29
atg gaa ttg gtc atg ttc cca gtc ttg gct ttg gtt tcc act ttg ttc 48
Met Glu Leu Val Met Phe Pro Val Leu Ala Leu Val Ser Thr Leu Phe
1 5 10 15
ttg ttg gct ttg cac ttc atc atc aga act ttg aag gaa aga tta ttc 96
Leu Leu Ala Leu His Phe Ile Ile Arg Thr Leu Lys Glu Arg Leu Phe
20 25 30
ggt tct cca aat ttg cca cca ggt aga tta ggt tgg cca ttg att ggt 144
Gly Ser Pro Asn Leu Pro Pro Gly Arg Leu Gly Trp Pro Leu Ile Gly
35 40 45
gaa acc cca gct ttc ttc aga gct ggt ttc gaa gcc aag cca gaa aag 192
Glu Thr Pro Ala Phe Phe Arg Ala Gly Phe Glu Ala Lys Pro Glu Lys
50 55 60
ttc atc ggt gaa aga atg gaa aag tac gac tct cgt gtt ttc aag acc 240
Phe Ile Gly Glu Arg Met Glu Lys Tyr Asp Ser Arg Val Phe Lys Thr
65 70 75 80
tct ttg ttg ggt aag cct ttc gct gtc att tct ggt act gct ggt cac 288
Ser Leu Leu Gly Lys Pro Phe Ala Val Ile Ser Gly Thr Ala Gly His
85 90 95
aag ttc ttg ttt tcc aac gaa aac aaa ttg gtt aac ttg tgg tgg cca 336
Lys Phe Leu Phe Ser Asn Glu Asn Lys Leu Val Asn Leu Trp Trp Pro
100 105 110
gaa tcc gtc aga atg ttg ttc aag tct gct ttg gtt tcc gtt gtt ggt 384
Glu Ser Val Arg Met Leu Phe Lys Ser Ala Leu Val Ser Val Val Gly
115 120 125
gac gaa gcc aag aga atc aga aag atg ttg atg act ttc tta ggt ttg 432
Asp Glu Ala Lys Arg Ile Arg Lys Met Leu Met Thr Phe Leu Gly Leu
130 135 140
gat gct ttg aag aac tac act gaa aga att gac atg gtc acc caa caa 480
Asp Ala Leu Lys Asn Tyr Thr Glu Arg Ile Asp Met Val Thr Gln Gln
145 150 155 160
cac atc aga acc tac tgg gaa ggt aag gaa gaa gtt acc gtt tac tcc 528
His Ile Arg Thr Tyr Trp Glu Gly Lys Glu Glu Val Thr Val Tyr Ser
165 170 175
act ttg aaa ttg tac act ttc acc ttg gct tgt aac ttg ttt gcc tcc 576
Thr Leu Lys Leu Tyr Thr Phe Thr Leu Ala Cys Asn Leu Phe Ala Ser
180 185 190
atc aac gac cct gaa aga tta tcc aag ttg ggt gct cac ttc gat gtt 624
Ile Asn Asp Pro Glu Arg Leu Ser Lys Leu Gly Ala His Phe Asp Val
195 200 205
ttc gtt aag ggt gtt atc tct ttg cca att tcc att cca ggt act aga 672
Phe Val Lys Gly Val Ile Ser Leu Pro Ile Ser Ile Pro Gly Thr Arg
210 215 220
tta tac aaa tcc atg aag gct gcc aac gcc att aga gaa gaa ttg aaa 720
Leu Tyr Lys Ser Met Lys Ala Ala Asn Ala Ile Arg Glu Glu Leu Lys
225 230 235 240
ttg att gtc cgt gac aga aag gaa gct ttg gaa aga aag atg gct tct 768
Leu Ile Val Arg Asp Arg Lys Glu Ala Leu Glu Arg Lys Met Ala Ser
245 250 255
cca act caa gat ttg ttg tct tat ttg ttg gtc gac tct gac acc aac 816
Pro Thr Gln Asp Leu Leu Ser Tyr Leu Leu Val Asp Ser Asp Thr Asn
260 265 270
ggt cgt ttc ttg tct gaa atg gaa atc ttg gac aac atc atg ttg cta 864
Gly Arg Phe Leu Ser Glu Met Glu Ile Leu Asp Asn Ile Met Leu Leu
275 280 285
ttg tac gct gaa caa ttg gaa att gct aac tcc aag aag cca ggt gaa 912
Leu Tyr Ala Glu Gln Leu Glu Ile Ala Asn Ser Lys Lys Pro Gly Glu
290 295 300
tta ttg caa tgg gaa gat gtc caa aag atg aga tac tcc tgg aac gtc 960
Leu Leu Gln Trp Glu Asp Val Gln Lys Met Arg Tyr Ser Trp Asn Val
305 310 315 320
atc tct gaa gtc ttg aga ttg tct cca cca gtt tct tct gct tac aga 1008
Ile Ser Glu Val Leu Arg Leu Ser Pro Pro Val Ser Ser Ala Tyr Arg
325 330 335
cac gcc att gtt gac ttt acc tac gaa ggt tac acc att cca aag ggt 1056
His Ala Ile Val Asp Phe Thr Tyr Glu Gly Tyr Thr Ile Pro Lys Gly
340 345 350
tgg caa ttg ttc acc tct ttc ggt act acc cac cgt gac cca gct tta 1104
Trp Gln Leu Phe Thr Ser Phe Gly Thr Thr His Arg Asp Pro Ala Leu
355 360 365
ttc cca aac cca gaa aga ttc gat gct tcc aga ttt gaa ggt aac ggt 1152
Phe Pro Asn Pro Glu Arg Phe Asp Ala Ser Arg Phe Glu Gly Asn Gly
370 375 380
cca cct tct tac tct tac atc cca ttc ggt ggt ggt cca aga atg tgt 1200
Pro Pro Ser Tyr Ser Tyr Ile Pro Phe Gly Gly Gly Pro Arg Met Cys
385 390 395 400
att ggt tac gaa ttt gcc aga ttg gaa atg ttg atc ttc tta cat aac 1248
Ile Gly Tyr Glu Phe Ala Arg Leu Glu Met Leu Ile Phe Leu His Asn
405 410 415
atc atc aag aga ttc aaa tgg gat atc ttg atc cca gat gaa caa ttc 1296
Ile Ile Lys Arg Phe Lys Trp Asp Ile Leu Ile Pro Asp Glu Gln Phe
420 425 430
ggt tac aac cca tta ttg gct cca tct caa ggt ttc cca gtc aga tta 1344
Gly Tyr Asn Pro Leu Leu Ala Pro Ser Gln Gly Phe Pro Val Arg Leu
435 440 445
aga cca cac cac tct cat cta 1365
Arg Pro His His Ser His Leu
450 455
<210> 30
<211> 455
<212> PRT
<213> Ricinus communis
<400> 30
Met Glu Leu Val Met Phe Pro Val Leu Ala Leu Val Ser Thr Leu Phe
1 5 10 15
Leu Leu Ala Leu His Phe Ile Ile Arg Thr Leu Lys Glu Arg Leu Phe
20 25 30
Gly Ser Pro Asn Leu Pro Pro Gly Arg Leu Gly Trp Pro Leu Ile Gly
35 40 45
Glu Thr Pro Ala Phe Phe Arg Ala Gly Phe Glu Ala Lys Pro Glu Lys
50 55 60
Phe Ile Gly Glu Arg Met Glu Lys Tyr Asp Ser Arg Val Phe Lys Thr
65 70 75 80
Ser Leu Leu Gly Lys Pro Phe Ala Val Ile Ser Gly Thr Ala Gly His
85 90 95
Lys Phe Leu Phe Ser Asn Glu Asn Lys Leu Val Asn Leu Trp Trp Pro
100 105 110
Glu Ser Val Arg Met Leu Phe Lys Ser Ala Leu Val Ser Val Val Gly
115 120 125
Asp Glu Ala Lys Arg Ile Arg Lys Met Leu Met Thr Phe Leu Gly Leu
130 135 140
Asp Ala Leu Lys Asn Tyr Thr Glu Arg Ile Asp Met Val Thr Gln Gln
145 150 155 160
His Ile Arg Thr Tyr Trp Glu Gly Lys Glu Glu Val Thr Val Tyr Ser
165 170 175
Thr Leu Lys Leu Tyr Thr Phe Thr Leu Ala Cys Asn Leu Phe Ala Ser
180 185 190
Ile Asn Asp Pro Glu Arg Leu Ser Lys Leu Gly Ala His Phe Asp Val
195 200 205
Phe Val Lys Gly Val Ile Ser Leu Pro Ile Ser Ile Pro Gly Thr Arg
210 215 220
Leu Tyr Lys Ser Met Lys Ala Ala Asn Ala Ile Arg Glu Glu Leu Lys
225 230 235 240
Leu Ile Val Arg Asp Arg Lys Glu Ala Leu Glu Arg Lys Met Ala Ser
245 250 255
Pro Thr Gln Asp Leu Leu Ser Tyr Leu Leu Val Asp Ser Asp Thr Asn
260 265 270
Gly Arg Phe Leu Ser Glu Met Glu Ile Leu Asp Asn Ile Met Leu Leu
275 280 285
Leu Tyr Ala Glu Gln Leu Glu Ile Ala Asn Ser Lys Lys Pro Gly Glu
290 295 300
Leu Leu Gln Trp Glu Asp Val Gln Lys Met Arg Tyr Ser Trp Asn Val
305 310 315 320
Ile Ser Glu Val Leu Arg Leu Ser Pro Pro Val Ser Ser Ala Tyr Arg
325 330 335
His Ala Ile Val Asp Phe Thr Tyr Glu Gly Tyr Thr Ile Pro Lys Gly
340 345 350
Trp Gln Leu Phe Thr Ser Phe Gly Thr Thr His Arg Asp Pro Ala Leu
355 360 365
Phe Pro Asn Pro Glu Arg Phe Asp Ala Ser Arg Phe Glu Gly Asn Gly
370 375 380
Pro Pro Ser Tyr Ser Tyr Ile Pro Phe Gly Gly Gly Pro Arg Met Cys
385 390 395 400
Ile Gly Tyr Glu Phe Ala Arg Leu Glu Met Leu Ile Phe Leu His Asn
405 410 415
Ile Ile Lys Arg Phe Lys Trp Asp Ile Leu Ile Pro Asp Glu Gln Phe
420 425 430
Gly Tyr Asn Pro Leu Leu Ala Pro Ser Gln Gly Phe Pro Val Arg Leu
435 440 445
Arg Pro His His Ser His Leu
450 455
<210> 31
<211> 1428
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1428)
<400> 31
atg att caa gtt ttg acc cca atc ttg ttg ttc ttg atc ttc ttc gtc 48
Met Ile Gln Val Leu Thr Pro Ile Leu Leu Phe Leu Ile Phe Phe Val
1 5 10 15
ttt tgg aag gtc tac aag cac caa aag acc aag atc aac ttg cct cca 96
Phe Trp Lys Val Tyr Lys His Gln Lys Thr Lys Ile Asn Leu Pro Pro
20 25 30
ggt tct ttc ggt tgg cca ttc ttg ggt gaa act tta gct ttg ttg aga 144
Gly Ser Phe Gly Trp Pro Phe Leu Gly Glu Thr Leu Ala Leu Leu Arg
35 40 45
gct ggt tgg gac tct gaa cca gaa aga ttc gtt aga gaa aga atc aag 192
Ala Gly Trp Asp Ser Glu Pro Glu Arg Phe Val Arg Glu Arg Ile Lys
50 55 60
aag cat ggt tct cca ttg gtt ttc aag acc tct ttg ttc ggt gac aga 240
Lys His Gly Ser Pro Leu Val Phe Lys Thr Ser Leu Phe Gly Asp Arg
65 70 75 80
ttc gct gtc ttg tgt ggt cca gct ggt aac aaa ttc ttg ttc tgt aac 288
Phe Ala Val Leu Cys Gly Pro Ala Gly Asn Lys Phe Leu Phe Cys Asn
85 90 95
gaa aac aaa ttg gtt gct tcc tgg tgg cca gtt cca gtt aga aag ttg 336
Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Val Pro Val Arg Lys Leu
100 105 110
ttc ggt aag tcc ttg ttg act atc aga ggt gac gaa gcc aag tgg atg 384
Phe Gly Lys Ser Leu Leu Thr Ile Arg Gly Asp Glu Ala Lys Trp Met
115 120 125
aga aag atg tta ttg tct tac ttg ggt cca gat gcc ttt gct acc cac 432
Arg Lys Met Leu Leu Ser Tyr Leu Gly Pro Asp Ala Phe Ala Thr His
130 135 140
tac gct gtc acc atg gac gtt gtt acc cgt cgt cac att gat gtc cac 480
Tyr Ala Val Thr Met Asp Val Val Thr Arg Arg His Ile Asp Val His
145 150 155 160
tgg aga ggt aag gaa gaa gtt aac gtt ttc caa acc gtc aaa ttg tac 528
Trp Arg Gly Lys Glu Glu Val Asn Val Phe Gln Thr Val Lys Leu Tyr
165 170 175
gct ttc gaa ttg gct tgt aga tta ttc atg aac ttg gat gac cca aac 576
Ala Phe Glu Leu Ala Cys Arg Leu Phe Met Asn Leu Asp Asp Pro Asn
180 185 190
cat att gcc aaa tta ggt tct tta ttc aac atc ttc ttg aag ggt atc 624
His Ile Ala Lys Leu Gly Ser Leu Phe Asn Ile Phe Leu Lys Gly Ile
195 200 205
atc gaa ttg cca att gat gtt cca ggt act aga ttc tac tcc tcc aag 672
Ile Glu Leu Pro Ile Asp Val Pro Gly Thr Arg Phe Tyr Ser Ser Lys
210 215 220
aag gct gct gct gcc atc aga att gaa ttg aag aaa ttg atc aag gcc 720
Lys Ala Ala Ala Ala Ile Arg Ile Glu Leu Lys Lys Leu Ile Lys Ala
225 230 235 240
aga aag ttg gaa ttg aag gaa ggt aag gct tct tcc tct caa gat ttg 768
Arg Lys Leu Glu Leu Lys Glu Gly Lys Ala Ser Ser Ser Gln Asp Leu
245 250 255
cta tcc cac tta ttg acc tct cca gac gaa aat ggt atg ttc ttg act 816
Leu Ser His Leu Leu Thr Ser Pro Asp Glu Asn Gly Met Phe Leu Thr
260 265 270
gaa gaa gaa att gtc gac aac atc tta ttg cta ttg ttc gct ggt cac 864
Glu Glu Glu Ile Val Asp Asn Ile Leu Leu Leu Leu Phe Ala Gly His
275 280 285
gac acc tct gct cta tcc att act ttg ttg atg aaa act ttg ggt gaa 912
Asp Thr Ser Ala Leu Ser Ile Thr Leu Leu Met Lys Thr Leu Gly Glu
290 295 300
cac tct gat gtt tac gac aag gtc tta aag gaa caa ttg gaa atc tcc 960
His Ser Asp Val Tyr Asp Lys Val Leu Lys Glu Gln Leu Glu Ile Ser
305 310 315 320
aag act aag gaa gct tgg gaa tct ttg aaa tgg gaa gat atc caa aag 1008
Lys Thr Lys Glu Ala Trp Glu Ser Leu Lys Trp Glu Asp Ile Gln Lys
325 330 335
atg aag tac tcc tgg tct gtt atc tgt gaa gtc atg aga ttg aac cct 1056
Met Lys Tyr Ser Trp Ser Val Ile Cys Glu Val Met Arg Leu Asn Pro
340 345 350
cca gtc att ggt act tac aga gaa gct ttg gtc gat atc gac tac gct 1104
Pro Val Ile Gly Thr Tyr Arg Glu Ala Leu Val Asp Ile Asp Tyr Ala
355 360 365
ggt tac acc att cca aag ggt tgg aag ttg cac tgg tct gcc gtt tcc 1152
Gly Tyr Thr Ile Pro Lys Gly Trp Lys Leu His Trp Ser Ala Val Ser
370 375 380
act caa aga gat gaa gct aac ttc gaa gat gtc acc aga ttt gac cca 1200
Thr Gln Arg Asp Glu Ala Asn Phe Glu Asp Val Thr Arg Phe Asp Pro
385 390 395 400
tct cgt ttc gaa ggt gct ggt cca act cca ttc act ttt gtt cca ttc 1248
Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr Phe Val Pro Phe
405 410 415
ggt ggt ggt cca aga atg tgt ttg ggt aag gaa ttc gct aga tta gaa 1296
Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Phe Ala Arg Leu Glu
420 425 430
gtt ttg gct ttc ttg cac aac att gtc acc aac ttt aaa tgg gac ttg 1344
Val Leu Ala Phe Leu His Asn Ile Val Thr Asn Phe Lys Trp Asp Leu
435 440 445
ttg att cca gac gaa aag atc gaa tac gac cca atg gct act cca gcc 1392
Leu Ile Pro Asp Glu Lys Ile Glu Tyr Asp Pro Met Ala Thr Pro Ala
450 455 460
aag ggt ttg cca atc aga ttg cac cct cac caa gtc 1428
Lys Gly Leu Pro Ile Arg Leu His Pro His Gln Val
465 470 475
<210> 32
<211> 476
<212> PRT
<213> Stevia rebaudiana
<400> 32
Met Ile Gln Val Leu Thr Pro Ile Leu Leu Phe Leu Ile Phe Phe Val
1 5 10 15
Phe Trp Lys Val Tyr Lys His Gln Lys Thr Lys Ile Asn Leu Pro Pro
20 25 30
Gly Ser Phe Gly Trp Pro Phe Leu Gly Glu Thr Leu Ala Leu Leu Arg
35 40 45
Ala Gly Trp Asp Ser Glu Pro Glu Arg Phe Val Arg Glu Arg Ile Lys
50 55 60
Lys His Gly Ser Pro Leu Val Phe Lys Thr Ser Leu Phe Gly Asp Arg
65 70 75 80
Phe Ala Val Leu Cys Gly Pro Ala Gly Asn Lys Phe Leu Phe Cys Asn
85 90 95
Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Val Pro Val Arg Lys Leu
100 105 110
Phe Gly Lys Ser Leu Leu Thr Ile Arg Gly Asp Glu Ala Lys Trp Met
115 120 125
Arg Lys Met Leu Leu Ser Tyr Leu Gly Pro Asp Ala Phe Ala Thr His
130 135 140
Tyr Ala Val Thr Met Asp Val Val Thr Arg Arg His Ile Asp Val His
145 150 155 160
Trp Arg Gly Lys Glu Glu Val Asn Val Phe Gln Thr Val Lys Leu Tyr
165 170 175
Ala Phe Glu Leu Ala Cys Arg Leu Phe Met Asn Leu Asp Asp Pro Asn
180 185 190
His Ile Ala Lys Leu Gly Ser Leu Phe Asn Ile Phe Leu Lys Gly Ile
195 200 205
Ile Glu Leu Pro Ile Asp Val Pro Gly Thr Arg Phe Tyr Ser Ser Lys
210 215 220
Lys Ala Ala Ala Ala Ile Arg Ile Glu Leu Lys Lys Leu Ile Lys Ala
225 230 235 240
Arg Lys Leu Glu Leu Lys Glu Gly Lys Ala Ser Ser Ser Gln Asp Leu
245 250 255
Leu Ser His Leu Leu Thr Ser Pro Asp Glu Asn Gly Met Phe Leu Thr
260 265 270
Glu Glu Glu Ile Val Asp Asn Ile Leu Leu Leu Leu Phe Ala Gly His
275 280 285
Asp Thr Ser Ala Leu Ser Ile Thr Leu Leu Met Lys Thr Leu Gly Glu
290 295 300
His Ser Asp Val Tyr Asp Lys Val Leu Lys Glu Gln Leu Glu Ile Ser
305 310 315 320
Lys Thr Lys Glu Ala Trp Glu Ser Leu Lys Trp Glu Asp Ile Gln Lys
325 330 335
Met Lys Tyr Ser Trp Ser Val Ile Cys Glu Val Met Arg Leu Asn Pro
340 345 350
Pro Val Ile Gly Thr Tyr Arg Glu Ala Leu Val Asp Ile Asp Tyr Ala
355 360 365
Gly Tyr Thr Ile Pro Lys Gly Trp Lys Leu His Trp Ser Ala Val Ser
370 375 380
Thr Gln Arg Asp Glu Ala Asn Phe Glu Asp Val Thr Arg Phe Asp Pro
385 390 395 400
Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr Phe Val Pro Phe
405 410 415
Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Phe Ala Arg Leu Glu
420 425 430
Val Leu Ala Phe Leu His Asn Ile Val Thr Asn Phe Lys Trp Asp Leu
435 440 445
Leu Ile Pro Asp Glu Lys Ile Glu Tyr Asp Pro Met Ala Thr Pro Ala
450 455 460
Lys Gly Leu Pro Ile Arg Leu His Pro His Gln Val
465 470 475
<210> 33
<211> 1575
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(1575)
<400> 33
atg gaa tct tta gtc gtt cac acc gtc aat gcc atc tgg tgt att gtc 48
Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val
1 5 10 15
att gtt ggt att ttc tct gtt ggt tac cac gtt tac ggt cgt gcc gtt 96
Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val
20 25 30
gtt gaa caa tgg aga atg aga aga tct ttg aaa ttg caa ggt gtc aag 144
Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys
35 40 45
ggt cca cca cca tcc att ttc aac ggt aat gtc tct gaa atg caa aga 192
Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg
50 55 60
atc caa tct gaa gct aag cac tgt tcc ggt gac aac atc att tct cac 240
Ile Gln Ser Glu Ala Lys His Cys Ser Gly Asp Asn Ile Ile Ser His
65 70 75 80
gat tac tcc tcc tct ttg ttc cct cac ttt gac cac tgg aga aag caa 288
Asp Tyr Ser Ser Ser Leu Phe Pro His Phe Asp His Trp Arg Lys Gln
85 90 95
tac ggt aga atc tac acc tac tcc act ggt ttg aaa caa cat ttg tac 336
Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Lys Gln His Leu Tyr
100 105 110
atc aac cat cca gaa atg gtc aag gaa tta tct caa acc aac act ttg 384
Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Thr Leu
115 120 125
aac tta ggt cgt atc act cac atc acc aag aga ttg aac cca atc tta 432
Asn Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Asn Pro Ile Leu
130 135 140
ggt aac ggt atc atc act tcc aac ggt cca cac tgg gct cat caa aga 480
Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg
145 150 155 160
aga att att gct tac gaa ttc acc cac gac aaa atc aag ggt atg gtc 528
Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Ile Lys Gly Met Val
165 170 175
ggt ttg atg gtc gaa tct gcc atg cca atg ttg aac aaa tgg gaa gaa 576
Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu
180 185 190
atg gtt aag aga ggt ggt gaa atg ggt tgt gac atc cgt gtt gac gaa 624
Met Val Lys Arg Gly Gly Glu Met Gly Cys Asp Ile Arg Val Asp Glu
195 200 205
gat ttg aag gat gtt tct gct gat gtc att gct aag gct tgt ttc ggt 672
Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly
210 215 220
tcc tct ttc tcc aag ggt aag gct atc ttc tcc atg atc aga gac ttg 720
Ser Ser Phe Ser Lys Gly Lys Ala Ile Phe Ser Met Ile Arg Asp Leu
225 230 235 240
ttg act gcc atc act aag aga tct gtt ttg ttc aga ttc aac ggt ttc 768
Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe
245 250 255
acc gac atg gtt ttc ggt tcc aag aag cat ggt gat gtc gat atc gat 816
Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp
260 265 270
gct ttg gaa atg gaa ttg gaa tct tct atc tgg gaa acc gtt aag gaa 864
Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu
275 280 285
aga gaa att gaa tgt aag gac act cac aag aag gat ttg atg caa tta 912
Arg Glu Ile Glu Cys Lys Asp Thr His Lys Lys Asp Leu Met Gln Leu
290 295 300
atc ttg gaa ggt gcc atg aga tct tgt gac ggt aac ttg tgg gac aag 960
Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys
305 310 315 320
tct gct tac aga aga ttt gtt gtc gac aac tgt aaa tcc atc tac ttt 1008
Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe
325 330 335
gcc ggt cac gac tct act gct gtc tcc gtt tcc tgg tgt ttg atg ttg 1056
Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu
340 345 350
cta gct ttg aac cca tcc tgg caa gtc aag atc aga gat gaa atc tta 1104
Leu Ala Leu Asn Pro Ser Trp Gln Val Lys Ile Arg Asp Glu Ile Leu
355 360 365
tct tct tgt aag aac ggt att cca gat gct gaa tcc att cca aac ttg 1152
Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu
370 375 380
aag acc gtt acc atg gtc att caa gaa act atg aga ttg tac cca cca 1200
Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro
385 390 395 400
gct cca att gtc ggt aga gaa gct tcc aag gac atc aga tta ggt gac 1248
Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp
405 410 415
ttg gtt gtt cca aag ggt gtt tgt atc tgg act ttg att cca gct ttg 1296
Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu
420 425 430
cac cgt gac cca gaa atc tgg ggt cca gat gct aac gac ttc aag cca 1344
His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro
435 440 445
gaa aga ttc tct gaa ggt att tcc aag gct tgt aaa tac cca caa tct 1392
Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ser
450 455 460
tac atc cca ttc ggt ttg ggt cca aga acc tgt gtc ggt aag aac ttc 1440
Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe
465 470 475 480
ggt atg atg gaa gtc aaa gtt ttg gtt tct ttg att gtt tcc aag ttc 1488
Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe
485 490 495
tct ttc acc ttg tct cca act tac caa cac tct cca tct cac aag ttg 1536
Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu
500 505 510
ttg gtt gaa cct caa cac ggt gtt gtc att aga gtc gtt 1575
Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val
515 520 525
<210> 34
<211> 525
<212> PRT
<213> Arabidopsis thaliana
<400> 34
Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val
1 5 10 15
Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val
20 25 30
Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys
35 40 45
Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg
50 55 60
Ile Gln Ser Glu Ala Lys His Cys Ser Gly Asp Asn Ile Ile Ser His
65 70 75 80
Asp Tyr Ser Ser Ser Leu Phe Pro His Phe Asp His Trp Arg Lys Gln
85 90 95
Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Lys Gln His Leu Tyr
100 105 110
Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Thr Leu
115 120 125
Asn Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Asn Pro Ile Leu
130 135 140
Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg
145 150 155 160
Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Ile Lys Gly Met Val
165 170 175
Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu
180 185 190
Met Val Lys Arg Gly Gly Glu Met Gly Cys Asp Ile Arg Val Asp Glu
195 200 205
Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly
210 215 220
Ser Ser Phe Ser Lys Gly Lys Ala Ile Phe Ser Met Ile Arg Asp Leu
225 230 235 240
Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe
245 250 255
Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp
260 265 270
Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu
275 280 285
Arg Glu Ile Glu Cys Lys Asp Thr His Lys Lys Asp Leu Met Gln Leu
290 295 300
Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys
305 310 315 320
Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe
325 330 335
Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu
340 345 350
Leu Ala Leu Asn Pro Ser Trp Gln Val Lys Ile Arg Asp Glu Ile Leu
355 360 365
Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu
370 375 380
Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro
385 390 395 400
Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp
405 410 415
Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu
420 425 430
His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro
435 440 445
Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ser
450 455 460
Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe
465 470 475 480
Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe
485 490 495
Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu
500 505 510
Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val
515 520 525
<210> 35
<211> 1437
<212> DNA
<213> Ixeris dentata var. albiflora
<220>
<221> CDS
<222> (1)..(1437)
<400> 35
atg gat gcc gtt gcc gtc aac tct gaa acc atg tcc cac gtt gtt ttc 48
Met Asp Ala Val Ala Val Asn Ser Glu Thr Met Ser His Val Val Phe
1 5 10 15
att cca ttc cca gct caa tct cac atc aag tgt atg ttg aag ttg gcc 96
Ile Pro Phe Pro Ala Gln Ser His Ile Lys Cys Met Leu Lys Leu Ala
20 25 30
cgt cta tta cac cac aag ggt tta cac atc act ttc gtc aac act gaa 144
Arg Leu Leu His His Lys Gly Leu His Ile Thr Phe Val Asn Thr Glu
35 40 45
ttg aac cac aac caa ttg tta tct tct ggt ggt cca aac tct ttg gac 192
Leu Asn His Asn Gln Leu Leu Ser Ser Gly Gly Pro Asn Ser Leu Asp
50 55 60
ggt gaa cca ggt ttc aga ttc aag acc att cca gat ggt gtt cca gaa 240
Gly Glu Pro Gly Phe Arg Phe Lys Thr Ile Pro Asp Gly Val Pro Glu
65 70 75 80
ggt gct cca gac ttc atg tac gct tta tgt gac tct gtc ttg aac aaa 288
Gly Ala Pro Asp Phe Met Tyr Ala Leu Cys Asp Ser Val Leu Asn Lys
85 90 95
atg ttg gac cca ttt gtc gat ttg att ggt aga ttg gaa tct cca gcc 336
Met Leu Asp Pro Phe Val Asp Leu Ile Gly Arg Leu Glu Ser Pro Ala
100 105 110
acc tgt atc atc ggt gac ggt atg atg cct ttc act gtt gct gct gct 384
Thr Cys Ile Ile Gly Asp Gly Met Met Pro Phe Thr Val Ala Ala Ala
115 120 125
gaa aag ttg aaa ttg cca att atg cat ttc tgg act ttc cca gct gct 432
Glu Lys Leu Lys Leu Pro Ile Met His Phe Trp Thr Phe Pro Ala Ala
130 135 140
gct ttc ttg ggt tac tac caa gct cca aac ttg att gaa aag ggt ttc 480
Ala Phe Leu Gly Tyr Tyr Gln Ala Pro Asn Leu Ile Glu Lys Gly Phe
145 150 155 160
atc cca cct aag gac gaa tcc tgg tcc acc aac ggt tac ttg gaa acc 528
Ile Pro Pro Lys Asp Glu Ser Trp Ser Thr Asn Gly Tyr Leu Glu Thr
165 170 175
gtt gtt gac tcc att tct ggt ttg gaa ggt ttc aga atc aga gat atc 576
Val Val Asp Ser Ile Ser Gly Leu Glu Gly Phe Arg Ile Arg Asp Ile
180 185 190
cca gct tac ttc aga acc act gac cca aac gat tct gat ttc aac tac 624
Pro Ala Tyr Phe Arg Thr Thr Asp Pro Asn Asp Ser Asp Phe Asn Tyr
195 200 205
atc atc gaa tgt gtt aag gcc atc aga aag gtt tcc aac att gtc ttg 672
Ile Ile Glu Cys Val Lys Ala Ile Arg Lys Val Ser Asn Ile Val Leu
210 215 220
cac act ttc gaa gaa ttg gaa tct acc att atc aag gcc ttg caa cca 720
His Thr Phe Glu Glu Leu Glu Ser Thr Ile Ile Lys Ala Leu Gln Pro
225 230 235 240
atg atc cca cac gtc tac acc att ggt cca ttg gaa ttg ttg ttg aac 768
Met Ile Pro His Val Tyr Thr Ile Gly Pro Leu Glu Leu Leu Leu Asn
245 250 255
cca atc aag tta gaa gaa gaa act gaa aag ttg gac att aaa ggt tac 816
Pro Ile Lys Leu Glu Glu Glu Thr Glu Lys Leu Asp Ile Lys Gly Tyr
260 265 270
tct tta tgg aag gaa gat gac gaa tgt ttg aaa tgg ttg gac tcc aag 864
Ser Leu Trp Lys Glu Asp Asp Glu Cys Leu Lys Trp Leu Asp Ser Lys
275 280 285
gaa cca aac tcc gtc att tac gtc aac ttt ggt tct ttg atc tcc atg 912
Glu Pro Asn Ser Val Ile Tyr Val Asn Phe Gly Ser Leu Ile Ser Met
290 295 300
tcc aag gaa caa tta gct gaa ttt ggt tgg ggt ttg gtt aac tct aac 960
Ser Lys Glu Gln Leu Ala Glu Phe Gly Trp Gly Leu Val Asn Ser Asn
305 310 315 320
cac tgt ttc ttg tgg gtt atc aga aga gac ttg gtt gtc ggt gat tct 1008
His Cys Phe Leu Trp Val Ile Arg Arg Asp Leu Val Val Gly Asp Ser
325 330 335
gct cca ttg cct cca gaa ttg aaa gaa cgt atc aac gaa aga ggt ttc 1056
Ala Pro Leu Pro Pro Glu Leu Lys Glu Arg Ile Asn Glu Arg Gly Phe
340 345 350
atc gct tcc tgg tgt cca caa gaa aag gtc ttg aaa cat tcc tct gtt 1104
Ile Ala Ser Trp Cys Pro Gln Glu Lys Val Leu Lys His Ser Ser Val
355 360 365
ggt ggt ttc ttg act cac tgt ggt tgg ggt tcc atc atc gaa tct cta 1152
Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Ile Ile Glu Ser Leu
370 375 380
tct gct ggt gtc cca atg ttg tgt tgg cca tac ttg tgg gac caa cca 1200
Ser Ala Gly Val Pro Met Leu Cys Trp Pro Tyr Leu Trp Asp Gln Pro
385 390 395 400
acc aac tgt cgt caa gct tgt aag gaa tgg gaa gtt ggt ttg gaa att 1248
Thr Asn Cys Arg Gln Ala Cys Lys Glu Trp Glu Val Gly Leu Glu Ile
405 410 415
gaa ggt aac gtt aac aag gat gaa gtc gaa aga ttg acc aga gaa ttg 1296
Glu Gly Asn Val Asn Lys Asp Glu Val Glu Arg Leu Thr Arg Glu Leu
420 425 430
atc ggt ggt gaa aag ggt aag caa atg aga tcc aag gct ttg gaa tgg 1344
Ile Gly Gly Glu Lys Gly Lys Gln Met Arg Ser Lys Ala Leu Glu Trp
435 440 445
aag aag aag atc gaa att gct act ggt cca aag ggt tct tct tct tta 1392
Lys Lys Lys Ile Glu Ile Ala Thr Gly Pro Lys Gly Ser Ser Ser Leu
450 455 460
aat gtc gaa aga ttg gct aac gac att aac atg ttc tcc aga aat 1437
Asn Val Glu Arg Leu Ala Asn Asp Ile Asn Met Phe Ser Arg Asn
465 470 475
<210> 36
<211> 479
<212> PRT
<213> Ixeris dentata var. albiflora
<400> 36
Met Asp Ala Val Ala Val Asn Ser Glu Thr Met Ser His Val Val Phe
1 5 10 15
Ile Pro Phe Pro Ala Gln Ser His Ile Lys Cys Met Leu Lys Leu Ala
20 25 30
Arg Leu Leu His His Lys Gly Leu His Ile Thr Phe Val Asn Thr Glu
35 40 45
Leu Asn His Asn Gln Leu Leu Ser Ser Gly Gly Pro Asn Ser Leu Asp
50 55 60
Gly Glu Pro Gly Phe Arg Phe Lys Thr Ile Pro Asp Gly Val Pro Glu
65 70 75 80
Gly Ala Pro Asp Phe Met Tyr Ala Leu Cys Asp Ser Val Leu Asn Lys
85 90 95
Met Leu Asp Pro Phe Val Asp Leu Ile Gly Arg Leu Glu Ser Pro Ala
100 105 110
Thr Cys Ile Ile Gly Asp Gly Met Met Pro Phe Thr Val Ala Ala Ala
115 120 125
Glu Lys Leu Lys Leu Pro Ile Met His Phe Trp Thr Phe Pro Ala Ala
130 135 140
Ala Phe Leu Gly Tyr Tyr Gln Ala Pro Asn Leu Ile Glu Lys Gly Phe
145 150 155 160
Ile Pro Pro Lys Asp Glu Ser Trp Ser Thr Asn Gly Tyr Leu Glu Thr
165 170 175
Val Val Asp Ser Ile Ser Gly Leu Glu Gly Phe Arg Ile Arg Asp Ile
180 185 190
Pro Ala Tyr Phe Arg Thr Thr Asp Pro Asn Asp Ser Asp Phe Asn Tyr
195 200 205
Ile Ile Glu Cys Val Lys Ala Ile Arg Lys Val Ser Asn Ile Val Leu
210 215 220
His Thr Phe Glu Glu Leu Glu Ser Thr Ile Ile Lys Ala Leu Gln Pro
225 230 235 240
Met Ile Pro His Val Tyr Thr Ile Gly Pro Leu Glu Leu Leu Leu Asn
245 250 255
Pro Ile Lys Leu Glu Glu Glu Thr Glu Lys Leu Asp Ile Lys Gly Tyr
260 265 270
Ser Leu Trp Lys Glu Asp Asp Glu Cys Leu Lys Trp Leu Asp Ser Lys
275 280 285
Glu Pro Asn Ser Val Ile Tyr Val Asn Phe Gly Ser Leu Ile Ser Met
290 295 300
Ser Lys Glu Gln Leu Ala Glu Phe Gly Trp Gly Leu Val Asn Ser Asn
305 310 315 320
His Cys Phe Leu Trp Val Ile Arg Arg Asp Leu Val Val Gly Asp Ser
325 330 335
Ala Pro Leu Pro Pro Glu Leu Lys Glu Arg Ile Asn Glu Arg Gly Phe
340 345 350
Ile Ala Ser Trp Cys Pro Gln Glu Lys Val Leu Lys His Ser Ser Val
355 360 365
Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Ile Ile Glu Ser Leu
370 375 380
Ser Ala Gly Val Pro Met Leu Cys Trp Pro Tyr Leu Trp Asp Gln Pro
385 390 395 400
Thr Asn Cys Arg Gln Ala Cys Lys Glu Trp Glu Val Gly Leu Glu Ile
405 410 415
Glu Gly Asn Val Asn Lys Asp Glu Val Glu Arg Leu Thr Arg Glu Leu
420 425 430
Ile Gly Gly Glu Lys Gly Lys Gln Met Arg Ser Lys Ala Leu Glu Trp
435 440 445
Lys Lys Lys Ile Glu Ile Ala Thr Gly Pro Lys Gly Ser Ser Ser Leu
450 455 460
Asn Val Glu Arg Leu Ala Asn Asp Ile Asn Met Phe Ser Arg Asn
465 470 475
<210> 37
<211> 1446
<212> DNA
<213> Ricinus communis
<220>
<221> CDS
<222> (1)..(1446)
<400> 37
atg ggt tcc att gtc cgt gac cac gac aag cca cac gtt gtt tgt gtt 48
Met Gly Ser Ile Val Arg Asp His Asp Lys Pro His Val Val Cys Val
1 5 10 15
cca tac cca gct caa ggt cac gtt aac cca atg gtc aaa ttg gcc aag 96
Pro Tyr Pro Ala Gln Gly His Val Asn Pro Met Val Lys Leu Ala Lys
20 25 30
ttg ttg cac tac aac gat ttc cac gtc act ttc gtc aac act gaa tac 144
Leu Leu His Tyr Asn Asp Phe His Val Thr Phe Val Asn Thr Glu Tyr
35 40 45
aac cac aga aga tta ttg aac tcc aga ggt cct tct tct ttg gac ggt 192
Asn His Arg Arg Leu Leu Asn Ser Arg Gly Pro Ser Ser Leu Asp Gly
50 55 60
ttg cca gat ttc aga ttc gaa gcc atc tct gac ggt ttg cca cca tct 240
Leu Pro Asp Phe Arg Phe Glu Ala Ile Ser Asp Gly Leu Pro Pro Ser
65 70 75 80
gat gct aac gct acc caa gat atc cca tct cta tgt gac tct acc tcc 288
Asp Ala Asn Ala Thr Gln Asp Ile Pro Ser Leu Cys Asp Ser Thr Ser
85 90 95
aag aac tct ttg gct cca ttc aga aac ttg ttg ttg aag ttg aaa tcc 336
Lys Asn Ser Leu Ala Pro Phe Arg Asn Leu Leu Leu Lys Leu Lys Ser
100 105 110
tct gac tct ttg cca cca gtt acc tgt atc att tct gat gct tgt atg 384
Ser Asp Ser Leu Pro Pro Val Thr Cys Ile Ile Ser Asp Ala Cys Met
115 120 125
tcc ttc act ttg gat gct gct gaa gaa ttt ggt att cca gaa atc tta 432
Ser Phe Thr Leu Asp Ala Ala Glu Glu Phe Gly Ile Pro Glu Ile Leu
130 135 140
ttc tgg acc cca tct tct tgt ggt gtt ttg ggt tac tct caa tac cac 480
Phe Trp Thr Pro Ser Ser Cys Gly Val Leu Gly Tyr Ser Gln Tyr His
145 150 155 160
act ttg att gaa aag ggt ttg act cca tta aag gac gcc tct tac ttg 528
Thr Leu Ile Glu Lys Gly Leu Thr Pro Leu Lys Asp Ala Ser Tyr Leu
165 170 175
acc aac ggt tac ttg gaa acc act ttg gac tgg att cca ggt atg aag 576
Thr Asn Gly Tyr Leu Glu Thr Thr Leu Asp Trp Ile Pro Gly Met Lys
180 185 190
gat atc aga ttc aga gat ttg cca tct ttc atc aga acc act gac aga 624
Asp Ile Arg Phe Arg Asp Leu Pro Ser Phe Ile Arg Thr Thr Asp Arg
195 200 205
aac gat atc atg ttg aac ttt gtt gtc cgt gaa ttg gaa aga act tcc 672
Asn Asp Ile Met Leu Asn Phe Val Val Arg Glu Leu Glu Arg Thr Ser
210 215 220
aga gct tct gct gtt gtt ttc aac act ttc tac gcc ttc gaa aag gac 720
Arg Ala Ser Ala Val Val Phe Asn Thr Phe Tyr Ala Phe Glu Lys Asp
225 230 235 240
gtc tta gat gtc tta tcc acc atg ttc cca cca atc tac tcc atc ggt 768
Val Leu Asp Val Leu Ser Thr Met Phe Pro Pro Ile Tyr Ser Ile Gly
245 250 255
cca ttg caa ttg ttg gtt gac caa atc cca att gac aga aac ttg ggt 816
Pro Leu Gln Leu Leu Val Asp Gln Ile Pro Ile Asp Arg Asn Leu Gly
260 265 270
aac att ggt tcc aac tta tgg aag gaa caa cca gaa tgt att gac tgg 864
Asn Ile Gly Ser Asn Leu Trp Lys Glu Gln Pro Glu Cys Ile Asp Trp
275 280 285
ttg gac acc aag gaa cca aac tct gtt gtc tac gtc aac ttc ggt tcc 912
Leu Asp Thr Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser
290 295 300
atc act gtt atc act cct caa caa atg att gaa ttc gct tgg ggt cta 960
Ile Thr Val Ile Thr Pro Gln Gln Met Ile Glu Phe Ala Trp Gly Leu
305 310 315 320
gct tct tct aag aaa cca ttc tta tgg atc atc aga cca gac ttg gtt 1008
Ala Ser Ser Lys Lys Pro Phe Leu Trp Ile Ile Arg Pro Asp Leu Val
325 330 335
atc ggt gaa aac gct atg ttg cca gct gaa ttc gtt tct gaa acc aag 1056
Ile Gly Glu Asn Ala Met Leu Pro Ala Glu Phe Val Ser Glu Thr Lys
340 345 350
gat cgt ggt atg ttg gct tct tgg ggt cct caa gaa caa att ttg aaa 1104
Asp Arg Gly Met Leu Ala Ser Trp Gly Pro Gln Glu Gln Ile Leu Lys
355 360 365
cat cca gct gtc ggt ggt ttc ttg tct cac atg ggt tgg aac tcc act 1152
His Pro Ala Val Gly Gly Phe Leu Ser His Met Gly Trp Asn Ser Thr
370 375 380
ttg gac tcc atg tcc ggt ggt gtc cca atg gtt tgt tgg cca ttc ttt 1200
Leu Asp Ser Met Ser Gly Gly Val Pro Met Val Cys Trp Pro Phe Phe
385 390 395 400
gct gaa caa caa acc aac tgt aga ttt gct tgt acc gaa tgg ggt gtt 1248
Ala Glu Gln Gln Thr Asn Cys Arg Phe Ala Cys Thr Glu Trp Gly Val
405 410 415
ggt atg gaa att gac aac aat gtc aag aga gat gaa gtc aag aag ttg 1296
Gly Met Glu Ile Asp Asn Asn Val Lys Arg Asp Glu Val Lys Lys Leu
420 425 430
gtt gaa gtt ttg atg gac ggt aag aaa ggt aag gaa atg aag tcc aag 1344
Val Glu Val Leu Met Asp Gly Lys Lys Gly Lys Glu Met Lys Ser Lys
435 440 445
gcc atg gaa tgg aaa acc aag gct gaa gaa gct gcc aag cca ggt ggt 1392
Ala Met Glu Trp Lys Thr Lys Ala Glu Glu Ala Ala Lys Pro Gly Gly
450 455 460
tcc tct cat aac aac ttg gac aga tta gtc aag ttc atc aag ggt caa 1440
Ser Ser His Asn Asn Leu Asp Arg Leu Val Lys Phe Ile Lys Gly Gln
465 470 475 480
aag aat 1446
Lys Asn
<210> 38
<211> 482
<212> PRT
<213> Ricinus communis
<400> 38
Met Gly Ser Ile Val Arg Asp His Asp Lys Pro His Val Val Cys Val
1 5 10 15
Pro Tyr Pro Ala Gln Gly His Val Asn Pro Met Val Lys Leu Ala Lys
20 25 30
Leu Leu His Tyr Asn Asp Phe His Val Thr Phe Val Asn Thr Glu Tyr
35 40 45
Asn His Arg Arg Leu Leu Asn Ser Arg Gly Pro Ser Ser Leu Asp Gly
50 55 60
Leu Pro Asp Phe Arg Phe Glu Ala Ile Ser Asp Gly Leu Pro Pro Ser
65 70 75 80
Asp Ala Asn Ala Thr Gln Asp Ile Pro Ser Leu Cys Asp Ser Thr Ser
85 90 95
Lys Asn Ser Leu Ala Pro Phe Arg Asn Leu Leu Leu Lys Leu Lys Ser
100 105 110
Ser Asp Ser Leu Pro Pro Val Thr Cys Ile Ile Ser Asp Ala Cys Met
115 120 125
Ser Phe Thr Leu Asp Ala Ala Glu Glu Phe Gly Ile Pro Glu Ile Leu
130 135 140
Phe Trp Thr Pro Ser Ser Cys Gly Val Leu Gly Tyr Ser Gln Tyr His
145 150 155 160
Thr Leu Ile Glu Lys Gly Leu Thr Pro Leu Lys Asp Ala Ser Tyr Leu
165 170 175
Thr Asn Gly Tyr Leu Glu Thr Thr Leu Asp Trp Ile Pro Gly Met Lys
180 185 190
Asp Ile Arg Phe Arg Asp Leu Pro Ser Phe Ile Arg Thr Thr Asp Arg
195 200 205
Asn Asp Ile Met Leu Asn Phe Val Val Arg Glu Leu Glu Arg Thr Ser
210 215 220
Arg Ala Ser Ala Val Val Phe Asn Thr Phe Tyr Ala Phe Glu Lys Asp
225 230 235 240
Val Leu Asp Val Leu Ser Thr Met Phe Pro Pro Ile Tyr Ser Ile Gly
245 250 255
Pro Leu Gln Leu Leu Val Asp Gln Ile Pro Ile Asp Arg Asn Leu Gly
260 265 270
Asn Ile Gly Ser Asn Leu Trp Lys Glu Gln Pro Glu Cys Ile Asp Trp
275 280 285
Leu Asp Thr Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser
290 295 300
Ile Thr Val Ile Thr Pro Gln Gln Met Ile Glu Phe Ala Trp Gly Leu
305 310 315 320
Ala Ser Ser Lys Lys Pro Phe Leu Trp Ile Ile Arg Pro Asp Leu Val
325 330 335
Ile Gly Glu Asn Ala Met Leu Pro Ala Glu Phe Val Ser Glu Thr Lys
340 345 350
Asp Arg Gly Met Leu Ala Ser Trp Gly Pro Gln Glu Gln Ile Leu Lys
355 360 365
His Pro Ala Val Gly Gly Phe Leu Ser His Met Gly Trp Asn Ser Thr
370 375 380
Leu Asp Ser Met Ser Gly Gly Val Pro Met Val Cys Trp Pro Phe Phe
385 390 395 400
Ala Glu Gln Gln Thr Asn Cys Arg Phe Ala Cys Thr Glu Trp Gly Val
405 410 415
Gly Met Glu Ile Asp Asn Asn Val Lys Arg Asp Glu Val Lys Lys Leu
420 425 430
Val Glu Val Leu Met Asp Gly Lys Lys Gly Lys Glu Met Lys Ser Lys
435 440 445
Ala Met Glu Trp Lys Thr Lys Ala Glu Glu Ala Ala Lys Pro Gly Gly
450 455 460
Ser Ser His Asn Asn Leu Asp Arg Leu Val Lys Phe Ile Lys Gly Gln
465 470 475 480
Lys Asn
<210> 39
<211> 1374
<212> DNA
<213> Ixeris dentata var. albiflora
<220>
<221> CDS
<222> (1)..(1374)
<400> 39
atg gct gaa gaa cac aac aag acc aac aac tct tct cct cac gtt gtc 48
Met Ala Glu Glu His Asn Lys Thr Asn Asn Ser Ser Pro His Val Val
1 5 10 15
atc ttc cca ttc cca tct caa ggt cac att aac cca ttg att caa ttt 96
Ile Phe Pro Phe Pro Ser Gln Gly His Ile Asn Pro Leu Ile Gln Phe
20 25 30
gcc aag aga ttg tcc tcc aag ggt gtc aag cca act ttg atc act acc 144
Ala Lys Arg Leu Ser Ser Lys Gly Val Lys Pro Thr Leu Ile Thr Thr
35 40 45
atc tac att gct aag acc tct cca tac cca aac tct tcc att gtt gtt 192
Ile Tyr Ile Ala Lys Thr Ser Pro Tyr Pro Asn Ser Ser Ile Val Val
50 55 60
gaa cca att tct gat ggt ttc gac gac ggt ggt ttc aag tct gcc act 240
Glu Pro Ile Ser Asp Gly Phe Asp Asp Gly Gly Phe Lys Ser Ala Thr
65 70 75 80
tct gct gaa tct tac att gac act ttc cac caa gtt ggt tcc aaa tct 288
Ser Ala Glu Ser Tyr Ile Asp Thr Phe His Gln Val Gly Ser Lys Ser
85 90 95
cta gct aac ttg atc aga aag ttg gtc aac gaa ggt aac cat gtc gat 336
Leu Ala Asn Leu Ile Arg Lys Leu Val Asn Glu Gly Asn His Val Asp
100 105 110
gct atc atc tac gac tct ttc gtc acc tgg gct ttg gat gtt gct atg 384
Ala Ile Ile Tyr Asp Ser Phe Val Thr Trp Ala Leu Asp Val Ala Met
115 120 125
gaa tac ggt att gac ggt ggt tgt ttc ttc acc caa gct tgt gct gtc 432
Glu Tyr Gly Ile Asp Gly Gly Cys Phe Phe Thr Gln Ala Cys Ala Val
130 135 140
aac aac atc tac tac cat gtt tac aag ggt gtc ttg gaa att cca ttg 480
Asn Asn Ile Tyr Tyr His Val Tyr Lys Gly Val Leu Glu Ile Pro Leu
145 150 155 160
caa gct gct gct cca cca acc gtc act atc tta ttg cct gaa ttg cct 528
Gln Ala Ala Ala Pro Pro Thr Val Thr Ile Leu Leu Pro Glu Leu Pro
165 170 175
caa tta caa tta tgg gaa act cca tcc ttt gtc cac aac cca ggt cca 576
Gln Leu Gln Leu Trp Glu Thr Pro Ser Phe Val His Asn Pro Gly Pro
180 185 190
tac cca ggt tgg gct cac att gtt ttc aac caa ttc cca aac atc cac 624
Tyr Pro Gly Trp Ala His Ile Val Phe Asn Gln Phe Pro Asn Ile His
195 200 205
aac gcc aga tgg gtt ttc tcc aac act ttc ttc aaa ttg gaa gaa caa 672
Asn Ala Arg Trp Val Phe Ser Asn Thr Phe Phe Lys Leu Glu Glu Gln
210 215 220
gtt atc aaa tgg atg aga ttg atg tgg cca ttg atg gtt gtc ggt cca 720
Val Ile Lys Trp Met Arg Leu Met Trp Pro Leu Met Val Val Gly Pro
225 230 235 240
act gtt cca tcc atg tac ttg gac aag aga tta gaa gat gac gat gac 768
Thr Val Pro Ser Met Tyr Leu Asp Lys Arg Leu Glu Asp Asp Asp Asp
245 250 255
tac ggt atg tct ttg ttg aag cca aac cac att gaa tgt atg ggt tgg 816
Tyr Gly Met Ser Leu Leu Lys Pro Asn His Ile Glu Cys Met Gly Trp
260 265 270
ttg aat aac aaa cca aag ggt tcc gtt gtc tac gtt tcc ttc ggt tcc 864
Leu Asn Asn Lys Pro Lys Gly Ser Val Val Tyr Val Ser Phe Gly Ser
275 280 285
tac ggt gaa tta ggt gtt gct caa atg gaa gaa att gct tgg ggt ttg 912
Tyr Gly Glu Leu Gly Val Ala Gln Met Glu Glu Ile Ala Trp Gly Leu
290 295 300
aac gaa tct tcc gtc aac tat tta tgg gtt gtc aga gaa act gaa aag 960
Asn Glu Ser Ser Val Asn Tyr Leu Trp Val Val Arg Glu Thr Glu Lys
305 310 315 320
gaa aag ttg cca aag tct ttc ttg gct aat ggt ttg att gtt gaa tgg 1008
Glu Lys Leu Pro Lys Ser Phe Leu Ala Asn Gly Leu Ile Val Glu Trp
325 330 335
tgt cgt caa ttg gaa gtt ttg gct cac gaa gct gtc ggt tgt ttc gtc 1056
Cys Arg Gln Leu Glu Val Leu Ala His Glu Ala Val Gly Cys Phe Val
340 345 350
act cac tgt ggt ttc aac tcc tct ttg gaa acc atc tct ttg ggt gtc 1104
Thr His Cys Gly Phe Asn Ser Ser Leu Glu Thr Ile Ser Leu Gly Val
355 360 365
cca gtt gtt gcc atc cca caa tgg acc gat caa acc acc aac gct aag 1152
Pro Val Val Ala Ile Pro Gln Trp Thr Asp Gln Thr Thr Asn Ala Lys
370 375 380
tgt ttg gaa gat atc tgg ggt gtt ggt atc aga gcc aag act cca gtc 1200
Cys Leu Glu Asp Ile Trp Gly Val Gly Ile Arg Ala Lys Thr Pro Val
385 390 395 400
acc aga acc aac ttg gtc tgg tgt atc aag gaa atc atg gaa ggt gaa 1248
Thr Arg Thr Asn Leu Val Trp Cys Ile Lys Glu Ile Met Glu Gly Glu
405 410 415
cgt ggt gct gtt gct aga aag aac gcc atc aag tgg aag gac ttg gcc 1296
Arg Gly Ala Val Ala Arg Lys Asn Ala Ile Lys Trp Lys Asp Leu Ala
420 425 430
att gaa gct gtc tct cca ggt ggt tcc tct gac aag gac atc aac gaa 1344
Ile Glu Ala Val Ser Pro Gly Gly Ser Ser Asp Lys Asp Ile Asn Glu
435 440 445
ttt gtt tct caa ttg tct cca atc aaa tgt 1374
Phe Val Ser Gln Leu Ser Pro Ile Lys Cys
450 455
<210> 40
<211> 458
<212> PRT
<213> Ixeris dentata var. albiflora
<400> 40
Met Ala Glu Glu His Asn Lys Thr Asn Asn Ser Ser Pro His Val Val
1 5 10 15
Ile Phe Pro Phe Pro Ser Gln Gly His Ile Asn Pro Leu Ile Gln Phe
20 25 30
Ala Lys Arg Leu Ser Ser Lys Gly Val Lys Pro Thr Leu Ile Thr Thr
35 40 45
Ile Tyr Ile Ala Lys Thr Ser Pro Tyr Pro Asn Ser Ser Ile Val Val
50 55 60
Glu Pro Ile Ser Asp Gly Phe Asp Asp Gly Gly Phe Lys Ser Ala Thr
65 70 75 80
Ser Ala Glu Ser Tyr Ile Asp Thr Phe His Gln Val Gly Ser Lys Ser
85 90 95
Leu Ala Asn Leu Ile Arg Lys Leu Val Asn Glu Gly Asn His Val Asp
100 105 110
Ala Ile Ile Tyr Asp Ser Phe Val Thr Trp Ala Leu Asp Val Ala Met
115 120 125
Glu Tyr Gly Ile Asp Gly Gly Cys Phe Phe Thr Gln Ala Cys Ala Val
130 135 140
Asn Asn Ile Tyr Tyr His Val Tyr Lys Gly Val Leu Glu Ile Pro Leu
145 150 155 160
Gln Ala Ala Ala Pro Pro Thr Val Thr Ile Leu Leu Pro Glu Leu Pro
165 170 175
Gln Leu Gln Leu Trp Glu Thr Pro Ser Phe Val His Asn Pro Gly Pro
180 185 190
Tyr Pro Gly Trp Ala His Ile Val Phe Asn Gln Phe Pro Asn Ile His
195 200 205
Asn Ala Arg Trp Val Phe Ser Asn Thr Phe Phe Lys Leu Glu Glu Gln
210 215 220
Val Ile Lys Trp Met Arg Leu Met Trp Pro Leu Met Val Val Gly Pro
225 230 235 240
Thr Val Pro Ser Met Tyr Leu Asp Lys Arg Leu Glu Asp Asp Asp Asp
245 250 255
Tyr Gly Met Ser Leu Leu Lys Pro Asn His Ile Glu Cys Met Gly Trp
260 265 270
Leu Asn Asn Lys Pro Lys Gly Ser Val Val Tyr Val Ser Phe Gly Ser
275 280 285
Tyr Gly Glu Leu Gly Val Ala Gln Met Glu Glu Ile Ala Trp Gly Leu
290 295 300
Asn Glu Ser Ser Val Asn Tyr Leu Trp Val Val Arg Glu Thr Glu Lys
305 310 315 320
Glu Lys Leu Pro Lys Ser Phe Leu Ala Asn Gly Leu Ile Val Glu Trp
325 330 335
Cys Arg Gln Leu Glu Val Leu Ala His Glu Ala Val Gly Cys Phe Val
340 345 350
Thr His Cys Gly Phe Asn Ser Ser Leu Glu Thr Ile Ser Leu Gly Val
355 360 365
Pro Val Val Ala Ile Pro Gln Trp Thr Asp Gln Thr Thr Asn Ala Lys
370 375 380
Cys Leu Glu Asp Ile Trp Gly Val Gly Ile Arg Ala Lys Thr Pro Val
385 390 395 400
Thr Arg Thr Asn Leu Val Trp Cys Ile Lys Glu Ile Met Glu Gly Glu
405 410 415
Arg Gly Ala Val Ala Arg Lys Asn Ala Ile Lys Trp Lys Asp Leu Ala
420 425 430
Ile Glu Ala Val Ser Pro Gly Gly Ser Ser Asp Lys Asp Ile Asn Glu
435 440 445
Phe Val Ser Gln Leu Ser Pro Ile Lys Cys
450 455
<210> 41
<211> 1362
<212> DNA
<213> Populus trichocarpa
<220>
<221> CDS
<222> (1)..(1362)
<400> 41
atg gac aac aag aag tct cat gtc att gtc ttg acc tac cca gct caa 48
Met Asp Asn Lys Lys Ser His Val Ile Val Leu Thr Tyr Pro Ala Gln
1 5 10 15
ggt cac atc aac cca ttg ttg caa ttc gcc aag aga tta gct tcc aag 96
Gly His Ile Asn Pro Leu Leu Gln Phe Ala Lys Arg Leu Ala Ser Lys
20 25 30
ggt ttg aag gcc act ttg gct acc acc tac tac act gtt aac tcc att 144
Gly Leu Lys Ala Thr Leu Ala Thr Thr Tyr Tyr Thr Val Asn Ser Ile
35 40 45
gac gct cca act gtc ggt gtt gaa cca atc tct gat ggt ttc gat gaa 192
Asp Ala Pro Thr Val Gly Val Glu Pro Ile Ser Asp Gly Phe Asp Glu
50 55 60
ggt ggt ttc aag caa gct tcc tct cta gat gtc tac ttg gaa tct ttc 240
Gly Gly Phe Lys Gln Ala Ser Ser Leu Asp Val Tyr Leu Glu Ser Phe
65 70 75 80
aag acc gtc ggt tcc aga acc ttg act gaa tta gtc ttt aaa ttc aag 288
Lys Thr Val Gly Ser Arg Thr Leu Thr Glu Leu Val Phe Lys Phe Lys
85 90 95
gct tct ggt tct cct gtt aac tgt gtt gtt tac gac tcc atg ttg cca 336
Ala Ser Gly Ser Pro Val Asn Cys Val Val Tyr Asp Ser Met Leu Pro
100 105 110
tgg gct ttg gac gtt gcc aga gat ttg ggt atc tac gct gct gcc ttc 384
Trp Ala Leu Asp Val Ala Arg Asp Leu Gly Ile Tyr Ala Ala Ala Phe
115 120 125
atg acc acc tct gct tcc gtc tgt tcc atg tac tgg aga att gac tta 432
Met Thr Thr Ser Ala Ser Val Cys Ser Met Tyr Trp Arg Ile Asp Leu
130 135 140
ggt tta ttg tct ttg cca ttg aag caa caa acc gct acc gtt tct ttg 480
Gly Leu Leu Ser Leu Pro Leu Lys Gln Gln Thr Ala Thr Val Ser Leu
145 150 155 160
cca ggt ttg cca cca ttg ggt tgt tgt gat ttg cca tct ttc tta gct 528
Pro Gly Leu Pro Pro Leu Gly Cys Cys Asp Leu Pro Ser Phe Leu Ala
165 170 175
gaa cca act tct caa act gct tac ttg gaa gtt atc atg gaa aag ttc 576
Glu Pro Thr Ser Gln Thr Ala Tyr Leu Glu Val Ile Met Glu Lys Phe
180 185 190
cac tcc ttg aac gaa gat gac tgg gtt ttc tgt aac tct ttc gaa gat 624
His Ser Leu Asn Glu Asp Asp Trp Val Phe Cys Asn Ser Phe Glu Asp
195 200 205
ttg gaa att gaa ttg gtc aag gct atg aga ggt aaa tgg cca ttg gtt 672
Leu Glu Ile Glu Leu Val Lys Ala Met Arg Gly Lys Trp Pro Leu Val
210 215 220
atg gtt ggt cca atg gtt cca tct gcc tac ttg gac caa caa att gat 720
Met Val Gly Pro Met Val Pro Ser Ala Tyr Leu Asp Gln Gln Ile Asp
225 230 235 240
ggt gac cgt gcc tac ggt gct tct ttg tgg aaa cca act tct tct caa 768
Gly Asp Arg Ala Tyr Gly Ala Ser Leu Trp Lys Pro Thr Ser Ser Gln
245 250 255
tgt ttc acc tgg ttg gac act aag cct cca aga tct gtc atc tac gtt 816
Cys Phe Thr Trp Leu Asp Thr Lys Pro Pro Arg Ser Val Ile Tyr Val
260 265 270
tcc ttt ggt tcc atg ggt aac atc tcc gct gaa caa gtt gaa gaa att 864
Ser Phe Gly Ser Met Gly Asn Ile Ser Ala Glu Gln Val Glu Glu Ile
275 280 285
gct tgg ggt ttg aag gct tcc aac aga cca ttc tta tgg gtt atg aag 912
Ala Trp Gly Leu Lys Ala Ser Asn Arg Pro Phe Leu Trp Val Met Lys
290 295 300
gaa tct gaa aag aaa ttg cca act ggt ttc ttg aac tct gtc ggt gaa 960
Glu Ser Glu Lys Lys Leu Pro Thr Gly Phe Leu Asn Ser Val Gly Glu
305 310 315 320
acc ggt atg gtt gtt tcc tgg tgt aac caa ttg gaa gtc ttg gct cac 1008
Thr Gly Met Val Val Ser Trp Cys Asn Gln Leu Glu Val Leu Ala His
325 330 335
caa gct atc ggt tgt ttc gtc acc cac tgt ggt tgg aac tcc act ttg 1056
Gln Ala Ile Gly Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Leu
340 345 350
gaa ggt ttg ggt tta ggt gtt cca atg gtt tgt gtt act gaa aga tct 1104
Glu Gly Leu Gly Leu Gly Val Pro Met Val Cys Val Thr Glu Arg Ser
355 360 365
gac caa cca atg aac gct aag ttc gtt gaa gat gtc tgg aag gtc ggt 1152
Asp Gln Pro Met Asn Ala Lys Phe Val Glu Asp Val Trp Lys Val Gly
370 375 380
gtc aga gct aag aag gac gaa gtt ggt att gtc act aga gaa gaa ttg 1200
Val Arg Ala Lys Lys Asp Glu Val Gly Ile Val Thr Arg Glu Glu Leu
385 390 395 400
gaa aag tgt atc aga ggt gtc atg gac ggt gaa aac ggt gaa gaa atc 1248
Glu Lys Cys Ile Arg Gly Val Met Asp Gly Glu Asn Gly Glu Glu Ile
405 410 415
aag aga aac gcc aac aaa tgg aga gaa tta gct cgt tct gcc gtt tcc 1296
Lys Arg Asn Ala Asn Lys Trp Arg Glu Leu Ala Arg Ser Ala Val Ser
420 425 430
gtc ggt ggt tct tct gac atg aac atc aat gaa ttt gtt gtc aaa ttg 1344
Val Gly Gly Ser Ser Asp Met Asn Ile Asn Glu Phe Val Val Lys Leu
435 440 445
ttg gaa ggt aag aag ggg 1362
Leu Glu Gly Lys Lys Gly
450
<210> 42
<211> 454
<212> PRT
<213> Populus trichocarpa
<400> 42
Met Asp Asn Lys Lys Ser His Val Ile Val Leu Thr Tyr Pro Ala Gln
1 5 10 15
Gly His Ile Asn Pro Leu Leu Gln Phe Ala Lys Arg Leu Ala Ser Lys
20 25 30
Gly Leu Lys Ala Thr Leu Ala Thr Thr Tyr Tyr Thr Val Asn Ser Ile
35 40 45
Asp Ala Pro Thr Val Gly Val Glu Pro Ile Ser Asp Gly Phe Asp Glu
50 55 60
Gly Gly Phe Lys Gln Ala Ser Ser Leu Asp Val Tyr Leu Glu Ser Phe
65 70 75 80
Lys Thr Val Gly Ser Arg Thr Leu Thr Glu Leu Val Phe Lys Phe Lys
85 90 95
Ala Ser Gly Ser Pro Val Asn Cys Val Val Tyr Asp Ser Met Leu Pro
100 105 110
Trp Ala Leu Asp Val Ala Arg Asp Leu Gly Ile Tyr Ala Ala Ala Phe
115 120 125
Met Thr Thr Ser Ala Ser Val Cys Ser Met Tyr Trp Arg Ile Asp Leu
130 135 140
Gly Leu Leu Ser Leu Pro Leu Lys Gln Gln Thr Ala Thr Val Ser Leu
145 150 155 160
Pro Gly Leu Pro Pro Leu Gly Cys Cys Asp Leu Pro Ser Phe Leu Ala
165 170 175
Glu Pro Thr Ser Gln Thr Ala Tyr Leu Glu Val Ile Met Glu Lys Phe
180 185 190
His Ser Leu Asn Glu Asp Asp Trp Val Phe Cys Asn Ser Phe Glu Asp
195 200 205
Leu Glu Ile Glu Leu Val Lys Ala Met Arg Gly Lys Trp Pro Leu Val
210 215 220
Met Val Gly Pro Met Val Pro Ser Ala Tyr Leu Asp Gln Gln Ile Asp
225 230 235 240
Gly Asp Arg Ala Tyr Gly Ala Ser Leu Trp Lys Pro Thr Ser Ser Gln
245 250 255
Cys Phe Thr Trp Leu Asp Thr Lys Pro Pro Arg Ser Val Ile Tyr Val
260 265 270
Ser Phe Gly Ser Met Gly Asn Ile Ser Ala Glu Gln Val Glu Glu Ile
275 280 285
Ala Trp Gly Leu Lys Ala Ser Asn Arg Pro Phe Leu Trp Val Met Lys
290 295 300
Glu Ser Glu Lys Lys Leu Pro Thr Gly Phe Leu Asn Ser Val Gly Glu
305 310 315 320
Thr Gly Met Val Val Ser Trp Cys Asn Gln Leu Glu Val Leu Ala His
325 330 335
Gln Ala Ile Gly Cys Phe Val Thr His Cys Gly Trp Asn Ser Thr Leu
340 345 350
Glu Gly Leu Gly Leu Gly Val Pro Met Val Cys Val Thr Glu Arg Ser
355 360 365
Asp Gln Pro Met Asn Ala Lys Phe Val Glu Asp Val Trp Lys Val Gly
370 375 380
Val Arg Ala Lys Lys Asp Glu Val Gly Ile Val Thr Arg Glu Glu Leu
385 390 395 400
Glu Lys Cys Ile Arg Gly Val Met Asp Gly Glu Asn Gly Glu Glu Ile
405 410 415
Lys Arg Asn Ala Asn Lys Trp Arg Glu Leu Ala Arg Ser Ala Val Ser
420 425 430
Val Gly Gly Ser Ser Asp Met Asn Ile Asn Glu Phe Val Val Lys Leu
435 440 445
Leu Glu Gly Lys Lys Gly
450
<210> 43
<211> 1377
<212> DNA
<213> Nicotiana tabacum
<220>
<221> CDS
<222> (1)..(1377)
<400> 43
atg acc act caa aag gct cac tgt ttg atc tta cca tac cca gct caa 48
Met Thr Thr Gln Lys Ala His Cys Leu Ile Leu Pro Tyr Pro Ala Gln
1 5 10 15
ggt cac atc aac cca atg ttg caa ttt tcc aag aga tta caa tct aag 96
Gly His Ile Asn Pro Met Leu Gln Phe Ser Lys Arg Leu Gln Ser Lys
20 25 30
ggt gtc aag atc acc att gct gct acc aag tct ttc tta aag acc atg 144
Gly Val Lys Ile Thr Ile Ala Ala Thr Lys Ser Phe Leu Lys Thr Met
35 40 45
caa gaa ttg tcc act tct gtc tct gtt gaa gct atc tct gat ggt tac 192
Gln Glu Leu Ser Thr Ser Val Ser Val Glu Ala Ile Ser Asp Gly Tyr
50 55 60
gat gac ggt ggt aga gaa caa gct ggt act ttc gtt gct tac atc acc 240
Asp Asp Gly Gly Arg Glu Gln Ala Gly Thr Phe Val Ala Tyr Ile Thr
65 70 75 80
aga ttc aag gaa gtt ggt tct gac act cta tcc caa tta atc ggt aag 288
Arg Phe Lys Glu Val Gly Ser Asp Thr Leu Ser Gln Leu Ile Gly Lys
85 90 95
ttg acc aac tgt ggt tgt cca gtt tct tgt atc gtc tac gat cct ttc 336
Leu Thr Asn Cys Gly Cys Pro Val Ser Cys Ile Val Tyr Asp Pro Phe
100 105 110
ttg cca tgg gct gtc gaa gtc ggt aac aac ttt ggt gtt gcc act gct 384
Leu Pro Trp Ala Val Glu Val Gly Asn Asn Phe Gly Val Ala Thr Ala
115 120 125
gct ttc ttc act caa tct tgt gct gtt gac aac atc tac tac cat gtc 432
Ala Phe Phe Thr Gln Ser Cys Ala Val Asp Asn Ile Tyr Tyr His Val
130 135 140
cac aaa ggt gtt ttg aaa ttg cca cca act gat gtt gac aag gaa att 480
His Lys Gly Val Leu Lys Leu Pro Pro Thr Asp Val Asp Lys Glu Ile
145 150 155 160
tcc att cca ggt ttg ttg acc att gaa gct tct gat gtt cca tct ttc 528
Ser Ile Pro Gly Leu Leu Thr Ile Glu Ala Ser Asp Val Pro Ser Phe
165 170 175
gtt tcc aac cca gaa tct tcc aga atc ttg gaa atg ttg gtc aac caa 576
Val Ser Asn Pro Glu Ser Ser Arg Ile Leu Glu Met Leu Val Asn Gln
180 185 190
ttc tcc aat ttg gaa aac act gac tgg gtt ttg atc aac tct ttc tac 624
Phe Ser Asn Leu Glu Asn Thr Asp Trp Val Leu Ile Asn Ser Phe Tyr
195 200 205
gaa ttg gaa aag gaa gtc att gac tgg atg gcc aag atc tac cca atc 672
Glu Leu Glu Lys Glu Val Ile Asp Trp Met Ala Lys Ile Tyr Pro Ile
210 215 220
aag acc att ggt cca acc att cca tcc atg tac ttg gac aaa aga tta 720
Lys Thr Ile Gly Pro Thr Ile Pro Ser Met Tyr Leu Asp Lys Arg Leu
225 230 235 240
cca gat gac aag gaa tac ggt ttg tcc gtc ttt aaa cca atg acc aat 768
Pro Asp Asp Lys Glu Tyr Gly Leu Ser Val Phe Lys Pro Met Thr Asn
245 250 255
gct tgt ttg aac tgg ttg aac cac caa cca gtt tct tcc gtt gtc tac 816
Ala Cys Leu Asn Trp Leu Asn His Gln Pro Val Ser Ser Val Val Tyr
260 265 270
gtt tct ttc ggt tct ttg gct aag ttg gaa gct gaa caa atg gaa gaa 864
Val Ser Phe Gly Ser Leu Ala Lys Leu Glu Ala Glu Gln Met Glu Glu
275 280 285
ttg gct tgg ggt ttg tcc aac tcc aac aag aac ttc tta tgg gtt gtc 912
Leu Ala Trp Gly Leu Ser Asn Ser Asn Lys Asn Phe Leu Trp Val Val
290 295 300
aga tct act gaa gaa tcc aaa ttg cca aac aac ttc ttg gaa gaa ttg 960
Arg Ser Thr Glu Glu Ser Lys Leu Pro Asn Asn Phe Leu Glu Glu Leu
305 310 315 320
gct tct gaa aag ggt ttg gtt gtt tcc tgg tgt cca caa tta caa gtt 1008
Ala Ser Glu Lys Gly Leu Val Val Ser Trp Cys Pro Gln Leu Gln Val
325 330 335
ttg gaa cac aag tcc atc ggt tgt ttc ttg act cac tgt ggt tgg aac 1056
Leu Glu His Lys Ser Ile Gly Cys Phe Leu Thr His Cys Gly Trp Asn
340 345 350
tct act ttg gaa gcc atc tct ttg ggt gtt cca atg att gcc atg cct 1104
Ser Thr Leu Glu Ala Ile Ser Leu Gly Val Pro Met Ile Ala Met Pro
355 360 365
cac tgg tct gac caa cca acc aac gcc aag ttg gtc gaa gat gtc tgg 1152
His Trp Ser Asp Gln Pro Thr Asn Ala Lys Leu Val Glu Asp Val Trp
370 375 380
gaa atg ggt atc aga cca aag caa gat gaa aag ggt tta gtc cgt cgt 1200
Glu Met Gly Ile Arg Pro Lys Gln Asp Glu Lys Gly Leu Val Arg Arg
385 390 395 400
gaa gtc att gaa gaa tgt atc aag att gtc atg gaa gaa aag aag ggt 1248
Glu Val Ile Glu Glu Cys Ile Lys Ile Val Met Glu Glu Lys Lys Gly
405 410 415
aag aag atc aga gaa aac gcc aag aaa tgg aag gaa ttg gcc aga aag 1296
Lys Lys Ile Arg Glu Asn Ala Lys Lys Trp Lys Glu Leu Ala Arg Lys
420 425 430
gct gtt gac gaa ggt ggt tct tct gac aga aac att gaa gaa ttc gtt 1344
Ala Val Asp Glu Gly Gly Ser Ser Asp Arg Asn Ile Glu Glu Phe Val
435 440 445
tcc aaa ttg gtc acc att gct tct gtc gag agt 1377
Ser Lys Leu Val Thr Ile Ala Ser Val Glu Ser
450 455
<210> 44
<211> 459
<212> PRT
<213> Nicotiana tabacum
<400> 44
Met Thr Thr Gln Lys Ala His Cys Leu Ile Leu Pro Tyr Pro Ala Gln
1 5 10 15
Gly His Ile Asn Pro Met Leu Gln Phe Ser Lys Arg Leu Gln Ser Lys
20 25 30
Gly Val Lys Ile Thr Ile Ala Ala Thr Lys Ser Phe Leu Lys Thr Met
35 40 45
Gln Glu Leu Ser Thr Ser Val Ser Val Glu Ala Ile Ser Asp Gly Tyr
50 55 60
Asp Asp Gly Gly Arg Glu Gln Ala Gly Thr Phe Val Ala Tyr Ile Thr
65 70 75 80
Arg Phe Lys Glu Val Gly Ser Asp Thr Leu Ser Gln Leu Ile Gly Lys
85 90 95
Leu Thr Asn Cys Gly Cys Pro Val Ser Cys Ile Val Tyr Asp Pro Phe
100 105 110
Leu Pro Trp Ala Val Glu Val Gly Asn Asn Phe Gly Val Ala Thr Ala
115 120 125
Ala Phe Phe Thr Gln Ser Cys Ala Val Asp Asn Ile Tyr Tyr His Val
130 135 140
His Lys Gly Val Leu Lys Leu Pro Pro Thr Asp Val Asp Lys Glu Ile
145 150 155 160
Ser Ile Pro Gly Leu Leu Thr Ile Glu Ala Ser Asp Val Pro Ser Phe
165 170 175
Val Ser Asn Pro Glu Ser Ser Arg Ile Leu Glu Met Leu Val Asn Gln
180 185 190
Phe Ser Asn Leu Glu Asn Thr Asp Trp Val Leu Ile Asn Ser Phe Tyr
195 200 205
Glu Leu Glu Lys Glu Val Ile Asp Trp Met Ala Lys Ile Tyr Pro Ile
210 215 220
Lys Thr Ile Gly Pro Thr Ile Pro Ser Met Tyr Leu Asp Lys Arg Leu
225 230 235 240
Pro Asp Asp Lys Glu Tyr Gly Leu Ser Val Phe Lys Pro Met Thr Asn
245 250 255
Ala Cys Leu Asn Trp Leu Asn His Gln Pro Val Ser Ser Val Val Tyr
260 265 270
Val Ser Phe Gly Ser Leu Ala Lys Leu Glu Ala Glu Gln Met Glu Glu
275 280 285
Leu Ala Trp Gly Leu Ser Asn Ser Asn Lys Asn Phe Leu Trp Val Val
290 295 300
Arg Ser Thr Glu Glu Ser Lys Leu Pro Asn Asn Phe Leu Glu Glu Leu
305 310 315 320
Ala Ser Glu Lys Gly Leu Val Val Ser Trp Cys Pro Gln Leu Gln Val
325 330 335
Leu Glu His Lys Ser Ile Gly Cys Phe Leu Thr His Cys Gly Trp Asn
340 345 350
Ser Thr Leu Glu Ala Ile Ser Leu Gly Val Pro Met Ile Ala Met Pro
355 360 365
His Trp Ser Asp Gln Pro Thr Asn Ala Lys Leu Val Glu Asp Val Trp
370 375 380
Glu Met Gly Ile Arg Pro Lys Gln Asp Glu Lys Gly Leu Val Arg Arg
385 390 395 400
Glu Val Ile Glu Glu Cys Ile Lys Ile Val Met Glu Glu Lys Lys Gly
405 410 415
Lys Lys Ile Arg Glu Asn Ala Lys Lys Trp Lys Glu Leu Ala Arg Lys
420 425 430
Ala Val Asp Glu Gly Gly Ser Ser Asp Arg Asn Ile Glu Glu Phe Val
435 440 445
Ser Lys Leu Val Thr Ile Ala Ser Val Glu Ser
450 455
<210> 45
<211> 1434
<212> DNA
<213> Vaccaria hispanica
<220>
<221> CDS
<222> (1)..(1434)
<400> 45
atg tcc aac aat gaa aac aat gcc act caa gtt atc gtt ttg cca tac 48
Met Ser Asn Asn Glu Asn Asn Ala Thr Gln Val Ile Val Leu Pro Tyr
1 5 10 15
cac ggt caa ggt cac atg aac acc atg gtt caa ttt gcc aag aga tta 96
His Gly Gln Gly His Met Asn Thr Met Val Gln Phe Ala Lys Arg Leu
20 25 30
gct tgg aag ggt gtc cac gtc acc att gct acc act ttc aac act atc 144
Ala Trp Lys Gly Val His Val Thr Ile Ala Thr Thr Phe Asn Thr Ile
35 40 45
caa caa atg aaa ttg aac att tct tct tac aac tcc atc act ttg gaa 192
Gln Gln Met Lys Leu Asn Ile Ser Ser Tyr Asn Ser Ile Thr Leu Glu
50 55 60
cca atc tac gat gac act gat gac tcc act ttg cac atc aag gac aga 240
Pro Ile Tyr Asp Asp Thr Asp Asp Ser Thr Leu His Ile Lys Asp Arg
65 70 75 80
atg gcc aga ttt gaa gct gaa gcc gct tcc aat ttg acc aga gtt ttg 288
Met Ala Arg Phe Glu Ala Glu Ala Ala Ser Asn Leu Thr Arg Val Leu
85 90 95
gaa gcc aag aaa caa caa caa gct tta aac aag aag tgt ttg ttg gtt 336
Glu Ala Lys Lys Gln Gln Gln Ala Leu Asn Lys Lys Cys Leu Leu Val
100 105 110
tac cac ggt tct ttg aac tgg gct tta gtc gtt gct cac caa caa aac 384
Tyr His Gly Ser Leu Asn Trp Ala Leu Val Val Ala His Gln Gln Asn
115 120 125
gtt gct ggt gct gct ttc ttc acc gct gct tct gct tct ttc gct tgt 432
Val Ala Gly Ala Ala Phe Phe Thr Ala Ala Ser Ala Ser Phe Ala Cys
130 135 140
tac tac tac ttg cat ttg gaa tct caa ggt aag ggt gtc gac ttg gaa 480
Tyr Tyr Tyr Leu His Leu Glu Ser Gln Gly Lys Gly Val Asp Leu Glu
145 150 155 160
gaa tta cca tcc atc tta cca cca cca aag gtc att gtt caa aag ttg 528
Glu Leu Pro Ser Ile Leu Pro Pro Pro Lys Val Ile Val Gln Lys Leu
165 170 175
cca aag tct ttc tta gct tac ggt gac aac aac tct cac aac aac aac 576
Pro Lys Ser Phe Leu Ala Tyr Gly Asp Asn Asn Ser His Asn Asn Asn
180 185 190
aac aac aac aac aac aac aac aac aac aac aac atg ggt ttg cac cca 624
Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Met Gly Leu His Pro
195 200 205
ttg gtc ttg tgg ttg ttg aaa gat tac ggt aac tcc gtc aag gct gac 672
Leu Val Leu Trp Leu Leu Lys Asp Tyr Gly Asn Ser Val Lys Ala Asp
210 215 220
ttc gtt ttg ttg aac tct ttc gac aaa ttg gaa gaa gaa gct atc aaa 720
Phe Val Leu Leu Asn Ser Phe Asp Lys Leu Glu Glu Glu Ala Ile Lys
225 230 235 240
tgg atc tct aac atc tgt tcc gtc aag act atc ggt cca acc att cca 768
Trp Ile Ser Asn Ile Cys Ser Val Lys Thr Ile Gly Pro Thr Ile Pro
245 250 255
tct acc tac ttg gac aag caa att gaa aac gat gtt gac tac ggt ttc 816
Ser Thr Tyr Leu Asp Lys Gln Ile Glu Asn Asp Val Asp Tyr Gly Phe
260 265 270
aac caa tac aag cca acc aat gaa gat tgt atg aaa tgg ttg gac acc 864
Asn Gln Tyr Lys Pro Thr Asn Glu Asp Cys Met Lys Trp Leu Asp Thr
275 280 285
aag gaa gcc aac tcc gtt gtc tac atc gcc ttc ggt tct gtt gct cgt 912
Lys Glu Ala Asn Ser Val Val Tyr Ile Ala Phe Gly Ser Val Ala Arg
290 295 300
ttg tct gtc gaa caa atg gct gaa att gct aag gct ttg gac cat tct 960
Leu Ser Val Glu Gln Met Ala Glu Ile Ala Lys Ala Leu Asp His Ser
305 310 315 320
tcc aag tct ttc atc tgg gtt gtc aga gaa act gaa aag gaa aag ttg 1008
Ser Lys Ser Phe Ile Trp Val Val Arg Glu Thr Glu Lys Glu Lys Leu
325 330 335
cca gtc gat ttg gtt gaa aag atc tct ggt caa ggt atg gtt gtc cca 1056
Pro Val Asp Leu Val Glu Lys Ile Ser Gly Gln Gly Met Val Val Pro
340 345 350
tgg gct cct caa ttg gaa gtt ttg gct cac gat gct gtc ggt tgt ttc 1104
Trp Ala Pro Gln Leu Glu Val Leu Ala His Asp Ala Val Gly Cys Phe
355 360 365
gtt tct cac tgt ggt tgg aac tcc acc att gaa gct ttg tct ttc ggt 1152
Val Ser His Cys Gly Trp Asn Ser Thr Ile Glu Ala Leu Ser Phe Gly
370 375 380
gtt cca att ttg gcc atg cct caa ttc tta gat caa ttg gtc gat gct 1200
Val Pro Ile Leu Ala Met Pro Gln Phe Leu Asp Gln Leu Val Asp Ala
385 390 395 400
cat ttc gtt gac aga gtc tgg ggt gtc ggt att gct cca act gtc gat 1248
His Phe Val Asp Arg Val Trp Gly Val Gly Ile Ala Pro Thr Val Asp
405 410 415
gaa aac gat ttg gtt act caa gaa gaa atc tcc aga tgt cta gac gaa 1296
Glu Asn Asp Leu Val Thr Gln Glu Glu Ile Ser Arg Cys Leu Asp Glu
420 425 430
atg atg ggt ggt ggt cca gaa ggt gaa aag atc aag aag aac gtt gcc 1344
Met Met Gly Gly Gly Pro Glu Gly Glu Lys Ile Lys Lys Asn Val Ala
435 440 445
atg tgg aag gaa ttg acc aag gaa gct ttg gac aag ggt ggt tcc tct 1392
Met Trp Lys Glu Leu Thr Lys Glu Ala Leu Asp Lys Gly Gly Ser Ser
450 455 460
gac aag cac att gac gaa atc att gaa tgg tta tct tcc tcc 1434
Asp Lys His Ile Asp Glu Ile Ile Glu Trp Leu Ser Ser Ser
465 470 475
<210> 46
<211> 478
<212> PRT
<213> Vaccaria hispanica
<400> 46
Met Ser Asn Asn Glu Asn Asn Ala Thr Gln Val Ile Val Leu Pro Tyr
1 5 10 15
His Gly Gln Gly His Met Asn Thr Met Val Gln Phe Ala Lys Arg Leu
20 25 30
Ala Trp Lys Gly Val His Val Thr Ile Ala Thr Thr Phe Asn Thr Ile
35 40 45
Gln Gln Met Lys Leu Asn Ile Ser Ser Tyr Asn Ser Ile Thr Leu Glu
50 55 60
Pro Ile Tyr Asp Asp Thr Asp Asp Ser Thr Leu His Ile Lys Asp Arg
65 70 75 80
Met Ala Arg Phe Glu Ala Glu Ala Ala Ser Asn Leu Thr Arg Val Leu
85 90 95
Glu Ala Lys Lys Gln Gln Gln Ala Leu Asn Lys Lys Cys Leu Leu Val
100 105 110
Tyr His Gly Ser Leu Asn Trp Ala Leu Val Val Ala His Gln Gln Asn
115 120 125
Val Ala Gly Ala Ala Phe Phe Thr Ala Ala Ser Ala Ser Phe Ala Cys
130 135 140
Tyr Tyr Tyr Leu His Leu Glu Ser Gln Gly Lys Gly Val Asp Leu Glu
145 150 155 160
Glu Leu Pro Ser Ile Leu Pro Pro Pro Lys Val Ile Val Gln Lys Leu
165 170 175
Pro Lys Ser Phe Leu Ala Tyr Gly Asp Asn Asn Ser His Asn Asn Asn
180 185 190
Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Met Gly Leu His Pro
195 200 205
Leu Val Leu Trp Leu Leu Lys Asp Tyr Gly Asn Ser Val Lys Ala Asp
210 215 220
Phe Val Leu Leu Asn Ser Phe Asp Lys Leu Glu Glu Glu Ala Ile Lys
225 230 235 240
Trp Ile Ser Asn Ile Cys Ser Val Lys Thr Ile Gly Pro Thr Ile Pro
245 250 255
Ser Thr Tyr Leu Asp Lys Gln Ile Glu Asn Asp Val Asp Tyr Gly Phe
260 265 270
Asn Gln Tyr Lys Pro Thr Asn Glu Asp Cys Met Lys Trp Leu Asp Thr
275 280 285
Lys Glu Ala Asn Ser Val Val Tyr Ile Ala Phe Gly Ser Val Ala Arg
290 295 300
Leu Ser Val Glu Gln Met Ala Glu Ile Ala Lys Ala Leu Asp His Ser
305 310 315 320
Ser Lys Ser Phe Ile Trp Val Val Arg Glu Thr Glu Lys Glu Lys Leu
325 330 335
Pro Val Asp Leu Val Glu Lys Ile Ser Gly Gln Gly Met Val Val Pro
340 345 350
Trp Ala Pro Gln Leu Glu Val Leu Ala His Asp Ala Val Gly Cys Phe
355 360 365
Val Ser His Cys Gly Trp Asn Ser Thr Ile Glu Ala Leu Ser Phe Gly
370 375 380
Val Pro Ile Leu Ala Met Pro Gln Phe Leu Asp Gln Leu Val Asp Ala
385 390 395 400
His Phe Val Asp Arg Val Trp Gly Val Gly Ile Ala Pro Thr Val Asp
405 410 415
Glu Asn Asp Leu Val Thr Gln Glu Glu Ile Ser Arg Cys Leu Asp Glu
420 425 430
Met Met Gly Gly Gly Pro Glu Gly Glu Lys Ile Lys Lys Asn Val Ala
435 440 445
Met Trp Lys Glu Leu Thr Lys Glu Ala Leu Asp Lys Gly Gly Ser Ser
450 455 460
Asp Lys His Ile Asp Glu Ile Ile Glu Trp Leu Ser Ser Ser
465 470 475
<210> 47
<211> 1443
<212> DNA
<213> Streptococcus mutans
<220>
<221> CDS
<222> (1)..(1443)
<400> 47
atg cca atc atc aac aag acc atg ttg atc act tac gct gac tct cta 48
Met Pro Ile Ile Asn Lys Thr Met Leu Ile Thr Tyr Ala Asp Ser Leu
1 5 10 15
ggt aag aac ttg aaa gaa ttg aat gaa aac att gaa aac tac ttc ggt 96
Gly Lys Asn Leu Lys Glu Leu Asn Glu Asn Ile Glu Asn Tyr Phe Gly
20 25 30
gat gct gtc ggt ggt gtt cac ttg ttg cct ttc ttc cca tcc acc ggt 144
Asp Ala Val Gly Gly Val His Leu Leu Pro Phe Phe Pro Ser Thr Gly
35 40 45
gac aga ggt ttc gct cca att gac tac cat gaa gtc gac tcc gcc ttc 192
Asp Arg Gly Phe Ala Pro Ile Asp Tyr His Glu Val Asp Ser Ala Phe
50 55 60
ggt gac tgg gat gac gtt aag tgt ttg ggt gaa aag tac tac ttg atg 240
Gly Asp Trp Asp Asp Val Lys Cys Leu Gly Glu Lys Tyr Tyr Leu Met
65 70 75 80
ttt gac ttc atg atc aac cac atc tcc aga caa tct aag tac tac aag 288
Phe Asp Phe Met Ile Asn His Ile Ser Arg Gln Ser Lys Tyr Tyr Lys
85 90 95
gac tac caa gaa aag cac gaa gct tct gct tac aag gat ttg ttc ttg 336
Asp Tyr Gln Glu Lys His Glu Ala Ser Ala Tyr Lys Asp Leu Phe Leu
100 105 110
aac tgg gac aaa ttc tgg cca aag aac aga cca act caa gaa gat gtc 384
Asn Trp Asp Lys Phe Trp Pro Lys Asn Arg Pro Thr Gln Glu Asp Val
115 120 125
gac ttg atc tac aag aga aag gac cgt gcc cca aag caa gaa atc caa 432
Asp Leu Ile Tyr Lys Arg Lys Asp Arg Ala Pro Lys Gln Glu Ile Gln
130 135 140
ttt gcc gat ggt tct gtt gaa cac tta tgg aac act ttc ggt gaa gaa 480
Phe Ala Asp Gly Ser Val Glu His Leu Trp Asn Thr Phe Gly Glu Glu
145 150 155 160
caa atc gat ttg gat gtc acc aag gaa gtt acc atg gac ttc atc aga 528
Gln Ile Asp Leu Asp Val Thr Lys Glu Val Thr Met Asp Phe Ile Arg
165 170 175
tcc acc att gaa aac ttg gct gct aac ggt tgt gac ttg atc aga ttg 576
Ser Thr Ile Glu Asn Leu Ala Ala Asn Gly Cys Asp Leu Ile Arg Leu
180 185 190
gat gct ttc gct tac gct gtt aag aag cta gac acc aac gat ttc ttc 624
Asp Ala Phe Ala Tyr Ala Val Lys Lys Leu Asp Thr Asn Asp Phe Phe
195 200 205
gtc gaa cca gaa atc tgg act ttg ttg gac aag gtc cgt gac att gct 672
Val Glu Pro Glu Ile Trp Thr Leu Leu Asp Lys Val Arg Asp Ile Ala
210 215 220
gct gtt tct ggt gct gaa att ttg cca gaa att cac gaa cat tac act 720
Ala Val Ser Gly Ala Glu Ile Leu Pro Glu Ile His Glu His Tyr Thr
225 230 235 240
atc caa ttc aag att gct gac cac gac tac tac gtt tac gac ttt gct 768
Ile Gln Phe Lys Ile Ala Asp His Asp Tyr Tyr Val Tyr Asp Phe Ala
245 250 255
ttg cca atg gtt act tta tac tct ttg tac tcc tcc aag gtc gac aga 816
Leu Pro Met Val Thr Leu Tyr Ser Leu Tyr Ser Ser Lys Val Asp Arg
260 265 270
tta gct aaa tgg ttg aaa atg tct cca atg aag caa ttc acc act ttg 864
Leu Ala Lys Trp Leu Lys Met Ser Pro Met Lys Gln Phe Thr Thr Leu
275 280 285
gac acc cac gac ggt att ggt gtt gtc gat gtc aag gat atc ttg acc 912
Asp Thr His Asp Gly Ile Gly Val Val Asp Val Lys Asp Ile Leu Thr
290 295 300
gat gaa gaa atc act tac act tcc aat gaa tta tac aag gtc ggt gcc 960
Asp Glu Glu Ile Thr Tyr Thr Ser Asn Glu Leu Tyr Lys Val Gly Ala
305 310 315 320
aac gtt aac aga aaa tac tct act gct gaa tac aac aac ttg gat atc 1008
Asn Val Asn Arg Lys Tyr Ser Thr Ala Glu Tyr Asn Asn Leu Asp Ile
325 330 335
tac caa atc aac tct acc tac tac tct gct ttg ggt gac gat gac caa 1056
Tyr Gln Ile Asn Ser Thr Tyr Tyr Ser Ala Leu Gly Asp Asp Asp Gln
340 345 350
aag tat ttc ttg gcc aga tta atc caa gct ttc gct cca ggt atc cca 1104
Lys Tyr Phe Leu Ala Arg Leu Ile Gln Ala Phe Ala Pro Gly Ile Pro
355 360 365
caa gtc tac tac gtt ggt ttc ttg gcc ggt aag aac gat ttg gaa ttg 1152
Gln Val Tyr Tyr Val Gly Phe Leu Ala Gly Lys Asn Asp Leu Glu Leu
370 375 380
ttg gaa tcc acc aag gaa ggt aga aac atc aac aga cac tac tac tct 1200
Leu Glu Ser Thr Lys Glu Gly Arg Asn Ile Asn Arg His Tyr Tyr Ser
385 390 395 400
tct gaa gaa att gct aag gaa gtc aaa aga cca gtt gtt aag gct ttg 1248
Ser Glu Glu Ile Ala Lys Glu Val Lys Arg Pro Val Val Lys Ala Leu
405 410 415
ttg aac tta ttc act tac aga aac caa tct gct gcc ttt gac ttg gat 1296
Leu Asn Leu Phe Thr Tyr Arg Asn Gln Ser Ala Ala Phe Asp Leu Asp
420 425 430
ggt aga att gaa gtt gaa acc cca aac gaa gct acc att gtt att gaa 1344
Gly Arg Ile Glu Val Glu Thr Pro Asn Glu Ala Thr Ile Val Ile Glu
435 440 445
aga caa aac aaa gat ggt tcc cac att gcc aag gct gaa atc aac ttg 1392
Arg Gln Asn Lys Asp Gly Ser His Ile Ala Lys Ala Glu Ile Asn Leu
450 455 460
caa gac atg act tac cgt gtc act gaa aac gac caa acc att tct ttc 1440
Gln Asp Met Thr Tyr Arg Val Thr Glu Asn Asp Gln Thr Ile Ser Phe
465 470 475 480
gaa 1443
Glu
<210> 48
<211> 481
<212> PRT
<213> Streptococcus mutans
<400> 48
Met Pro Ile Ile Asn Lys Thr Met Leu Ile Thr Tyr Ala Asp Ser Leu
1 5 10 15
Gly Lys Asn Leu Lys Glu Leu Asn Glu Asn Ile Glu Asn Tyr Phe Gly
20 25 30
Asp Ala Val Gly Gly Val His Leu Leu Pro Phe Phe Pro Ser Thr Gly
35 40 45
Asp Arg Gly Phe Ala Pro Ile Asp Tyr His Glu Val Asp Ser Ala Phe
50 55 60
Gly Asp Trp Asp Asp Val Lys Cys Leu Gly Glu Lys Tyr Tyr Leu Met
65 70 75 80
Phe Asp Phe Met Ile Asn His Ile Ser Arg Gln Ser Lys Tyr Tyr Lys
85 90 95
Asp Tyr Gln Glu Lys His Glu Ala Ser Ala Tyr Lys Asp Leu Phe Leu
100 105 110
Asn Trp Asp Lys Phe Trp Pro Lys Asn Arg Pro Thr Gln Glu Asp Val
115 120 125
Asp Leu Ile Tyr Lys Arg Lys Asp Arg Ala Pro Lys Gln Glu Ile Gln
130 135 140
Phe Ala Asp Gly Ser Val Glu His Leu Trp Asn Thr Phe Gly Glu Glu
145 150 155 160
Gln Ile Asp Leu Asp Val Thr Lys Glu Val Thr Met Asp Phe Ile Arg
165 170 175
Ser Thr Ile Glu Asn Leu Ala Ala Asn Gly Cys Asp Leu Ile Arg Leu
180 185 190
Asp Ala Phe Ala Tyr Ala Val Lys Lys Leu Asp Thr Asn Asp Phe Phe
195 200 205
Val Glu Pro Glu Ile Trp Thr Leu Leu Asp Lys Val Arg Asp Ile Ala
210 215 220
Ala Val Ser Gly Ala Glu Ile Leu Pro Glu Ile His Glu His Tyr Thr
225 230 235 240
Ile Gln Phe Lys Ile Ala Asp His Asp Tyr Tyr Val Tyr Asp Phe Ala
245 250 255
Leu Pro Met Val Thr Leu Tyr Ser Leu Tyr Ser Ser Lys Val Asp Arg
260 265 270
Leu Ala Lys Trp Leu Lys Met Ser Pro Met Lys Gln Phe Thr Thr Leu
275 280 285
Asp Thr His Asp Gly Ile Gly Val Val Asp Val Lys Asp Ile Leu Thr
290 295 300
Asp Glu Glu Ile Thr Tyr Thr Ser Asn Glu Leu Tyr Lys Val Gly Ala
305 310 315 320
Asn Val Asn Arg Lys Tyr Ser Thr Ala Glu Tyr Asn Asn Leu Asp Ile
325 330 335
Tyr Gln Ile Asn Ser Thr Tyr Tyr Ser Ala Leu Gly Asp Asp Asp Gln
340 345 350
Lys Tyr Phe Leu Ala Arg Leu Ile Gln Ala Phe Ala Pro Gly Ile Pro
355 360 365
Gln Val Tyr Tyr Val Gly Phe Leu Ala Gly Lys Asn Asp Leu Glu Leu
370 375 380
Leu Glu Ser Thr Lys Glu Gly Arg Asn Ile Asn Arg His Tyr Tyr Ser
385 390 395 400
Ser Glu Glu Ile Ala Lys Glu Val Lys Arg Pro Val Val Lys Ala Leu
405 410 415
Leu Asn Leu Phe Thr Tyr Arg Asn Gln Ser Ala Ala Phe Asp Leu Asp
420 425 430
Gly Arg Ile Glu Val Glu Thr Pro Asn Glu Ala Thr Ile Val Ile Glu
435 440 445
Arg Gln Asn Lys Asp Gly Ser His Ile Ala Lys Ala Glu Ile Asn Leu
450 455 460
Gln Asp Met Thr Tyr Arg Val Thr Glu Asn Asp Gln Thr Ile Ser Phe
465 470 475 480
Glu
<210> 49
<211> 1392
<212> DNA
<213> Lobelia erinus
<220>
<221> CDS
<222> (1)..(1392)
<400> 49
atg gac aac aac cat ttg ggt gaa act ttg ttg cca ttg gct cca aag 48
Met Asp Asn Asn His Leu Gly Glu Thr Leu Leu Pro Leu Ala Pro Lys
1 5 10 15
aac ggt aga aga gtc ttg ttc ttc cca tac cca tta caa ggt cac att 96
Asn Gly Arg Arg Val Leu Phe Phe Pro Tyr Pro Leu Gln Gly His Ile
20 25 30
tct cca atg ttg aac ttg gcc aac ttg ttg cac tcc aag ggt ttc acc 144
Ser Pro Met Leu Asn Leu Ala Asn Leu Leu His Ser Lys Gly Phe Thr
35 40 45
atc acc atc atc cac acc aat ttg aac tct cca aac caa tct gac tac 192
Ile Thr Ile Ile His Thr Asn Leu Asn Ser Pro Asn Gln Ser Asp Tyr
50 55 60
cca cac ttc act ttc aga cca ttt gat gac ggt ttc cca cca tac tcc 240
Pro His Phe Thr Phe Arg Pro Phe Asp Asp Gly Phe Pro Pro Tyr Ser
65 70 75 80
aag ggt tgg caa ttg gct acc ttg tgt tcc aga tgt gtt gaa cca ttc 288
Lys Gly Trp Gln Leu Ala Thr Leu Cys Ser Arg Cys Val Glu Pro Phe
85 90 95
aga gaa tgt ttg gct caa atc ttc ttg tct gac cac act gct cca gaa 336
Arg Glu Cys Leu Ala Gln Ile Phe Leu Ser Asp His Thr Ala Pro Glu
100 105 110
ggt gaa aga gaa tct att gct tgt ttg att gct gat ggt tta tgg aac 384
Gly Glu Arg Glu Ser Ile Ala Cys Leu Ile Ala Asp Gly Leu Trp Asn
115 120 125
ttc ttg ggt gct gct gtc tac aac ttt aaa ttg cca atg att gtt ttg 432
Phe Leu Gly Ala Ala Val Tyr Asn Phe Lys Leu Pro Met Ile Val Leu
130 135 140
aga act ggt aac atg tct aac att gtt gcc aac gtt aag ttg cca tgt 480
Arg Thr Gly Asn Met Ser Asn Ile Val Ala Asn Val Lys Leu Pro Cys
145 150 155 160
ttc atc gaa aag ggt tac ttc gac cat acc aag gaa ggt tcc aag ttg 528
Phe Ile Glu Lys Gly Tyr Phe Asp His Thr Lys Glu Gly Ser Lys Leu
165 170 175
gaa gct gct gtc cca gaa ttc cca acc atc aag ttc aaa gat atc ttg 576
Glu Ala Ala Val Pro Glu Phe Pro Thr Ile Lys Phe Lys Asp Ile Leu
180 185 190
aaa acc tac ggt tct aac cca aag gcc atc tgt gaa act ttg act gct 624
Lys Thr Tyr Gly Ser Asn Pro Lys Ala Ile Cys Glu Thr Leu Thr Ala
195 200 205
ttg ttg aag gaa atg aga gct tct tct ggt gtc atc tgg aac tct tgt 672
Leu Leu Lys Glu Met Arg Ala Ser Ser Gly Val Ile Trp Asn Ser Cys
210 215 220
aag gaa tta gaa caa tct gaa tta caa atg atc tgt aag gaa ttc cca 720
Lys Glu Leu Glu Gln Ser Glu Leu Gln Met Ile Cys Lys Glu Phe Pro
225 230 235 240
gtt cct cat ttc ttg att ggt cct ttg cac aaa tac ttc cca gct tct 768
Val Pro His Phe Leu Ile Gly Pro Leu His Lys Tyr Phe Pro Ala Ser
245 250 255
tct tct tcc ttg gtt gcc cac gac cca tct tcc att tcc tgg ttg aac 816
Ser Ser Ser Leu Val Ala His Asp Pro Ser Ser Ile Ser Trp Leu Asn
260 265 270
tcc aag gct cca aac tct gtt ttg tac gtt tct ttc ggt tcc atc tct 864
Ser Lys Ala Pro Asn Ser Val Leu Tyr Val Ser Phe Gly Ser Ile Ser
275 280 285
tcc atg gac gaa gct gaa ttt cta gaa act gct tgg ggt ttg gcc aac 912
Ser Met Asp Glu Ala Glu Phe Leu Glu Thr Ala Trp Gly Leu Ala Asn
290 295 300
tcc atg caa caa ttc tta tgg gtt gtc aga cca ggt tct gtc aga ggt 960
Ser Met Gln Gln Phe Leu Trp Val Val Arg Pro Gly Ser Val Arg Gly
305 310 315 320
tct caa tgg ttg gaa tct tta cca gat ggt ttc att gac aag ttg gat 1008
Ser Gln Trp Leu Glu Ser Leu Pro Asp Gly Phe Ile Asp Lys Leu Asp
325 330 335
ggt aga ggt cac att gtc aaa tgg gct cct caa caa gaa gtc tta gct 1056
Gly Arg Gly His Ile Val Lys Trp Ala Pro Gln Gln Glu Val Leu Ala
340 345 350
cac caa gct acc ggt ggt ttc tgg act cac tgt ggt tgg aac tcc act 1104
His Gln Ala Thr Gly Gly Phe Trp Thr His Cys Gly Trp Asn Ser Thr
355 360 365
tta gaa tcc atg tgt gaa ggt gtc cca atg att tgt tcc cac ggt atc 1152
Leu Glu Ser Met Cys Glu Gly Val Pro Met Ile Cys Ser His Gly Ile
370 375 380
atg gac caa cca atc aat gct cgt tac gtt acc gat gtc tgg aag gtt 1200
Met Asp Gln Pro Ile Asn Ala Arg Tyr Val Thr Asp Val Trp Lys Val
385 390 395 400
ggt att gaa ttg gaa aag ggt ttt gac tct gaa gaa atc aag atg gcc 1248
Gly Ile Glu Leu Glu Lys Gly Phe Asp Ser Glu Glu Ile Lys Met Ala
405 410 415
atc cgt cgt ttg atg gtt gac aag gaa ggt caa gaa atc aga gaa aga 1296
Ile Arg Arg Leu Met Val Asp Lys Glu Gly Gln Glu Ile Arg Glu Arg
420 425 430
tct tcc aga ttg aag gaa tct ttg tcc aac tgt ttg aag caa ggt ggt 1344
Ser Ser Arg Leu Lys Glu Ser Leu Ser Asn Cys Leu Lys Gln Gly Gly
435 440 445
tct tcc cac gat tct gtc gaa tct ttg gtt gac cac atc cta tcc ttc 1392
Ser Ser His Asp Ser Val Glu Ser Leu Val Asp His Ile Leu Ser Phe
450 455 460
<210> 50
<211> 464
<212> PRT
<213> Lobelia erinus
<400> 50
Met Asp Asn Asn His Leu Gly Glu Thr Leu Leu Pro Leu Ala Pro Lys
1 5 10 15
Asn Gly Arg Arg Val Leu Phe Phe Pro Tyr Pro Leu Gln Gly His Ile
20 25 30
Ser Pro Met Leu Asn Leu Ala Asn Leu Leu His Ser Lys Gly Phe Thr
35 40 45
Ile Thr Ile Ile His Thr Asn Leu Asn Ser Pro Asn Gln Ser Asp Tyr
50 55 60
Pro His Phe Thr Phe Arg Pro Phe Asp Asp Gly Phe Pro Pro Tyr Ser
65 70 75 80
Lys Gly Trp Gln Leu Ala Thr Leu Cys Ser Arg Cys Val Glu Pro Phe
85 90 95
Arg Glu Cys Leu Ala Gln Ile Phe Leu Ser Asp His Thr Ala Pro Glu
100 105 110
Gly Glu Arg Glu Ser Ile Ala Cys Leu Ile Ala Asp Gly Leu Trp Asn
115 120 125
Phe Leu Gly Ala Ala Val Tyr Asn Phe Lys Leu Pro Met Ile Val Leu
130 135 140
Arg Thr Gly Asn Met Ser Asn Ile Val Ala Asn Val Lys Leu Pro Cys
145 150 155 160
Phe Ile Glu Lys Gly Tyr Phe Asp His Thr Lys Glu Gly Ser Lys Leu
165 170 175
Glu Ala Ala Val Pro Glu Phe Pro Thr Ile Lys Phe Lys Asp Ile Leu
180 185 190
Lys Thr Tyr Gly Ser Asn Pro Lys Ala Ile Cys Glu Thr Leu Thr Ala
195 200 205
Leu Leu Lys Glu Met Arg Ala Ser Ser Gly Val Ile Trp Asn Ser Cys
210 215 220
Lys Glu Leu Glu Gln Ser Glu Leu Gln Met Ile Cys Lys Glu Phe Pro
225 230 235 240
Val Pro His Phe Leu Ile Gly Pro Leu His Lys Tyr Phe Pro Ala Ser
245 250 255
Ser Ser Ser Leu Val Ala His Asp Pro Ser Ser Ile Ser Trp Leu Asn
260 265 270
Ser Lys Ala Pro Asn Ser Val Leu Tyr Val Ser Phe Gly Ser Ile Ser
275 280 285
Ser Met Asp Glu Ala Glu Phe Leu Glu Thr Ala Trp Gly Leu Ala Asn
290 295 300
Ser Met Gln Gln Phe Leu Trp Val Val Arg Pro Gly Ser Val Arg Gly
305 310 315 320
Ser Gln Trp Leu Glu Ser Leu Pro Asp Gly Phe Ile Asp Lys Leu Asp
325 330 335
Gly Arg Gly His Ile Val Lys Trp Ala Pro Gln Gln Glu Val Leu Ala
340 345 350
His Gln Ala Thr Gly Gly Phe Trp Thr His Cys Gly Trp Asn Ser Thr
355 360 365
Leu Glu Ser Met Cys Glu Gly Val Pro Met Ile Cys Ser His Gly Ile
370 375 380
Met Asp Gln Pro Ile Asn Ala Arg Tyr Val Thr Asp Val Trp Lys Val
385 390 395 400
Gly Ile Glu Leu Glu Lys Gly Phe Asp Ser Glu Glu Ile Lys Met Ala
405 410 415
Ile Arg Arg Leu Met Val Asp Lys Glu Gly Gln Glu Ile Arg Glu Arg
420 425 430
Ser Ser Arg Leu Lys Glu Ser Leu Ser Asn Cys Leu Lys Gln Gly Gly
435 440 445
Ser Ser His Asp Ser Val Glu Ser Leu Val Asp His Ile Leu Ser Phe
450 455 460
<210> 51
<211> 1380
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(1380)
<400> 51
atg gaa gaa aga aag ggt aga aga atc atc atg ttc cca tta cca ttc 48
Met Glu Glu Arg Lys Gly Arg Arg Ile Ile Met Phe Pro Leu Pro Phe
1 5 10 15
cca ggt cac ttc aac cca atg att gaa ttg gct ggt atc ttt cat cac 96
Pro Gly His Phe Asn Pro Met Ile Glu Leu Ala Gly Ile Phe His His
20 25 30
cgt ggt ttc tcc gtt acc atc ttg cac act tct tac aac ttc cca gac 144
Arg Gly Phe Ser Val Thr Ile Leu His Thr Ser Tyr Asn Phe Pro Asp
35 40 45
cca tcc aga cac cca cac ttc act ttc aga acc att tct cac aac aag 192
Pro Ser Arg His Pro His Phe Thr Phe Arg Thr Ile Ser His Asn Lys
50 55 60
gaa ggt gaa gaa gat cca tta tct caa tct gaa acc tct tcc atg gac 240
Glu Gly Glu Glu Asp Pro Leu Ser Gln Ser Glu Thr Ser Ser Met Asp
65 70 75 80
ttg att gtc ttg gtc aga aga tta aag caa cgt tac gct gaa cca ttc 288
Leu Ile Val Leu Val Arg Arg Leu Lys Gln Arg Tyr Ala Glu Pro Phe
85 90 95
aga aag tcc gtt gct gct gaa gtc ggt ggt ggt gaa acc gtt tgt tgt 336
Arg Lys Ser Val Ala Ala Glu Val Gly Gly Gly Glu Thr Val Cys Cys
100 105 110
ttg gtt tct gat gct atc tgg ggt aag aac act gaa gtt gtt gct gaa 384
Leu Val Ser Asp Ala Ile Trp Gly Lys Asn Thr Glu Val Val Ala Glu
115 120 125
gaa atc ggt gtc aga aga gtt gtt ttg aga act ggt ggt gcc tct tct 432
Glu Ile Gly Val Arg Arg Val Val Leu Arg Thr Gly Gly Ala Ser Ser
130 135 140
ttc tgt gcc ttt gct gct ttc cca tta ttg aga gac aag ggt tac ttg 480
Phe Cys Ala Phe Ala Ala Phe Pro Leu Leu Arg Asp Lys Gly Tyr Leu
145 150 155 160
cca atc caa gat tct cgt ttg gat gaa cct gtt act gaa ttg cct cca 528
Pro Ile Gln Asp Ser Arg Leu Asp Glu Pro Val Thr Glu Leu Pro Pro
165 170 175
ttg aag gtc aag gac tta cca gtc atg gaa acc aat gaa cca gaa gaa 576
Leu Lys Val Lys Asp Leu Pro Val Met Glu Thr Asn Glu Pro Glu Glu
180 185 190
ttg tac aga gtt gtt aac gac atg gtt gaa ggt gct aaa tct tct tct 624
Leu Tyr Arg Val Val Asn Asp Met Val Glu Gly Ala Lys Ser Ser Ser
195 200 205
ggt gtc atc tgg aac act ttc gaa gat ttg gaa aga ttg tct ttg atg 672
Gly Val Ile Trp Asn Thr Phe Glu Asp Leu Glu Arg Leu Ser Leu Met
210 215 220
aac tgt tcc tcc aaa ttg caa gtt cca ttc ttc cca atc ggt cca ttc 720
Asn Cys Ser Ser Lys Leu Gln Val Pro Phe Phe Pro Ile Gly Pro Phe
225 230 235 240
cac aag tac tct gaa gat cca act cca aag act gaa aac aag gaa gat 768
His Lys Tyr Ser Glu Asp Pro Thr Pro Lys Thr Glu Asn Lys Glu Asp
245 250 255
acc gac tgg ttg gac aag caa gac cct caa tcc gtt gtc tac gcc tcc 816
Thr Asp Trp Leu Asp Lys Gln Asp Pro Gln Ser Val Val Tyr Ala Ser
260 265 270
ttt ggt tct ttg gcc gct att gaa gaa aag gaa ttc ttg gaa att gct 864
Phe Gly Ser Leu Ala Ala Ile Glu Glu Lys Glu Phe Leu Glu Ile Ala
275 280 285
tgg ggt cta aga aac tct gaa aga cca ttc tta tgg gtt gtt aga cca 912
Trp Gly Leu Arg Asn Ser Glu Arg Pro Phe Leu Trp Val Val Arg Pro
290 295 300
ggt tcc gtc cgt ggt act gaa tgg ttg gaa tct cta cca ttg ggt ttc 960
Gly Ser Val Arg Gly Thr Glu Trp Leu Glu Ser Leu Pro Leu Gly Phe
305 310 315 320
atg gaa aac atc ggt gac aag ggt aag att gtc aaa tgg gct aac caa 1008
Met Glu Asn Ile Gly Asp Lys Gly Lys Ile Val Lys Trp Ala Asn Gln
325 330 335
ttg gaa gtt ttg gct cac cca gcc att ggt gct ttc tgg acc cac tgt 1056
Leu Glu Val Leu Ala His Pro Ala Ile Gly Ala Phe Trp Thr His Cys
340 345 350
ggt tgg aac tcc act ttg gaa tcc atc tgt gaa ggt gtc cca atg atc 1104
Gly Trp Asn Ser Thr Leu Glu Ser Ile Cys Glu Gly Val Pro Met Ile
355 360 365
tgt acc tcc tgt ttt acc gac caa cat gtc aac gcc aga tac att gtc 1152
Cys Thr Ser Cys Phe Thr Asp Gln His Val Asn Ala Arg Tyr Ile Val
370 375 380
gac gtc tgg aga gtc ggt atg ttg ttg gaa aga tcc aag atg gaa aag 1200
Asp Val Trp Arg Val Gly Met Leu Leu Glu Arg Ser Lys Met Glu Lys
385 390 395 400
aag gaa atc gaa aag gtt ttg aga tct gtt atg atg gaa aag ggt gat 1248
Lys Glu Ile Glu Lys Val Leu Arg Ser Val Met Met Glu Lys Gly Asp
405 410 415
ggt ttg aga gaa aga tct ttg aaa ttg aag gaa aga gct gat ttc tgt 1296
Gly Leu Arg Glu Arg Ser Leu Lys Leu Lys Glu Arg Ala Asp Phe Cys
420 425 430
ttg tcc aag gac ggt tct tct tcc aag tac ttg gac aaa ttg gtt tcc 1344
Leu Ser Lys Asp Gly Ser Ser Ser Lys Tyr Leu Asp Lys Leu Val Ser
435 440 445
cac gtc tta tct ttc gac tct tac gct ttc gct tct 1380
His Val Leu Ser Phe Asp Ser Tyr Ala Phe Ala Ser
450 455 460
<210> 52
<211> 460
<212> PRT
<213> Arabidopsis thaliana
<400> 52
Met Glu Glu Arg Lys Gly Arg Arg Ile Ile Met Phe Pro Leu Pro Phe
1 5 10 15
Pro Gly His Phe Asn Pro Met Ile Glu Leu Ala Gly Ile Phe His His
20 25 30
Arg Gly Phe Ser Val Thr Ile Leu His Thr Ser Tyr Asn Phe Pro Asp
35 40 45
Pro Ser Arg His Pro His Phe Thr Phe Arg Thr Ile Ser His Asn Lys
50 55 60
Glu Gly Glu Glu Asp Pro Leu Ser Gln Ser Glu Thr Ser Ser Met Asp
65 70 75 80
Leu Ile Val Leu Val Arg Arg Leu Lys Gln Arg Tyr Ala Glu Pro Phe
85 90 95
Arg Lys Ser Val Ala Ala Glu Val Gly Gly Gly Glu Thr Val Cys Cys
100 105 110
Leu Val Ser Asp Ala Ile Trp Gly Lys Asn Thr Glu Val Val Ala Glu
115 120 125
Glu Ile Gly Val Arg Arg Val Val Leu Arg Thr Gly Gly Ala Ser Ser
130 135 140
Phe Cys Ala Phe Ala Ala Phe Pro Leu Leu Arg Asp Lys Gly Tyr Leu
145 150 155 160
Pro Ile Gln Asp Ser Arg Leu Asp Glu Pro Val Thr Glu Leu Pro Pro
165 170 175
Leu Lys Val Lys Asp Leu Pro Val Met Glu Thr Asn Glu Pro Glu Glu
180 185 190
Leu Tyr Arg Val Val Asn Asp Met Val Glu Gly Ala Lys Ser Ser Ser
195 200 205
Gly Val Ile Trp Asn Thr Phe Glu Asp Leu Glu Arg Leu Ser Leu Met
210 215 220
Asn Cys Ser Ser Lys Leu Gln Val Pro Phe Phe Pro Ile Gly Pro Phe
225 230 235 240
His Lys Tyr Ser Glu Asp Pro Thr Pro Lys Thr Glu Asn Lys Glu Asp
245 250 255
Thr Asp Trp Leu Asp Lys Gln Asp Pro Gln Ser Val Val Tyr Ala Ser
260 265 270
Phe Gly Ser Leu Ala Ala Ile Glu Glu Lys Glu Phe Leu Glu Ile Ala
275 280 285
Trp Gly Leu Arg Asn Ser Glu Arg Pro Phe Leu Trp Val Val Arg Pro
290 295 300
Gly Ser Val Arg Gly Thr Glu Trp Leu Glu Ser Leu Pro Leu Gly Phe
305 310 315 320
Met Glu Asn Ile Gly Asp Lys Gly Lys Ile Val Lys Trp Ala Asn Gln
325 330 335
Leu Glu Val Leu Ala His Pro Ala Ile Gly Ala Phe Trp Thr His Cys
340 345 350
Gly Trp Asn Ser Thr Leu Glu Ser Ile Cys Glu Gly Val Pro Met Ile
355 360 365
Cys Thr Ser Cys Phe Thr Asp Gln His Val Asn Ala Arg Tyr Ile Val
370 375 380
Asp Val Trp Arg Val Gly Met Leu Leu Glu Arg Ser Lys Met Glu Lys
385 390 395 400
Lys Glu Ile Glu Lys Val Leu Arg Ser Val Met Met Glu Lys Gly Asp
405 410 415
Gly Leu Arg Glu Arg Ser Leu Lys Leu Lys Glu Arg Ala Asp Phe Cys
420 425 430
Leu Ser Lys Asp Gly Ser Ser Ser Lys Tyr Leu Asp Lys Leu Val Ser
435 440 445
His Val Leu Ser Phe Asp Ser Tyr Ala Phe Ala Ser
450 455 460
<210> 53
<211> 2139
<212> DNA
<213> Gibberella fujikuroi
<220>
<221> CDS
<222> (1)..(2139)
<400> 53
atg gct gaa ttg gac act ttg gat atc gtc gtt ttg ggt gtt atc ttc 48
Met Ala Glu Leu Asp Thr Leu Asp Ile Val Val Leu Gly Val Ile Phe
1 5 10 15
ttg ggt act gtt gct tac ttc acc aag ggt aag ttg tgg ggt gtc acc 96
Leu Gly Thr Val Ala Tyr Phe Thr Lys Gly Lys Leu Trp Gly Val Thr
20 25 30
aag gat cca tac gct aac ggt ttt gcc gct ggt ggt gct tcc aag cca 144
Lys Asp Pro Tyr Ala Asn Gly Phe Ala Ala Gly Gly Ala Ser Lys Pro
35 40 45
ggt aga acc aga aac att gtt gaa gcc atg gaa gaa tct ggt aag aac 192
Gly Arg Thr Arg Asn Ile Val Glu Ala Met Glu Glu Ser Gly Lys Asn
50 55 60
tgt gtt gtt ttc tac ggt tct caa acc ggt act gct gaa gat tac gct 240
Cys Val Val Phe Tyr Gly Ser Gln Thr Gly Thr Ala Glu Asp Tyr Ala
65 70 75 80
tcc aga tta gct aag gaa ggt aag tcc aga ttc ggt ttg aac acc atg 288
Ser Arg Leu Ala Lys Glu Gly Lys Ser Arg Phe Gly Leu Asn Thr Met
85 90 95
att gct gac tta gaa gat tac gac ttc gac aac ttg gat acc gtt cca 336
Ile Ala Asp Leu Glu Asp Tyr Asp Phe Asp Asn Leu Asp Thr Val Pro
100 105 110
tct gac aac att gtc atg ttc gtt ttg gct acc tac ggt gaa ggt gaa 384
Ser Asp Asn Ile Val Met Phe Val Leu Ala Thr Tyr Gly Glu Gly Glu
115 120 125
cct acc gac aat gct gtt gac ttc tac gaa ttc atc act ggt gaa gat 432
Pro Thr Asp Asn Ala Val Asp Phe Tyr Glu Phe Ile Thr Gly Glu Asp
130 135 140
gct tct ttc aac gaa ggt aac gac cct cca ttg ggt aac ttg aat tat 480
Ala Ser Phe Asn Glu Gly Asn Asp Pro Pro Leu Gly Asn Leu Asn Tyr
145 150 155 160
gtc gct ttc ggt tta ggt aac aac acc tac gaa cac tac aac tcc atg 528
Val Ala Phe Gly Leu Gly Asn Asn Thr Tyr Glu His Tyr Asn Ser Met
165 170 175
gtt aga aat gtc aac aaa gct ttg gaa aag cta ggt gcc cac aga att 576
Val Arg Asn Val Asn Lys Ala Leu Glu Lys Leu Gly Ala His Arg Ile
180 185 190
ggt gaa gct ggt gaa ggt gac gac ggt gct ggt act atg gaa gaa gat 624
Gly Glu Ala Gly Glu Gly Asp Asp Gly Ala Gly Thr Met Glu Glu Asp
195 200 205
ttc ttg gct tgg aag gac cca atg tgg gaa gct ttg gct aag aag atg 672
Phe Leu Ala Trp Lys Asp Pro Met Trp Glu Ala Leu Ala Lys Lys Met
210 215 220
ggt ttg gaa gaa aga gaa gct gtt tac gaa cca atc ttt gct atc aac 720
Gly Leu Glu Glu Arg Glu Ala Val Tyr Glu Pro Ile Phe Ala Ile Asn
225 230 235 240
gaa aga gat gac ttg act cca gaa gct aac gaa gtt tac ttg ggt gaa 768
Glu Arg Asp Asp Leu Thr Pro Glu Ala Asn Glu Val Tyr Leu Gly Glu
245 250 255
cca aat aaa ttg cac ttg gaa ggt act gct aag ggt cct ttc aac tct 816
Pro Asn Lys Leu His Leu Glu Gly Thr Ala Lys Gly Pro Phe Asn Ser
260 265 270
cac aac cca tac att gct cca att gct gaa tct tac gaa ttg ttc tct 864
His Asn Pro Tyr Ile Ala Pro Ile Ala Glu Ser Tyr Glu Leu Phe Ser
275 280 285
gcc aag gac aga aac tgt ttg cac atg gaa att gat atc tct ggt tcc 912
Ala Lys Asp Arg Asn Cys Leu His Met Glu Ile Asp Ile Ser Gly Ser
290 295 300
aac ttg aaa tac gaa act ggt gac cac att gcc atc tgg cca acc aac 960
Asn Leu Lys Tyr Glu Thr Gly Asp His Ile Ala Ile Trp Pro Thr Asn
305 310 315 320
cca ggt gaa gaa gtc aac aaa ttc ttg gat atc ttg gac tta tcc ggt 1008
Pro Gly Glu Glu Val Asn Lys Phe Leu Asp Ile Leu Asp Leu Ser Gly
325 330 335
aag caa cac tct gtt gtt acc gtt aag gcc ttg gaa cca act gcc aag 1056
Lys Gln His Ser Val Val Thr Val Lys Ala Leu Glu Pro Thr Ala Lys
340 345 350
gtc cca ttc cca aac cca acc act tac gat gct atc cta aga tac cac 1104
Val Pro Phe Pro Asn Pro Thr Thr Tyr Asp Ala Ile Leu Arg Tyr His
355 360 365
ttg gaa atc tgt gcc cca gtt tcc cgt caa ttt gtc tct act ttg gct 1152
Leu Glu Ile Cys Ala Pro Val Ser Arg Gln Phe Val Ser Thr Leu Ala
370 375 380
gcc ttt gct cca aac gac gac atc aag gct gaa atg aac aga ttg ggt 1200
Ala Phe Ala Pro Asn Asp Asp Ile Lys Ala Glu Met Asn Arg Leu Gly
385 390 395 400
tct gac aag gac tac ttc cat gaa aag act ggt cca cac tac tac aac 1248
Ser Asp Lys Asp Tyr Phe His Glu Lys Thr Gly Pro His Tyr Tyr Asn
405 410 415
att gcc aga ttc tta gct tcc gtt tct aag ggt gaa aag tgg acc aag 1296
Ile Ala Arg Phe Leu Ala Ser Val Ser Lys Gly Glu Lys Trp Thr Lys
420 425 430
att cca ttt tcc gct ttc atc gaa ggt ttg acc aaa tta caa cca aga 1344
Ile Pro Phe Ser Ala Phe Ile Glu Gly Leu Thr Lys Leu Gln Pro Arg
435 440 445
tac tac tcc atc tcc tct tct tct ttg gtc caa cca aag aag atc tcc 1392
Tyr Tyr Ser Ile Ser Ser Ser Ser Leu Val Gln Pro Lys Lys Ile Ser
450 455 460
atc act gcc gtc gtt gaa tct caa caa att cca ggt aga gat gac cca 1440
Ile Thr Ala Val Val Glu Ser Gln Gln Ile Pro Gly Arg Asp Asp Pro
465 470 475 480
ttc aga ggt gtc gcc acc aac tac ttg ttc gct ttg aag caa aag caa 1488
Phe Arg Gly Val Ala Thr Asn Tyr Leu Phe Ala Leu Lys Gln Lys Gln
485 490 495
aac ggt gac cca aac cca gct cca ttc ggt caa tct tac gaa ttg acc 1536
Asn Gly Asp Pro Asn Pro Ala Pro Phe Gly Gln Ser Tyr Glu Leu Thr
500 505 510
ggt cca aga aac aaa tac gat ggt att cat gtt cca gtc cac gtc aga 1584
Gly Pro Arg Asn Lys Tyr Asp Gly Ile His Val Pro Val His Val Arg
515 520 525
cac tcc aac ttt aaa ttg cct tct gac cca ggt aag cca atc atc atg 1632
His Ser Asn Phe Lys Leu Pro Ser Asp Pro Gly Lys Pro Ile Ile Met
530 535 540
atc ggt cca ggt act ggt gtt gct cca ttc aga ggt ttc gtt caa gaa 1680
Ile Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly Phe Val Gln Glu
545 550 555 560
aga gcc aag caa gct aga gat ggt gtc gaa gtc ggt aag act ttg tta 1728
Arg Ala Lys Gln Ala Arg Asp Gly Val Glu Val Gly Lys Thr Leu Leu
565 570 575
ttc ttc ggt tgt cgt aaa tct act gaa gat ttc atg tac caa aag gaa 1776
Phe Phe Gly Cys Arg Lys Ser Thr Glu Asp Phe Met Tyr Gln Lys Glu
580 585 590
tgg caa gaa tac aaa gaa gct ttg ggt gac aag ttc gaa atg atc act 1824
Trp Gln Glu Tyr Lys Glu Ala Leu Gly Asp Lys Phe Glu Met Ile Thr
595 600 605
gct ttc tcc aga gaa ggt tcc aag aag gtt tac gtc caa cat cgt ttg 1872
Ala Phe Ser Arg Glu Gly Ser Lys Lys Val Tyr Val Gln His Arg Leu
610 615 620
aag gaa aga tcc aag gaa gtt tcc gac ttg ttg tct caa aag gcc tac 1920
Lys Glu Arg Ser Lys Glu Val Ser Asp Leu Leu Ser Gln Lys Ala Tyr
625 630 635 640
ttc tac gtt tgt ggt gac gct gct cac atg gct cgt gaa gtc aac acc 1968
Phe Tyr Val Cys Gly Asp Ala Ala His Met Ala Arg Glu Val Asn Thr
645 650 655
gtc ttg gct caa atc att gct gaa ggt aga ggt gtc tct gaa gct aag 2016
Val Leu Ala Gln Ile Ile Ala Glu Gly Arg Gly Val Ser Glu Ala Lys
660 665 670
ggt gaa gaa att gtc aag aac atg aga tct gcc aac caa tac caa gtt 2064
Gly Glu Glu Ile Val Lys Asn Met Arg Ser Ala Asn Gln Tyr Gln Val
675 680 685
tgt tct gat ttc gtt act tta cac tgt aag gaa acc acc tac gct aac 2112
Cys Ser Asp Phe Val Thr Leu His Cys Lys Glu Thr Thr Tyr Ala Asn
690 695 700
tct gaa ttg caa gaa gat gtc tgg tct 2139
Ser Glu Leu Gln Glu Asp Val Trp Ser
705 710
<210> 54
<211> 713
<212> PRT
<213> Gibberella fujikuroi
<400> 54
Met Ala Glu Leu Asp Thr Leu Asp Ile Val Val Leu Gly Val Ile Phe
1 5 10 15
Leu Gly Thr Val Ala Tyr Phe Thr Lys Gly Lys Leu Trp Gly Val Thr
20 25 30
Lys Asp Pro Tyr Ala Asn Gly Phe Ala Ala Gly Gly Ala Ser Lys Pro
35 40 45
Gly Arg Thr Arg Asn Ile Val Glu Ala Met Glu Glu Ser Gly Lys Asn
50 55 60
Cys Val Val Phe Tyr Gly Ser Gln Thr Gly Thr Ala Glu Asp Tyr Ala
65 70 75 80
Ser Arg Leu Ala Lys Glu Gly Lys Ser Arg Phe Gly Leu Asn Thr Met
85 90 95
Ile Ala Asp Leu Glu Asp Tyr Asp Phe Asp Asn Leu Asp Thr Val Pro
100 105 110
Ser Asp Asn Ile Val Met Phe Val Leu Ala Thr Tyr Gly Glu Gly Glu
115 120 125
Pro Thr Asp Asn Ala Val Asp Phe Tyr Glu Phe Ile Thr Gly Glu Asp
130 135 140
Ala Ser Phe Asn Glu Gly Asn Asp Pro Pro Leu Gly Asn Leu Asn Tyr
145 150 155 160
Val Ala Phe Gly Leu Gly Asn Asn Thr Tyr Glu His Tyr Asn Ser Met
165 170 175
Val Arg Asn Val Asn Lys Ala Leu Glu Lys Leu Gly Ala His Arg Ile
180 185 190
Gly Glu Ala Gly Glu Gly Asp Asp Gly Ala Gly Thr Met Glu Glu Asp
195 200 205
Phe Leu Ala Trp Lys Asp Pro Met Trp Glu Ala Leu Ala Lys Lys Met
210 215 220
Gly Leu Glu Glu Arg Glu Ala Val Tyr Glu Pro Ile Phe Ala Ile Asn
225 230 235 240
Glu Arg Asp Asp Leu Thr Pro Glu Ala Asn Glu Val Tyr Leu Gly Glu
245 250 255
Pro Asn Lys Leu His Leu Glu Gly Thr Ala Lys Gly Pro Phe Asn Ser
260 265 270
His Asn Pro Tyr Ile Ala Pro Ile Ala Glu Ser Tyr Glu Leu Phe Ser
275 280 285
Ala Lys Asp Arg Asn Cys Leu His Met Glu Ile Asp Ile Ser Gly Ser
290 295 300
Asn Leu Lys Tyr Glu Thr Gly Asp His Ile Ala Ile Trp Pro Thr Asn
305 310 315 320
Pro Gly Glu Glu Val Asn Lys Phe Leu Asp Ile Leu Asp Leu Ser Gly
325 330 335
Lys Gln His Ser Val Val Thr Val Lys Ala Leu Glu Pro Thr Ala Lys
340 345 350
Val Pro Phe Pro Asn Pro Thr Thr Tyr Asp Ala Ile Leu Arg Tyr His
355 360 365
Leu Glu Ile Cys Ala Pro Val Ser Arg Gln Phe Val Ser Thr Leu Ala
370 375 380
Ala Phe Ala Pro Asn Asp Asp Ile Lys Ala Glu Met Asn Arg Leu Gly
385 390 395 400
Ser Asp Lys Asp Tyr Phe His Glu Lys Thr Gly Pro His Tyr Tyr Asn
405 410 415
Ile Ala Arg Phe Leu Ala Ser Val Ser Lys Gly Glu Lys Trp Thr Lys
420 425 430
Ile Pro Phe Ser Ala Phe Ile Glu Gly Leu Thr Lys Leu Gln Pro Arg
435 440 445
Tyr Tyr Ser Ile Ser Ser Ser Ser Leu Val Gln Pro Lys Lys Ile Ser
450 455 460
Ile Thr Ala Val Val Glu Ser Gln Gln Ile Pro Gly Arg Asp Asp Pro
465 470 475 480
Phe Arg Gly Val Ala Thr Asn Tyr Leu Phe Ala Leu Lys Gln Lys Gln
485 490 495
Asn Gly Asp Pro Asn Pro Ala Pro Phe Gly Gln Ser Tyr Glu Leu Thr
500 505 510
Gly Pro Arg Asn Lys Tyr Asp Gly Ile His Val Pro Val His Val Arg
515 520 525
His Ser Asn Phe Lys Leu Pro Ser Asp Pro Gly Lys Pro Ile Ile Met
530 535 540
Ile Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly Phe Val Gln Glu
545 550 555 560
Arg Ala Lys Gln Ala Arg Asp Gly Val Glu Val Gly Lys Thr Leu Leu
565 570 575
Phe Phe Gly Cys Arg Lys Ser Thr Glu Asp Phe Met Tyr Gln Lys Glu
580 585 590
Trp Gln Glu Tyr Lys Glu Ala Leu Gly Asp Lys Phe Glu Met Ile Thr
595 600 605
Ala Phe Ser Arg Glu Gly Ser Lys Lys Val Tyr Val Gln His Arg Leu
610 615 620
Lys Glu Arg Ser Lys Glu Val Ser Asp Leu Leu Ser Gln Lys Ala Tyr
625 630 635 640
Phe Tyr Val Cys Gly Asp Ala Ala His Met Ala Arg Glu Val Asn Thr
645 650 655
Val Leu Ala Gln Ile Ile Ala Glu Gly Arg Gly Val Ser Glu Ala Lys
660 665 670
Gly Glu Glu Ile Val Lys Asn Met Arg Ser Ala Asn Gln Tyr Gln Val
675 680 685
Cys Ser Asp Phe Val Thr Leu His Cys Lys Glu Thr Thr Tyr Ala Asn
690 695 700
Ser Glu Leu Gln Glu Asp Val Trp Ser
705 710
<210> 55
<211> 2076
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(2076)
<400> 55
atg acc tct gct tta tac gcc tcc gat ttg ttc aag caa ttg aag tcc 48
Met Thr Ser Ala Leu Tyr Ala Ser Asp Leu Phe Lys Gln Leu Lys Ser
1 5 10 15
atc atg ggt act gac tct ttg tcc gat gat gtt gtc tta gtc att gcc 96
Ile Met Gly Thr Asp Ser Leu Ser Asp Asp Val Val Leu Val Ile Ala
20 25 30
acc act tct ttg gct ttg gtt gct ggt ttc gtt gtc ttg ttg tgg aag 144
Thr Thr Ser Leu Ala Leu Val Ala Gly Phe Val Val Leu Leu Trp Lys
35 40 45
aaa acc acc gct gac aga tcc ggt gaa ttg aaa cct ttg atg att cca 192
Lys Thr Thr Ala Asp Arg Ser Gly Glu Leu Lys Pro Leu Met Ile Pro
50 55 60
aag tct ttg atg gcc aag gac gaa gat gac gac ttg gac ttg ggt tct 240
Lys Ser Leu Met Ala Lys Asp Glu Asp Asp Asp Leu Asp Leu Gly Ser
65 70 75 80
ggt aag acc aga gtc tcc att ttc ttc ggt act caa acc ggt act gct 288
Gly Lys Thr Arg Val Ser Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala
85 90 95
gaa ggt ttc gct aag gct ttg tct gaa gaa att aaa gcc aga tac gaa 336
Glu Gly Phe Ala Lys Ala Leu Ser Glu Glu Ile Lys Ala Arg Tyr Glu
100 105 110
aag gct gcc gtt aag gtt atc gat ttg gat gac tac gct gct gac gat 384
Lys Ala Ala Val Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp Asp
115 120 125
gac caa tac gaa gaa aag ttg aag aag gaa act ttg gct ttc ttc tgt 432
Asp Gln Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe Cys
130 135 140
gtt gct acc tac ggt gac ggt gaa cca act gac aat gct gct aga ttc 480
Val Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe
145 150 155 160
tac aag tgg ttc acc gaa gaa aac gaa aga gac atc aaa ttg caa caa 528
Tyr Lys Trp Phe Thr Glu Glu Asn Glu Arg Asp Ile Lys Leu Gln Gln
165 170 175
ttg gct tac ggt gtc ttt gct ttg ggt aac aga caa tac gaa cat ttc 576
Leu Ala Tyr Gly Val Phe Ala Leu Gly Asn Arg Gln Tyr Glu His Phe
180 185 190
aac aag atc ggt att gtc ttg gac gaa gaa tta tgt aag aag ggt gcc 624
Asn Lys Ile Gly Ile Val Leu Asp Glu Glu Leu Cys Lys Lys Gly Ala
195 200 205
aag aga ttg att gaa gtc ggt ttg ggt gac gat gac caa tct att gaa 672
Lys Arg Leu Ile Glu Val Gly Leu Gly Asp Asp Asp Gln Ser Ile Glu
210 215 220
gat gac ttc aac gct tgg aag gaa tct ttg tgg tct gaa tta gac aag 720
Asp Asp Phe Asn Ala Trp Lys Glu Ser Leu Trp Ser Glu Leu Asp Lys
225 230 235 240
tta ttg aag gac gaa gat gac aag tcc gtt gct act cca tac act gct 768
Leu Leu Lys Asp Glu Asp Asp Lys Ser Val Ala Thr Pro Tyr Thr Ala
245 250 255
gtc atc cca gaa tac cgt gtc gtc acc cac gac cca aga ttc acc acc 816
Val Ile Pro Glu Tyr Arg Val Val Thr His Asp Pro Arg Phe Thr Thr
260 265 270
caa aag tcc atg gaa tcc aac gtt gcc aac ggt aac acc act att gac 864
Gln Lys Ser Met Glu Ser Asn Val Ala Asn Gly Asn Thr Thr Ile Asp
275 280 285
atc cac cac cca tgt cgt gtt gat gtt gct gtt caa aag gaa ttg cac 912
Ile His His Pro Cys Arg Val Asp Val Ala Val Gln Lys Glu Leu His
290 295 300
acc cac gaa tct gac cgt tct tgt att cac ttg gaa ttc gat atc tcc 960
Thr His Glu Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ser
305 310 315 320
aga acc ggt atc act tac gaa act ggt gac cac gtc ggt gtc tac gct 1008
Arg Thr Gly Ile Thr Tyr Glu Thr Gly Asp His Val Gly Val Tyr Ala
325 330 335
gaa aac cac gtt gaa atc gtc gaa gaa gcc ggt aaa ttg ttg ggt cac 1056
Glu Asn His Val Glu Ile Val Glu Glu Ala Gly Lys Leu Leu Gly His
340 345 350
tct ttg gac ttg gtt ttc tcc atc cac gct gac aag gaa gat ggt tct 1104
Ser Leu Asp Leu Val Phe Ser Ile His Ala Asp Lys Glu Asp Gly Ser
355 360 365
cca tta gaa tcc gcc gtc cca cct cca ttc cca ggt cca tgt acc tta 1152
Pro Leu Glu Ser Ala Val Pro Pro Pro Phe Pro Gly Pro Cys Thr Leu
370 375 380
ggt act ggt cta gct cgt tac gct gat ttg ttg aac cca cca aga aag 1200
Gly Thr Gly Leu Ala Arg Tyr Ala Asp Leu Leu Asn Pro Pro Arg Lys
385 390 395 400
tct gct ttg gtt gct ttg gct gct tac gct act gaa cct tct gaa gct 1248
Ser Ala Leu Val Ala Leu Ala Ala Tyr Ala Thr Glu Pro Ser Glu Ala
405 410 415
gaa aag ttg aag cat ttg act tct cca gac ggt aag gat gaa tac tcc 1296
Glu Lys Leu Lys His Leu Thr Ser Pro Asp Gly Lys Asp Glu Tyr Ser
420 425 430
caa tgg atc gtt gct tct caa aga tct cta ttg gaa gtt atg gct gct 1344
Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala Ala
435 440 445
ttc cca tct gcc aag cca cca ttg ggt gtt ttc ttt gct gcc att gct 1392
Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala Ile Ala
450 455 460
cca aga ttg caa cca aga tac tac tcc att tct tct tct cca aga ttg 1440
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Leu
465 470 475 480
gct cca tct aga gtc cac gtc acc tct gct ttg gtt tac ggt cca act 1488
Ala Pro Ser Arg Val His Val Thr Ser Ala Leu Val Tyr Gly Pro Thr
485 490 495
cca acc ggt aga atc cac aaa ggt gtt tgt tcc acc tgg atg aag aac 1536
Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
500 505 510
gct gtt cca gct gaa aag tcc cac gaa tgt tcc ggt gct cca atc ttt 1584
Ala Val Pro Ala Glu Lys Ser His Glu Cys Ser Gly Ala Pro Ile Phe
515 520 525
atc aga gct tct aac ttc aag ttg cca tcc aac cca tct act cca atc 1632
Ile Arg Ala Ser Asn Phe Lys Leu Pro Ser Asn Pro Ser Thr Pro Ile
530 535 540
gtc atg gtt ggt cca ggt act ggt ttg gct cca ttc aga ggt ttc tta 1680
Val Met Val Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu
545 550 555 560
caa gaa aga atg gct ttg aaa gaa gat ggt gaa gaa ttg ggt tct tct 1728
Gln Glu Arg Met Ala Leu Lys Glu Asp Gly Glu Glu Leu Gly Ser Ser
565 570 575
cta ttg ttc ttc ggt tgt aga aac aga caa atg gac ttc atc tac gaa 1776
Leu Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Phe Ile Tyr Glu
580 585 590
gat gaa tta aac aac ttt gtc gac caa ggt gtt atc tcc gaa ttg atc 1824
Asp Glu Leu Asn Asn Phe Val Asp Gln Gly Val Ile Ser Glu Leu Ile
595 600 605
atg gcc ttc tct aga gaa ggt gct caa aag gaa tac gtt caa cat aaa 1872
Met Ala Phe Ser Arg Glu Gly Ala Gln Lys Glu Tyr Val Gln His Lys
610 615 620
atg atg gaa aag gct gct caa gtt tgg gac ttg atc aag gaa gaa ggt 1920
Met Met Glu Lys Ala Ala Gln Val Trp Asp Leu Ile Lys Glu Glu Gly
625 630 635 640
tac ttg tac gtt tgt ggt gat gcc aaa ggt atg gcc aga gat gtc cac 1968
Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His
645 650 655
aga act ttg cac acc att gtc caa gaa caa gaa ggt gtt tcc tct tcc 2016
Arg Thr Leu His Thr Ile Val Gln Glu Gln Glu Gly Val Ser Ser Ser
660 665 670
gaa gct gaa gcc att gtc aag aag tta caa act gaa ggt cgt tat ttg 2064
Glu Ala Glu Ala Ile Val Lys Lys Leu Gln Thr Glu Gly Arg Tyr Leu
675 680 685
aga gat gtc tgg 2076
Arg Asp Val Trp
690
<210> 56
<211> 692
<212> PRT
<213> Arabidopsis thaliana
<400> 56
Met Thr Ser Ala Leu Tyr Ala Ser Asp Leu Phe Lys Gln Leu Lys Ser
1 5 10 15
Ile Met Gly Thr Asp Ser Leu Ser Asp Asp Val Val Leu Val Ile Ala
20 25 30
Thr Thr Ser Leu Ala Leu Val Ala Gly Phe Val Val Leu Leu Trp Lys
35 40 45
Lys Thr Thr Ala Asp Arg Ser Gly Glu Leu Lys Pro Leu Met Ile Pro
50 55 60
Lys Ser Leu Met Ala Lys Asp Glu Asp Asp Asp Leu Asp Leu Gly Ser
65 70 75 80
Gly Lys Thr Arg Val Ser Ile Phe Phe Gly Thr Gln Thr Gly Thr Ala
85 90 95
Glu Gly Phe Ala Lys Ala Leu Ser Glu Glu Ile Lys Ala Arg Tyr Glu
100 105 110
Lys Ala Ala Val Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp Asp
115 120 125
Asp Gln Tyr Glu Glu Lys Leu Lys Lys Glu Thr Leu Ala Phe Phe Cys
130 135 140
Val Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe
145 150 155 160
Tyr Lys Trp Phe Thr Glu Glu Asn Glu Arg Asp Ile Lys Leu Gln Gln
165 170 175
Leu Ala Tyr Gly Val Phe Ala Leu Gly Asn Arg Gln Tyr Glu His Phe
180 185 190
Asn Lys Ile Gly Ile Val Leu Asp Glu Glu Leu Cys Lys Lys Gly Ala
195 200 205
Lys Arg Leu Ile Glu Val Gly Leu Gly Asp Asp Asp Gln Ser Ile Glu
210 215 220
Asp Asp Phe Asn Ala Trp Lys Glu Ser Leu Trp Ser Glu Leu Asp Lys
225 230 235 240
Leu Leu Lys Asp Glu Asp Asp Lys Ser Val Ala Thr Pro Tyr Thr Ala
245 250 255
Val Ile Pro Glu Tyr Arg Val Val Thr His Asp Pro Arg Phe Thr Thr
260 265 270
Gln Lys Ser Met Glu Ser Asn Val Ala Asn Gly Asn Thr Thr Ile Asp
275 280 285
Ile His His Pro Cys Arg Val Asp Val Ala Val Gln Lys Glu Leu His
290 295 300
Thr His Glu Ser Asp Arg Ser Cys Ile His Leu Glu Phe Asp Ile Ser
305 310 315 320
Arg Thr Gly Ile Thr Tyr Glu Thr Gly Asp His Val Gly Val Tyr Ala
325 330 335
Glu Asn His Val Glu Ile Val Glu Glu Ala Gly Lys Leu Leu Gly His
340 345 350
Ser Leu Asp Leu Val Phe Ser Ile His Ala Asp Lys Glu Asp Gly Ser
355 360 365
Pro Leu Glu Ser Ala Val Pro Pro Pro Phe Pro Gly Pro Cys Thr Leu
370 375 380
Gly Thr Gly Leu Ala Arg Tyr Ala Asp Leu Leu Asn Pro Pro Arg Lys
385 390 395 400
Ser Ala Leu Val Ala Leu Ala Ala Tyr Ala Thr Glu Pro Ser Glu Ala
405 410 415
Glu Lys Leu Lys His Leu Thr Ser Pro Asp Gly Lys Asp Glu Tyr Ser
420 425 430
Gln Trp Ile Val Ala Ser Gln Arg Ser Leu Leu Glu Val Met Ala Ala
435 440 445
Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala Ile Ala
450 455 460
Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Leu
465 470 475 480
Ala Pro Ser Arg Val His Val Thr Ser Ala Leu Val Tyr Gly Pro Thr
485 490 495
Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp Met Lys Asn
500 505 510
Ala Val Pro Ala Glu Lys Ser His Glu Cys Ser Gly Ala Pro Ile Phe
515 520 525
Ile Arg Ala Ser Asn Phe Lys Leu Pro Ser Asn Pro Ser Thr Pro Ile
530 535 540
Val Met Val Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe Leu
545 550 555 560
Gln Glu Arg Met Ala Leu Lys Glu Asp Gly Glu Glu Leu Gly Ser Ser
565 570 575
Leu Leu Phe Phe Gly Cys Arg Asn Arg Gln Met Asp Phe Ile Tyr Glu
580 585 590
Asp Glu Leu Asn Asn Phe Val Asp Gln Gly Val Ile Ser Glu Leu Ile
595 600 605
Met Ala Phe Ser Arg Glu Gly Ala Gln Lys Glu Tyr Val Gln His Lys
610 615 620
Met Met Glu Lys Ala Ala Gln Val Trp Asp Leu Ile Lys Glu Glu Gly
625 630 635 640
Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val His
645 650 655
Arg Thr Leu His Thr Ile Val Gln Glu Gln Glu Gly Val Ser Ser Ser
660 665 670
Glu Ala Glu Ala Ile Val Lys Lys Leu Gln Thr Glu Gly Arg Tyr Leu
675 680 685
Arg Asp Val Trp
690
<210> 57
<211> 2133
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(2133)
<400> 57
atg tcc tct tct tct tct tct tct act tcc atg att gat ttg atg gct 48
Met Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met Ala
1 5 10 15
gcc atc atc aag ggt gaa cca gtc att gtc tct gac cca gcc aac gct 96
Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn Ala
20 25 30
tct gct tac gaa tcc gtt gct gct gaa ttg tcc tcc atg ttg att gaa 144
Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile Glu
35 40 45
aac aga caa ttc gct atg att gtc act act tcc att gct gtc ttg att 192
Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu Ile
50 55 60
ggt tgt atc gtc atg ttg gtc tgg aga aga tcc ggt tcc ggt aac tcc 240
Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn Ser
65 70 75 80
aag aga gtt gaa cca ttg aag cca tta gtc atc aag cca aga gaa gaa 288
Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu Glu
85 90 95
gaa att gat gac ggt aga aag aag gtc acc atc ttc ttt ggt act caa 336
Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr Gln
100 105 110
acc ggt act gct gaa ggt ttt gct aag gct ttg ggt gaa gaa gcc aaa 384
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala Lys
115 120 125
gct aga tac gaa aag acc aga ttc aag atc gtt gac ttg gac gac tac 432
Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp Tyr
130 135 140
gct gct gat gac gac gaa tac gaa gaa aag ttg aag aag gaa gat gtt 480
Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp Val
145 150 155 160
gcc ttc ttc ttc ttg gct act tac ggt gat ggt gaa cca act gac aat 528
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175
gct gcc aga ttc tac aaa tgg ttc acc gaa ggt aac gac aga ggt gaa 576
Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly Glu
180 185 190
tgg tta aag aac ttg aaa tac ggt gtt ttc ggt cta ggt aac aga caa 624
Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205
tac gaa cac ttc aac aag gtt gcc aag gtt gtc gat gac atc ttg gtt 672
Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu Val
210 215 220
gaa caa ggt gct caa aga tta gtc caa gtc ggt ttg ggt gat gat gac 720
Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp Asp
225 230 235 240
caa tgt atc gaa gat gac ttc act gct tgg aga gaa gct ttg tgg cca 768
Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp Pro
245 250 255
gaa ttg gac acc atc tta aga gaa gaa ggt gat acc gct gtt gcc acc 816
Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala Thr
260 265 270
cca tac act gct gct gtt ttg gaa tac aga gtt tct atc cac gac tct 864
Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp Ser
275 280 285
gaa gat gcc aag ttc aac gac atc aac atg gct aac ggt aac ggt tac 912
Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly Tyr
290 295 300
act gtt ttc gac gct caa cac cca tac aag gcc aat gtt gct gtc aag 960
Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val Lys
305 310 315 320
aga gaa ttg cac act cca gaa tct gat cgt tct tgt atc cac ttg gaa 1008
Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu Glu
325 330 335
ttt gac att gct ggt tct ggt ttg acc tac gaa acc ggt gac cac gtc 1056
Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His Val
340 345 350
ggt gtc tta tgt gac aac ttg tct gaa act gtc gat gaa gct ttg aga 1104
Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu Arg
355 360 365
tta ttg gac atg tct cca gac act tat ttc tcc ttg cat gct gaa aag 1152
Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu Lys
370 375 380
gaa gat ggt act cca att tct tct tcc ttg cct cct cca ttc cca cca 1200
Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro Pro
385 390 395 400
tgt aac ttg aga acc gct tta acc aga tac gct tgt ttg cta tcc tct 1248
Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser Ser
405 410 415
cca aag aag tcc gct ttg gtt gct ttg gct gct cac gct tct gac cca 1296
Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp Pro
420 425 430
act gaa gct gaa aga ttg aaa cat ttg gct tcc cca gct ggt aag gat 1344
Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys Asp
435 440 445
gaa tac tcc aaa tgg gtt gtt gaa tct caa aga tct ttg ttg gaa gtc 1392
Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu Val
450 455 460
atg gct gaa ttc cca tct gcc aag cca cca ttg ggt gtt ttc ttc gcc 1440
Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala
465 470 475 480
ggt gtt gct cca aga ttg caa cca aga ttt tac tcc atc tct tct tct 1488
Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser
485 490 495
cca aag att gct gaa acc aga att cac gtt acc tgt gcc ttg gtc tac 1536
Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val Tyr
500 505 510
gaa aag atg cca acc ggt aga att cac aag ggt gtt tgt tcc acc tgg 1584
Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp
515 520 525
atg aag aac gct gtt cca tac gaa aag tct gaa aac tgt tct tct gct 1632
Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser Ala
530 535 540
cca atc ttc gtc cgt caa tcc aac ttc aag ttg cca tct gac tcc aag 1680
Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser Lys
545 550 555 560
gtc cca atc atc atg atc ggt cca ggt act ggt tta gct cca ttc aga 1728
Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg
565 570 575
ggt ttc ttg caa gaa aga ttg gcc tta gtt gaa tct ggt gtc gaa ttg 1776
Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu Leu
580 585 590
ggt cct tct gtt ttg ttc ttc ggt tgt aga aac cgt cgt atg gac ttc 1824
Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp Phe
595 600 605
atc tac gaa gaa gaa ttg caa aga ttt gtc gaa tct ggt gct ttg gct 1872
Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu Ala
610 615 620
gaa ttg tcc gtt gct ttc tct cgt gaa ggt cca acc aaa gaa tac gtt 1920
Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val
625 630 635 640
caa cac aag atg atg gac aaa gcc tcc gac atc tgg aac atg atc tcc 1968
Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile Ser
645 650 655
caa ggt gct tac ttg tac gtt tgt ggt gat gct aaa ggt atg gcc aga 2016
Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg
660 665 670
gat gtc cac aga tct tta cat acc att gcc caa gaa caa ggt tcc atg 2064
Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser Met
675 680 685
gac tcc acc aag gct gaa ggt ttc gtt aag aac ttg caa act tct ggt 2112
Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser Gly
690 695 700
cgt tac ttg aga gat gtt tgg 2133
Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 58
<211> 711
<212> PRT
<213> Arabidopsis thaliana
<400> 58
Met Ser Ser Ser Ser Ser Ser Ser Thr Ser Met Ile Asp Leu Met Ala
1 5 10 15
Ala Ile Ile Lys Gly Glu Pro Val Ile Val Ser Asp Pro Ala Asn Ala
20 25 30
Ser Ala Tyr Glu Ser Val Ala Ala Glu Leu Ser Ser Met Leu Ile Glu
35 40 45
Asn Arg Gln Phe Ala Met Ile Val Thr Thr Ser Ile Ala Val Leu Ile
50 55 60
Gly Cys Ile Val Met Leu Val Trp Arg Arg Ser Gly Ser Gly Asn Ser
65 70 75 80
Lys Arg Val Glu Pro Leu Lys Pro Leu Val Ile Lys Pro Arg Glu Glu
85 90 95
Glu Ile Asp Asp Gly Arg Lys Lys Val Thr Ile Phe Phe Gly Thr Gln
100 105 110
Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Gly Glu Glu Ala Lys
115 120 125
Ala Arg Tyr Glu Lys Thr Arg Phe Lys Ile Val Asp Leu Asp Asp Tyr
130 135 140
Ala Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asp Val
145 150 155 160
Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn
165 170 175
Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Asn Asp Arg Gly Glu
180 185 190
Trp Leu Lys Asn Leu Lys Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln
195 200 205
Tyr Glu His Phe Asn Lys Val Ala Lys Val Val Asp Asp Ile Leu Val
210 215 220
Glu Gln Gly Ala Gln Arg Leu Val Gln Val Gly Leu Gly Asp Asp Asp
225 230 235 240
Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Arg Glu Ala Leu Trp Pro
245 250 255
Glu Leu Asp Thr Ile Leu Arg Glu Glu Gly Asp Thr Ala Val Ala Thr
260 265 270
Pro Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Ser Ile His Asp Ser
275 280 285
Glu Asp Ala Lys Phe Asn Asp Ile Asn Met Ala Asn Gly Asn Gly Tyr
290 295 300
Thr Val Phe Asp Ala Gln His Pro Tyr Lys Ala Asn Val Ala Val Lys
305 310 315 320
Arg Glu Leu His Thr Pro Glu Ser Asp Arg Ser Cys Ile His Leu Glu
325 330 335
Phe Asp Ile Ala Gly Ser Gly Leu Thr Tyr Glu Thr Gly Asp His Val
340 345 350
Gly Val Leu Cys Asp Asn Leu Ser Glu Thr Val Asp Glu Ala Leu Arg
355 360 365
Leu Leu Asp Met Ser Pro Asp Thr Tyr Phe Ser Leu His Ala Glu Lys
370 375 380
Glu Asp Gly Thr Pro Ile Ser Ser Ser Leu Pro Pro Pro Phe Pro Pro
385 390 395 400
Cys Asn Leu Arg Thr Ala Leu Thr Arg Tyr Ala Cys Leu Leu Ser Ser
405 410 415
Pro Lys Lys Ser Ala Leu Val Ala Leu Ala Ala His Ala Ser Asp Pro
420 425 430
Thr Glu Ala Glu Arg Leu Lys His Leu Ala Ser Pro Ala Gly Lys Asp
435 440 445
Glu Tyr Ser Lys Trp Val Val Glu Ser Gln Arg Ser Leu Leu Glu Val
450 455 460
Met Ala Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala
465 470 475 480
Gly Val Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser
485 490 495
Pro Lys Ile Ala Glu Thr Arg Ile His Val Thr Cys Ala Leu Val Tyr
500 505 510
Glu Lys Met Pro Thr Gly Arg Ile His Lys Gly Val Cys Ser Thr Trp
515 520 525
Met Lys Asn Ala Val Pro Tyr Glu Lys Ser Glu Asn Cys Ser Ser Ala
530 535 540
Pro Ile Phe Val Arg Gln Ser Asn Phe Lys Leu Pro Ser Asp Ser Lys
545 550 555 560
Val Pro Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg
565 570 575
Gly Phe Leu Gln Glu Arg Leu Ala Leu Val Glu Ser Gly Val Glu Leu
580 585 590
Gly Pro Ser Val Leu Phe Phe Gly Cys Arg Asn Arg Arg Met Asp Phe
595 600 605
Ile Tyr Glu Glu Glu Leu Gln Arg Phe Val Glu Ser Gly Ala Leu Ala
610 615 620
Glu Leu Ser Val Ala Phe Ser Arg Glu Gly Pro Thr Lys Glu Tyr Val
625 630 635 640
Gln His Lys Met Met Asp Lys Ala Ser Asp Ile Trp Asn Met Ile Ser
645 650 655
Gln Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg
660 665 670
Asp Val His Arg Ser Leu His Thr Ile Ala Gln Glu Gln Gly Ser Met
675 680 685
Asp Ser Thr Lys Ala Glu Gly Phe Val Lys Asn Leu Gln Thr Ser Gly
690 695 700
Arg Tyr Leu Arg Asp Val Trp
705 710
<210> 59
<211> 2361
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(2361)
<400> 59
atg aag act ggt ttc atc tct cca gct acc gtc ttc cac cac aga atc 48
Met Lys Thr Gly Phe Ile Ser Pro Ala Thr Val Phe His His Arg Ile
1 5 10 15
tct cca gct acc act ttc aga cac cac ttg tcc cca gct acc act aac 96
Ser Pro Ala Thr Thr Phe Arg His His Leu Ser Pro Ala Thr Thr Asn
20 25 30
tct act ggt att gtt gct ttg aga gac atc aac ttc aga tgt aaa gct 144
Ser Thr Gly Ile Val Ala Leu Arg Asp Ile Asn Phe Arg Cys Lys Ala
35 40 45
gtt tcc aag gaa tac tct gac ttg ttg caa aag gat gaa gcc tcc ttc 192
Val Ser Lys Glu Tyr Ser Asp Leu Leu Gln Lys Asp Glu Ala Ser Phe
50 55 60
acc aaa tgg gat gat gac aaa gtt aag gac cat tta gac act aac aag 240
Thr Lys Trp Asp Asp Asp Lys Val Lys Asp His Leu Asp Thr Asn Lys
65 70 75 80
aac ttg tac cca aac gat gaa atc aag gaa ttc gtc gaa tct gtc aaa 288
Asn Leu Tyr Pro Asn Asp Glu Ile Lys Glu Phe Val Glu Ser Val Lys
85 90 95
gct atg ttc ggt tcc atg aat gat ggt gaa atc aac gtt tcc gct tac 336
Ala Met Phe Gly Ser Met Asn Asp Gly Glu Ile Asn Val Ser Ala Tyr
100 105 110
gac acc gct tgg gtt gct ttg gtt caa gac gtt gat ggt tcc ggt tct 384
Asp Thr Ala Trp Val Ala Leu Val Gln Asp Val Asp Gly Ser Gly Ser
115 120 125
cca caa ttc cca tct tct ttg gaa tgg att gcc aac aac caa ttg tct 432
Pro Gln Phe Pro Ser Ser Leu Glu Trp Ile Ala Asn Asn Gln Leu Ser
130 135 140
gat ggt tct tgg ggt gac cat ttg tta ttc tct gct cac gac aga att 480
Asp Gly Ser Trp Gly Asp His Leu Leu Phe Ser Ala His Asp Arg Ile
145 150 155 160
att aac act tta gct tgt gtc att gct ttg act tcc tgg aat gtc cat 528
Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser Trp Asn Val His
165 170 175
cca tcc aag tgt gaa aag ggt ttg aac ttc ttg aga gaa aac atc tgt 576
Pro Ser Lys Cys Glu Lys Gly Leu Asn Phe Leu Arg Glu Asn Ile Cys
180 185 190
aag ttg gaa gat gaa aat gct gaa cac atg cca att ggt ttc gaa gtt 624
Lys Leu Glu Asp Glu Asn Ala Glu His Met Pro Ile Gly Phe Glu Val
195 200 205
acc ttc cca tct ttg att gat atc gcc aag aag ttg aac atc gaa gtc 672
Thr Phe Pro Ser Leu Ile Asp Ile Ala Lys Lys Leu Asn Ile Glu Val
210 215 220
cca gaa gac acc cca gct ttg aag gaa atc tac gcc aga aga gat atc 720
Pro Glu Asp Thr Pro Ala Leu Lys Glu Ile Tyr Ala Arg Arg Asp Ile
225 230 235 240
aag ttg acc aaa atc cca atg gaa gtt ttg cac aag gtt cca acc acc 768
Lys Leu Thr Lys Ile Pro Met Glu Val Leu His Lys Val Pro Thr Thr
245 250 255
ttg ttg cac tct ttg gaa ggt atg cca gac ttg gaa tgg gaa aag ttg 816
Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu Trp Glu Lys Leu
260 265 270
tta aag ttg caa tgt aag gac ggt tct ttc ttg ttc tct cca tct tct 864
Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe Ser Pro Ser Ser
275 280 285
acc gcc ttt gct ttg atg caa act aag gac gaa aag tgt cta caa tac 912
Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Glu Lys Cys Leu Gln Tyr
290 295 300
tta act aat atc gtt acc aaa ttc aac ggt ggt gtc cca aac gtt tac 960
Leu Thr Asn Ile Val Thr Lys Phe Asn Gly Gly Val Pro Asn Val Tyr
305 310 315 320
cct gtt gac ttg ttt gaa cac atc tgg gtt gtt gac aga ttg caa cgt 1008
Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp Arg Leu Gln Arg
325 330 335
ttg ggt att gct cgt tat ttc aag tct gaa atc aag gac tgt gtt gaa 1056
Leu Gly Ile Ala Arg Tyr Phe Lys Ser Glu Ile Lys Asp Cys Val Glu
340 345 350
tac atc aac aag tac tgg act aag aac ggt atc tgt tgg gct cgt aac 1104
Tyr Ile Asn Lys Tyr Trp Thr Lys Asn Gly Ile Cys Trp Ala Arg Asn
355 360 365
acc cac gtt caa gat atc gac gac act gct atg ggt ttc aga gtc ttg 1152
Thr His Val Gln Asp Ile Asp Asp Thr Ala Met Gly Phe Arg Val Leu
370 375 380
aga gct cat ggt tac gat gtc acc cca gat gtc ttc aga caa ttc gaa 1200
Arg Ala His Gly Tyr Asp Val Thr Pro Asp Val Phe Arg Gln Phe Glu
385 390 395 400
aag gat ggt aag ttc gtt tgt ttt gcc ggt caa tcc act caa gcc gtc 1248
Lys Asp Gly Lys Phe Val Cys Phe Ala Gly Gln Ser Thr Gln Ala Val
405 410 415
act ggt atg ttc aac gtc tac aga gct tct caa atg ttg ttc cca ggt 1296
Thr Gly Met Phe Asn Val Tyr Arg Ala Ser Gln Met Leu Phe Pro Gly
420 425 430
gaa aga atc cta gaa gac gct aag aag ttc tcc tac aac tac ttg aaa 1344
Glu Arg Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr Asn Tyr Leu Lys
435 440 445
gaa aag caa tct act aac gaa ttg ttg gac aaa tgg atc att gcc aaa 1392
Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp Ile Ile Ala Lys
450 455 460
gac tta cca ggt gaa gtc ggt tac gct ttg gat att cca tgg tac gct 1440
Asp Leu Pro Gly Glu Val Gly Tyr Ala Leu Asp Ile Pro Trp Tyr Ala
465 470 475 480
tct cta cca aga tta gaa acc aga tac tac ttg gaa caa tac ggt ggt 1488
Ser Leu Pro Arg Leu Glu Thr Arg Tyr Tyr Leu Glu Gln Tyr Gly Gly
485 490 495
gaa gac gat gtc tgg atc ggt aag acc ttg tac aga atg ggt tac gtt 1536
Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met Gly Tyr Val
500 505 510
tcc aac aac act tac ttg gaa atg gcc aaa ttg gac tac aac aac tac 1584
Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp Tyr Asn Asn Tyr
515 520 525
gtc gcc gtc tta caa ttg gaa tgg tac acc att caa caa tgg tac gtt 1632
Val Ala Val Leu Gln Leu Glu Trp Tyr Thr Ile Gln Gln Trp Tyr Val
530 535 540
gac att ggt att gaa aag ttt gaa tcc gac aac atc aag tcc gtc ttg 1680
Asp Ile Gly Ile Glu Lys Phe Glu Ser Asp Asn Ile Lys Ser Val Leu
545 550 555 560
gtt tcc tac tac ttg gct gct gct tcc atc ttt gaa cca gaa aga tcc 1728
Val Ser Tyr Tyr Leu Ala Ala Ala Ser Ile Phe Glu Pro Glu Arg Ser
565 570 575
aag gaa aga att gct tgg gct aag acc acc atc ttg gtt gac aag atc 1776
Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Ile Leu Val Asp Lys Ile
580 585 590
act tct att ttc gac tct tcc caa tct tcc aag gaa gat atc acc gct 1824
Thr Ser Ile Phe Asp Ser Ser Gln Ser Ser Lys Glu Asp Ile Thr Ala
595 600 605
ttc att gac aaa ttc aga aac aag tct tct tcc aag aag cac tcc att 1872
Phe Ile Asp Lys Phe Arg Asn Lys Ser Ser Ser Lys Lys His Ser Ile
610 615 620
aac ggt gaa cca tgg cac gaa gtt atg gtt gct ttg aag aag act ttg 1920
Asn Gly Glu Pro Trp His Glu Val Met Val Ala Leu Lys Lys Thr Leu
625 630 635 640
cac ggt ttt gct ttg gat gct ttg atg act cac tct caa gat att cac 1968
His Gly Phe Ala Leu Asp Ala Leu Met Thr His Ser Gln Asp Ile His
645 650 655
cct caa tta cac caa gct tgg gaa atg tgg tta acc aag ttg caa gat 2016
Pro Gln Leu His Gln Ala Trp Glu Met Trp Leu Thr Lys Leu Gln Asp
660 665 670
ggt gtc gat gtc act gct gaa ttg atg gtt caa atg atc aac atg act 2064
Gly Val Asp Val Thr Ala Glu Leu Met Val Gln Met Ile Asn Met Thr
675 680 685
gcc ggt aga tgg gtt tct aag gaa ttg ttg act cac cct caa tac caa 2112
Ala Gly Arg Trp Val Ser Lys Glu Leu Leu Thr His Pro Gln Tyr Gln
690 695 700
cgt ttg tcc acc gtc acc aac tct gtc tgt cac gac atc act aag ttg 2160
Arg Leu Ser Thr Val Thr Asn Ser Val Cys His Asp Ile Thr Lys Leu
705 710 715 720
cac aac ttc aaa gaa aac tcc act act gtc gat tct aag gtt caa gaa 2208
His Asn Phe Lys Glu Asn Ser Thr Thr Val Asp Ser Lys Val Gln Glu
725 730 735
ttg gtt caa tta gtt ttc tct gac acc cca gat gac ttg gac caa gac 2256
Leu Val Gln Leu Val Phe Ser Asp Thr Pro Asp Asp Leu Asp Gln Asp
740 745 750
atg aag caa act ttc ttg act gtc atg aag acc ttc tac tac aag gct 2304
Met Lys Gln Thr Phe Leu Thr Val Met Lys Thr Phe Tyr Tyr Lys Ala
755 760 765
tgg tgt gac cca aac acc atc aac gac cat att tct aag gtc ttc gaa 2352
Trp Cys Asp Pro Asn Thr Ile Asn Asp His Ile Ser Lys Val Phe Glu
770 775 780
att gtt atc 2361
Ile Val Ile
785
<210> 60
<211> 787
<212> PRT
<213> Stevia rebaudiana
<400> 60
Met Lys Thr Gly Phe Ile Ser Pro Ala Thr Val Phe His His Arg Ile
1 5 10 15
Ser Pro Ala Thr Thr Phe Arg His His Leu Ser Pro Ala Thr Thr Asn
20 25 30
Ser Thr Gly Ile Val Ala Leu Arg Asp Ile Asn Phe Arg Cys Lys Ala
35 40 45
Val Ser Lys Glu Tyr Ser Asp Leu Leu Gln Lys Asp Glu Ala Ser Phe
50 55 60
Thr Lys Trp Asp Asp Asp Lys Val Lys Asp His Leu Asp Thr Asn Lys
65 70 75 80
Asn Leu Tyr Pro Asn Asp Glu Ile Lys Glu Phe Val Glu Ser Val Lys
85 90 95
Ala Met Phe Gly Ser Met Asn Asp Gly Glu Ile Asn Val Ser Ala Tyr
100 105 110
Asp Thr Ala Trp Val Ala Leu Val Gln Asp Val Asp Gly Ser Gly Ser
115 120 125
Pro Gln Phe Pro Ser Ser Leu Glu Trp Ile Ala Asn Asn Gln Leu Ser
130 135 140
Asp Gly Ser Trp Gly Asp His Leu Leu Phe Ser Ala His Asp Arg Ile
145 150 155 160
Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser Trp Asn Val His
165 170 175
Pro Ser Lys Cys Glu Lys Gly Leu Asn Phe Leu Arg Glu Asn Ile Cys
180 185 190
Lys Leu Glu Asp Glu Asn Ala Glu His Met Pro Ile Gly Phe Glu Val
195 200 205
Thr Phe Pro Ser Leu Ile Asp Ile Ala Lys Lys Leu Asn Ile Glu Val
210 215 220
Pro Glu Asp Thr Pro Ala Leu Lys Glu Ile Tyr Ala Arg Arg Asp Ile
225 230 235 240
Lys Leu Thr Lys Ile Pro Met Glu Val Leu His Lys Val Pro Thr Thr
245 250 255
Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu Trp Glu Lys Leu
260 265 270
Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe Ser Pro Ser Ser
275 280 285
Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Glu Lys Cys Leu Gln Tyr
290 295 300
Leu Thr Asn Ile Val Thr Lys Phe Asn Gly Gly Val Pro Asn Val Tyr
305 310 315 320
Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp Arg Leu Gln Arg
325 330 335
Leu Gly Ile Ala Arg Tyr Phe Lys Ser Glu Ile Lys Asp Cys Val Glu
340 345 350
Tyr Ile Asn Lys Tyr Trp Thr Lys Asn Gly Ile Cys Trp Ala Arg Asn
355 360 365
Thr His Val Gln Asp Ile Asp Asp Thr Ala Met Gly Phe Arg Val Leu
370 375 380
Arg Ala His Gly Tyr Asp Val Thr Pro Asp Val Phe Arg Gln Phe Glu
385 390 395 400
Lys Asp Gly Lys Phe Val Cys Phe Ala Gly Gln Ser Thr Gln Ala Val
405 410 415
Thr Gly Met Phe Asn Val Tyr Arg Ala Ser Gln Met Leu Phe Pro Gly
420 425 430
Glu Arg Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr Asn Tyr Leu Lys
435 440 445
Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp Ile Ile Ala Lys
450 455 460
Asp Leu Pro Gly Glu Val Gly Tyr Ala Leu Asp Ile Pro Trp Tyr Ala
465 470 475 480
Ser Leu Pro Arg Leu Glu Thr Arg Tyr Tyr Leu Glu Gln Tyr Gly Gly
485 490 495
Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg Met Gly Tyr Val
500 505 510
Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp Tyr Asn Asn Tyr
515 520 525
Val Ala Val Leu Gln Leu Glu Trp Tyr Thr Ile Gln Gln Trp Tyr Val
530 535 540
Asp Ile Gly Ile Glu Lys Phe Glu Ser Asp Asn Ile Lys Ser Val Leu
545 550 555 560
Val Ser Tyr Tyr Leu Ala Ala Ala Ser Ile Phe Glu Pro Glu Arg Ser
565 570 575
Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Ile Leu Val Asp Lys Ile
580 585 590
Thr Ser Ile Phe Asp Ser Ser Gln Ser Ser Lys Glu Asp Ile Thr Ala
595 600 605
Phe Ile Asp Lys Phe Arg Asn Lys Ser Ser Ser Lys Lys His Ser Ile
610 615 620
Asn Gly Glu Pro Trp His Glu Val Met Val Ala Leu Lys Lys Thr Leu
625 630 635 640
His Gly Phe Ala Leu Asp Ala Leu Met Thr His Ser Gln Asp Ile His
645 650 655
Pro Gln Leu His Gln Ala Trp Glu Met Trp Leu Thr Lys Leu Gln Asp
660 665 670
Gly Val Asp Val Thr Ala Glu Leu Met Val Gln Met Ile Asn Met Thr
675 680 685
Ala Gly Arg Trp Val Ser Lys Glu Leu Leu Thr His Pro Gln Tyr Gln
690 695 700
Arg Leu Ser Thr Val Thr Asn Ser Val Cys His Asp Ile Thr Lys Leu
705 710 715 720
His Asn Phe Lys Glu Asn Ser Thr Thr Val Asp Ser Lys Val Gln Glu
725 730 735
Leu Val Gln Leu Val Phe Ser Asp Thr Pro Asp Asp Leu Asp Gln Asp
740 745 750
Met Lys Gln Thr Phe Leu Thr Val Met Lys Thr Phe Tyr Tyr Lys Ala
755 760 765
Trp Cys Asp Pro Asn Thr Ile Asn Asp His Ile Ser Lys Val Phe Glu
770 775 780
Ile Val Ile
785
<210> 61
<211> 2229
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(2229)
<400> 61
atg tgt aaa gct gtt tcc aag gaa tac tct gac ttg ttg caa aag gat 48
Met Cys Lys Ala Val Ser Lys Glu Tyr Ser Asp Leu Leu Gln Lys Asp
1 5 10 15
gaa gcc tcc ttc acc aaa tgg gat gat gac aaa gtt aag gac cat tta 96
Glu Ala Ser Phe Thr Lys Trp Asp Asp Asp Lys Val Lys Asp His Leu
20 25 30
gac act aac aag aac ttg tac cca aac gat gaa atc aag gaa ttc gtc 144
Asp Thr Asn Lys Asn Leu Tyr Pro Asn Asp Glu Ile Lys Glu Phe Val
35 40 45
gaa tct gtc aaa gct atg ttc ggt tcc atg aat gat ggt gaa atc aac 192
Glu Ser Val Lys Ala Met Phe Gly Ser Met Asn Asp Gly Glu Ile Asn
50 55 60
gtt tcc gct tac gac acc gct tgg gtt gct ttg gtt caa gac gtt gat 240
Val Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu Val Gln Asp Val Asp
65 70 75 80
ggt tcc ggt tct cca caa ttc cca tct tct ttg gaa tgg att gcc aac 288
Gly Ser Gly Ser Pro Gln Phe Pro Ser Ser Leu Glu Trp Ile Ala Asn
85 90 95
aac caa ttg tct gat ggt tct tgg ggt gac cat ttg tta ttc tct gct 336
Asn Gln Leu Ser Asp Gly Ser Trp Gly Asp His Leu Leu Phe Ser Ala
100 105 110
cac gac aga att att aac act tta gct tgt gtc att gct ttg act tcc 384
His Asp Arg Ile Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser
115 120 125
tgg aat gtc cat cca tcc aag tgt gaa aag ggt ttg aac ttc ttg aga 432
Trp Asn Val His Pro Ser Lys Cys Glu Lys Gly Leu Asn Phe Leu Arg
130 135 140
gaa aac atc tgt aag ttg gaa gat gaa aat gct gaa cac atg cca att 480
Glu Asn Ile Cys Lys Leu Glu Asp Glu Asn Ala Glu His Met Pro Ile
145 150 155 160
ggt ttc gaa gtt acc ttc cca tct ttg att gat atc gcc aag aag ttg 528
Gly Phe Glu Val Thr Phe Pro Ser Leu Ile Asp Ile Ala Lys Lys Leu
165 170 175
aac atc gaa gtc cca gaa gac acc cca gct ttg aag gaa atc tac gcc 576
Asn Ile Glu Val Pro Glu Asp Thr Pro Ala Leu Lys Glu Ile Tyr Ala
180 185 190
aga aga gat atc aag ttg acc aaa atc cca atg gaa gtt ttg cac aag 624
Arg Arg Asp Ile Lys Leu Thr Lys Ile Pro Met Glu Val Leu His Lys
195 200 205
gtt cca acc acc ttg ttg cac tct ttg gaa ggt atg cca gac ttg gaa 672
Val Pro Thr Thr Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu
210 215 220
tgg gaa aag ttg tta aag ttg caa tgt aag gac ggt tct ttc ttg ttc 720
Trp Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe
225 230 235 240
tct cca tct tct acc gcc ttt gct ttg atg caa act aag gac gaa aag 768
Ser Pro Ser Ser Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Glu Lys
245 250 255
tgt cta caa tac tta act aat atc gtt acc aaa ttc aac ggt ggt gtc 816
Cys Leu Gln Tyr Leu Thr Asn Ile Val Thr Lys Phe Asn Gly Gly Val
260 265 270
cca aac gtt tac cct gtt gac ttg ttt gaa cac atc tgg gtt gtt gac 864
Pro Asn Val Tyr Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp
275 280 285
aga ttg caa cgt ttg ggt att gct cgt tat ttc aag tct gaa atc aag 912
Arg Leu Gln Arg Leu Gly Ile Ala Arg Tyr Phe Lys Ser Glu Ile Lys
290 295 300
gac tgt gtt gaa tac atc aac aag tac tgg act aag aac ggt atc tgt 960
Asp Cys Val Glu Tyr Ile Asn Lys Tyr Trp Thr Lys Asn Gly Ile Cys
305 310 315 320
tgg gct cgt aac acc cac gtt caa gat atc gac gac act gct atg ggt 1008
Trp Ala Arg Asn Thr His Val Gln Asp Ile Asp Asp Thr Ala Met Gly
325 330 335
ttc aga gtc ttg aga gct cat ggt tac gat gtc acc cca gat gtc ttc 1056
Phe Arg Val Leu Arg Ala His Gly Tyr Asp Val Thr Pro Asp Val Phe
340 345 350
aga caa ttc gaa aag gat ggt aag ttc gtt tgt ttt gcc ggt caa tcc 1104
Arg Gln Phe Glu Lys Asp Gly Lys Phe Val Cys Phe Ala Gly Gln Ser
355 360 365
act caa gcc gtc act ggt atg ttc aac gtc tac aga gct tct caa atg 1152
Thr Gln Ala Val Thr Gly Met Phe Asn Val Tyr Arg Ala Ser Gln Met
370 375 380
ttg ttc cca ggt gaa aga atc cta gaa gac gct aag aag ttc tcc tac 1200
Leu Phe Pro Gly Glu Arg Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr
385 390 395 400
aac tac ttg aaa gaa aag caa tct act aac gaa ttg ttg gac aaa tgg 1248
Asn Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp
405 410 415
atc att gcc aaa gac tta cca ggt gaa gtc ggt tac gct ttg gat att 1296
Ile Ile Ala Lys Asp Leu Pro Gly Glu Val Gly Tyr Ala Leu Asp Ile
420 425 430
cca tgg tac gct tct cta cca aga tta gaa acc aga tac tac ttg gaa 1344
Pro Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr Arg Tyr Tyr Leu Glu
435 440 445
caa tac ggt ggt gaa gac gat gtc tgg atc ggt aag acc ttg tac aga 1392
Gln Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg
450 455 460
atg ggt tac gtt tcc aac aac act tac ttg gaa atg gcc aaa ttg gac 1440
Met Gly Tyr Val Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp
465 470 475 480
tac aac aac tac gtc gcc gtc tta caa ttg gaa tgg tac acc att caa 1488
Tyr Asn Asn Tyr Val Ala Val Leu Gln Leu Glu Trp Tyr Thr Ile Gln
485 490 495
caa tgg tac gtt gac att ggt att gaa aag ttt gaa tcc gac aac atc 1536
Gln Trp Tyr Val Asp Ile Gly Ile Glu Lys Phe Glu Ser Asp Asn Ile
500 505 510
aag tcc gtc ttg gtt tcc tac tac ttg gct gct gct tcc atc ttt gaa 1584
Lys Ser Val Leu Val Ser Tyr Tyr Leu Ala Ala Ala Ser Ile Phe Glu
515 520 525
cca gaa aga tcc aag gaa aga att gct tgg gct aag acc acc atc ttg 1632
Pro Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Ile Leu
530 535 540
gtt gac aag atc act tct att ttc gac tct tcc caa tct tcc aag gaa 1680
Val Asp Lys Ile Thr Ser Ile Phe Asp Ser Ser Gln Ser Ser Lys Glu
545 550 555 560
gat atc acc gct ttc att gac aaa ttc aga aac aag tct tct tcc aag 1728
Asp Ile Thr Ala Phe Ile Asp Lys Phe Arg Asn Lys Ser Ser Ser Lys
565 570 575
aag cac tcc att aac ggt gaa cca tgg cac gaa gtt atg gtt gct ttg 1776
Lys His Ser Ile Asn Gly Glu Pro Trp His Glu Val Met Val Ala Leu
580 585 590
aag aag act ttg cac ggt ttt gct ttg gat gct ttg atg act cac tct 1824
Lys Lys Thr Leu His Gly Phe Ala Leu Asp Ala Leu Met Thr His Ser
595 600 605
caa gat att cac cct caa tta cac caa gct tgg gaa atg tgg tta acc 1872
Gln Asp Ile His Pro Gln Leu His Gln Ala Trp Glu Met Trp Leu Thr
610 615 620
aag ttg caa gat ggt gtc gat gtc act gct gaa ttg atg gtt caa atg 1920
Lys Leu Gln Asp Gly Val Asp Val Thr Ala Glu Leu Met Val Gln Met
625 630 635 640
atc aac atg act gcc ggt aga tgg gtt tct aag gaa ttg ttg act cac 1968
Ile Asn Met Thr Ala Gly Arg Trp Val Ser Lys Glu Leu Leu Thr His
645 650 655
cct caa tac caa cgt ttg tcc acc gtc acc aac tct gtc tgt cac gac 2016
Pro Gln Tyr Gln Arg Leu Ser Thr Val Thr Asn Ser Val Cys His Asp
660 665 670
atc act aag ttg cac aac ttc aaa gaa aac tcc act act gtc gat tct 2064
Ile Thr Lys Leu His Asn Phe Lys Glu Asn Ser Thr Thr Val Asp Ser
675 680 685
aag gtt caa gaa ttg gtt caa tta gtt ttc tct gac acc cca gat gac 2112
Lys Val Gln Glu Leu Val Gln Leu Val Phe Ser Asp Thr Pro Asp Asp
690 695 700
ttg gac caa gac atg aag caa act ttc ttg act gtc atg aag acc ttc 2160
Leu Asp Gln Asp Met Lys Gln Thr Phe Leu Thr Val Met Lys Thr Phe
705 710 715 720
tac tac aag gct tgg tgt gac cca aac acc atc aac gac cat att tct 2208
Tyr Tyr Lys Ala Trp Cys Asp Pro Asn Thr Ile Asn Asp His Ile Ser
725 730 735
aag gtc ttc gaa att gtt atc 2229
Lys Val Phe Glu Ile Val Ile
740
<210> 62
<211> 743
<212> PRT
<213> Stevia rebaudiana
<400> 62
Met Cys Lys Ala Val Ser Lys Glu Tyr Ser Asp Leu Leu Gln Lys Asp
1 5 10 15
Glu Ala Ser Phe Thr Lys Trp Asp Asp Asp Lys Val Lys Asp His Leu
20 25 30
Asp Thr Asn Lys Asn Leu Tyr Pro Asn Asp Glu Ile Lys Glu Phe Val
35 40 45
Glu Ser Val Lys Ala Met Phe Gly Ser Met Asn Asp Gly Glu Ile Asn
50 55 60
Val Ser Ala Tyr Asp Thr Ala Trp Val Ala Leu Val Gln Asp Val Asp
65 70 75 80
Gly Ser Gly Ser Pro Gln Phe Pro Ser Ser Leu Glu Trp Ile Ala Asn
85 90 95
Asn Gln Leu Ser Asp Gly Ser Trp Gly Asp His Leu Leu Phe Ser Ala
100 105 110
His Asp Arg Ile Ile Asn Thr Leu Ala Cys Val Ile Ala Leu Thr Ser
115 120 125
Trp Asn Val His Pro Ser Lys Cys Glu Lys Gly Leu Asn Phe Leu Arg
130 135 140
Glu Asn Ile Cys Lys Leu Glu Asp Glu Asn Ala Glu His Met Pro Ile
145 150 155 160
Gly Phe Glu Val Thr Phe Pro Ser Leu Ile Asp Ile Ala Lys Lys Leu
165 170 175
Asn Ile Glu Val Pro Glu Asp Thr Pro Ala Leu Lys Glu Ile Tyr Ala
180 185 190
Arg Arg Asp Ile Lys Leu Thr Lys Ile Pro Met Glu Val Leu His Lys
195 200 205
Val Pro Thr Thr Leu Leu His Ser Leu Glu Gly Met Pro Asp Leu Glu
210 215 220
Trp Glu Lys Leu Leu Lys Leu Gln Cys Lys Asp Gly Ser Phe Leu Phe
225 230 235 240
Ser Pro Ser Ser Thr Ala Phe Ala Leu Met Gln Thr Lys Asp Glu Lys
245 250 255
Cys Leu Gln Tyr Leu Thr Asn Ile Val Thr Lys Phe Asn Gly Gly Val
260 265 270
Pro Asn Val Tyr Pro Val Asp Leu Phe Glu His Ile Trp Val Val Asp
275 280 285
Arg Leu Gln Arg Leu Gly Ile Ala Arg Tyr Phe Lys Ser Glu Ile Lys
290 295 300
Asp Cys Val Glu Tyr Ile Asn Lys Tyr Trp Thr Lys Asn Gly Ile Cys
305 310 315 320
Trp Ala Arg Asn Thr His Val Gln Asp Ile Asp Asp Thr Ala Met Gly
325 330 335
Phe Arg Val Leu Arg Ala His Gly Tyr Asp Val Thr Pro Asp Val Phe
340 345 350
Arg Gln Phe Glu Lys Asp Gly Lys Phe Val Cys Phe Ala Gly Gln Ser
355 360 365
Thr Gln Ala Val Thr Gly Met Phe Asn Val Tyr Arg Ala Ser Gln Met
370 375 380
Leu Phe Pro Gly Glu Arg Ile Leu Glu Asp Ala Lys Lys Phe Ser Tyr
385 390 395 400
Asn Tyr Leu Lys Glu Lys Gln Ser Thr Asn Glu Leu Leu Asp Lys Trp
405 410 415
Ile Ile Ala Lys Asp Leu Pro Gly Glu Val Gly Tyr Ala Leu Asp Ile
420 425 430
Pro Trp Tyr Ala Ser Leu Pro Arg Leu Glu Thr Arg Tyr Tyr Leu Glu
435 440 445
Gln Tyr Gly Gly Glu Asp Asp Val Trp Ile Gly Lys Thr Leu Tyr Arg
450 455 460
Met Gly Tyr Val Ser Asn Asn Thr Tyr Leu Glu Met Ala Lys Leu Asp
465 470 475 480
Tyr Asn Asn Tyr Val Ala Val Leu Gln Leu Glu Trp Tyr Thr Ile Gln
485 490 495
Gln Trp Tyr Val Asp Ile Gly Ile Glu Lys Phe Glu Ser Asp Asn Ile
500 505 510
Lys Ser Val Leu Val Ser Tyr Tyr Leu Ala Ala Ala Ser Ile Phe Glu
515 520 525
Pro Glu Arg Ser Lys Glu Arg Ile Ala Trp Ala Lys Thr Thr Ile Leu
530 535 540
Val Asp Lys Ile Thr Ser Ile Phe Asp Ser Ser Gln Ser Ser Lys Glu
545 550 555 560
Asp Ile Thr Ala Phe Ile Asp Lys Phe Arg Asn Lys Ser Ser Ser Lys
565 570 575
Lys His Ser Ile Asn Gly Glu Pro Trp His Glu Val Met Val Ala Leu
580 585 590
Lys Lys Thr Leu His Gly Phe Ala Leu Asp Ala Leu Met Thr His Ser
595 600 605
Gln Asp Ile His Pro Gln Leu His Gln Ala Trp Glu Met Trp Leu Thr
610 615 620
Lys Leu Gln Asp Gly Val Asp Val Thr Ala Glu Leu Met Val Gln Met
625 630 635 640
Ile Asn Met Thr Ala Gly Arg Trp Val Ser Lys Glu Leu Leu Thr His
645 650 655
Pro Gln Tyr Gln Arg Leu Ser Thr Val Thr Asn Ser Val Cys His Asp
660 665 670
Ile Thr Lys Leu His Asn Phe Lys Glu Asn Ser Thr Thr Val Asp Ser
675 680 685
Lys Val Gln Glu Leu Val Gln Leu Val Phe Ser Asp Thr Pro Asp Asp
690 695 700
Leu Asp Gln Asp Met Lys Gln Thr Phe Leu Thr Val Met Lys Thr Phe
705 710 715 720
Tyr Tyr Lys Ala Trp Cys Asp Pro Asn Thr Ile Asn Asp His Ile Ser
725 730 735
Lys Val Phe Glu Ile Val Ile
740
<210> 63
<211> 2352
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(2352)
<400> 63
atg aac ttg tct cta tgt att gct tct cca tta ttg acc aaa tct aac 48
Met Asn Leu Ser Leu Cys Ile Ala Ser Pro Leu Leu Thr Lys Ser Asn
1 5 10 15
aga cca gcc gct cta tcc gct att cac act gcc tcc act tct cac ggt 96
Arg Pro Ala Ala Leu Ser Ala Ile His Thr Ala Ser Thr Ser His Gly
20 25 30
ggt caa acc aac cca acc aac ttg att att gac acc acc aag gaa aga 144
Gly Gln Thr Asn Pro Thr Asn Leu Ile Ile Asp Thr Thr Lys Glu Arg
35 40 45
atc caa aag caa ttc aag aat gtt gaa atc tcc gtt tcc tcc tac gac 192
Ile Gln Lys Gln Phe Lys Asn Val Glu Ile Ser Val Ser Ser Tyr Asp
50 55 60
act gct tgg gtt gcc atg gtt cca tct cca aac tcc cca aag tct cca 240
Thr Ala Trp Val Ala Met Val Pro Ser Pro Asn Ser Pro Lys Ser Pro
65 70 75 80
tgt ttc cca gaa tgt ttg aac tgg tta atc aac aac caa ttg aac gat 288
Cys Phe Pro Glu Cys Leu Asn Trp Leu Ile Asn Asn Gln Leu Asn Asp
85 90 95
ggt tcc tgg ggt tta gtc aat cac acc cac aac cac aat cac cca ttg 336
Gly Ser Trp Gly Leu Val Asn His Thr His Asn His Asn His Pro Leu
100 105 110
ttg aag gac tct cta tcc tcc act ttg gct tgt atc gtt gct ttg aag 384
Leu Lys Asp Ser Leu Ser Ser Thr Leu Ala Cys Ile Val Ala Leu Lys
115 120 125
aga tgg aac gtt ggt gaa gac caa atc aac aag ggt ttg tcc ttt att 432
Arg Trp Asn Val Gly Glu Asp Gln Ile Asn Lys Gly Leu Ser Phe Ile
130 135 140
gaa tcc aac ttg gct tct gct act gaa aag tcc caa cca tct cct atc 480
Glu Ser Asn Leu Ala Ser Ala Thr Glu Lys Ser Gln Pro Ser Pro Ile
145 150 155 160
ggt ttt gac atc att ttc cca ggt tta ttg gaa tac gct aag aac ttg 528
Gly Phe Asp Ile Ile Phe Pro Gly Leu Leu Glu Tyr Ala Lys Asn Leu
165 170 175
gac atc aac tta tta tct aag caa acc gat ttc tcc ttg atg ttg cac 576
Asp Ile Asn Leu Leu Ser Lys Gln Thr Asp Phe Ser Leu Met Leu His
180 185 190
aag aga gaa ttg gaa caa aag aga tgt cac tcc aac gaa atg gac ggt 624
Lys Arg Glu Leu Glu Gln Lys Arg Cys His Ser Asn Glu Met Asp Gly
195 200 205
tac ttg gct tac att tct gaa ggt ttg ggt aac ttg tac gac tgg aac 672
Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn Leu Tyr Asp Trp Asn
210 215 220
atg gtc aag aaa tac caa atg aag aac ggt tcc gtt ttc aac tct cca 720
Met Val Lys Lys Tyr Gln Met Lys Asn Gly Ser Val Phe Asn Ser Pro
225 230 235 240
tct gct acc gct gct gct ttc atc aac cat caa aac cca ggt tgt ttg 768
Ser Ala Thr Ala Ala Ala Phe Ile Asn His Gln Asn Pro Gly Cys Leu
245 250 255
aac tac ttg aac tct ttg ttg gac aaa ttc ggt aac gct gtt cca act 816
Asn Tyr Leu Asn Ser Leu Leu Asp Lys Phe Gly Asn Ala Val Pro Thr
260 265 270
gtc tac cca cac gat ttg ttt atc aga tta tcc atg gtt gac acc att 864
Val Tyr Pro His Asp Leu Phe Ile Arg Leu Ser Met Val Asp Thr Ile
275 280 285
gaa cgt ttg ggt att tct cat cac ttc aga gtc gaa atc aag aac gtt 912
Glu Arg Leu Gly Ile Ser His His Phe Arg Val Glu Ile Lys Asn Val
290 295 300
ttg gat gaa act tac aga tgt tgg gtt gaa aga gat gaa caa atc ttc 960
Leu Asp Glu Thr Tyr Arg Cys Trp Val Glu Arg Asp Glu Gln Ile Phe
305 310 315 320
atg gat gtc gtc act tgt gcc ttg gcc ttc aga tta ttg aga att aac 1008
Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg Leu Leu Arg Ile Asn
325 330 335
ggt tac gaa gtt tct cca gac cca ttg gct gaa atc act aac gaa ttg 1056
Gly Tyr Glu Val Ser Pro Asp Pro Leu Ala Glu Ile Thr Asn Glu Leu
340 345 350
gct ttg aag gac gaa tac gcc gct ttg gaa act tac cat gcc tct cac 1104
Ala Leu Lys Asp Glu Tyr Ala Ala Leu Glu Thr Tyr His Ala Ser His
355 360 365
atc tta tac caa gaa gac ttg tcc tct ggt aag caa atc ttg aag tct 1152
Ile Leu Tyr Gln Glu Asp Leu Ser Ser Gly Lys Gln Ile Leu Lys Ser
370 375 380
gct gac ttc ttg aag gaa att atc tct act gat tct aac aga ttg tcc 1200
Ala Asp Phe Leu Lys Glu Ile Ile Ser Thr Asp Ser Asn Arg Leu Ser
385 390 395 400
aag ttg att cac aag gaa gtt gaa aac gcc ttg aaa ttc cca atc aac 1248
Lys Leu Ile His Lys Glu Val Glu Asn Ala Leu Lys Phe Pro Ile Asn
405 410 415
act ggt ttg gaa aga att aac acc aga aga aac atc caa tta tac aac 1296
Thr Gly Leu Glu Arg Ile Asn Thr Arg Arg Asn Ile Gln Leu Tyr Asn
420 425 430
gtt gac aac act aga atc ttg aag act act tat cac tct tcc aac atc 1344
Val Asp Asn Thr Arg Ile Leu Lys Thr Thr Tyr His Ser Ser Asn Ile
435 440 445
tcc aac act gac tac ttg aga ttg gct gtc gaa gat ttc tac acc tgt 1392
Ser Asn Thr Asp Tyr Leu Arg Leu Ala Val Glu Asp Phe Tyr Thr Cys
450 455 460
caa tct att tac aga gaa gaa ttg aag ggt ttg gaa aga tgg gtt gtc 1440
Gln Ser Ile Tyr Arg Glu Glu Leu Lys Gly Leu Glu Arg Trp Val Val
465 470 475 480
gaa aac aaa ttg gac caa ttg aaa ttt gct aga caa aag acc gcc tac 1488
Glu Asn Lys Leu Asp Gln Leu Lys Phe Ala Arg Gln Lys Thr Ala Tyr
485 490 495
tgt tac ttc tcc gtt gct gcc act ttg tcc tct cca gaa tta tct gac 1536
Cys Tyr Phe Ser Val Ala Ala Thr Leu Ser Ser Pro Glu Leu Ser Asp
500 505 510
gcc aga atc tcc tgg gct aag aat ggt atc ttg acc acc gtt gtc gat 1584
Ala Arg Ile Ser Trp Ala Lys Asn Gly Ile Leu Thr Thr Val Val Asp
515 520 525
gac ttc ttc gat att ggt ggt acc att gac gaa ttg acc aac ttg att 1632
Asp Phe Phe Asp Ile Gly Gly Thr Ile Asp Glu Leu Thr Asn Leu Ile
530 535 540
caa tgt gtt gaa aag tgg aac gtc gat gtc gat aag gac tgt tgt tct 1680
Gln Cys Val Glu Lys Trp Asn Val Asp Val Asp Lys Asp Cys Cys Ser
545 550 555 560
gaa cac gtc aga atc tta ttc ttg gct ttg aaa gat gct atc tgt tgg 1728
Glu His Val Arg Ile Leu Phe Leu Ala Leu Lys Asp Ala Ile Cys Trp
565 570 575
atc ggt gac gaa gct ttc aaa tgg caa gct cgt gac gtt acc tct cac 1776
Ile Gly Asp Glu Ala Phe Lys Trp Gln Ala Arg Asp Val Thr Ser His
580 585 590
gtc atc caa acc tgg ttg gaa ttg atg aac tct atg ttg aga gaa gcc 1824
Val Ile Gln Thr Trp Leu Glu Leu Met Asn Ser Met Leu Arg Glu Ala
595 600 605
atc tgg acc cgt gat gct tac gtc cca act ttg aac gaa tac atg gaa 1872
Ile Trp Thr Arg Asp Ala Tyr Val Pro Thr Leu Asn Glu Tyr Met Glu
610 615 620
aat gct tac gtt tct ttc gct ttg ggt cca att gtc aag cct gct att 1920
Asn Ala Tyr Val Ser Phe Ala Leu Gly Pro Ile Val Lys Pro Ala Ile
625 630 635 640
tac ttc gtt ggt cca aag ttg tcc gaa gaa att gtt gaa tct tct gaa 1968
Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile Val Glu Ser Ser Glu
645 650 655
tac cac aac ttg ttc aaa ttg atg tct act caa ggt cgt ttg ttg aac 2016
Tyr His Asn Leu Phe Lys Leu Met Ser Thr Gln Gly Arg Leu Leu Asn
660 665 670
gat atc cac tct ttc aag cgt gaa ttc aag gaa ggt aag ttg aat gct 2064
Asp Ile His Ser Phe Lys Arg Glu Phe Lys Glu Gly Lys Leu Asn Ala
675 680 685
gtt gct ttg cat ttg tct aac ggt gaa tct ggt aag gtc gaa gaa gaa 2112
Val Ala Leu His Leu Ser Asn Gly Glu Ser Gly Lys Val Glu Glu Glu
690 695 700
gtt gtc gaa gaa atg atg atg atg atc aag aac aag aga aag gaa ttg 2160
Val Val Glu Glu Met Met Met Met Ile Lys Asn Lys Arg Lys Glu Leu
705 710 715 720
atg aag ttg atc ttt gaa gaa aac ggt tct att gtc cca aga gct tgt 2208
Met Lys Leu Ile Phe Glu Glu Asn Gly Ser Ile Val Pro Arg Ala Cys
725 730 735
aag gat gct ttc tgg aac atg tgt cac gtc ttg aac ttc ttc tac gct 2256
Lys Asp Ala Phe Trp Asn Met Cys His Val Leu Asn Phe Phe Tyr Ala
740 745 750
aac gat gac ggt ttc act ggt aac acc atc tta gac acc gtc aag gac 2304
Asn Asp Asp Gly Phe Thr Gly Asn Thr Ile Leu Asp Thr Val Lys Asp
755 760 765
atc att tac aac cca tta gtc ttg gtt aac gaa aac gaa gaa caa aga 2352
Ile Ile Tyr Asn Pro Leu Val Leu Val Asn Glu Asn Glu Glu Gln Arg
770 775 780
<210> 64
<211> 784
<212> PRT
<213> Stevia rebaudiana
<400> 64
Met Asn Leu Ser Leu Cys Ile Ala Ser Pro Leu Leu Thr Lys Ser Asn
1 5 10 15
Arg Pro Ala Ala Leu Ser Ala Ile His Thr Ala Ser Thr Ser His Gly
20 25 30
Gly Gln Thr Asn Pro Thr Asn Leu Ile Ile Asp Thr Thr Lys Glu Arg
35 40 45
Ile Gln Lys Gln Phe Lys Asn Val Glu Ile Ser Val Ser Ser Tyr Asp
50 55 60
Thr Ala Trp Val Ala Met Val Pro Ser Pro Asn Ser Pro Lys Ser Pro
65 70 75 80
Cys Phe Pro Glu Cys Leu Asn Trp Leu Ile Asn Asn Gln Leu Asn Asp
85 90 95
Gly Ser Trp Gly Leu Val Asn His Thr His Asn His Asn His Pro Leu
100 105 110
Leu Lys Asp Ser Leu Ser Ser Thr Leu Ala Cys Ile Val Ala Leu Lys
115 120 125
Arg Trp Asn Val Gly Glu Asp Gln Ile Asn Lys Gly Leu Ser Phe Ile
130 135 140
Glu Ser Asn Leu Ala Ser Ala Thr Glu Lys Ser Gln Pro Ser Pro Ile
145 150 155 160
Gly Phe Asp Ile Ile Phe Pro Gly Leu Leu Glu Tyr Ala Lys Asn Leu
165 170 175
Asp Ile Asn Leu Leu Ser Lys Gln Thr Asp Phe Ser Leu Met Leu His
180 185 190
Lys Arg Glu Leu Glu Gln Lys Arg Cys His Ser Asn Glu Met Asp Gly
195 200 205
Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn Leu Tyr Asp Trp Asn
210 215 220
Met Val Lys Lys Tyr Gln Met Lys Asn Gly Ser Val Phe Asn Ser Pro
225 230 235 240
Ser Ala Thr Ala Ala Ala Phe Ile Asn His Gln Asn Pro Gly Cys Leu
245 250 255
Asn Tyr Leu Asn Ser Leu Leu Asp Lys Phe Gly Asn Ala Val Pro Thr
260 265 270
Val Tyr Pro His Asp Leu Phe Ile Arg Leu Ser Met Val Asp Thr Ile
275 280 285
Glu Arg Leu Gly Ile Ser His His Phe Arg Val Glu Ile Lys Asn Val
290 295 300
Leu Asp Glu Thr Tyr Arg Cys Trp Val Glu Arg Asp Glu Gln Ile Phe
305 310 315 320
Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg Leu Leu Arg Ile Asn
325 330 335
Gly Tyr Glu Val Ser Pro Asp Pro Leu Ala Glu Ile Thr Asn Glu Leu
340 345 350
Ala Leu Lys Asp Glu Tyr Ala Ala Leu Glu Thr Tyr His Ala Ser His
355 360 365
Ile Leu Tyr Gln Glu Asp Leu Ser Ser Gly Lys Gln Ile Leu Lys Ser
370 375 380
Ala Asp Phe Leu Lys Glu Ile Ile Ser Thr Asp Ser Asn Arg Leu Ser
385 390 395 400
Lys Leu Ile His Lys Glu Val Glu Asn Ala Leu Lys Phe Pro Ile Asn
405 410 415
Thr Gly Leu Glu Arg Ile Asn Thr Arg Arg Asn Ile Gln Leu Tyr Asn
420 425 430
Val Asp Asn Thr Arg Ile Leu Lys Thr Thr Tyr His Ser Ser Asn Ile
435 440 445
Ser Asn Thr Asp Tyr Leu Arg Leu Ala Val Glu Asp Phe Tyr Thr Cys
450 455 460
Gln Ser Ile Tyr Arg Glu Glu Leu Lys Gly Leu Glu Arg Trp Val Val
465 470 475 480
Glu Asn Lys Leu Asp Gln Leu Lys Phe Ala Arg Gln Lys Thr Ala Tyr
485 490 495
Cys Tyr Phe Ser Val Ala Ala Thr Leu Ser Ser Pro Glu Leu Ser Asp
500 505 510
Ala Arg Ile Ser Trp Ala Lys Asn Gly Ile Leu Thr Thr Val Val Asp
515 520 525
Asp Phe Phe Asp Ile Gly Gly Thr Ile Asp Glu Leu Thr Asn Leu Ile
530 535 540
Gln Cys Val Glu Lys Trp Asn Val Asp Val Asp Lys Asp Cys Cys Ser
545 550 555 560
Glu His Val Arg Ile Leu Phe Leu Ala Leu Lys Asp Ala Ile Cys Trp
565 570 575
Ile Gly Asp Glu Ala Phe Lys Trp Gln Ala Arg Asp Val Thr Ser His
580 585 590
Val Ile Gln Thr Trp Leu Glu Leu Met Asn Ser Met Leu Arg Glu Ala
595 600 605
Ile Trp Thr Arg Asp Ala Tyr Val Pro Thr Leu Asn Glu Tyr Met Glu
610 615 620
Asn Ala Tyr Val Ser Phe Ala Leu Gly Pro Ile Val Lys Pro Ala Ile
625 630 635 640
Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile Val Glu Ser Ser Glu
645 650 655
Tyr His Asn Leu Phe Lys Leu Met Ser Thr Gln Gly Arg Leu Leu Asn
660 665 670
Asp Ile His Ser Phe Lys Arg Glu Phe Lys Glu Gly Lys Leu Asn Ala
675 680 685
Val Ala Leu His Leu Ser Asn Gly Glu Ser Gly Lys Val Glu Glu Glu
690 695 700
Val Val Glu Glu Met Met Met Met Ile Lys Asn Lys Arg Lys Glu Leu
705 710 715 720
Met Lys Leu Ile Phe Glu Glu Asn Gly Ser Ile Val Pro Arg Ala Cys
725 730 735
Lys Asp Ala Phe Trp Asn Met Cys His Val Leu Asn Phe Phe Tyr Ala
740 745 750
Asn Asp Asp Gly Phe Thr Gly Asn Thr Ile Leu Asp Thr Val Lys Asp
755 760 765
Ile Ile Tyr Asn Pro Leu Val Leu Val Asn Glu Asn Glu Glu Gln Arg
770 775 780
<210> 65
<211> 2271
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(2271)
<400> 65
atg act tct cac ggt ggt caa acc aac cca acc aac ttg att att gac 48
Met Thr Ser His Gly Gly Gln Thr Asn Pro Thr Asn Leu Ile Ile Asp
1 5 10 15
acc acc aag gaa aga atc caa aag caa ttc aag aat gtt gaa atc tcc 96
Thr Thr Lys Glu Arg Ile Gln Lys Gln Phe Lys Asn Val Glu Ile Ser
20 25 30
gtt tcc tcc tac gac act gct tgg gtt gcc atg gtt cca tct cca aac 144
Val Ser Ser Tyr Asp Thr Ala Trp Val Ala Met Val Pro Ser Pro Asn
35 40 45
tcc cca aag tct cca tgt ttc cca gaa tgt ttg aac tgg tta atc aac 192
Ser Pro Lys Ser Pro Cys Phe Pro Glu Cys Leu Asn Trp Leu Ile Asn
50 55 60
aac caa ttg aac gat ggt tcc tgg ggt tta gtc aat cac acc cac aac 240
Asn Gln Leu Asn Asp Gly Ser Trp Gly Leu Val Asn His Thr His Asn
65 70 75 80
cac aat cac cca ttg ttg aag gac tct cta tcc tcc act ttg gct tgt 288
His Asn His Pro Leu Leu Lys Asp Ser Leu Ser Ser Thr Leu Ala Cys
85 90 95
atc gtt gct ttg aag aga tgg aac gtt ggt gaa gac caa atc aac aag 336
Ile Val Ala Leu Lys Arg Trp Asn Val Gly Glu Asp Gln Ile Asn Lys
100 105 110
ggt ttg tcc ttt att gaa tcc aac ttg gct tct gct act gaa aag tcc 384
Gly Leu Ser Phe Ile Glu Ser Asn Leu Ala Ser Ala Thr Glu Lys Ser
115 120 125
caa cca tct cct atc ggt ttt gac atc att ttc cca ggt tta ttg gaa 432
Gln Pro Ser Pro Ile Gly Phe Asp Ile Ile Phe Pro Gly Leu Leu Glu
130 135 140
tac gct aag aac ttg gac atc aac tta tta tct aag caa acc gat ttc 480
Tyr Ala Lys Asn Leu Asp Ile Asn Leu Leu Ser Lys Gln Thr Asp Phe
145 150 155 160
tcc ttg atg ttg cac aag aga gaa ttg gaa caa aag aga tgt cac tcc 528
Ser Leu Met Leu His Lys Arg Glu Leu Glu Gln Lys Arg Cys His Ser
165 170 175
aac gaa atg gac ggt tac ttg gct tac att tct gaa ggt ttg ggt aac 576
Asn Glu Met Asp Gly Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn
180 185 190
ttg tac gac tgg aac atg gtc aag aaa tac caa atg aag aac ggt tcc 624
Leu Tyr Asp Trp Asn Met Val Lys Lys Tyr Gln Met Lys Asn Gly Ser
195 200 205
gtt ttc aac tct cca tct gct acc gct gct gct ttc atc aac cat caa 672
Val Phe Asn Ser Pro Ser Ala Thr Ala Ala Ala Phe Ile Asn His Gln
210 215 220
aac cca ggt tgt ttg aac tac ttg aac tct ttg ttg gac aaa ttc ggt 720
Asn Pro Gly Cys Leu Asn Tyr Leu Asn Ser Leu Leu Asp Lys Phe Gly
225 230 235 240
aac gct gtt cca act gtc tac cca cac gat ttg ttt atc aga tta tcc 768
Asn Ala Val Pro Thr Val Tyr Pro His Asp Leu Phe Ile Arg Leu Ser
245 250 255
atg gtt gac acc att gaa cgt ttg ggt att tct cat cac ttc aga gtc 816
Met Val Asp Thr Ile Glu Arg Leu Gly Ile Ser His His Phe Arg Val
260 265 270
gaa atc aag aac gtt ttg gat gaa act tac aga tgt tgg gtt gaa aga 864
Glu Ile Lys Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp Val Glu Arg
275 280 285
gat gaa caa atc ttc atg gat gtc gtc act tgt gcc ttg gcc ttc aga 912
Asp Glu Gln Ile Phe Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg
290 295 300
tta ttg aga att aac ggt tac gaa gtt tct cca gac cca ttg gct gaa 960
Leu Leu Arg Ile Asn Gly Tyr Glu Val Ser Pro Asp Pro Leu Ala Glu
305 310 315 320
atc act aac gaa ttg gct ttg aag gac gaa tac gcc gct ttg gaa act 1008
Ile Thr Asn Glu Leu Ala Leu Lys Asp Glu Tyr Ala Ala Leu Glu Thr
325 330 335
tac cat gcc tct cac atc tta tac caa gaa gac ttg tcc tct ggt aag 1056
Tyr His Ala Ser His Ile Leu Tyr Gln Glu Asp Leu Ser Ser Gly Lys
340 345 350
caa atc ttg aag tct gct gac ttc ttg aag gaa att atc tct act gat 1104
Gln Ile Leu Lys Ser Ala Asp Phe Leu Lys Glu Ile Ile Ser Thr Asp
355 360 365
tct aac aga ttg tcc aag ttg att cac aag gaa gtt gaa aac gcc ttg 1152
Ser Asn Arg Leu Ser Lys Leu Ile His Lys Glu Val Glu Asn Ala Leu
370 375 380
aaa ttc cca atc aac act ggt ttg gaa aga att aac acc aga aga aac 1200
Lys Phe Pro Ile Asn Thr Gly Leu Glu Arg Ile Asn Thr Arg Arg Asn
385 390 395 400
atc caa tta tac aac gtt gac aac act aga atc ttg aag act act tat 1248
Ile Gln Leu Tyr Asn Val Asp Asn Thr Arg Ile Leu Lys Thr Thr Tyr
405 410 415
cac tct tcc aac atc tcc aac act gac tac ttg aga ttg gct gtc gaa 1296
His Ser Ser Asn Ile Ser Asn Thr Asp Tyr Leu Arg Leu Ala Val Glu
420 425 430
gat ttc tac acc tgt caa tct att tac aga gaa gaa ttg aag ggt ttg 1344
Asp Phe Tyr Thr Cys Gln Ser Ile Tyr Arg Glu Glu Leu Lys Gly Leu
435 440 445
gaa aga tgg gtt gtc gaa aac aaa ttg gac caa ttg aaa ttt gct aga 1392
Glu Arg Trp Val Val Glu Asn Lys Leu Asp Gln Leu Lys Phe Ala Arg
450 455 460
caa aag acc gcc tac tgt tac ttc tcc gtt gct gcc act ttg tcc tct 1440
Gln Lys Thr Ala Tyr Cys Tyr Phe Ser Val Ala Ala Thr Leu Ser Ser
465 470 475 480
cca gaa tta tct gac gcc aga atc tcc tgg gct aag aat ggt atc ttg 1488
Pro Glu Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys Asn Gly Ile Leu
485 490 495
acc acc gtt gtc gat gac ttc ttc gat att ggt ggt acc att gac gaa 1536
Thr Thr Val Val Asp Asp Phe Phe Asp Ile Gly Gly Thr Ile Asp Glu
500 505 510
ttg acc aac ttg att caa tgt gtt gaa aag tgg aac gtc gat gtc gat 1584
Leu Thr Asn Leu Ile Gln Cys Val Glu Lys Trp Asn Val Asp Val Asp
515 520 525
aag gac tgt tgt tct gaa cac gtc aga atc tta ttc ttg gct ttg aaa 1632
Lys Asp Cys Cys Ser Glu His Val Arg Ile Leu Phe Leu Ala Leu Lys
530 535 540
gat gct atc tgt tgg atc ggt gac gaa gct ttc aaa tgg caa gct cgt 1680
Asp Ala Ile Cys Trp Ile Gly Asp Glu Ala Phe Lys Trp Gln Ala Arg
545 550 555 560
gac gtt acc tct cac gtc atc caa acc tgg ttg gaa ttg atg aac tct 1728
Asp Val Thr Ser His Val Ile Gln Thr Trp Leu Glu Leu Met Asn Ser
565 570 575
atg ttg aga gaa gcc atc tgg acc cgt gat gct tac gtc cca act ttg 1776
Met Leu Arg Glu Ala Ile Trp Thr Arg Asp Ala Tyr Val Pro Thr Leu
580 585 590
aac gaa tac atg gaa aat gct tac gtt tct ttc gct ttg ggt cca att 1824
Asn Glu Tyr Met Glu Asn Ala Tyr Val Ser Phe Ala Leu Gly Pro Ile
595 600 605
gtc aag cct gct att tac ttc gtt ggt cca aag ttg tcc gaa gaa att 1872
Val Lys Pro Ala Ile Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile
610 615 620
gtt gaa tct tct gaa tac cac aac ttg ttc aaa ttg atg tct act caa 1920
Val Glu Ser Ser Glu Tyr His Asn Leu Phe Lys Leu Met Ser Thr Gln
625 630 635 640
ggt cgt ttg ttg aac gat atc cac tct ttc aag cgt gaa ttc aag gaa 1968
Gly Arg Leu Leu Asn Asp Ile His Ser Phe Lys Arg Glu Phe Lys Glu
645 650 655
ggt aag ttg aat gct gtt gct ttg cat ttg tct aac ggt gaa tct ggt 2016
Gly Lys Leu Asn Ala Val Ala Leu His Leu Ser Asn Gly Glu Ser Gly
660 665 670
aag gtc gaa gaa gaa gtt gtc gaa gaa atg atg atg atg atc aag aac 2064
Lys Val Glu Glu Glu Val Val Glu Glu Met Met Met Met Ile Lys Asn
675 680 685
aag aga aag gaa ttg atg aag ttg atc ttt gaa gaa aac ggt tct att 2112
Lys Arg Lys Glu Leu Met Lys Leu Ile Phe Glu Glu Asn Gly Ser Ile
690 695 700
gtc cca aga gct tgt aag gat gct ttc tgg aac atg tgt cac gtc ttg 2160
Val Pro Arg Ala Cys Lys Asp Ala Phe Trp Asn Met Cys His Val Leu
705 710 715 720
aac ttc ttc tac gct aac gat gac ggt ttc act ggt aac acc atc tta 2208
Asn Phe Phe Tyr Ala Asn Asp Asp Gly Phe Thr Gly Asn Thr Ile Leu
725 730 735
gac acc gtc aag gac atc att tac aac cca tta gtc ttg gtt aac gaa 2256
Asp Thr Val Lys Asp Ile Ile Tyr Asn Pro Leu Val Leu Val Asn Glu
740 745 750
aac gaa gaa caa aga 2271
Asn Glu Glu Gln Arg
755
<210> 66
<211> 757
<212> PRT
<213> Stevia rebaudiana
<400> 66
Met Thr Ser His Gly Gly Gln Thr Asn Pro Thr Asn Leu Ile Ile Asp
1 5 10 15
Thr Thr Lys Glu Arg Ile Gln Lys Gln Phe Lys Asn Val Glu Ile Ser
20 25 30
Val Ser Ser Tyr Asp Thr Ala Trp Val Ala Met Val Pro Ser Pro Asn
35 40 45
Ser Pro Lys Ser Pro Cys Phe Pro Glu Cys Leu Asn Trp Leu Ile Asn
50 55 60
Asn Gln Leu Asn Asp Gly Ser Trp Gly Leu Val Asn His Thr His Asn
65 70 75 80
His Asn His Pro Leu Leu Lys Asp Ser Leu Ser Ser Thr Leu Ala Cys
85 90 95
Ile Val Ala Leu Lys Arg Trp Asn Val Gly Glu Asp Gln Ile Asn Lys
100 105 110
Gly Leu Ser Phe Ile Glu Ser Asn Leu Ala Ser Ala Thr Glu Lys Ser
115 120 125
Gln Pro Ser Pro Ile Gly Phe Asp Ile Ile Phe Pro Gly Leu Leu Glu
130 135 140
Tyr Ala Lys Asn Leu Asp Ile Asn Leu Leu Ser Lys Gln Thr Asp Phe
145 150 155 160
Ser Leu Met Leu His Lys Arg Glu Leu Glu Gln Lys Arg Cys His Ser
165 170 175
Asn Glu Met Asp Gly Tyr Leu Ala Tyr Ile Ser Glu Gly Leu Gly Asn
180 185 190
Leu Tyr Asp Trp Asn Met Val Lys Lys Tyr Gln Met Lys Asn Gly Ser
195 200 205
Val Phe Asn Ser Pro Ser Ala Thr Ala Ala Ala Phe Ile Asn His Gln
210 215 220
Asn Pro Gly Cys Leu Asn Tyr Leu Asn Ser Leu Leu Asp Lys Phe Gly
225 230 235 240
Asn Ala Val Pro Thr Val Tyr Pro His Asp Leu Phe Ile Arg Leu Ser
245 250 255
Met Val Asp Thr Ile Glu Arg Leu Gly Ile Ser His His Phe Arg Val
260 265 270
Glu Ile Lys Asn Val Leu Asp Glu Thr Tyr Arg Cys Trp Val Glu Arg
275 280 285
Asp Glu Gln Ile Phe Met Asp Val Val Thr Cys Ala Leu Ala Phe Arg
290 295 300
Leu Leu Arg Ile Asn Gly Tyr Glu Val Ser Pro Asp Pro Leu Ala Glu
305 310 315 320
Ile Thr Asn Glu Leu Ala Leu Lys Asp Glu Tyr Ala Ala Leu Glu Thr
325 330 335
Tyr His Ala Ser His Ile Leu Tyr Gln Glu Asp Leu Ser Ser Gly Lys
340 345 350
Gln Ile Leu Lys Ser Ala Asp Phe Leu Lys Glu Ile Ile Ser Thr Asp
355 360 365
Ser Asn Arg Leu Ser Lys Leu Ile His Lys Glu Val Glu Asn Ala Leu
370 375 380
Lys Phe Pro Ile Asn Thr Gly Leu Glu Arg Ile Asn Thr Arg Arg Asn
385 390 395 400
Ile Gln Leu Tyr Asn Val Asp Asn Thr Arg Ile Leu Lys Thr Thr Tyr
405 410 415
His Ser Ser Asn Ile Ser Asn Thr Asp Tyr Leu Arg Leu Ala Val Glu
420 425 430
Asp Phe Tyr Thr Cys Gln Ser Ile Tyr Arg Glu Glu Leu Lys Gly Leu
435 440 445
Glu Arg Trp Val Val Glu Asn Lys Leu Asp Gln Leu Lys Phe Ala Arg
450 455 460
Gln Lys Thr Ala Tyr Cys Tyr Phe Ser Val Ala Ala Thr Leu Ser Ser
465 470 475 480
Pro Glu Leu Ser Asp Ala Arg Ile Ser Trp Ala Lys Asn Gly Ile Leu
485 490 495
Thr Thr Val Val Asp Asp Phe Phe Asp Ile Gly Gly Thr Ile Asp Glu
500 505 510
Leu Thr Asn Leu Ile Gln Cys Val Glu Lys Trp Asn Val Asp Val Asp
515 520 525
Lys Asp Cys Cys Ser Glu His Val Arg Ile Leu Phe Leu Ala Leu Lys
530 535 540
Asp Ala Ile Cys Trp Ile Gly Asp Glu Ala Phe Lys Trp Gln Ala Arg
545 550 555 560
Asp Val Thr Ser His Val Ile Gln Thr Trp Leu Glu Leu Met Asn Ser
565 570 575
Met Leu Arg Glu Ala Ile Trp Thr Arg Asp Ala Tyr Val Pro Thr Leu
580 585 590
Asn Glu Tyr Met Glu Asn Ala Tyr Val Ser Phe Ala Leu Gly Pro Ile
595 600 605
Val Lys Pro Ala Ile Tyr Phe Val Gly Pro Lys Leu Ser Glu Glu Ile
610 615 620
Val Glu Ser Ser Glu Tyr His Asn Leu Phe Lys Leu Met Ser Thr Gln
625 630 635 640
Gly Arg Leu Leu Asn Asp Ile His Ser Phe Lys Arg Glu Phe Lys Glu
645 650 655
Gly Lys Leu Asn Ala Val Ala Leu His Leu Ser Asn Gly Glu Ser Gly
660 665 670
Lys Val Glu Glu Glu Val Val Glu Glu Met Met Met Met Ile Lys Asn
675 680 685
Lys Arg Lys Glu Leu Met Lys Leu Ile Phe Glu Glu Asn Gly Ser Ile
690 695 700
Val Pro Arg Ala Cys Lys Asp Ala Phe Trp Asn Met Cys His Val Leu
705 710 715 720
Asn Phe Phe Tyr Ala Asn Asp Asp Gly Phe Thr Gly Asn Thr Ile Leu
725 730 735
Asp Thr Val Lys Asp Ile Ile Tyr Asn Pro Leu Val Leu Val Asn Glu
740 745 750
Asn Glu Glu Gln Arg
755
<210> 67
<211> 1539
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1539)
<400> 67
atg gat gct gtc act ggt ttg ttg acc gtc cca gct acc gcc atc acc 48
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
att ggt ggt act gct gtt gct ttg gct gtt gct ttg atc ttc tgg tac 96
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
ttg aaa tct tac act tct gcc aga aga tct caa tct aac cat tta cca 144
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
aga gtt cca gaa gtt cca ggt gtt cca ttg ttg ggt aac ttg ttg caa 192
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
ttg aaa gaa aag aag cct tac atg act ttc acc aga tgg gct gct acc 240
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
tac ggt cca att tac tct atc aag acc ggt gct acc tcc atg gtt gtt 288
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
gtt tcc tcc aac gaa att gcc aag gaa gct tta gtc acc aga ttc caa 336
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
tcc atc tct acc aga aac ttg tcc aag gct ttg aag gtc ttg act gct 384
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
gac aag acc atg gtt gcc atg tct gac tac gat gac tac cac aag act 432
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
gtc aaa cgt cac atc ttg act gct gtt ttg ggt cca aat gct caa aag 480
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
aag cac aga att cac aga gat atc atg atg gac aac atc tct act caa 528
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
ttg cat gaa ttt gtc aag aac aac cca gaa caa gaa gaa gtc gat ttg 576
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
aga aag atc ttc caa tct gaa ttg ttc ggt ttg gcc atg aga caa gct 624
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
ttg ggt aaa gat gtc gaa tct tta tac gtc gaa gat ttg aag atc acc 672
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
atg aac aga gat gaa atc ttc caa gtt ttg gtt gtc gac cca atg atg 720
Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
ggt gcc att gat gtt gac tgg aga gat ttc ttc cca tac ttg aaa tgg 768
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
gtt cca aac aag aag ttc gaa aat acc att caa caa atg tac atc cgt 816
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
cgt gaa gct gtc atg aag tct tta atc aag gaa aac aag aag aga att 864
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu Asn Lys Lys Arg Ile
275 280 285
gct tct ggt gaa aag tta aac tcc tac att gac tat ttg ttg tcc gaa 912
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
gct caa act ttg act gac caa caa tta ttg atg tct tta tgg gaa cca 960
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
atc att gaa tct tct gac acc acc atg gtc act act gaa tgg gct atg 1008
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
tac gaa ttg gcc aag aac cca aaa tta caa gac cgt ttg tac aga gat 1056
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
atc aaa tcc gtc tgt ggt tcc gaa aag atc act gaa gaa cac ttg tct 1104
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
caa ttg cca tac atc act gct atc ttc cac gaa act ttg aga aga cac 1152
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His
370 375 380
tct cca gtt cca atc att cca ttg aga cac gtc cac gaa gac act gtt 1200
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
ttg ggt ggt tac cac gtt cca gct ggt act gaa ttg gct gtc aac atc 1248
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
tac ggt tgt aac atg gac aag aac gtc tgg gaa aac cca gaa gaa tgg 1296
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
aac cca gaa aga ttc atg aag gaa aac gaa acc att gac ttc caa aag 1344
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
acc atg gct ttc ggt ggt ggt aag aga gtt tgt gcc ggt tcc ttg caa 1392
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
gct cta ttg acc gct tcc att ggt att ggt aga atg gtt caa gaa ttt 1440
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
gaa tgg aag ttg aag gac atg acc caa gaa gaa gtt aac acc att ggt 1488
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
ttg act act caa atg tta aga cca ttg aga gcc atc atc aag cct cgc 1536
Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
atc 1539
Ile
<210> 68
<211> 513
<212> PRT
<213> Stevia rebaudiana
<400> 68
Met Asp Ala Val Thr Gly Leu Leu Thr Val Pro Ala Thr Ala Ile Thr
1 5 10 15
Ile Gly Gly Thr Ala Val Ala Leu Ala Val Ala Leu Ile Phe Trp Tyr
20 25 30
Leu Lys Ser Tyr Thr Ser Ala Arg Arg Ser Gln Ser Asn His Leu Pro
35 40 45
Arg Val Pro Glu Val Pro Gly Val Pro Leu Leu Gly Asn Leu Leu Gln
50 55 60
Leu Lys Glu Lys Lys Pro Tyr Met Thr Phe Thr Arg Trp Ala Ala Thr
65 70 75 80
Tyr Gly Pro Ile Tyr Ser Ile Lys Thr Gly Ala Thr Ser Met Val Val
85 90 95
Val Ser Ser Asn Glu Ile Ala Lys Glu Ala Leu Val Thr Arg Phe Gln
100 105 110
Ser Ile Ser Thr Arg Asn Leu Ser Lys Ala Leu Lys Val Leu Thr Ala
115 120 125
Asp Lys Thr Met Val Ala Met Ser Asp Tyr Asp Asp Tyr His Lys Thr
130 135 140
Val Lys Arg His Ile Leu Thr Ala Val Leu Gly Pro Asn Ala Gln Lys
145 150 155 160
Lys His Arg Ile His Arg Asp Ile Met Met Asp Asn Ile Ser Thr Gln
165 170 175
Leu His Glu Phe Val Lys Asn Asn Pro Glu Gln Glu Glu Val Asp Leu
180 185 190
Arg Lys Ile Phe Gln Ser Glu Leu Phe Gly Leu Ala Met Arg Gln Ala
195 200 205
Leu Gly Lys Asp Val Glu Ser Leu Tyr Val Glu Asp Leu Lys Ile Thr
210 215 220
Met Asn Arg Asp Glu Ile Phe Gln Val Leu Val Val Asp Pro Met Met
225 230 235 240
Gly Ala Ile Asp Val Asp Trp Arg Asp Phe Phe Pro Tyr Leu Lys Trp
245 250 255
Val Pro Asn Lys Lys Phe Glu Asn Thr Ile Gln Gln Met Tyr Ile Arg
260 265 270
Arg Glu Ala Val Met Lys Ser Leu Ile Lys Glu Asn Lys Lys Arg Ile
275 280 285
Ala Ser Gly Glu Lys Leu Asn Ser Tyr Ile Asp Tyr Leu Leu Ser Glu
290 295 300
Ala Gln Thr Leu Thr Asp Gln Gln Leu Leu Met Ser Leu Trp Glu Pro
305 310 315 320
Ile Ile Glu Ser Ser Asp Thr Thr Met Val Thr Thr Glu Trp Ala Met
325 330 335
Tyr Glu Leu Ala Lys Asn Pro Lys Leu Gln Asp Arg Leu Tyr Arg Asp
340 345 350
Ile Lys Ser Val Cys Gly Ser Glu Lys Ile Thr Glu Glu His Leu Ser
355 360 365
Gln Leu Pro Tyr Ile Thr Ala Ile Phe His Glu Thr Leu Arg Arg His
370 375 380
Ser Pro Val Pro Ile Ile Pro Leu Arg His Val His Glu Asp Thr Val
385 390 395 400
Leu Gly Gly Tyr His Val Pro Ala Gly Thr Glu Leu Ala Val Asn Ile
405 410 415
Tyr Gly Cys Asn Met Asp Lys Asn Val Trp Glu Asn Pro Glu Glu Trp
420 425 430
Asn Pro Glu Arg Phe Met Lys Glu Asn Glu Thr Ile Asp Phe Gln Lys
435 440 445
Thr Met Ala Phe Gly Gly Gly Lys Arg Val Cys Ala Gly Ser Leu Gln
450 455 460
Ala Leu Leu Thr Ala Ser Ile Gly Ile Gly Arg Met Val Gln Glu Phe
465 470 475 480
Glu Trp Lys Leu Lys Asp Met Thr Gln Glu Glu Val Asn Thr Ile Gly
485 490 495
Leu Thr Thr Gln Met Leu Arg Pro Leu Arg Ala Ile Ile Lys Pro Arg
500 505 510
Ile
<210> 69
<211> 1566
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1566)
<400> 69
atg ggt ttg ttc cca ttg gaa gac tct tac gct ttg gtt ttc gaa ggt 48
Met Gly Leu Phe Pro Leu Glu Asp Ser Tyr Ala Leu Val Phe Glu Gly
1 5 10 15
ttg gcc atc act ttg gct ttg tac tac cta ttg tct ttc atc tac aag 96
Leu Ala Ile Thr Leu Ala Leu Tyr Tyr Leu Leu Ser Phe Ile Tyr Lys
20 25 30
acc tcc aag aag acc tgt acc cca cca aag gct tct ggt gaa cac cca 144
Thr Ser Lys Lys Thr Cys Thr Pro Pro Lys Ala Ser Gly Glu His Pro
35 40 45
att acc ggt cat tta aac ttg ttg tct ggt tct tcc ggt ttg cca cac 192
Ile Thr Gly His Leu Asn Leu Leu Ser Gly Ser Ser Gly Leu Pro His
50 55 60
ttg gct ttg gct tct ttg gct gac aga tgt ggt cct atc ttc acc atc 240
Leu Ala Leu Ala Ser Leu Ala Asp Arg Cys Gly Pro Ile Phe Thr Ile
65 70 75 80
aga tta ggt atc aga aga gtt ttg gtt gtt tcc aac tgg gaa att gcc 288
Arg Leu Gly Ile Arg Arg Val Leu Val Val Ser Asn Trp Glu Ile Ala
85 90 95
aag gaa atc ttt acc acc cac gac ttg att gtc tcc aac aga cca aag 336
Lys Glu Ile Phe Thr Thr His Asp Leu Ile Val Ser Asn Arg Pro Lys
100 105 110
tac ttg gct gcc aag atc tta ggt ttc aac tac gtt tct ttc tct ttc 384
Tyr Leu Ala Ala Lys Ile Leu Gly Phe Asn Tyr Val Ser Phe Ser Phe
115 120 125
gct cca tac ggt cca tac tgg gtt ggt atc aga aag att att gct act 432
Ala Pro Tyr Gly Pro Tyr Trp Val Gly Ile Arg Lys Ile Ile Ala Thr
130 135 140
aaa ttg atg tct tct tct aga ttg caa aag ttg caa ttc gtc cgt gtc 480
Lys Leu Met Ser Ser Ser Arg Leu Gln Lys Leu Gln Phe Val Arg Val
145 150 155 160
ttt gaa ttg gaa aac tcc atg aaa tcc atc aga gaa tcc tgg aag gaa 528
Phe Glu Leu Glu Asn Ser Met Lys Ser Ile Arg Glu Ser Trp Lys Glu
165 170 175
aag aag gat gaa gaa ggt aag gtc ttg gtt gaa atg aag aag tgg ttc 576
Lys Lys Asp Glu Glu Gly Lys Val Leu Val Glu Met Lys Lys Trp Phe
180 185 190
tgg gaa ttg aat atg aac att gtc ttg aga acc gtt gct ggt aag caa 624
Trp Glu Leu Asn Met Asn Ile Val Leu Arg Thr Val Ala Gly Lys Gln
195 200 205
tac act ggt act gtt gac gat gct gat gcc aag aga att tct gaa ttg 672
Tyr Thr Gly Thr Val Asp Asp Ala Asp Ala Lys Arg Ile Ser Glu Leu
210 215 220
ttc aga gaa tgg ttc cac tac act ggt aga ttc gtt gtc ggt gat gct 720
Phe Arg Glu Trp Phe His Tyr Thr Gly Arg Phe Val Val Gly Asp Ala
225 230 235 240
ttc cca ttt cta ggt tgg ttg gat tta ggt ggt tac aag aag acc atg 768
Phe Pro Phe Leu Gly Trp Leu Asp Leu Gly Gly Tyr Lys Lys Thr Met
245 250 255
gaa ttg gtt gcc tcc aga tta gat tcc atg gtc agc aaa tgg ttg gac 816
Glu Leu Val Ala Ser Arg Leu Asp Ser Met Val Ser Lys Trp Leu Asp
260 265 270
gaa cac aga aag aag caa gct aac gat gac aag aag gaa gac atg gac 864
Glu His Arg Lys Lys Gln Ala Asn Asp Asp Lys Lys Glu Asp Met Asp
275 280 285
ttc atg gac atc atg atc tcc atg act gaa gct aac tct cca ttg gaa 912
Phe Met Asp Ile Met Ile Ser Met Thr Glu Ala Asn Ser Pro Leu Glu
290 295 300
ggt tac ggt act gac acc atc atc aag acc act tgt atg act ttg att 960
Gly Tyr Gly Thr Asp Thr Ile Ile Lys Thr Thr Cys Met Thr Leu Ile
305 310 315 320
gtt tcc ggt gtc gac acc act tct atc gtt ttg acc tgg gct ttg tct 1008
Val Ser Gly Val Asp Thr Thr Ser Ile Val Leu Thr Trp Ala Leu Ser
325 330 335
ttg ttg ttg aac aac aga gac act tta aag aag gct caa gaa gaa ttg 1056
Leu Leu Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln Glu Glu Leu
340 345 350
gac atg tgt gtc ggt aag ggt aga caa gtc aat gaa tct gat ttg gtc 1104
Asp Met Cys Val Gly Lys Gly Arg Gln Val Asn Glu Ser Asp Leu Val
355 360 365
aac tta atc tat tta gaa gct gtc ttg aaa gaa gct ttg aga ttg tac 1152
Asn Leu Ile Tyr Leu Glu Ala Val Leu Lys Glu Ala Leu Arg Leu Tyr
370 375 380
cca gct gct ttc ttg ggt ggt cct cgt gct ttc ttg gaa gac tgt acc 1200
Pro Ala Ala Phe Leu Gly Gly Pro Arg Ala Phe Leu Glu Asp Cys Thr
385 390 395 400
gtt gcc ggt tac aga att cca aag ggt act tgt ttg ttg atc aac atg 1248
Val Ala Gly Tyr Arg Ile Pro Lys Gly Thr Cys Leu Leu Ile Asn Met
405 410 415
tgg aaa ttg cac aga gat cca aac atc tgg tct gac cca tgt gaa ttc 1296
Trp Lys Leu His Arg Asp Pro Asn Ile Trp Ser Asp Pro Cys Glu Phe
420 425 430
aaa cca gaa aga ttc ttg act cca aac caa aag gat gtt gat gtc att 1344
Lys Pro Glu Arg Phe Leu Thr Pro Asn Gln Lys Asp Val Asp Val Ile
435 440 445
ggt atg gac ttc gaa ttg att cca ttc ggt gct ggt cgt cgt tac tgt 1392
Gly Met Asp Phe Glu Leu Ile Pro Phe Gly Ala Gly Arg Arg Tyr Cys
450 455 460
cca ggt acc aga tta gct ttg caa atg ttg cac att gtc ttg gcc act 1440
Pro Gly Thr Arg Leu Ala Leu Gln Met Leu His Ile Val Leu Ala Thr
465 470 475 480
cta tta caa aac ttt gaa atg tcc act cca aac gat gct cca gtt gac 1488
Leu Leu Gln Asn Phe Glu Met Ser Thr Pro Asn Asp Ala Pro Val Asp
485 490 495
atg act gct tcc gtt ggt atg acc aac gcc aag gct tct cca ttg gaa 1536
Met Thr Ala Ser Val Gly Met Thr Asn Ala Lys Ala Ser Pro Leu Glu
500 505 510
gtt ttg ttg tct cca aga gtc aaa tgg tct 1566
Val Leu Leu Ser Pro Arg Val Lys Trp Ser
515 520
<210> 70
<211> 522
<212> PRT
<213> Stevia rebaudiana
<400> 70
Met Gly Leu Phe Pro Leu Glu Asp Ser Tyr Ala Leu Val Phe Glu Gly
1 5 10 15
Leu Ala Ile Thr Leu Ala Leu Tyr Tyr Leu Leu Ser Phe Ile Tyr Lys
20 25 30
Thr Ser Lys Lys Thr Cys Thr Pro Pro Lys Ala Ser Gly Glu His Pro
35 40 45
Ile Thr Gly His Leu Asn Leu Leu Ser Gly Ser Ser Gly Leu Pro His
50 55 60
Leu Ala Leu Ala Ser Leu Ala Asp Arg Cys Gly Pro Ile Phe Thr Ile
65 70 75 80
Arg Leu Gly Ile Arg Arg Val Leu Val Val Ser Asn Trp Glu Ile Ala
85 90 95
Lys Glu Ile Phe Thr Thr His Asp Leu Ile Val Ser Asn Arg Pro Lys
100 105 110
Tyr Leu Ala Ala Lys Ile Leu Gly Phe Asn Tyr Val Ser Phe Ser Phe
115 120 125
Ala Pro Tyr Gly Pro Tyr Trp Val Gly Ile Arg Lys Ile Ile Ala Thr
130 135 140
Lys Leu Met Ser Ser Ser Arg Leu Gln Lys Leu Gln Phe Val Arg Val
145 150 155 160
Phe Glu Leu Glu Asn Ser Met Lys Ser Ile Arg Glu Ser Trp Lys Glu
165 170 175
Lys Lys Asp Glu Glu Gly Lys Val Leu Val Glu Met Lys Lys Trp Phe
180 185 190
Trp Glu Leu Asn Met Asn Ile Val Leu Arg Thr Val Ala Gly Lys Gln
195 200 205
Tyr Thr Gly Thr Val Asp Asp Ala Asp Ala Lys Arg Ile Ser Glu Leu
210 215 220
Phe Arg Glu Trp Phe His Tyr Thr Gly Arg Phe Val Val Gly Asp Ala
225 230 235 240
Phe Pro Phe Leu Gly Trp Leu Asp Leu Gly Gly Tyr Lys Lys Thr Met
245 250 255
Glu Leu Val Ala Ser Arg Leu Asp Ser Met Val Ser Lys Trp Leu Asp
260 265 270
Glu His Arg Lys Lys Gln Ala Asn Asp Asp Lys Lys Glu Asp Met Asp
275 280 285
Phe Met Asp Ile Met Ile Ser Met Thr Glu Ala Asn Ser Pro Leu Glu
290 295 300
Gly Tyr Gly Thr Asp Thr Ile Ile Lys Thr Thr Cys Met Thr Leu Ile
305 310 315 320
Val Ser Gly Val Asp Thr Thr Ser Ile Val Leu Thr Trp Ala Leu Ser
325 330 335
Leu Leu Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln Glu Glu Leu
340 345 350
Asp Met Cys Val Gly Lys Gly Arg Gln Val Asn Glu Ser Asp Leu Val
355 360 365
Asn Leu Ile Tyr Leu Glu Ala Val Leu Lys Glu Ala Leu Arg Leu Tyr
370 375 380
Pro Ala Ala Phe Leu Gly Gly Pro Arg Ala Phe Leu Glu Asp Cys Thr
385 390 395 400
Val Ala Gly Tyr Arg Ile Pro Lys Gly Thr Cys Leu Leu Ile Asn Met
405 410 415
Trp Lys Leu His Arg Asp Pro Asn Ile Trp Ser Asp Pro Cys Glu Phe
420 425 430
Lys Pro Glu Arg Phe Leu Thr Pro Asn Gln Lys Asp Val Asp Val Ile
435 440 445
Gly Met Asp Phe Glu Leu Ile Pro Phe Gly Ala Gly Arg Arg Tyr Cys
450 455 460
Pro Gly Thr Arg Leu Ala Leu Gln Met Leu His Ile Val Leu Ala Thr
465 470 475 480
Leu Leu Gln Asn Phe Glu Met Ser Thr Pro Asn Asp Ala Pro Val Asp
485 490 495
Met Thr Ala Ser Val Gly Met Thr Asn Ala Lys Ala Ser Pro Leu Glu
500 505 510
Val Leu Leu Ser Pro Arg Val Lys Trp Ser
515 520
<210> 71
<211> 1443
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1443)
<400> 71
atg gac gct atg gcc acc act gaa aag aag cct cac gtt atc ttt att 48
Met Asp Ala Met Ala Thr Thr Glu Lys Lys Pro His Val Ile Phe Ile
1 5 10 15
cca ttc cca gct caa tct cat atc aag gct atg ttg aaa ttg gct caa 96
Pro Phe Pro Ala Gln Ser His Ile Lys Ala Met Leu Lys Leu Ala Gln
20 25 30
tta ttg cac cac aag ggt ttg caa atc act ttt gtc aac acc gac ttc 144
Leu Leu His His Lys Gly Leu Gln Ile Thr Phe Val Asn Thr Asp Phe
35 40 45
att cac aac caa ttc ttg gaa tct tct ggt cct cac tgt ttg gac ggt 192
Ile His Asn Gln Phe Leu Glu Ser Ser Gly Pro His Cys Leu Asp Gly
50 55 60
gct cca ggt ttc aga ttc gaa acc att cca gat ggt gtt tcc cac tct 240
Ala Pro Gly Phe Arg Phe Glu Thr Ile Pro Asp Gly Val Ser His Ser
65 70 75 80
cca gaa gcc tcc atc cca atc aga gaa tcc ttg ttg aga tct att gaa 288
Pro Glu Ala Ser Ile Pro Ile Arg Glu Ser Leu Leu Arg Ser Ile Glu
85 90 95
acc aac ttc ttg gac cgt ttc atc gat ttg gtt acc aaa ttg cca gac 336
Thr Asn Phe Leu Asp Arg Phe Ile Asp Leu Val Thr Lys Leu Pro Asp
100 105 110
cca cca acc tgt atc att tct gac ggt ttc ttg tcc gtt ttc acc atc 384
Pro Pro Thr Cys Ile Ile Ser Asp Gly Phe Leu Ser Val Phe Thr Ile
115 120 125
gat gct gcc aag aaa ttg ggt att cca gtc atg atg tac tgg act ttg 432
Asp Ala Ala Lys Lys Leu Gly Ile Pro Val Met Met Tyr Trp Thr Leu
130 135 140
gct gct tgt ggt ttc atg ggt ttc tac cat att cac tct ttg att gaa 480
Ala Ala Cys Gly Phe Met Gly Phe Tyr His Ile His Ser Leu Ile Glu
145 150 155 160
aag ggt ttc gct cca tta aag gat gct tct tac ttg acc aac ggt tac 528
Lys Gly Phe Ala Pro Leu Lys Asp Ala Ser Tyr Leu Thr Asn Gly Tyr
165 170 175
ttg gac acc gtc att gac tgg gtt cca ggt atg gaa ggt atc aga ttg 576
Leu Asp Thr Val Ile Asp Trp Val Pro Gly Met Glu Gly Ile Arg Leu
180 185 190
aaa gat ttc cca ttg gac tgg tct act gac ttg aat gac aag gtc ttg 624
Lys Asp Phe Pro Leu Asp Trp Ser Thr Asp Leu Asn Asp Lys Val Leu
195 200 205
atg ttc act act gaa gct cca caa aga tct cat aag gtt tct cac cac 672
Met Phe Thr Thr Glu Ala Pro Gln Arg Ser His Lys Val Ser His His
210 215 220
atc ttc cac act ttc gat gaa tta gaa cca tct atc atc aag act cta 720
Ile Phe His Thr Phe Asp Glu Leu Glu Pro Ser Ile Ile Lys Thr Leu
225 230 235 240
tcc ttg aga tac aac cat atc tac acc att ggt cca tta caa ttg ttg 768
Ser Leu Arg Tyr Asn His Ile Tyr Thr Ile Gly Pro Leu Gln Leu Leu
245 250 255
ttg gac caa atc cca gaa gaa aag aag caa acc ggt atc act tct ttg 816
Leu Asp Gln Ile Pro Glu Glu Lys Lys Gln Thr Gly Ile Thr Ser Leu
260 265 270
cac ggt tac tct tta gtc aag gaa gaa cca gaa tgt ttc caa tgg tta 864
His Gly Tyr Ser Leu Val Lys Glu Glu Pro Glu Cys Phe Gln Trp Leu
275 280 285
caa tcc aag gaa cca aac tct gtt gtc tac gtt aac ttt ggt tcc acc 912
Gln Ser Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser Thr
290 295 300
act gtt atg tcc ttg gaa gat atg act gaa ttt ggt tgg ggt ttg gct 960
Thr Val Met Ser Leu Glu Asp Met Thr Glu Phe Gly Trp Gly Leu Ala
305 310 315 320
aac tct aac cac tac ttc tta tgg atc atc aga tct aac ttg gtc att 1008
Asn Ser Asn His Tyr Phe Leu Trp Ile Ile Arg Ser Asn Leu Val Ile
325 330 335
ggt gaa aac gcc gtt ttg cct cca gaa ttg gaa gaa cac atc aag aag 1056
Gly Glu Asn Ala Val Leu Pro Pro Glu Leu Glu Glu His Ile Lys Lys
340 345 350
aga ggt ttc att gct tcc tgg tgt tct caa gaa aag gtc ttg aag cac 1104
Arg Gly Phe Ile Ala Ser Trp Cys Ser Gln Glu Lys Val Leu Lys His
355 360 365
cca tct gtt ggt ggt ttc ttg acc cac tgt ggt tgg ggt tcc acc att 1152
Pro Ser Val Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Thr Ile
370 375 380
gaa tcc cta tct gct ggt gtt cca atg atc tgt tgg cca tac tcc tgg 1200
Glu Ser Leu Ser Ala Gly Val Pro Met Ile Cys Trp Pro Tyr Ser Trp
385 390 395 400
gac caa ttg act aac tgt cgt tac atc tgt aag gaa tgg gaa gtt ggt 1248
Asp Gln Leu Thr Asn Cys Arg Tyr Ile Cys Lys Glu Trp Glu Val Gly
405 410 415
ttg gaa atg ggt act aag gtc aag aga gat gaa gtc aag aga tta gtc 1296
Leu Glu Met Gly Thr Lys Val Lys Arg Asp Glu Val Lys Arg Leu Val
420 425 430
caa gaa ttg atg ggt gaa ggt ggt cac aag atg aga aac aaa gcc aag 1344
Gln Glu Leu Met Gly Glu Gly Gly His Lys Met Arg Asn Lys Ala Lys
435 440 445
gac tgg aag gaa aag gcc aga att gct att gct cca aac ggt tct tcc 1392
Asp Trp Lys Glu Lys Ala Arg Ile Ala Ile Ala Pro Asn Gly Ser Ser
450 455 460
tcc ttg aac atc gat aaa atg gtt aag gaa atc act gtc ttg gct cga 1440
Ser Leu Asn Ile Asp Lys Met Val Lys Glu Ile Thr Val Leu Ala Arg
465 470 475 480
aac 1443
Asn
<210> 72
<211> 481
<212> PRT
<213> Stevia rebaudiana
<400> 72
Met Asp Ala Met Ala Thr Thr Glu Lys Lys Pro His Val Ile Phe Ile
1 5 10 15
Pro Phe Pro Ala Gln Ser His Ile Lys Ala Met Leu Lys Leu Ala Gln
20 25 30
Leu Leu His His Lys Gly Leu Gln Ile Thr Phe Val Asn Thr Asp Phe
35 40 45
Ile His Asn Gln Phe Leu Glu Ser Ser Gly Pro His Cys Leu Asp Gly
50 55 60
Ala Pro Gly Phe Arg Phe Glu Thr Ile Pro Asp Gly Val Ser His Ser
65 70 75 80
Pro Glu Ala Ser Ile Pro Ile Arg Glu Ser Leu Leu Arg Ser Ile Glu
85 90 95
Thr Asn Phe Leu Asp Arg Phe Ile Asp Leu Val Thr Lys Leu Pro Asp
100 105 110
Pro Pro Thr Cys Ile Ile Ser Asp Gly Phe Leu Ser Val Phe Thr Ile
115 120 125
Asp Ala Ala Lys Lys Leu Gly Ile Pro Val Met Met Tyr Trp Thr Leu
130 135 140
Ala Ala Cys Gly Phe Met Gly Phe Tyr His Ile His Ser Leu Ile Glu
145 150 155 160
Lys Gly Phe Ala Pro Leu Lys Asp Ala Ser Tyr Leu Thr Asn Gly Tyr
165 170 175
Leu Asp Thr Val Ile Asp Trp Val Pro Gly Met Glu Gly Ile Arg Leu
180 185 190
Lys Asp Phe Pro Leu Asp Trp Ser Thr Asp Leu Asn Asp Lys Val Leu
195 200 205
Met Phe Thr Thr Glu Ala Pro Gln Arg Ser His Lys Val Ser His His
210 215 220
Ile Phe His Thr Phe Asp Glu Leu Glu Pro Ser Ile Ile Lys Thr Leu
225 230 235 240
Ser Leu Arg Tyr Asn His Ile Tyr Thr Ile Gly Pro Leu Gln Leu Leu
245 250 255
Leu Asp Gln Ile Pro Glu Glu Lys Lys Gln Thr Gly Ile Thr Ser Leu
260 265 270
His Gly Tyr Ser Leu Val Lys Glu Glu Pro Glu Cys Phe Gln Trp Leu
275 280 285
Gln Ser Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser Thr
290 295 300
Thr Val Met Ser Leu Glu Asp Met Thr Glu Phe Gly Trp Gly Leu Ala
305 310 315 320
Asn Ser Asn His Tyr Phe Leu Trp Ile Ile Arg Ser Asn Leu Val Ile
325 330 335
Gly Glu Asn Ala Val Leu Pro Pro Glu Leu Glu Glu His Ile Lys Lys
340 345 350
Arg Gly Phe Ile Ala Ser Trp Cys Ser Gln Glu Lys Val Leu Lys His
355 360 365
Pro Ser Val Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Thr Ile
370 375 380
Glu Ser Leu Ser Ala Gly Val Pro Met Ile Cys Trp Pro Tyr Ser Trp
385 390 395 400
Asp Gln Leu Thr Asn Cys Arg Tyr Ile Cys Lys Glu Trp Glu Val Gly
405 410 415
Leu Glu Met Gly Thr Lys Val Lys Arg Asp Glu Val Lys Arg Leu Val
420 425 430
Gln Glu Leu Met Gly Glu Gly Gly His Lys Met Arg Asn Lys Ala Lys
435 440 445
Asp Trp Lys Glu Lys Ala Arg Ile Ala Ile Ala Pro Asn Gly Ser Ser
450 455 460
Ser Leu Asn Ile Asp Lys Met Val Lys Glu Ile Thr Val Leu Ala Arg
465 470 475 480
Asn
<210> 73
<211> 1380
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1380)
<400> 73
atg gct gaa caa caa aag atc aag aaa tct cca cac gtc ttg ttg att 48
Met Ala Glu Gln Gln Lys Ile Lys Lys Ser Pro His Val Leu Leu Ile
1 5 10 15
cca ttc cca ttg caa ggt cac atc aac cca ttc atc caa ttc ggt aag 96
Pro Phe Pro Leu Gln Gly His Ile Asn Pro Phe Ile Gln Phe Gly Lys
20 25 30
aga ttg att tcc aag ggt gtc aag acc act tta gtc acc act att cac 144
Arg Leu Ile Ser Lys Gly Val Lys Thr Thr Leu Val Thr Thr Ile His
35 40 45
act tta aac tcc act tta aac cac tct aac act act acc acc tct att 192
Thr Leu Asn Ser Thr Leu Asn His Ser Asn Thr Thr Thr Thr Ser Ile
50 55 60
gaa atc caa gcc att tct gac ggt tgt gac gaa ggt ggt ttc atg tct 240
Glu Ile Gln Ala Ile Ser Asp Gly Cys Asp Glu Gly Gly Phe Met Ser
65 70 75 80
gct ggt gaa tct tac ttg gaa act ttc aag caa gtc ggt tcc aag tct 288
Ala Gly Glu Ser Tyr Leu Glu Thr Phe Lys Gln Val Gly Ser Lys Ser
85 90 95
ttg gct gat ttg atc aag aaa ttg caa tcc gaa ggt act acc atc gat 336
Leu Ala Asp Leu Ile Lys Lys Leu Gln Ser Glu Gly Thr Thr Ile Asp
100 105 110
gct atc atc tac gac tcc atg act gaa tgg gtt ttg gat gtt gcc att 384
Ala Ile Ile Tyr Asp Ser Met Thr Glu Trp Val Leu Asp Val Ala Ile
115 120 125
gaa ttt ggt att gac ggt ggt tct ttc ttc acc caa gcc tgt gtt gtt 432
Glu Phe Gly Ile Asp Gly Gly Ser Phe Phe Thr Gln Ala Cys Val Val
130 135 140
aac tct ttg tac tac cac gtc cac aag ggt ttg atc tct cta cca tta 480
Asn Ser Leu Tyr Tyr His Val His Lys Gly Leu Ile Ser Leu Pro Leu
145 150 155 160
ggt gaa acc gtt tcc gtc cca ggt ttc cca gtc ttg caa aga tgg gaa 528
Gly Glu Thr Val Ser Val Pro Gly Phe Pro Val Leu Gln Arg Trp Glu
165 170 175
act cca ttg atc tta caa aac cat gaa caa atc caa tct cca tgg tcc 576
Thr Pro Leu Ile Leu Gln Asn His Glu Gln Ile Gln Ser Pro Trp Ser
180 185 190
caa atg ttg ttt ggt caa ttc gct aac att gac caa gct aga tgg gtt 624
Gln Met Leu Phe Gly Gln Phe Ala Asn Ile Asp Gln Ala Arg Trp Val
195 200 205
ttc acc aac tct ttc tac aag ttg gaa gaa gaa gtc att gaa tgg acc 672
Phe Thr Asn Ser Phe Tyr Lys Leu Glu Glu Glu Val Ile Glu Trp Thr
210 215 220
aga aag atc tgg aac ttg aag gtt atc ggt cca act cta cca tcc atg 720
Arg Lys Ile Trp Asn Leu Lys Val Ile Gly Pro Thr Leu Pro Ser Met
225 230 235 240
tac ttg gac aag aga ttg gat gac gac aag gac aac ggt ttc aac ttg 768
Tyr Leu Asp Lys Arg Leu Asp Asp Asp Lys Asp Asn Gly Phe Asn Leu
245 250 255
tac aag gct aac cat cac gaa tgt atg aac tgg ttg gat gac aag cca 816
Tyr Lys Ala Asn His His Glu Cys Met Asn Trp Leu Asp Asp Lys Pro
260 265 270
aag gaa tct gtt gtt tac gtt gct ttc ggt tct ttg gtc aag cat ggt 864
Lys Glu Ser Val Val Tyr Val Ala Phe Gly Ser Leu Val Lys His Gly
275 280 285
cca gaa caa gtt gaa gaa atc acc aga gct ttg att gac tcc gat gtt 912
Pro Glu Gln Val Glu Glu Ile Thr Arg Ala Leu Ile Asp Ser Asp Val
290 295 300
aac ttc tta tgg gtt atc aag cac aag gaa gaa ggt aaa ttg cca gaa 960
Asn Phe Leu Trp Val Ile Lys His Lys Glu Glu Gly Lys Leu Pro Glu
305 310 315 320
aac ttg tct gaa gtt atc aag acc ggt aag ggt ttg att gtt gct tgg 1008
Asn Leu Ser Glu Val Ile Lys Thr Gly Lys Gly Leu Ile Val Ala Trp
325 330 335
tgt aag caa ttg gat gtt ttg gct cac gaa tcc gtc ggt tgt ttc gtc 1056
Cys Lys Gln Leu Asp Val Leu Ala His Glu Ser Val Gly Cys Phe Val
340 345 350
act cac tgt ggt ttc aac tct act ttg gaa gct atc tcc ttg ggt gtt 1104
Thr His Cys Gly Phe Asn Ser Thr Leu Glu Ala Ile Ser Leu Gly Val
355 360 365
cca gtt gtt gcc atg cct caa ttc tct gac caa acc acc aac gcc aaa 1152
Pro Val Val Ala Met Pro Gln Phe Ser Asp Gln Thr Thr Asn Ala Lys
370 375 380
ttg ttg gat gaa atc ttg ggt gtc ggt gtc cgt gtc aag gct gat gaa 1200
Leu Leu Asp Glu Ile Leu Gly Val Gly Val Arg Val Lys Ala Asp Glu
385 390 395 400
aac ggt att gtt aga aga ggt aac tta gct tcc tgt atc aag atg atc 1248
Asn Gly Ile Val Arg Arg Gly Asn Leu Ala Ser Cys Ile Lys Met Ile
405 410 415
atg gaa gaa gaa cgt ggt gtc att atc aga aag aat gct gtc aaa tgg 1296
Met Glu Glu Glu Arg Gly Val Ile Ile Arg Lys Asn Ala Val Lys Trp
420 425 430
aag gac ttg gct aag gtt gct gtc cac gaa ggt ggt tcc tct gac aat 1344
Lys Asp Leu Ala Lys Val Ala Val His Glu Gly Gly Ser Ser Asp Asn
435 440 445
gac att gtt gaa ttt gtc tct gaa ttg atc aaa gcg 1380
Asp Ile Val Glu Phe Val Ser Glu Leu Ile Lys Ala
450 455 460
<210> 74
<211> 460
<212> PRT
<213> Stevia rebaudiana
<400> 74
Met Ala Glu Gln Gln Lys Ile Lys Lys Ser Pro His Val Leu Leu Ile
1 5 10 15
Pro Phe Pro Leu Gln Gly His Ile Asn Pro Phe Ile Gln Phe Gly Lys
20 25 30
Arg Leu Ile Ser Lys Gly Val Lys Thr Thr Leu Val Thr Thr Ile His
35 40 45
Thr Leu Asn Ser Thr Leu Asn His Ser Asn Thr Thr Thr Thr Ser Ile
50 55 60
Glu Ile Gln Ala Ile Ser Asp Gly Cys Asp Glu Gly Gly Phe Met Ser
65 70 75 80
Ala Gly Glu Ser Tyr Leu Glu Thr Phe Lys Gln Val Gly Ser Lys Ser
85 90 95
Leu Ala Asp Leu Ile Lys Lys Leu Gln Ser Glu Gly Thr Thr Ile Asp
100 105 110
Ala Ile Ile Tyr Asp Ser Met Thr Glu Trp Val Leu Asp Val Ala Ile
115 120 125
Glu Phe Gly Ile Asp Gly Gly Ser Phe Phe Thr Gln Ala Cys Val Val
130 135 140
Asn Ser Leu Tyr Tyr His Val His Lys Gly Leu Ile Ser Leu Pro Leu
145 150 155 160
Gly Glu Thr Val Ser Val Pro Gly Phe Pro Val Leu Gln Arg Trp Glu
165 170 175
Thr Pro Leu Ile Leu Gln Asn His Glu Gln Ile Gln Ser Pro Trp Ser
180 185 190
Gln Met Leu Phe Gly Gln Phe Ala Asn Ile Asp Gln Ala Arg Trp Val
195 200 205
Phe Thr Asn Ser Phe Tyr Lys Leu Glu Glu Glu Val Ile Glu Trp Thr
210 215 220
Arg Lys Ile Trp Asn Leu Lys Val Ile Gly Pro Thr Leu Pro Ser Met
225 230 235 240
Tyr Leu Asp Lys Arg Leu Asp Asp Asp Lys Asp Asn Gly Phe Asn Leu
245 250 255
Tyr Lys Ala Asn His His Glu Cys Met Asn Trp Leu Asp Asp Lys Pro
260 265 270
Lys Glu Ser Val Val Tyr Val Ala Phe Gly Ser Leu Val Lys His Gly
275 280 285
Pro Glu Gln Val Glu Glu Ile Thr Arg Ala Leu Ile Asp Ser Asp Val
290 295 300
Asn Phe Leu Trp Val Ile Lys His Lys Glu Glu Gly Lys Leu Pro Glu
305 310 315 320
Asn Leu Ser Glu Val Ile Lys Thr Gly Lys Gly Leu Ile Val Ala Trp
325 330 335
Cys Lys Gln Leu Asp Val Leu Ala His Glu Ser Val Gly Cys Phe Val
340 345 350
Thr His Cys Gly Phe Asn Ser Thr Leu Glu Ala Ile Ser Leu Gly Val
355 360 365
Pro Val Val Ala Met Pro Gln Phe Ser Asp Gln Thr Thr Asn Ala Lys
370 375 380
Leu Leu Asp Glu Ile Leu Gly Val Gly Val Arg Val Lys Ala Asp Glu
385 390 395 400
Asn Gly Ile Val Arg Arg Gly Asn Leu Ala Ser Cys Ile Lys Met Ile
405 410 415
Met Glu Glu Glu Arg Gly Val Ile Ile Arg Lys Asn Ala Val Lys Trp
420 425 430
Lys Asp Leu Ala Lys Val Ala Val His Glu Gly Gly Ser Ser Asp Asn
435 440 445
Asp Ile Val Glu Phe Val Ser Glu Leu Ile Lys Ala
450 455 460
<210> 75
<211> 1374
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1374)
<400> 75
atg gaa aac aag act gaa acc act gtt aga aga aga aga aga atc atc 48
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
tta ttc cca gtt cca ttc caa ggt cac att aac cca atc ttg caa ttg 96
Leu Phe Pro Val Pro Phe Gln Gly His Ile Asn Pro Ile Leu Gln Leu
20 25 30
gct aac gtc tta tac tcc aag ggt ttc tcc atc acc atc ttc cac acc 144
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
aac ttc aac aaa cct aaa act tcc aac tac cca cac ttc acc ttc aga 192
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
ttt atc ttg gac aac gac cca caa gat gaa aga att tct aac ttg cca 240
Phe Ile Leu Asp Asn Asp Pro Gln Asp Glu Arg Ile Ser Asn Leu Pro
65 70 75 80
acc cat ggt cca ttg gcc ggt atg aga att cca atc atc aac gaa cac 288
Thr His Gly Pro Leu Ala Gly Met Arg Ile Pro Ile Ile Asn Glu His
85 90 95
ggt gct gac gaa ttg aga aga gaa ttg gaa ttg ttg atg ttg gct tct 336
Gly Ala Asp Glu Leu Arg Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
gaa gaa gat gaa gaa gtc tct tgt ttg atc act gat gct tta tgg tac 384
Glu Glu Asp Glu Glu Val Ser Cys Leu Ile Thr Asp Ala Leu Trp Tyr
115 120 125
ttt gct caa tct gtt gct gac tct ttg aac ttg aga aga tta gtc ttg 432
Phe Ala Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
atg acc tct tct ttg ttc aac ttc cac gct cac gtt tct cta cca caa 480
Met Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
ttt gat gaa ttg ggt tac ttg gac cca gat gac aag acc aga ttg gaa 528
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
gaa caa gcc tcc ggt ttc cca atg ttg aag gtc aag gat atc aag tct 576
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Ser
180 185 190
gcc tac tcc aac tgg caa atc ttg aag gaa att ttg ggt aag atg atc 624
Ala Tyr Ser Asn Trp Gln Ile Leu Lys Glu Ile Leu Gly Lys Met Ile
195 200 205
aag caa acc aag gct tct tct ggt gtc atc tgg aac tcc ttc aag gaa 672
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
ttg gaa gaa tct gaa ttg gaa acc gtc atc aga gaa att cca gct cca 720
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
tct ttc ttg att cca tta cca aag cat ttg act gct tcc tcc tct tct 768
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
cta ttg gac cac gac aga act gtt ttc caa tgg ttg gac caa caa cca 816
Leu Leu Asp His Asp Arg Thr Val Phe Gln Trp Leu Asp Gln Gln Pro
260 265 270
cca tct tcc gtc tta tac gtt tcc ttt ggt tcc act tct gaa gtt gac 864
Pro Ser Ser Val Leu Tyr Val Ser Phe Gly Ser Thr Ser Glu Val Asp
275 280 285
gaa aag gac ttc ttg gaa att gct cgt ggt ttg gtt gac tcc aag caa 912
Glu Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys Gln
290 295 300
tct ttc tta tgg gtt gtc aga cca ggt ttc gtc aag ggt tcc acc tgg 960
Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr Trp
305 310 315 320
gtt gaa cct ttg cca gac ggt ttc ttg ggt gaa aga ggt aga att gtc 1008
Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile Val
325 330 335
aaa tgg gtt cca caa caa gaa gtt ttg gct cac ggt gcc att ggt gct 1056
Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly Ala
340 345 350
ttc tgg act cac tct ggt tgg aac tct act ttg gaa tcc gtt tgt gaa 1104
Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys Glu
355 360 365
ggt gtt cca atg att ttc tct gac ttc ggt ttg gac caa cca ttg aat 1152
Gly Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu Asn
370 375 380
gct cgt tac atg tcc gat gtt ttg aag gtt ggt gtc tac ttg gaa aac 1200
Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu Asn
385 390 395 400
ggt tgg gaa cgt ggt gaa att gct aac gcc atc aga aga gtc atg gtc 1248
Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met Val
405 410 415
gat gaa gaa ggt gaa tac atc aga caa aat gct cgt gtc ttg aaa caa 1296
Asp Glu Glu Gly Glu Tyr Ile Arg Gln Asn Ala Arg Val Leu Lys Gln
420 425 430
aag gct gat gtt tct ttg atg aag ggt ggt tct tct tac gaa tct ttg 1344
Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser Leu
435 440 445
gaa tct ttg gtt tcc tac atc tcc agt ctc 1374
Glu Ser Leu Val Ser Tyr Ile Ser Ser Leu
450 455
<210> 76
<211> 458
<212> PRT
<213> Stevia rebaudiana
<400> 76
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
Leu Phe Pro Val Pro Phe Gln Gly His Ile Asn Pro Ile Leu Gln Leu
20 25 30
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
Phe Ile Leu Asp Asn Asp Pro Gln Asp Glu Arg Ile Ser Asn Leu Pro
65 70 75 80
Thr His Gly Pro Leu Ala Gly Met Arg Ile Pro Ile Ile Asn Glu His
85 90 95
Gly Ala Asp Glu Leu Arg Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
Glu Glu Asp Glu Glu Val Ser Cys Leu Ile Thr Asp Ala Leu Trp Tyr
115 120 125
Phe Ala Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
Met Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Ser
180 185 190
Ala Tyr Ser Asn Trp Gln Ile Leu Lys Glu Ile Leu Gly Lys Met Ile
195 200 205
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
Leu Leu Asp His Asp Arg Thr Val Phe Gln Trp Leu Asp Gln Gln Pro
260 265 270
Pro Ser Ser Val Leu Tyr Val Ser Phe Gly Ser Thr Ser Glu Val Asp
275 280 285
Glu Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys Gln
290 295 300
Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr Trp
305 310 315 320
Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile Val
325 330 335
Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly Ala
340 345 350
Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys Glu
355 360 365
Gly Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu Asn
370 375 380
Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu Asn
385 390 395 400
Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met Val
405 410 415
Asp Glu Glu Gly Glu Tyr Ile Arg Gln Asn Ala Arg Val Leu Lys Gln
420 425 430
Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser Leu
435 440 445
Glu Ser Leu Val Ser Tyr Ile Ser Ser Leu
450 455
<210> 77
<211> 2130
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(2130)
<400> 77
atg caa tct gat tcc gtt aag gtt tct cca ttc gac tta gtc tct gct 48
Met Gln Ser Asp Ser Val Lys Val Ser Pro Phe Asp Leu Val Ser Ala
1 5 10 15
gcc atg aac ggt aag gct atg gaa aag ttg aac gct tct gaa tct gag 96
Ala Met Asn Gly Lys Ala Met Glu Lys Leu Asn Ala Ser Glu Ser Glu
20 25 30
gac cca act act tta cca gct tta aag atg ttg gtt gaa aac aga gaa 144
Asp Pro Thr Thr Leu Pro Ala Leu Lys Met Leu Val Glu Asn Arg Glu
35 40 45
ttg ttg act tta ttc acc acc tcc ttt gct gtc ttg att ggt tgt ttg 192
Leu Leu Thr Leu Phe Thr Thr Ser Phe Ala Val Leu Ile Gly Cys Leu
50 55 60
gtt ttc ttg atg tgg aga aga tct tct tct aag aag ttg gtt caa gac 240
Val Phe Leu Met Trp Arg Arg Ser Ser Ser Lys Lys Leu Val Gln Asp
65 70 75 80
cca gtt cct caa gtc atc gtc gtc aag aag aag gaa aag gaa tct gaa 288
Pro Val Pro Gln Val Ile Val Val Lys Lys Lys Glu Lys Glu Ser Glu
85 90 95
gtc gat gac ggt aag aag aaa gtt tcc atc ttc tac ggt act caa acc 336
Val Asp Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln Thr
100 105 110
ggt act gct gaa ggt ttc gct aaa gct tta gtt gaa gaa gct aag gtc 384
Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Val
115 120 125
aga tac gaa aag acc tct ttc aag gtt atc gat ttg gac gac tac gcc 432
Arg Tyr Glu Lys Thr Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala
130 135 140
gct gac gac gac gaa tac gaa gaa aag ttg aag aag gaa tct cta gct 480
Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala
145 150 155 160
ttc ttc ttc ttg gct acc tac ggt gac ggt gaa cca acc gac aat gct 528
Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala
165 170 175
gcc aac ttc tac aaa tgg ttc acc gaa ggt gat gac aag ggt gaa tgg 576
Ala Asn Phe Tyr Lys Trp Phe Thr Glu Gly Asp Asp Lys Gly Glu Trp
180 185 190
ttg aag aaa ttg caa tac ggt gtt ttc ggt tta ggt aac aga caa tac 624
Leu Lys Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr
195 200 205
gaa cac ttc aac aag att gcc atc gtc gtt gat gac aaa ttg act gaa 672
Glu His Phe Asn Lys Ile Ala Ile Val Val Asp Asp Lys Leu Thr Glu
210 215 220
atg ggt gcc aag aga ttg gtt cca gtc ggt ttg ggt gat gat gac caa 720
Met Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln
225 230 235 240
tgt att gaa gat gac ttt acc gct tgg aag gaa tta gtt tgg cca gaa 768
Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu
245 250 255
tta gac caa ttg ttg aga gat gaa gat gac acc tct gtt acc acc cca 816
Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Thr Thr Pro
260 265 270
tac acc gcc gct gtc ttg gaa tac cgt gtc gtt tac cat gac aaa cca 864
Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Val Tyr His Asp Lys Pro
275 280 285
gct gac tct tac gct gaa gat caa acc cac acc aac ggt cac gtt gtc 912
Ala Asp Ser Tyr Ala Glu Asp Gln Thr His Thr Asn Gly His Val Val
290 295 300
cac gac gct caa cac cca tcc aga tct aac gtt gct ttc aag aag gaa 960
His Asp Ala Gln His Pro Ser Arg Ser Asn Val Ala Phe Lys Lys Glu
305 310 315 320
ttg cac act tcc caa tct gat cgt tct tgt acc cac ttg gaa ttt gac 1008
Leu His Thr Ser Gln Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp
325 330 335
atc tcc cac acc ggt ttg tct tac gaa acc ggt gac cat gtt ggt gtc 1056
Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val
340 345 350
tac tct gaa aac ttg tct gaa gtt gtc gac gaa gct ttg aaa ttg ttg 1104
Tyr Ser Glu Asn Leu Ser Glu Val Val Asp Glu Ala Leu Lys Leu Leu
355 360 365
ggt ttg tct cca gac act tac ttc tcc gtc cac gct gac aaa gaa gat 1152
Gly Leu Ser Pro Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu Asp
370 375 380
ggt act cca atc ggt ggt gct tct ttg cct cca cca ttc cca cca tgt 1200
Gly Thr Pro Ile Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys
385 390 395 400
act cta aga gat gct ttg acc cgt tat gct gac gtt cta tcc tct cca 1248
Thr Leu Arg Asp Ala Leu Thr Arg Tyr Ala Asp Val Leu Ser Ser Pro
405 410 415
aag aag gtt gct ttg ttg gct ttg gct gct cat gct tct gat cca tct 1296
Lys Lys Val Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Ser
420 425 430
gaa gct gac aga ttg aag ttc ttg gct tcc cca gcc ggt aag gac gaa 1344
Glu Ala Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu
435 440 445
tac gct caa tgg att gtt gcc aac caa aga tcc ttg ttg gaa gtc atg 1392
Tyr Ala Gln Trp Ile Val Ala Asn Gln Arg Ser Leu Leu Glu Val Met
450 455 460
caa tct ttc cca tct gcc aag cct cca ttg ggt gtt ttc ttt gcc gct 1440
Gln Ser Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala
465 470 475 480
gtt gcc cca aga ttg caa cca aga tac tac tcc atc tcc tct tct cca 1488
Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro
485 490 495
aag atg tcc cca aac aga att cac gtc act tgt gct ttg gtt tac gaa 1536
Lys Met Ser Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu
500 505 510
acc act cca gct ggt aga att cac aga ggt ttg tgt tcc acc tgg atg 1584
Thr Thr Pro Ala Gly Arg Ile His Arg Gly Leu Cys Ser Thr Trp Met
515 520 525
aag aat gcc gtc cca tta act gaa tct cca gac tgt tcc caa gct tct 1632
Lys Asn Ala Val Pro Leu Thr Glu Ser Pro Asp Cys Ser Gln Ala Ser
530 535 540
atc ttc gtc aga act tcc aac ttc aga ttg cca gtc gat cca aag gtc 1680
Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Val Asp Pro Lys Val
545 550 555 560
cca gtt atc atg atc ggt cca ggt act ggt ttg gct cca ttc aga ggt 1728
Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly
565 570 575
ttc ttg caa gaa aga tta gct ttg aaa gaa tct ggt act gaa ttg ggt 1776
Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu Gly
580 585 590
tcc tcc atc ttc ttc ttc ggt tgt aga aac aga aag gtc gat ttt atc 1824
Ser Ser Ile Phe Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile
595 600 605
tac gaa gat gaa ttg aac aac ttc gtt gaa act ggt gct ttg tct gaa 1872
Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu
610 615 620
ttg att gtc gct ttc tcc aga gaa ggt act gcc aag gaa tac gtc caa 1920
Leu Ile Val Ala Phe Ser Arg Glu Gly Thr Ala Lys Glu Tyr Val Gln
625 630 635 640
cac aag atg tcc caa aag gct tct gat atc tgg aag ttg ttg tct gaa 1968
His Lys Met Ser Gln Lys Ala Ser Asp Ile Trp Lys Leu Leu Ser Glu
645 650 655
ggt gct tac ttg tac gtt tgt ggt gat gct aaa ggt atg gcc aag gac 2016
Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp
660 665 670
gtt cac aga act ttg cac act att gtt caa gaa caa ggt tct ttg gac 2064
Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp
675 680 685
tcc tcc aag gct gaa tta tac gtc aag aac ttg caa atg tcc ggt cgt 2112
Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg
690 695 700
tac ttg aga gat gta tgg 2130
Tyr Leu Arg Asp Val Trp
705 710
<210> 78
<211> 710
<212> PRT
<213> Stevia rebaudiana
<400> 78
Met Gln Ser Asp Ser Val Lys Val Ser Pro Phe Asp Leu Val Ser Ala
1 5 10 15
Ala Met Asn Gly Lys Ala Met Glu Lys Leu Asn Ala Ser Glu Ser Glu
20 25 30
Asp Pro Thr Thr Leu Pro Ala Leu Lys Met Leu Val Glu Asn Arg Glu
35 40 45
Leu Leu Thr Leu Phe Thr Thr Ser Phe Ala Val Leu Ile Gly Cys Leu
50 55 60
Val Phe Leu Met Trp Arg Arg Ser Ser Ser Lys Lys Leu Val Gln Asp
65 70 75 80
Pro Val Pro Gln Val Ile Val Val Lys Lys Lys Glu Lys Glu Ser Glu
85 90 95
Val Asp Asp Gly Lys Lys Lys Val Ser Ile Phe Tyr Gly Thr Gln Thr
100 105 110
Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu Ala Lys Val
115 120 125
Arg Tyr Glu Lys Thr Ser Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala
130 135 140
Ala Asp Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu Ser Leu Ala
145 150 155 160
Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala
165 170 175
Ala Asn Phe Tyr Lys Trp Phe Thr Glu Gly Asp Asp Lys Gly Glu Trp
180 185 190
Leu Lys Lys Leu Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr
195 200 205
Glu His Phe Asn Lys Ile Ala Ile Val Val Asp Asp Lys Leu Thr Glu
210 215 220
Met Gly Ala Lys Arg Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln
225 230 235 240
Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val Trp Pro Glu
245 250 255
Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val Thr Thr Pro
260 265 270
Tyr Thr Ala Ala Val Leu Glu Tyr Arg Val Val Tyr His Asp Lys Pro
275 280 285
Ala Asp Ser Tyr Ala Glu Asp Gln Thr His Thr Asn Gly His Val Val
290 295 300
His Asp Ala Gln His Pro Ser Arg Ser Asn Val Ala Phe Lys Lys Glu
305 310 315 320
Leu His Thr Ser Gln Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asp
325 330 335
Ile Ser His Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly Val
340 345 350
Tyr Ser Glu Asn Leu Ser Glu Val Val Asp Glu Ala Leu Lys Leu Leu
355 360 365
Gly Leu Ser Pro Asp Thr Tyr Phe Ser Val His Ala Asp Lys Glu Asp
370 375 380
Gly Thr Pro Ile Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro Cys
385 390 395 400
Thr Leu Arg Asp Ala Leu Thr Arg Tyr Ala Asp Val Leu Ser Ser Pro
405 410 415
Lys Lys Val Ala Leu Leu Ala Leu Ala Ala His Ala Ser Asp Pro Ser
420 425 430
Glu Ala Asp Arg Leu Lys Phe Leu Ala Ser Pro Ala Gly Lys Asp Glu
435 440 445
Tyr Ala Gln Trp Ile Val Ala Asn Gln Arg Ser Leu Leu Glu Val Met
450 455 460
Gln Ser Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Ala
465 470 475 480
Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro
485 490 495
Lys Met Ser Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr Glu
500 505 510
Thr Thr Pro Ala Gly Arg Ile His Arg Gly Leu Cys Ser Thr Trp Met
515 520 525
Lys Asn Ala Val Pro Leu Thr Glu Ser Pro Asp Cys Ser Gln Ala Ser
530 535 540
Ile Phe Val Arg Thr Ser Asn Phe Arg Leu Pro Val Asp Pro Lys Val
545 550 555 560
Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly
565 570 575
Phe Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Thr Glu Leu Gly
580 585 590
Ser Ser Ile Phe Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe Ile
595 600 605
Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser Glu
610 615 620
Leu Ile Val Ala Phe Ser Arg Glu Gly Thr Ala Lys Glu Tyr Val Gln
625 630 635 640
His Lys Met Ser Gln Lys Ala Ser Asp Ile Trp Lys Leu Leu Ser Glu
645 650 655
Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys Asp
660 665 670
Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu Asp
675 680 685
Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ser Gly Arg
690 695 700
Tyr Leu Arg Asp Val Trp
705 710
<210> 79
<211> 1575
<212> DNA
<213> Saccharomyces cerevisiae
<220>
<221> CDS
<222> (1)..(1575)
<400> 79
atg gac caa ttg gtc aag act gaa gtc acc aag aaa tct ttc act gct 48
Met Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr Ala
1 5 10 15
cca gtc caa aag gct tcc act cca gtt ttg acc aac aag acc gtc atc 96
Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val Ile
20 25 30
tcc ggt tcc aag gtt aaa tct ttg tcc tct gct caa tct tcc tcc tct 144
Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser Ser
35 40 45
ggt cca tct tct tct tct gaa gaa gat gat tcc aga gat atc gaa tct 192
Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu Ser
50 55 60
ttg gac aag aaa atc aga cca ttg gaa gaa ttg gaa gct cta ttg tcc 240
Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu Ser
65 70 75 80
tct ggt aac act aag caa tta aag aac aag gaa gtt gct gct ttg gtt 288
Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu Val
85 90 95
atc cac ggt aaa ttg cca ttg tac gct ttg gaa aag aaa tta ggt gac 336
Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly Asp
100 105 110
acc acc aga gct gtt gct gtc aga aga aag gct ttg tcc att ttg gct 384
Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu Ala
115 120 125
gaa gct cca gtc ttg gct tcc gac aga tta cca tac aag aac tac gac 432
Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr Asp
130 135 140
tac gac cgt gtc ttt ggt gct tgt tgt gaa aat gtc att ggt tac atg 480
Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met
145 150 155 160
cca tta cca gtt ggt gtc att ggt cca ttg gtt atc gac ggt act tct 528
Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr Ser
165 170 175
tac cac atc cca atg gct acc act gaa ggt tgt ttg gtt gct tct gcc 576
Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Ala
180 185 190
atg aga ggt tgt aag gcc atc aac gct ggt ggt ggt gct acc acc gtt 624
Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr Val
195 200 205
ttg act aag gat ggt atg acc aga ggt cct gtt gtc aga ttc cca act 672
Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro Thr
210 215 220
ttg aag aga tct ggt gct tgt aag atc tgg ttg gat tct gaa gaa ggt 720
Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu Gly
225 230 235 240
caa aac gcc atc aag aag gct ttc aac tcc act tcc aga ttc gct aga 768
Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg
245 250 255
ttg caa cac att caa act tgt tta gct ggt gac ttg ttg ttc atg aga 816
Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met Arg
260 265 270
ttc aga acc acc act ggt gac gct atg ggt atg aac atg atc tcc aag 864
Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser Lys
275 280 285
ggt gtt gaa tac tct ttg aag caa atg gtt gaa gaa tac ggt tgg gaa 912
Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp Glu
290 295 300
gat atg gaa gtt gtc tct gtt tct ggt aac tac tgt acc gac aag aag 960
Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys
305 310 315 320
cca gct gcc atc aac tgg atc gaa ggt cgt ggt aag tcc gtt gtt gct 1008
Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala
325 330 335
gaa gct acc att cca ggt gac gtt gtc aga aag gtt ttg aaa tct gat 1056
Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser Asp
340 345 350
gtt tct gct tta gtc gaa ttg aac att gcc aag aac ttg gtc ggt tct 1104
Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly Ser
355 360 365
gcc atg gct ggt tcc gtc ggt ggt ttc aac gct cat gcc gct aac ttg 1152
Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Leu
370 375 380
gtc act gct gtt ttc ttg gct tta ggt caa gat cca gct caa aat gtc 1200
Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn Val
385 390 395 400
gaa tcc tct aac tgt atc act ttg atg aag gaa gtt gac ggt gat ttg 1248
Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp Leu
405 410 415
aga att tct gtt tcc atg cca tcc att gaa gtc ggt act atc ggt ggt 1296
Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly
420 425 430
ggt act gtc ttg gaa cca caa ggt gcc atg ttg gac ttg ttg ggt gtt 1344
Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly Val
435 440 445
cgt ggt cca cac gct acc gct cca ggt act aac gcc aga caa ttg gcc 1392
Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu Ala
450 455 460
aga att gtt gcc tgt gcc gtc ttg gct ggt gaa ttg tct cta tgt gcc 1440
Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys Ala
465 470 475 480
gct ttg gct gct ggt cac ttg gtt caa tct cac atg acc cac aac aga 1488
Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn Arg
485 490 495
aag cct gct gaa cca acc aaa cca aac aac ttg gat gct act gac att 1536
Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp Ile
500 505 510
aac aga tta aag gac ggt tct gtc acc tgt atc aag tct 1575
Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
515 520 525
<210> 80
<211> 525
<212> PRT
<213> Saccharomyces cerevisiae
<400> 80
Met Asp Gln Leu Val Lys Thr Glu Val Thr Lys Lys Ser Phe Thr Ala
1 5 10 15
Pro Val Gln Lys Ala Ser Thr Pro Val Leu Thr Asn Lys Thr Val Ile
20 25 30
Ser Gly Ser Lys Val Lys Ser Leu Ser Ser Ala Gln Ser Ser Ser Ser
35 40 45
Gly Pro Ser Ser Ser Ser Glu Glu Asp Asp Ser Arg Asp Ile Glu Ser
50 55 60
Leu Asp Lys Lys Ile Arg Pro Leu Glu Glu Leu Glu Ala Leu Leu Ser
65 70 75 80
Ser Gly Asn Thr Lys Gln Leu Lys Asn Lys Glu Val Ala Ala Leu Val
85 90 95
Ile His Gly Lys Leu Pro Leu Tyr Ala Leu Glu Lys Lys Leu Gly Asp
100 105 110
Thr Thr Arg Ala Val Ala Val Arg Arg Lys Ala Leu Ser Ile Leu Ala
115 120 125
Glu Ala Pro Val Leu Ala Ser Asp Arg Leu Pro Tyr Lys Asn Tyr Asp
130 135 140
Tyr Asp Arg Val Phe Gly Ala Cys Cys Glu Asn Val Ile Gly Tyr Met
145 150 155 160
Pro Leu Pro Val Gly Val Ile Gly Pro Leu Val Ile Asp Gly Thr Ser
165 170 175
Tyr His Ile Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala Ser Ala
180 185 190
Met Arg Gly Cys Lys Ala Ile Asn Ala Gly Gly Gly Ala Thr Thr Val
195 200 205
Leu Thr Lys Asp Gly Met Thr Arg Gly Pro Val Val Arg Phe Pro Thr
210 215 220
Leu Lys Arg Ser Gly Ala Cys Lys Ile Trp Leu Asp Ser Glu Glu Gly
225 230 235 240
Gln Asn Ala Ile Lys Lys Ala Phe Asn Ser Thr Ser Arg Phe Ala Arg
245 250 255
Leu Gln His Ile Gln Thr Cys Leu Ala Gly Asp Leu Leu Phe Met Arg
260 265 270
Phe Arg Thr Thr Thr Gly Asp Ala Met Gly Met Asn Met Ile Ser Lys
275 280 285
Gly Val Glu Tyr Ser Leu Lys Gln Met Val Glu Glu Tyr Gly Trp Glu
290 295 300
Asp Met Glu Val Val Ser Val Ser Gly Asn Tyr Cys Thr Asp Lys Lys
305 310 315 320
Pro Ala Ala Ile Asn Trp Ile Glu Gly Arg Gly Lys Ser Val Val Ala
325 330 335
Glu Ala Thr Ile Pro Gly Asp Val Val Arg Lys Val Leu Lys Ser Asp
340 345 350
Val Ser Ala Leu Val Glu Leu Asn Ile Ala Lys Asn Leu Val Gly Ser
355 360 365
Ala Met Ala Gly Ser Val Gly Gly Phe Asn Ala His Ala Ala Asn Leu
370 375 380
Val Thr Ala Val Phe Leu Ala Leu Gly Gln Asp Pro Ala Gln Asn Val
385 390 395 400
Glu Ser Ser Asn Cys Ile Thr Leu Met Lys Glu Val Asp Gly Asp Leu
405 410 415
Arg Ile Ser Val Ser Met Pro Ser Ile Glu Val Gly Thr Ile Gly Gly
420 425 430
Gly Thr Val Leu Glu Pro Gln Gly Ala Met Leu Asp Leu Leu Gly Val
435 440 445
Arg Gly Pro His Ala Thr Ala Pro Gly Thr Asn Ala Arg Gln Leu Ala
450 455 460
Arg Ile Val Ala Cys Ala Val Leu Ala Gly Glu Leu Ser Leu Cys Ala
465 470 475 480
Ala Leu Ala Ala Gly His Leu Val Gln Ser His Met Thr His Asn Arg
485 490 495
Lys Pro Ala Glu Pro Thr Lys Pro Asn Asn Leu Asp Ala Thr Asp Ile
500 505 510
Asn Arg Leu Lys Asp Gly Ser Val Thr Cys Ile Lys Ser
515 520 525
<210> 81
<211> 1056
<212> DNA
<213> Saccharomyces cerevisiae
<220>
<221> CDS
<222> (1)..(1056)
<400> 81
atg gct tct gaa aag gaa atc aga aga gaa cgt ttc ttg aat gtt ttc 48
Met Ala Ser Glu Lys Glu Ile Arg Arg Glu Arg Phe Leu Asn Val Phe
1 5 10 15
cca aaa ttg gtt gaa gaa ttg aac gct tct cta tta gct tac ggt atg 96
Pro Lys Leu Val Glu Glu Leu Asn Ala Ser Leu Leu Ala Tyr Gly Met
20 25 30
cca aag gaa gct tgt gac tgg tac gct cac tct ttg aac tac aac acc 144
Pro Lys Glu Ala Cys Asp Trp Tyr Ala His Ser Leu Asn Tyr Asn Thr
35 40 45
cca ggt ggt aag ttg aac aga ggt cta tcc gtt gtt gac acc tac gcc 192
Pro Gly Gly Lys Leu Asn Arg Gly Leu Ser Val Val Asp Thr Tyr Ala
50 55 60
att ttg tcc aac aag acc gtc gaa caa tta ggt caa gaa gaa tac gaa 240
Ile Leu Ser Asn Lys Thr Val Glu Gln Leu Gly Gln Glu Glu Tyr Glu
65 70 75 80
aag gtt gcc atc tta ggt tgg tgt atc gaa ttg ttg caa gct tac ttc 288
Lys Val Ala Ile Leu Gly Trp Cys Ile Glu Leu Leu Gln Ala Tyr Phe
85 90 95
ttg gtt gct gat gac atg atg gac aaa tct atc acc aga aga ggt caa 336
Leu Val Ala Asp Asp Met Met Asp Lys Ser Ile Thr Arg Arg Gly Gln
100 105 110
cca tgt tgg tac aag gtt cca gaa gtc ggt gaa att gcc atc aac gat 384
Pro Cys Trp Tyr Lys Val Pro Glu Val Gly Glu Ile Ala Ile Asn Asp
115 120 125
gct ttc atg ttg gaa gct gcc atc tac aag ttg ttg aag tct cac ttc 432
Ala Phe Met Leu Glu Ala Ala Ile Tyr Lys Leu Leu Lys Ser His Phe
130 135 140
aga aac gaa aag tac tac att gac atc act gaa tta ttc cac gaa gtt 480
Arg Asn Glu Lys Tyr Tyr Ile Asp Ile Thr Glu Leu Phe His Glu Val
145 150 155 160
act ttc caa acc gaa ttg ggt caa ttg atg gac ttg att acc gct cca 528
Thr Phe Gln Thr Glu Leu Gly Gln Leu Met Asp Leu Ile Thr Ala Pro
165 170 175
gaa gat aag gtc gat ttg tcc aaa ttt tcc ttg aag aaa cac tct ttc 576
Glu Asp Lys Val Asp Leu Ser Lys Phe Ser Leu Lys Lys His Ser Phe
180 185 190
att gtc act ttc aag act gct tac tac tcc ttt tac ttg cct gtt gct 624
Ile Val Thr Phe Lys Thr Ala Tyr Tyr Ser Phe Tyr Leu Pro Val Ala
195 200 205
ttg gcc atg tat gtc gct ggt atc acc gat gaa aag gac ttg aag caa 672
Leu Ala Met Tyr Val Ala Gly Ile Thr Asp Glu Lys Asp Leu Lys Gln
210 215 220
gct cgt gat gtc ttg att cca tta ggt gaa tac ttc caa atc caa gat 720
Ala Arg Asp Val Leu Ile Pro Leu Gly Glu Tyr Phe Gln Ile Gln Asp
225 230 235 240
gac tac ttg gac tgt ttc ggt act cca gaa caa atc ggt aag att ggt 768
Asp Tyr Leu Asp Cys Phe Gly Thr Pro Glu Gln Ile Gly Lys Ile Gly
245 250 255
act gat atc caa gac aac aag tgt tcc tgg gtt atc aac aag gct ttg 816
Thr Asp Ile Gln Asp Asn Lys Cys Ser Trp Val Ile Asn Lys Ala Leu
260 265 270
gaa ttg gct tct gct gaa caa aga aag act ttg gac gaa aac tac ggt 864
Glu Leu Ala Ser Ala Glu Gln Arg Lys Thr Leu Asp Glu Asn Tyr Gly
275 280 285
aag aag gac tct gtt gct gaa gct aag tgt aag aag atc ttc aac gat 912
Lys Lys Asp Ser Val Ala Glu Ala Lys Cys Lys Lys Ile Phe Asn Asp
290 295 300
ttg aaa att gaa caa tta tac cat gaa tac gaa gaa tct att gcc aag 960
Leu Lys Ile Glu Gln Leu Tyr His Glu Tyr Glu Glu Ser Ile Ala Lys
305 310 315 320
gac ttg aaa gcc aag atc tct caa gtc gac gaa tcc aga ggt ttc aag 1008
Asp Leu Lys Ala Lys Ile Ser Gln Val Asp Glu Ser Arg Gly Phe Lys
325 330 335
gct gat gtc ttg act gct ttc ttg aac aag gtc tac aag aga tca aaa 1056
Ala Asp Val Leu Thr Ala Phe Leu Asn Lys Val Tyr Lys Arg Ser Lys
340 345 350
<210> 82
<211> 352
<212> PRT
<213> Saccharomyces cerevisiae
<400> 82
Met Ala Ser Glu Lys Glu Ile Arg Arg Glu Arg Phe Leu Asn Val Phe
1 5 10 15
Pro Lys Leu Val Glu Glu Leu Asn Ala Ser Leu Leu Ala Tyr Gly Met
20 25 30
Pro Lys Glu Ala Cys Asp Trp Tyr Ala His Ser Leu Asn Tyr Asn Thr
35 40 45
Pro Gly Gly Lys Leu Asn Arg Gly Leu Ser Val Val Asp Thr Tyr Ala
50 55 60
Ile Leu Ser Asn Lys Thr Val Glu Gln Leu Gly Gln Glu Glu Tyr Glu
65 70 75 80
Lys Val Ala Ile Leu Gly Trp Cys Ile Glu Leu Leu Gln Ala Tyr Phe
85 90 95
Leu Val Ala Asp Asp Met Met Asp Lys Ser Ile Thr Arg Arg Gly Gln
100 105 110
Pro Cys Trp Tyr Lys Val Pro Glu Val Gly Glu Ile Ala Ile Asn Asp
115 120 125
Ala Phe Met Leu Glu Ala Ala Ile Tyr Lys Leu Leu Lys Ser His Phe
130 135 140
Arg Asn Glu Lys Tyr Tyr Ile Asp Ile Thr Glu Leu Phe His Glu Val
145 150 155 160
Thr Phe Gln Thr Glu Leu Gly Gln Leu Met Asp Leu Ile Thr Ala Pro
165 170 175
Glu Asp Lys Val Asp Leu Ser Lys Phe Ser Leu Lys Lys His Ser Phe
180 185 190
Ile Val Thr Phe Lys Thr Ala Tyr Tyr Ser Phe Tyr Leu Pro Val Ala
195 200 205
Leu Ala Met Tyr Val Ala Gly Ile Thr Asp Glu Lys Asp Leu Lys Gln
210 215 220
Ala Arg Asp Val Leu Ile Pro Leu Gly Glu Tyr Phe Gln Ile Gln Asp
225 230 235 240
Asp Tyr Leu Asp Cys Phe Gly Thr Pro Glu Gln Ile Gly Lys Ile Gly
245 250 255
Thr Asp Ile Gln Asp Asn Lys Cys Ser Trp Val Ile Asn Lys Ala Leu
260 265 270
Glu Leu Ala Ser Ala Glu Gln Arg Lys Thr Leu Asp Glu Asn Tyr Gly
275 280 285
Lys Lys Asp Ser Val Ala Glu Ala Lys Cys Lys Lys Ile Phe Asn Asp
290 295 300
Leu Lys Ile Glu Gln Leu Tyr His Glu Tyr Glu Glu Ser Ile Ala Lys
305 310 315 320
Asp Leu Lys Ala Lys Ile Ser Gln Val Asp Glu Ser Arg Gly Phe Lys
325 330 335
Ala Asp Val Leu Thr Ala Phe Leu Asn Lys Val Tyr Lys Arg Ser Lys
340 345 350
<210> 83
<211> 1005
<212> DNA
<213> Saccharomyces cerevisiae
<220>
<221> CDS
<222> (1)..(1005)
<400> 83
atg gaa gct aag att gac gaa ttg atc aac aac gac cct gtc tgg tcc 48
Met Glu Ala Lys Ile Asp Glu Leu Ile Asn Asn Asp Pro Val Trp Ser
1 5 10 15
tct caa aac gaa tct ttg atc tcc aag cca tac aac cac atc ttg ttg 96
Ser Gln Asn Glu Ser Leu Ile Ser Lys Pro Tyr Asn His Ile Leu Leu
20 25 30
aag cca ggt aag aac ttc aga tta aac ttg att gtt caa atc aac aga 144
Lys Pro Gly Lys Asn Phe Arg Leu Asn Leu Ile Val Gln Ile Asn Arg
35 40 45
gtt atg aac ttg cca aag gac caa ttg gcc att gtt tcc caa att gtc 192
Val Met Asn Leu Pro Lys Asp Gln Leu Ala Ile Val Ser Gln Ile Val
50 55 60
gaa ttg ttg cac aac tcc tct cta ttg atc gat gac att gaa gat aat 240
Glu Leu Leu His Asn Ser Ser Leu Leu Ile Asp Asp Ile Glu Asp Asn
65 70 75 80
gct cca tta aga aga ggt caa acc act tct cat ttg att ttc ggt gtc 288
Ala Pro Leu Arg Arg Gly Gln Thr Thr Ser His Leu Ile Phe Gly Val
85 90 95
cca tcc acc atc aac act gct aac tac atg tac ttc aga gcc atg caa 336
Pro Ser Thr Ile Asn Thr Ala Asn Tyr Met Tyr Phe Arg Ala Met Gln
100 105 110
ttg gtt tct caa ttg acc acc aag gaa cca tta tac cac aac ttg atc 384
Leu Val Ser Gln Leu Thr Thr Lys Glu Pro Leu Tyr His Asn Leu Ile
115 120 125
act atc ttt aac gaa gaa ttg att aac ttg cac cgt ggt caa ggt ttg 432
Thr Ile Phe Asn Glu Glu Leu Ile Asn Leu His Arg Gly Gln Gly Leu
130 135 140
gac atc tac tgg aga gat ttc ttg cca gaa att att cca act caa gaa 480
Asp Ile Tyr Trp Arg Asp Phe Leu Pro Glu Ile Ile Pro Thr Gln Glu
145 150 155 160
atg tac ttg aac atg gtc atg aac aag act ggt ggt tta ttc aga ttg 528
Met Tyr Leu Asn Met Val Met Asn Lys Thr Gly Gly Leu Phe Arg Leu
165 170 175
act tta cgt ttg atg gaa gct ttg tct cca tct tcc cac cac ggt cac 576
Thr Leu Arg Leu Met Glu Ala Leu Ser Pro Ser Ser His His Gly His
180 185 190
tct ttg gtt cca ttc atc aat cta tta ggt atc atc tac caa atc aga 624
Ser Leu Val Pro Phe Ile Asn Leu Leu Gly Ile Ile Tyr Gln Ile Arg
195 200 205
gat gat tac ttg aac ttg aag gac ttc caa atg tcc tct gaa aag ggt 672
Asp Asp Tyr Leu Asn Leu Lys Asp Phe Gln Met Ser Ser Glu Lys Gly
210 215 220
ttc gct gaa gat atc act gaa ggt aaa ttg tct ttc cca att gtc cac 720
Phe Ala Glu Asp Ile Thr Glu Gly Lys Leu Ser Phe Pro Ile Val His
225 230 235 240
gcc ttg aac ttt acc aag acc aag ggt caa act gaa caa cac aac gaa 768
Ala Leu Asn Phe Thr Lys Thr Lys Gly Gln Thr Glu Gln His Asn Glu
245 250 255
att ttg aga atc tta ttg ttg aga act tct gac aag gac atc aag ttg 816
Ile Leu Arg Ile Leu Leu Leu Arg Thr Ser Asp Lys Asp Ile Lys Leu
260 265 270
aaa ttg atc caa atc ttg gaa ttc gat acc aac tct ttg gct tac acc 864
Lys Leu Ile Gln Ile Leu Glu Phe Asp Thr Asn Ser Leu Ala Tyr Thr
275 280 285
aag aac ttc atc aac caa ttg gtt aac atg atc aag aat gac aac gaa 912
Lys Asn Phe Ile Asn Gln Leu Val Asn Met Ile Lys Asn Asp Asn Glu
290 295 300
aac aaa tac ttg cca gac ttg gct tcc cac tcc gat acc gct acc aac 960
Asn Lys Tyr Leu Pro Asp Leu Ala Ser His Ser Asp Thr Ala Thr Asn
305 310 315 320
ttg cac gac gaa ttg ttg tac att att gac cat ttg tct gag tta 1005
Leu His Asp Glu Leu Leu Tyr Ile Ile Asp His Leu Ser Glu Leu
325 330 335
<210> 84
<211> 335
<212> PRT
<213> Saccharomyces cerevisiae
<400> 84
Met Glu Ala Lys Ile Asp Glu Leu Ile Asn Asn Asp Pro Val Trp Ser
1 5 10 15
Ser Gln Asn Glu Ser Leu Ile Ser Lys Pro Tyr Asn His Ile Leu Leu
20 25 30
Lys Pro Gly Lys Asn Phe Arg Leu Asn Leu Ile Val Gln Ile Asn Arg
35 40 45
Val Met Asn Leu Pro Lys Asp Gln Leu Ala Ile Val Ser Gln Ile Val
50 55 60
Glu Leu Leu His Asn Ser Ser Leu Leu Ile Asp Asp Ile Glu Asp Asn
65 70 75 80
Ala Pro Leu Arg Arg Gly Gln Thr Thr Ser His Leu Ile Phe Gly Val
85 90 95
Pro Ser Thr Ile Asn Thr Ala Asn Tyr Met Tyr Phe Arg Ala Met Gln
100 105 110
Leu Val Ser Gln Leu Thr Thr Lys Glu Pro Leu Tyr His Asn Leu Ile
115 120 125
Thr Ile Phe Asn Glu Glu Leu Ile Asn Leu His Arg Gly Gln Gly Leu
130 135 140
Asp Ile Tyr Trp Arg Asp Phe Leu Pro Glu Ile Ile Pro Thr Gln Glu
145 150 155 160
Met Tyr Leu Asn Met Val Met Asn Lys Thr Gly Gly Leu Phe Arg Leu
165 170 175
Thr Leu Arg Leu Met Glu Ala Leu Ser Pro Ser Ser His His Gly His
180 185 190
Ser Leu Val Pro Phe Ile Asn Leu Leu Gly Ile Ile Tyr Gln Ile Arg
195 200 205
Asp Asp Tyr Leu Asn Leu Lys Asp Phe Gln Met Ser Ser Glu Lys Gly
210 215 220
Phe Ala Glu Asp Ile Thr Glu Gly Lys Leu Ser Phe Pro Ile Val His
225 230 235 240
Ala Leu Asn Phe Thr Lys Thr Lys Gly Gln Thr Glu Gln His Asn Glu
245 250 255
Ile Leu Arg Ile Leu Leu Leu Arg Thr Ser Asp Lys Asp Ile Lys Leu
260 265 270
Lys Leu Ile Gln Ile Leu Glu Phe Asp Thr Asn Ser Leu Ala Tyr Thr
275 280 285
Lys Asn Phe Ile Asn Gln Leu Val Asn Met Ile Lys Asn Asp Asn Glu
290 295 300
Asn Lys Tyr Leu Pro Asp Leu Ala Ser His Ser Asp Thr Ala Thr Asn
305 310 315 320
Leu His Asp Glu Leu Leu Tyr Ile Ile Asp His Leu Ser Glu Leu
325 330 335
<210> 85
<211> 1575
<212> DNA
<213> Gibberella fujikuroi
<220>
<221> CDS
<222> (1)..(1575)
<400> 85
atg tcc aag tct aac tcc atg aac tcc act tct cac gaa act tta ttc 48
Met Ser Lys Ser Asn Ser Met Asn Ser Thr Ser His Glu Thr Leu Phe
1 5 10 15
caa caa ttg gtt ttg ggt ttg gac aga atg cca ttg atg gat gtc cac 96
Gln Gln Leu Val Leu Gly Leu Asp Arg Met Pro Leu Met Asp Val His
20 25 30
tgg ttg atc tac gtt gct ttc ggt gct tgg tta tgt tcc tac gtc att 144
Trp Leu Ile Tyr Val Ala Phe Gly Ala Trp Leu Cys Ser Tyr Val Ile
35 40 45
cac gtt ttg tcc tct tct tct acc gtc aag gtt cca gtt gtc ggt tac 192
His Val Leu Ser Ser Ser Ser Thr Val Lys Val Pro Val Val Gly Tyr
50 55 60
aga tcc gtt ttc gaa cca acc tgg tta ttg aga tta aga ttt gtc tgg 240
Arg Ser Val Phe Glu Pro Thr Trp Leu Leu Arg Leu Arg Phe Val Trp
65 70 75 80
gaa ggt ggt tcc att att ggt caa ggt tac aac aaa ttc aag gac tct 288
Glu Gly Gly Ser Ile Ile Gly Gln Gly Tyr Asn Lys Phe Lys Asp Ser
85 90 95
atc ttc caa gtc aga aag ttg ggt act gac att gtt atc atc cca cca 336
Ile Phe Gln Val Arg Lys Leu Gly Thr Asp Ile Val Ile Ile Pro Pro
100 105 110
aac tac atc gat gaa gtc aga aag ttg tcc caa gac aag acc aga tct 384
Asn Tyr Ile Asp Glu Val Arg Lys Leu Ser Gln Asp Lys Thr Arg Ser
115 120 125
gtt gaa cca ttc atc aac gat ttc gct ggt caa tac acc aga ggt atg 432
Val Glu Pro Phe Ile Asn Asp Phe Ala Gly Gln Tyr Thr Arg Gly Met
130 135 140
gtc ttt cta caa tct gat ttg caa aac cgt gtc atc caa caa aga ttg 480
Val Phe Leu Gln Ser Asp Leu Gln Asn Arg Val Ile Gln Gln Arg Leu
145 150 155 160
act cca aag ttg gtt tct ttg act aag gtc atg aag gaa gaa ttg gac 528
Thr Pro Lys Leu Val Ser Leu Thr Lys Val Met Lys Glu Glu Leu Asp
165 170 175
tac gct ttg acc aag gaa atg cca gac atg aag aac gac gaa tgg gtt 576
Tyr Ala Leu Thr Lys Glu Met Pro Asp Met Lys Asn Asp Glu Trp Val
180 185 190
gaa gtt gac att tct tcc atc atg gtc aga ttg atc tcc aga atc tct 624
Glu Val Asp Ile Ser Ser Ile Met Val Arg Leu Ile Ser Arg Ile Ser
195 200 205
gcc cgt gtt ttc ttg ggt cca gaa cac tgt cgt aac caa gaa tgg ttg 672
Ala Arg Val Phe Leu Gly Pro Glu His Cys Arg Asn Gln Glu Trp Leu
210 215 220
acc acc act gct gaa tac tct gaa tct tta ttc atc act ggt ttc atc 720
Thr Thr Thr Ala Glu Tyr Ser Glu Ser Leu Phe Ile Thr Gly Phe Ile
225 230 235 240
ttg aga gtt gtc cca cac atc tta aga cca ttc att gct cca ttg ttg 768
Leu Arg Val Val Pro His Ile Leu Arg Pro Phe Ile Ala Pro Leu Leu
245 250 255
cct tct tac aga act ttg ttg aga aat gtc tct tct ggt aga aga gtt 816
Pro Ser Tyr Arg Thr Leu Leu Arg Asn Val Ser Ser Gly Arg Arg Val
260 265 270
atc ggt gat atc atc aga tct caa caa ggt gat ggt aac gaa gat atc 864
Ile Gly Asp Ile Ile Arg Ser Gln Gln Gly Asp Gly Asn Glu Asp Ile
275 280 285
ttg tcc tgg atg aga gat gct gct acc ggt gaa gaa aag caa att gac 912
Leu Ser Trp Met Arg Asp Ala Ala Thr Gly Glu Glu Lys Gln Ile Asp
290 295 300
aac att gct caa aga atg ttg atc ttg tct ttg gct tcc att cac acc 960
Asn Ile Ala Gln Arg Met Leu Ile Leu Ser Leu Ala Ser Ile His Thr
305 310 315 320
acc gcc atg acc atg acc cat gcc atg tac gac ttg tgt gcc tgt cca 1008
Thr Ala Met Thr Met Thr His Ala Met Tyr Asp Leu Cys Ala Cys Pro
325 330 335
gaa tac att gaa cca tta cgt gac gaa gtc aaa tcc gtt gtt ggt gct 1056
Glu Tyr Ile Glu Pro Leu Arg Asp Glu Val Lys Ser Val Val Gly Ala
340 345 350
tct ggt tgg gac aag act gct ttg aac aga ttc cac aag ttg gac tct 1104
Ser Gly Trp Asp Lys Thr Ala Leu Asn Arg Phe His Lys Leu Asp Ser
355 360 365
ttc ttg aag gaa tct caa aga ttc aac cca gtt ttc ttg ttg act ttc 1152
Phe Leu Lys Glu Ser Gln Arg Phe Asn Pro Val Phe Leu Leu Thr Phe
370 375 380
aac aga atc tac cat caa tcc atg act ttg tcc gat ggt acc aac att 1200
Asn Arg Ile Tyr His Gln Ser Met Thr Leu Ser Asp Gly Thr Asn Ile
385 390 395 400
cca tct ggt acc aga att gct gtt cca tct cac gct atg ttg caa gat 1248
Pro Ser Gly Thr Arg Ile Ala Val Pro Ser His Ala Met Leu Gln Asp
405 410 415
tct gct cac gtt cca ggt cca act cct cca act gaa ttt gac ggt ttc 1296
Ser Ala His Val Pro Gly Pro Thr Pro Pro Thr Glu Phe Asp Gly Phe
420 425 430
aga tac tcc aag atc aga tct gac tct aac tat gct caa aag tac ttg 1344
Arg Tyr Ser Lys Ile Arg Ser Asp Ser Asn Tyr Ala Gln Lys Tyr Leu
435 440 445
ttc tcc atg acc gat tct tcc aac atg gct ttc ggt tac ggt aag tac 1392
Phe Ser Met Thr Asp Ser Ser Asn Met Ala Phe Gly Tyr Gly Lys Tyr
450 455 460
gct tgt cca ggt cgt ttc tac gcc tcc aac gaa atg aaa ttg act ttg 1440
Ala Cys Pro Gly Arg Phe Tyr Ala Ser Asn Glu Met Lys Leu Thr Leu
465 470 475 480
gcc att ttg ttg ttg caa ttt gaa ttc aaa ttg cca gat ggt aag ggt 1488
Ala Ile Leu Leu Leu Gln Phe Glu Phe Lys Leu Pro Asp Gly Lys Gly
485 490 495
aga cca aga aac atc act atc gac tct gac atg att cca gac cca aga 1536
Arg Pro Arg Asn Ile Thr Ile Asp Ser Asp Met Ile Pro Asp Pro Arg
500 505 510
gct aga tta tgt gtc aga aag aga tct cta cgt gac gaa 1575
Ala Arg Leu Cys Val Arg Lys Arg Ser Leu Arg Asp Glu
515 520 525
<210> 86
<211> 525
<212> PRT
<213> Gibberella fujikuroi
<400> 86
Met Ser Lys Ser Asn Ser Met Asn Ser Thr Ser His Glu Thr Leu Phe
1 5 10 15
Gln Gln Leu Val Leu Gly Leu Asp Arg Met Pro Leu Met Asp Val His
20 25 30
Trp Leu Ile Tyr Val Ala Phe Gly Ala Trp Leu Cys Ser Tyr Val Ile
35 40 45
His Val Leu Ser Ser Ser Ser Thr Val Lys Val Pro Val Val Gly Tyr
50 55 60
Arg Ser Val Phe Glu Pro Thr Trp Leu Leu Arg Leu Arg Phe Val Trp
65 70 75 80
Glu Gly Gly Ser Ile Ile Gly Gln Gly Tyr Asn Lys Phe Lys Asp Ser
85 90 95
Ile Phe Gln Val Arg Lys Leu Gly Thr Asp Ile Val Ile Ile Pro Pro
100 105 110
Asn Tyr Ile Asp Glu Val Arg Lys Leu Ser Gln Asp Lys Thr Arg Ser
115 120 125
Val Glu Pro Phe Ile Asn Asp Phe Ala Gly Gln Tyr Thr Arg Gly Met
130 135 140
Val Phe Leu Gln Ser Asp Leu Gln Asn Arg Val Ile Gln Gln Arg Leu
145 150 155 160
Thr Pro Lys Leu Val Ser Leu Thr Lys Val Met Lys Glu Glu Leu Asp
165 170 175
Tyr Ala Leu Thr Lys Glu Met Pro Asp Met Lys Asn Asp Glu Trp Val
180 185 190
Glu Val Asp Ile Ser Ser Ile Met Val Arg Leu Ile Ser Arg Ile Ser
195 200 205
Ala Arg Val Phe Leu Gly Pro Glu His Cys Arg Asn Gln Glu Trp Leu
210 215 220
Thr Thr Thr Ala Glu Tyr Ser Glu Ser Leu Phe Ile Thr Gly Phe Ile
225 230 235 240
Leu Arg Val Val Pro His Ile Leu Arg Pro Phe Ile Ala Pro Leu Leu
245 250 255
Pro Ser Tyr Arg Thr Leu Leu Arg Asn Val Ser Ser Gly Arg Arg Val
260 265 270
Ile Gly Asp Ile Ile Arg Ser Gln Gln Gly Asp Gly Asn Glu Asp Ile
275 280 285
Leu Ser Trp Met Arg Asp Ala Ala Thr Gly Glu Glu Lys Gln Ile Asp
290 295 300
Asn Ile Ala Gln Arg Met Leu Ile Leu Ser Leu Ala Ser Ile His Thr
305 310 315 320
Thr Ala Met Thr Met Thr His Ala Met Tyr Asp Leu Cys Ala Cys Pro
325 330 335
Glu Tyr Ile Glu Pro Leu Arg Asp Glu Val Lys Ser Val Val Gly Ala
340 345 350
Ser Gly Trp Asp Lys Thr Ala Leu Asn Arg Phe His Lys Leu Asp Ser
355 360 365
Phe Leu Lys Glu Ser Gln Arg Phe Asn Pro Val Phe Leu Leu Thr Phe
370 375 380
Asn Arg Ile Tyr His Gln Ser Met Thr Leu Ser Asp Gly Thr Asn Ile
385 390 395 400
Pro Ser Gly Thr Arg Ile Ala Val Pro Ser His Ala Met Leu Gln Asp
405 410 415
Ser Ala His Val Pro Gly Pro Thr Pro Pro Thr Glu Phe Asp Gly Phe
420 425 430
Arg Tyr Ser Lys Ile Arg Ser Asp Ser Asn Tyr Ala Gln Lys Tyr Leu
435 440 445
Phe Ser Met Thr Asp Ser Ser Asn Met Ala Phe Gly Tyr Gly Lys Tyr
450 455 460
Ala Cys Pro Gly Arg Phe Tyr Ala Ser Asn Glu Met Lys Leu Thr Leu
465 470 475 480
Ala Ile Leu Leu Leu Gln Phe Glu Phe Lys Leu Pro Asp Gly Lys Gly
485 490 495
Arg Pro Arg Asn Ile Thr Ile Asp Ser Asp Met Ile Pro Asp Pro Arg
500 505 510
Ala Arg Leu Cys Val Arg Lys Arg Ser Leu Arg Asp Glu
515 520 525
<210> 87
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 87
atg gcc act tct gac tcc atc gtt gat gac aga aag caa ttg cac gtt 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gct act ttc cca tgg tta gct ttc ggt cac att ttg cca tac ttg caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
ttg tcc aaa ttg att gct gaa aag ggt cac aaa gtc tct ttc ttg tcc 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
acc acc aga aac atc caa aga tta tct tct cac att tct cca ttg atc 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtt gtc caa ttg act tta cca aga gtt caa gaa ttg cca gaa gat 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct acc acc gat gtc cat cca gaa gat atc cca tac ttg aag 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
aag gct tct gat ggt ttg caa cca gaa gtc act aga ttc ttg gaa caa 336
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac tct cca gac tgg att atc tac gac tac act cac tac tgg tta cca 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
tcc att gct gcc tct ttg ggt atc tct cgt gct cat ttc tcc gtt acc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
act cca tgg gct att gct tac atg ggt cca tct gct gat gct atg atc 480
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
aac ggt tct gat ggt aga acc act gtc gaa gac ttg acc act cca cca 528
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aaa tgg ttc cca ttc cca acc aag gtc tgt tgg aga aag cac gat ttg 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gcc aga tta gtt cca tac aag gcc cca ggt atc tct gac ggt tac aga 624
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt tta gtc ttg aag ggt tct gac tgt ttg ttg tcc aag tgt tac 672
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
cat gaa ttc ggt act caa tgg tta cct ttg ttg gaa act ttg cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtc cca gtt gtt cca gtt ggt ttg ttg cct cct gaa atc cca ggt gac 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
gaa aag gac gaa acc tgg gtt tcc atc aag aaa tgg tta gat ggt aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aag ggt tcc gtt gtc tac gtt gct ttg ggt tct gaa gtc ttg gtt 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
tct caa act gaa gtt gtt gaa ttg gct ttg ggt ttg gaa ttg tcc ggt 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
cta cca ttt gtc tgg gct tac aga aag cca aag ggt cca gct aag tct 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gac tct gtt gaa ttg cca gat ggt ttc gtc gaa aga acc aga gac aga 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggt ttg gtc tgg act tcc tgg gct cca caa ttg aga att ttg tcc cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tct gtt tgt ggt ttc ttg acc cac tgt ggt tct ggt tcc att gtc 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa ggt ttg atg ttc ggt cac cca ttg atc atg ttg cca atc ttc ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
gac caa cca ttg aac gct aga tta ttg gaa gac aag caa gtc ggt att 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa atc cca aga aac gaa gaa gac ggt tgt ttg acc aag gaa tct gtt 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gcc cgt tct cta aga tct gtt gtt gtc gaa aag gaa ggt gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
aag gct aac gct aga gaa ttg tcc aag atc tac aac gat acc aag gtc 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gaa aag gaa tat gtc tcc caa ttt gtt gac tac ttg gaa aag aac gct 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
cgt gcc gtt gcc att gac cac gaa agc 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 88
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 88
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 89
<211> 1575
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1575)
<400> 89
atg ggt ttg ttc cca tta gag gat tcc tac gcg ctg gtc ttt gaa gga 48
Met Gly Leu Phe Pro Leu Glu Asp Ser Tyr Ala Leu Val Phe Glu Gly
1 5 10 15
cta gca ata aca ctg gct ttg tac tat cta ctg tct ttc atc tac aaa 96
Leu Ala Ile Thr Leu Ala Leu Tyr Tyr Leu Leu Ser Phe Ile Tyr Lys
20 25 30
aca tct aaa aag aca tgt aca cct cct aaa gca tct ggt gaa atc att 144
Thr Ser Lys Lys Thr Cys Thr Pro Pro Lys Ala Ser Gly Glu Ile Ile
35 40 45
cca att aca gga atc ata ttg aat ctg cta tct ggc tca agt ggt cta 192
Pro Ile Thr Gly Ile Ile Leu Asn Leu Leu Ser Gly Ser Ser Gly Leu
50 55 60
cct att atc tta gca ctt gcc tct tta gca gac aga tgt ggt cct att 240
Pro Ile Ile Leu Ala Leu Ala Ser Leu Ala Asp Arg Cys Gly Pro Ile
65 70 75 80
ttc acc att agg ctg ggt att agg aga gtg cta gta gta tca aat tgg 288
Phe Thr Ile Arg Leu Gly Ile Arg Arg Val Leu Val Val Ser Asn Trp
85 90 95
gaa atc gct aag gag att ttc act acc cac gat ttg ata gtt tct aat 336
Glu Ile Ala Lys Glu Ile Phe Thr Thr His Asp Leu Ile Val Ser Asn
100 105 110
aga cca aaa tac tta gcc gct aag att ctt ggt ttc aat tat gtt tca 384
Arg Pro Lys Tyr Leu Ala Ala Lys Ile Leu Gly Phe Asn Tyr Val Ser
115 120 125
ttc tct ttc gct cca tac ggc cca tat tgg gtc gga atc aga aag att 432
Phe Ser Phe Ala Pro Tyr Gly Pro Tyr Trp Val Gly Ile Arg Lys Ile
130 135 140
att gct aca aaa cta atg tct tct tcc aga ctt cag aag ttg caa ttt 480
Ile Ala Thr Lys Leu Met Ser Ser Ser Arg Leu Gln Lys Leu Gln Phe
145 150 155 160
gta aga gtt ttt gaa cta gaa aac tct atg aaa tct atc aga gaa tca 528
Val Arg Val Phe Glu Leu Glu Asn Ser Met Lys Ser Ile Arg Glu Ser
165 170 175
tgg aag gag aaa aag gat gaa gag gga aag gta tta gtt gag atg aaa 576
Trp Lys Glu Lys Lys Asp Glu Glu Gly Lys Val Leu Val Glu Met Lys
180 185 190
aag tgg ttc tgg gaa ctg aat atg aac ata gtg tta agg aca gtt gct 624
Lys Trp Phe Trp Glu Leu Asn Met Asn Ile Val Leu Arg Thr Val Ala
195 200 205
ggt aaa caa tac act ggt aca gtt gat gat gcc gat gca aag cgt atc 672
Gly Lys Gln Tyr Thr Gly Thr Val Asp Asp Ala Asp Ala Lys Arg Ile
210 215 220
tcc gag tta ttc aga gaa tgg ttt cac tac act ggc aga ttt gtc gtt 720
Ser Glu Leu Phe Arg Glu Trp Phe His Tyr Thr Gly Arg Phe Val Val
225 230 235 240
gga gat gct ttt cct ttt cta ggt tgg ttg gac ctg ggc gga tac aaa 768
Gly Asp Ala Phe Pro Phe Leu Gly Trp Leu Asp Leu Gly Gly Tyr Lys
245 250 255
aag aca atg gaa tta gtt gct agt aga ttg gac tca atg gtc agt aaa 816
Lys Thr Met Glu Leu Val Ala Ser Arg Leu Asp Ser Met Val Ser Lys
260 265 270
tgg tta gat gag cat cgt aaa aag caa gct aac gat gac aaa aag gag 864
Trp Leu Asp Glu His Arg Lys Lys Gln Ala Asn Asp Asp Lys Lys Glu
275 280 285
gat atg gat ttc atg gat atc atg atc tcc atg aca gaa gca aat tca 912
Asp Met Asp Phe Met Asp Ile Met Ile Ser Met Thr Glu Ala Asn Ser
290 295 300
cca ctt gaa gga tac ggc act gat act att atc aag acc aca tgt atg 960
Pro Leu Glu Gly Tyr Gly Thr Asp Thr Ile Ile Lys Thr Thr Cys Met
305 310 315 320
act ttg att gtt tca gga gtt gat aca acc tca atc gta ctt act tgg 1008
Thr Leu Ile Val Ser Gly Val Asp Thr Thr Ser Ile Val Leu Thr Trp
325 330 335
gcc tta tca ctt ttg tta aac aac aga gat act ttg aaa aag gca caa 1056
Ala Leu Ser Leu Leu Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln
340 345 350
gag gaa tta gat atg tgc gta ggt aaa gga aga caa gtc aac gag tct 1104
Glu Glu Leu Asp Met Cys Val Gly Lys Gly Arg Gln Val Asn Glu Ser
355 360 365
gat ctt gtt aac ttg ata tac ttg gaa gca gtg ctt aaa gag gct tta 1152
Asp Leu Val Asn Leu Ile Tyr Leu Glu Ala Val Leu Lys Glu Ala Leu
370 375 380
aga ctt tac cca gca gcg ttc tta ggc gga cca aga gca ttc ttg gaa 1200
Arg Leu Tyr Pro Ala Ala Phe Leu Gly Gly Pro Arg Ala Phe Leu Glu
385 390 395 400
gat tgt act gtt gct ggt tat aga att cca aag ggc acc tgc ttg ttg 1248
Asp Cys Thr Val Ala Gly Tyr Arg Ile Pro Lys Gly Thr Cys Leu Leu
405 410 415
att aac atg tgg aaa ctg cat aga gat cca aac att tgg agt gat cct 1296
Ile Asn Met Trp Lys Leu His Arg Asp Pro Asn Ile Trp Ser Asp Pro
420 425 430
tgc gaa ttc aag cca gaa aga ttt ttg aca cct aat caa aag gat gtt 1344
Cys Glu Phe Lys Pro Glu Arg Phe Leu Thr Pro Asn Gln Lys Asp Val
435 440 445
gat gtg atc ggt atg gat ttc gaa ttg ata cca ttt ggt gcc ggc aga 1392
Asp Val Ile Gly Met Asp Phe Glu Leu Ile Pro Phe Gly Ala Gly Arg
450 455 460
aga tat tgt cca ggt act aga ttg gct tta cag atg ttg cat atc gta 1440
Arg Tyr Cys Pro Gly Thr Arg Leu Ala Leu Gln Met Leu His Ile Val
465 470 475 480
tta gcg aca ttg ctg caa aac ttc gaa atg tca aca cca aac gat gcg 1488
Leu Ala Thr Leu Leu Gln Asn Phe Glu Met Ser Thr Pro Asn Asp Ala
485 490 495
cca gtc gat atg act gct tct gtt ggc atg aca aat gcc aaa gca tca 1536
Pro Val Asp Met Thr Ala Ser Val Gly Met Thr Asn Ala Lys Ala Ser
500 505 510
cct tta gaa gtc ttg cta tca cct cgt gtt aaa tgg tcc 1575
Pro Leu Glu Val Leu Leu Ser Pro Arg Val Lys Trp Ser
515 520 525
<210> 90
<211> 525
<212> PRT
<213> Stevia rebaudiana
<400> 90
Met Gly Leu Phe Pro Leu Glu Asp Ser Tyr Ala Leu Val Phe Glu Gly
1 5 10 15
Leu Ala Ile Thr Leu Ala Leu Tyr Tyr Leu Leu Ser Phe Ile Tyr Lys
20 25 30
Thr Ser Lys Lys Thr Cys Thr Pro Pro Lys Ala Ser Gly Glu Ile Ile
35 40 45
Pro Ile Thr Gly Ile Ile Leu Asn Leu Leu Ser Gly Ser Ser Gly Leu
50 55 60
Pro Ile Ile Leu Ala Leu Ala Ser Leu Ala Asp Arg Cys Gly Pro Ile
65 70 75 80
Phe Thr Ile Arg Leu Gly Ile Arg Arg Val Leu Val Val Ser Asn Trp
85 90 95
Glu Ile Ala Lys Glu Ile Phe Thr Thr His Asp Leu Ile Val Ser Asn
100 105 110
Arg Pro Lys Tyr Leu Ala Ala Lys Ile Leu Gly Phe Asn Tyr Val Ser
115 120 125
Phe Ser Phe Ala Pro Tyr Gly Pro Tyr Trp Val Gly Ile Arg Lys Ile
130 135 140
Ile Ala Thr Lys Leu Met Ser Ser Ser Arg Leu Gln Lys Leu Gln Phe
145 150 155 160
Val Arg Val Phe Glu Leu Glu Asn Ser Met Lys Ser Ile Arg Glu Ser
165 170 175
Trp Lys Glu Lys Lys Asp Glu Glu Gly Lys Val Leu Val Glu Met Lys
180 185 190
Lys Trp Phe Trp Glu Leu Asn Met Asn Ile Val Leu Arg Thr Val Ala
195 200 205
Gly Lys Gln Tyr Thr Gly Thr Val Asp Asp Ala Asp Ala Lys Arg Ile
210 215 220
Ser Glu Leu Phe Arg Glu Trp Phe His Tyr Thr Gly Arg Phe Val Val
225 230 235 240
Gly Asp Ala Phe Pro Phe Leu Gly Trp Leu Asp Leu Gly Gly Tyr Lys
245 250 255
Lys Thr Met Glu Leu Val Ala Ser Arg Leu Asp Ser Met Val Ser Lys
260 265 270
Trp Leu Asp Glu His Arg Lys Lys Gln Ala Asn Asp Asp Lys Lys Glu
275 280 285
Asp Met Asp Phe Met Asp Ile Met Ile Ser Met Thr Glu Ala Asn Ser
290 295 300
Pro Leu Glu Gly Tyr Gly Thr Asp Thr Ile Ile Lys Thr Thr Cys Met
305 310 315 320
Thr Leu Ile Val Ser Gly Val Asp Thr Thr Ser Ile Val Leu Thr Trp
325 330 335
Ala Leu Ser Leu Leu Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln
340 345 350
Glu Glu Leu Asp Met Cys Val Gly Lys Gly Arg Gln Val Asn Glu Ser
355 360 365
Asp Leu Val Asn Leu Ile Tyr Leu Glu Ala Val Leu Lys Glu Ala Leu
370 375 380
Arg Leu Tyr Pro Ala Ala Phe Leu Gly Gly Pro Arg Ala Phe Leu Glu
385 390 395 400
Asp Cys Thr Val Ala Gly Tyr Arg Ile Pro Lys Gly Thr Cys Leu Leu
405 410 415
Ile Asn Met Trp Lys Leu His Arg Asp Pro Asn Ile Trp Ser Asp Pro
420 425 430
Cys Glu Phe Lys Pro Glu Arg Phe Leu Thr Pro Asn Gln Lys Asp Val
435 440 445
Asp Val Ile Gly Met Asp Phe Glu Leu Ile Pro Phe Gly Ala Gly Arg
450 455 460
Arg Tyr Cys Pro Gly Thr Arg Leu Ala Leu Gln Met Leu His Ile Val
465 470 475 480
Leu Ala Thr Leu Leu Gln Asn Phe Glu Met Ser Thr Pro Asn Asp Ala
485 490 495
Pro Val Asp Met Thr Ala Ser Val Gly Met Thr Asn Ala Lys Ala Ser
500 505 510
Pro Leu Glu Val Leu Leu Ser Pro Arg Val Lys Trp Ser
515 520 525
<210> 91
<211> 1428
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1428)
<400> 91
atg ata caa gtt tta act cca att cta ctc ttc ctc atc ttc ttc gtt 48
Met Ile Gln Val Leu Thr Pro Ile Leu Leu Phe Leu Ile Phe Phe Val
1 5 10 15
ttc tgg aaa gtc tac aaa cat caa aag act aaa atc aat cta cca cca 96
Phe Trp Lys Val Tyr Lys His Gln Lys Thr Lys Ile Asn Leu Pro Pro
20 25 30
ggt tcc ttc ggc tgg cca ttt ttg ggt gaa acc tta gcc tta ctt aga 144
Gly Ser Phe Gly Trp Pro Phe Leu Gly Glu Thr Leu Ala Leu Leu Arg
35 40 45
gca ggc tgg gat tct gag cca gaa aga ttc gta aga gag cgt atc aaa 192
Ala Gly Trp Asp Ser Glu Pro Glu Arg Phe Val Arg Glu Arg Ile Lys
50 55 60
aag cat gga tct cca ctt gtt ttc aag aca tca cta ttt gga gac aga 240
Lys His Gly Ser Pro Leu Val Phe Lys Thr Ser Leu Phe Gly Asp Arg
65 70 75 80
ttc gct gtt ctt tgc ggt cca gct ggt aat aag ttt ttg ttc tgc aac 288
Phe Ala Val Leu Cys Gly Pro Ala Gly Asn Lys Phe Leu Phe Cys Asn
85 90 95
gaa aac aaa tta gtg gca tct tgg tgg cca gtc cct gta agg aag ttg 336
Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Val Pro Val Arg Lys Leu
100 105 110
ttc ggt aaa agt tta ctc aca ata aga gga gat gaa gca aaa tgg atg 384
Phe Gly Lys Ser Leu Leu Thr Ile Arg Gly Asp Glu Ala Lys Trp Met
115 120 125
aga aaa atg cta ttg tct tac ttg ggt cca gat gca ttt gcc aca cat 432
Arg Lys Met Leu Leu Ser Tyr Leu Gly Pro Asp Ala Phe Ala Thr His
130 135 140
tat gcc gtt act atg gat gtt gta aca cgt aga cat att gat gtc cat 480
Tyr Ala Val Thr Met Asp Val Val Thr Arg Arg His Ile Asp Val His
145 150 155 160
tgg agg ggc aag gag gaa gtt aat gta ttt caa aca gtt aag ttg tac 528
Trp Arg Gly Lys Glu Glu Val Asn Val Phe Gln Thr Val Lys Leu Tyr
165 170 175
gca ttc gaa tta gct tgt aga tta ttc atg aac cta gat gac cca aac 576
Ala Phe Glu Leu Ala Cys Arg Leu Phe Met Asn Leu Asp Asp Pro Asn
180 185 190
cac atc gcg aaa ctc ggt agt ctt ttc aac att ttc ctc aaa ggg atc 624
His Ile Ala Lys Leu Gly Ser Leu Phe Asn Ile Phe Leu Lys Gly Ile
195 200 205
atc gag ctt cct ata gac gtt cct gga act aga ttt tac tcc agt aaa 672
Ile Glu Leu Pro Ile Asp Val Pro Gly Thr Arg Phe Tyr Ser Ser Lys
210 215 220
aag gcc gca gct gcc att aga att gaa ttg aaa aag ctc att aaa gct 720
Lys Ala Ala Ala Ala Ile Arg Ile Glu Leu Lys Lys Leu Ile Lys Ala
225 230 235 240
aga aaa ctc gaa ttg aag gag ggt aag gcg tct tct tca cag gac ttg 768
Arg Lys Leu Glu Leu Lys Glu Gly Lys Ala Ser Ser Ser Gln Asp Leu
245 250 255
ctt tct cat cta tta aca tca cct gat gag aat ggg atg ttc ttg aca 816
Leu Ser His Leu Leu Thr Ser Pro Asp Glu Asn Gly Met Phe Leu Thr
260 265 270
gaa gag gaa ata gtc gat aac att cta ctt ttg tta ttc gct ggt cac 864
Glu Glu Glu Ile Val Asp Asn Ile Leu Leu Leu Leu Phe Ala Gly His
275 280 285
gat acc tct gca cta tca ata aca ctt ttg atg aaa acc tta ggt gaa 912
Asp Thr Ser Ala Leu Ser Ile Thr Leu Leu Met Lys Thr Leu Gly Glu
290 295 300
cac agt gat gtg tac gac aag gtt ttg aag gaa caa tta gaa att tcc 960
His Ser Asp Val Tyr Asp Lys Val Leu Lys Glu Gln Leu Glu Ile Ser
305 310 315 320
aaa aca aag gag gct tgg gaa tca cta aag tgg gaa gat atc cag aag 1008
Lys Thr Lys Glu Ala Trp Glu Ser Leu Lys Trp Glu Asp Ile Gln Lys
325 330 335
atg aag tac tca tgg tca gta atc tgt gaa gtc atg aga ttg aat cct 1056
Met Lys Tyr Ser Trp Ser Val Ile Cys Glu Val Met Arg Leu Asn Pro
340 345 350
cct gtc ata ggg aca tac aga gag gcg ttg gtt gat atc gac tat gct 1104
Pro Val Ile Gly Thr Tyr Arg Glu Ala Leu Val Asp Ile Asp Tyr Ala
355 360 365
ggt tac act atc cca aaa gga tgg aag ttg cat tgg tca gct gtt tct 1152
Gly Tyr Thr Ile Pro Lys Gly Trp Lys Leu His Trp Ser Ala Val Ser
370 375 380
act caa aga gat gaa gcc aat ttc gaa gat gta act aga ttc gat cca 1200
Thr Gln Arg Asp Glu Ala Asn Phe Glu Asp Val Thr Arg Phe Asp Pro
385 390 395 400
tcc aga ttt gaa ggg gca ggc cct act cca ttc aca ttt gtg cct ttc 1248
Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr Phe Val Pro Phe
405 410 415
ggt gga ggt cct aga atg tgt tta ggc aaa gag ttt gcc agg tta gaa 1296
Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Phe Ala Arg Leu Glu
420 425 430
gtg tta gca ttt ctc cac aac att gtt acc aac ttt aag tgg gat ctt 1344
Val Leu Ala Phe Leu His Asn Ile Val Thr Asn Phe Lys Trp Asp Leu
435 440 445
cta atc cct gat gag aag atc gaa tat gat cca atg gct act cca gct 1392
Leu Ile Pro Asp Glu Lys Ile Glu Tyr Asp Pro Met Ala Thr Pro Ala
450 455 460
aag ggc ttg cca att aga ctt cat cca cac caa gtc 1428
Lys Gly Leu Pro Ile Arg Leu His Pro His Gln Val
465 470 475
<210> 92
<211> 476
<212> PRT
<213> Stevia rebaudiana
<400> 92
Met Ile Gln Val Leu Thr Pro Ile Leu Leu Phe Leu Ile Phe Phe Val
1 5 10 15
Phe Trp Lys Val Tyr Lys His Gln Lys Thr Lys Ile Asn Leu Pro Pro
20 25 30
Gly Ser Phe Gly Trp Pro Phe Leu Gly Glu Thr Leu Ala Leu Leu Arg
35 40 45
Ala Gly Trp Asp Ser Glu Pro Glu Arg Phe Val Arg Glu Arg Ile Lys
50 55 60
Lys His Gly Ser Pro Leu Val Phe Lys Thr Ser Leu Phe Gly Asp Arg
65 70 75 80
Phe Ala Val Leu Cys Gly Pro Ala Gly Asn Lys Phe Leu Phe Cys Asn
85 90 95
Glu Asn Lys Leu Val Ala Ser Trp Trp Pro Val Pro Val Arg Lys Leu
100 105 110
Phe Gly Lys Ser Leu Leu Thr Ile Arg Gly Asp Glu Ala Lys Trp Met
115 120 125
Arg Lys Met Leu Leu Ser Tyr Leu Gly Pro Asp Ala Phe Ala Thr His
130 135 140
Tyr Ala Val Thr Met Asp Val Val Thr Arg Arg His Ile Asp Val His
145 150 155 160
Trp Arg Gly Lys Glu Glu Val Asn Val Phe Gln Thr Val Lys Leu Tyr
165 170 175
Ala Phe Glu Leu Ala Cys Arg Leu Phe Met Asn Leu Asp Asp Pro Asn
180 185 190
His Ile Ala Lys Leu Gly Ser Leu Phe Asn Ile Phe Leu Lys Gly Ile
195 200 205
Ile Glu Leu Pro Ile Asp Val Pro Gly Thr Arg Phe Tyr Ser Ser Lys
210 215 220
Lys Ala Ala Ala Ala Ile Arg Ile Glu Leu Lys Lys Leu Ile Lys Ala
225 230 235 240
Arg Lys Leu Glu Leu Lys Glu Gly Lys Ala Ser Ser Ser Gln Asp Leu
245 250 255
Leu Ser His Leu Leu Thr Ser Pro Asp Glu Asn Gly Met Phe Leu Thr
260 265 270
Glu Glu Glu Ile Val Asp Asn Ile Leu Leu Leu Leu Phe Ala Gly His
275 280 285
Asp Thr Ser Ala Leu Ser Ile Thr Leu Leu Met Lys Thr Leu Gly Glu
290 295 300
His Ser Asp Val Tyr Asp Lys Val Leu Lys Glu Gln Leu Glu Ile Ser
305 310 315 320
Lys Thr Lys Glu Ala Trp Glu Ser Leu Lys Trp Glu Asp Ile Gln Lys
325 330 335
Met Lys Tyr Ser Trp Ser Val Ile Cys Glu Val Met Arg Leu Asn Pro
340 345 350
Pro Val Ile Gly Thr Tyr Arg Glu Ala Leu Val Asp Ile Asp Tyr Ala
355 360 365
Gly Tyr Thr Ile Pro Lys Gly Trp Lys Leu His Trp Ser Ala Val Ser
370 375 380
Thr Gln Arg Asp Glu Ala Asn Phe Glu Asp Val Thr Arg Phe Asp Pro
385 390 395 400
Ser Arg Phe Glu Gly Ala Gly Pro Thr Pro Phe Thr Phe Val Pro Phe
405 410 415
Gly Gly Gly Pro Arg Met Cys Leu Gly Lys Glu Phe Ala Arg Leu Glu
420 425 430
Val Leu Ala Phe Leu His Asn Ile Val Thr Asn Phe Lys Trp Asp Leu
435 440 445
Leu Ile Pro Asp Glu Lys Ile Glu Tyr Asp Pro Met Ala Thr Pro Ala
450 455 460
Lys Gly Leu Pro Ile Arg Leu His Pro His Gln Val
465 470 475
<210> 93
<211> 1575
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(1575)
<400> 93
atg gag tct tta gtg gtt cat aca gta aat gct atc tgg tgt att gta 48
Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val
1 5 10 15
atc gtc ggg att ttc tca gtt ggt tat cac gtt tac ggt aga gct gtg 96
Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val
20 25 30
gtc gaa caa tgg aga atg aga aga tca ctg aag cta caa ggt gtt aaa 144
Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys
35 40 45
ggc cca cca cca tcc atc ttc aat ggt aac gtt tca gaa atg caa cgt 192
Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg
50 55 60
atc caa tcc gaa gct aaa cac tgc tct ggc gat aac att atc tca cat 240
Ile Gln Ser Glu Ala Lys His Cys Ser Gly Asp Asn Ile Ile Ser His
65 70 75 80
gat tat tct tct tca tta ttc cca cac ttc gat cac tgg aga aaa cag 288
Asp Tyr Ser Ser Ser Leu Phe Pro His Phe Asp His Trp Arg Lys Gln
85 90 95
tac ggc aga atc tac aca tac tct act gga tta aag caa cac ttg tac 336
Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Lys Gln His Leu Tyr
100 105 110
atc aat cat cca gaa atg gtg aag gag cta tct cag act aac aca ttg 384
Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Thr Leu
115 120 125
aac ttg ggt aga atc acc cat ata acc aaa aga ttg aat cct atc tta 432
Asn Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Asn Pro Ile Leu
130 135 140
ggt aac gga atc ata acc tct aat ggt cct cat tgg gcc cat cag cgt 480
Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg
145 150 155 160
aga att atc gcc tac gag ttt act cat gat aag atc aag ggt atg gtt 528
Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Ile Lys Gly Met Val
165 170 175
ggt ttg atg gtt gag tct gct atg cct atg ttg aat aag tgg gag gag 576
Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu
180 185 190
atg gta aag aga ggc gga gaa atg gga tgc gac ata aga gtt gat gag 624
Met Val Lys Arg Gly Gly Glu Met Gly Cys Asp Ile Arg Val Asp Glu
195 200 205
gac ttg aaa gat gtt tca gca gat gtg att gca aaa gcc tgt ttc gga 672
Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly
210 215 220
tcc tca ttt tct aaa ggt aag gct att ttc tct atg ata aga gat ttg 720
Ser Ser Phe Ser Lys Gly Lys Ala Ile Phe Ser Met Ile Arg Asp Leu
225 230 235 240
ctt aca gct atc aca aag aga agt gtt cta ttc aga ttc aac gga ttc 768
Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe
245 250 255
act gat atg gtc ttt ggg agt aaa aag cat ggt gac gtt gat ata gac 816
Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp
260 265 270
gct tta gaa atg gaa ttg gaa tca tcc att tgg gaa act gtc aag gaa 864
Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu
275 280 285
cgt gaa ata gaa tgt aaa gat act cac aaa aag gat ctg atg caa ttg 912
Arg Glu Ile Glu Cys Lys Asp Thr His Lys Lys Asp Leu Met Gln Leu
290 295 300
att ttg gaa ggg gca atg cgt tca tgt gac ggt aac ctt tgg gat aaa 960
Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys
305 310 315 320
tca gca tat aga aga ttt gtt gta gat aat tgt aaa tct atc tac ttc 1008
Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe
325 330 335
gca ggg cat gat agt aca gct gtc tca gtg tca tgg tgt ttg atg tta 1056
Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu
340 345 350
ctg gcc cta aac cca tca tgg caa gtt aag atc cgt gat gaa att ctg 1104
Leu Ala Leu Asn Pro Ser Trp Gln Val Lys Ile Arg Asp Glu Ile Leu
355 360 365
tct tct tgc aaa aat ggt att cca gat gcc gaa agt atc cca aac ctt 1152
Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu
370 375 380
aaa aca gtg act atg gtt att caa gag aca atg aga tta tac cct cca 1200
Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro
385 390 395 400
gca cca atc gtc ggg aga gaa gcc tct aaa gat atc aga ttg ggc gat 1248
Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp
405 410 415
cta gtt gtt cct aaa ggc gtc tgt ata tgg aca cta ata cca gct tta 1296
Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu
420 425 430
cac aga gat cct gag att tgg gga cca gat gca aac gat ttc aaa cca 1344
His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro
435 440 445
gaa aga ttt tct gaa gga att tca aag gct tgt aag tat cct caa agt 1392
Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ser
450 455 460
tac att cca ttt ggt ctg ggt cct aga aca tgc gtt ggt aaa aac ttt 1440
Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe
465 470 475 480
ggc atg atg gaa gta aag gtt ctt gtt tcc ctg att gtc tcc aag ttc 1488
Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe
485 490 495
tct ttc act cta tct cct acc tac caa cat agt cct agt cac aaa ctt 1536
Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu
500 505 510
tta gta gaa cca caa cat ggg gtg gta att aga gtg gtt 1575
Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val
515 520 525
<210> 94
<211> 525
<212> PRT
<213> Arabidopsis thaliana
<400> 94
Met Glu Ser Leu Val Val His Thr Val Asn Ala Ile Trp Cys Ile Val
1 5 10 15
Ile Val Gly Ile Phe Ser Val Gly Tyr His Val Tyr Gly Arg Ala Val
20 25 30
Val Glu Gln Trp Arg Met Arg Arg Ser Leu Lys Leu Gln Gly Val Lys
35 40 45
Gly Pro Pro Pro Ser Ile Phe Asn Gly Asn Val Ser Glu Met Gln Arg
50 55 60
Ile Gln Ser Glu Ala Lys His Cys Ser Gly Asp Asn Ile Ile Ser His
65 70 75 80
Asp Tyr Ser Ser Ser Leu Phe Pro His Phe Asp His Trp Arg Lys Gln
85 90 95
Tyr Gly Arg Ile Tyr Thr Tyr Ser Thr Gly Leu Lys Gln His Leu Tyr
100 105 110
Ile Asn His Pro Glu Met Val Lys Glu Leu Ser Gln Thr Asn Thr Leu
115 120 125
Asn Leu Gly Arg Ile Thr His Ile Thr Lys Arg Leu Asn Pro Ile Leu
130 135 140
Gly Asn Gly Ile Ile Thr Ser Asn Gly Pro His Trp Ala His Gln Arg
145 150 155 160
Arg Ile Ile Ala Tyr Glu Phe Thr His Asp Lys Ile Lys Gly Met Val
165 170 175
Gly Leu Met Val Glu Ser Ala Met Pro Met Leu Asn Lys Trp Glu Glu
180 185 190
Met Val Lys Arg Gly Gly Glu Met Gly Cys Asp Ile Arg Val Asp Glu
195 200 205
Asp Leu Lys Asp Val Ser Ala Asp Val Ile Ala Lys Ala Cys Phe Gly
210 215 220
Ser Ser Phe Ser Lys Gly Lys Ala Ile Phe Ser Met Ile Arg Asp Leu
225 230 235 240
Leu Thr Ala Ile Thr Lys Arg Ser Val Leu Phe Arg Phe Asn Gly Phe
245 250 255
Thr Asp Met Val Phe Gly Ser Lys Lys His Gly Asp Val Asp Ile Asp
260 265 270
Ala Leu Glu Met Glu Leu Glu Ser Ser Ile Trp Glu Thr Val Lys Glu
275 280 285
Arg Glu Ile Glu Cys Lys Asp Thr His Lys Lys Asp Leu Met Gln Leu
290 295 300
Ile Leu Glu Gly Ala Met Arg Ser Cys Asp Gly Asn Leu Trp Asp Lys
305 310 315 320
Ser Ala Tyr Arg Arg Phe Val Val Asp Asn Cys Lys Ser Ile Tyr Phe
325 330 335
Ala Gly His Asp Ser Thr Ala Val Ser Val Ser Trp Cys Leu Met Leu
340 345 350
Leu Ala Leu Asn Pro Ser Trp Gln Val Lys Ile Arg Asp Glu Ile Leu
355 360 365
Ser Ser Cys Lys Asn Gly Ile Pro Asp Ala Glu Ser Ile Pro Asn Leu
370 375 380
Lys Thr Val Thr Met Val Ile Gln Glu Thr Met Arg Leu Tyr Pro Pro
385 390 395 400
Ala Pro Ile Val Gly Arg Glu Ala Ser Lys Asp Ile Arg Leu Gly Asp
405 410 415
Leu Val Val Pro Lys Gly Val Cys Ile Trp Thr Leu Ile Pro Ala Leu
420 425 430
His Arg Asp Pro Glu Ile Trp Gly Pro Asp Ala Asn Asp Phe Lys Pro
435 440 445
Glu Arg Phe Ser Glu Gly Ile Ser Lys Ala Cys Lys Tyr Pro Gln Ser
450 455 460
Tyr Ile Pro Phe Gly Leu Gly Pro Arg Thr Cys Val Gly Lys Asn Phe
465 470 475 480
Gly Met Met Glu Val Lys Val Leu Val Ser Leu Ile Val Ser Lys Phe
485 490 495
Ser Phe Thr Leu Ser Pro Thr Tyr Gln His Ser Pro Ser His Lys Leu
500 505 510
Leu Val Glu Pro Gln His Gly Val Val Ile Arg Val Val
515 520 525
<210> 95
<211> 1587
<212> DNA
<213> Vitis vinifera
<220>
<221> CDS
<222> (1)..(1587)
<400> 95
atg tac ttc cta cta caa tac ctc aac atc aca acc gtt ggt gtc ttt 48
Met Tyr Phe Leu Leu Gln Tyr Leu Asn Ile Thr Thr Val Gly Val Phe
1 5 10 15
gcc aca ttg ttt ctc tct tat tgt tta ctt ctc tgg aga agt aga gcg 96
Ala Thr Leu Phe Leu Ser Tyr Cys Leu Leu Leu Trp Arg Ser Arg Ala
20 25 30
ggt aac aaa aag att gcc cca gaa gct gcc gct gca tgg cct att atc 144
Gly Asn Lys Lys Ile Ala Pro Glu Ala Ala Ala Ala Trp Pro Ile Ile
35 40 45
ggc cac ctc cac tta ctt gca ggt gga tcc cat caa cta cca cat att 192
Gly His Leu His Leu Leu Ala Gly Gly Ser His Gln Leu Pro His Ile
50 55 60
aca ttg ggt aac atg gca gat aag tac ggt cct gta ttc aca atc aga 240
Thr Leu Gly Asn Met Ala Asp Lys Tyr Gly Pro Val Phe Thr Ile Arg
65 70 75 80
ata ggc ttg cat aga gct gta gtt gtc tca tct tgg gaa atg gca aag 288
Ile Gly Leu His Arg Ala Val Val Val Ser Ser Trp Glu Met Ala Lys
85 90 95
gaa tgt tca aca gct aat gat caa gtg tct tct tca aga cct gaa cta 336
Glu Cys Ser Thr Ala Asn Asp Gln Val Ser Ser Ser Arg Pro Glu Leu
100 105 110
tta gct tct aag ttg ttg ggt tat aac tac gcc atg ttt ggt ttt tca 384
Leu Ala Ser Lys Leu Leu Gly Tyr Asn Tyr Ala Met Phe Gly Phe Ser
115 120 125
cca tac ggt tca tac tgg aga gaa atg aga aag atc atc tct ctc gaa 432
Pro Tyr Gly Ser Tyr Trp Arg Glu Met Arg Lys Ile Ile Ser Leu Glu
130 135 140
tta cta tct aat tcc aga ttg gaa cta ttg aaa gat gtt aga gcc tca 480
Leu Leu Ser Asn Ser Arg Leu Glu Leu Leu Lys Asp Val Arg Ala Ser
145 150 155 160
gaa gtt gtc aca tct att aag gaa cta tac aaa ttg tgg gcg gaa aag 528
Glu Val Val Thr Ser Ile Lys Glu Leu Tyr Lys Leu Trp Ala Glu Lys
165 170 175
aag aat gag tca gga ttg gtt tct gtc gag atg aaa caa tgg ttc gga 576
Lys Asn Glu Ser Gly Leu Val Ser Val Glu Met Lys Gln Trp Phe Gly
180 185 190
gat ttg act tta aac gtg atc ttg aga atg gtg gct ggt aaa aga tac 624
Asp Leu Thr Leu Asn Val Ile Leu Arg Met Val Ala Gly Lys Arg Tyr
195 200 205
ttc tcc gcg agt gac gct tca gaa aac aaa cag gcc cag cgt tgt aga 672
Phe Ser Ala Ser Asp Ala Ser Glu Asn Lys Gln Ala Gln Arg Cys Arg
210 215 220
aga gtc ttc aga gaa ttc ttc cat ctc tcc ggc ttg ttt gtg gtt gct 720
Arg Val Phe Arg Glu Phe Phe His Leu Ser Gly Leu Phe Val Val Ala
225 230 235 240
gat gct ata cct ttt ctt gga tgg ctc gat tgg gga aga cac gag aag 768
Asp Ala Ile Pro Phe Leu Gly Trp Leu Asp Trp Gly Arg His Glu Lys
245 250 255
acc ttg aaa aag acc gcc ata gaa atg gat tcc atc gcc cag gag tgg 816
Thr Leu Lys Lys Thr Ala Ile Glu Met Asp Ser Ile Ala Gln Glu Trp
260 265 270
ctt gag gaa cat aga cgt aga aaa gat tct gga gat gat aat tct acc 864
Leu Glu Glu His Arg Arg Arg Lys Asp Ser Gly Asp Asp Asn Ser Thr
275 280 285
caa gat ttc atg gac gtt atg caa tct gtg cta gat ggc aaa aat cta 912
Gln Asp Phe Met Asp Val Met Gln Ser Val Leu Asp Gly Lys Asn Leu
290 295 300
ggc gga tac gat gct gat acg att aac aag gct aca tgc tta act ctt 960
Gly Gly Tyr Asp Ala Asp Thr Ile Asn Lys Ala Thr Cys Leu Thr Leu
305 310 315 320
ata tca ggt ggc agt gat act act gta gtt tct ttg aca tgg gct ctt 1008
Ile Ser Gly Gly Ser Asp Thr Thr Val Val Ser Leu Thr Trp Ala Leu
325 330 335
agt ctt gtg tta aac aat aga gat act ttg aaa aag gca cag gaa gag 1056
Ser Leu Val Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln Glu Glu
340 345 350
tta gac atc caa gtc ggt aag gaa aga ttg gtt aac gag caa gac atc 1104
Leu Asp Ile Gln Val Gly Lys Glu Arg Leu Val Asn Glu Gln Asp Ile
355 360 365
agt aag tta gtt tac ttg caa gca ata gta aaa gag aca ctc aga ctt 1152
Ser Lys Leu Val Tyr Leu Gln Ala Ile Val Lys Glu Thr Leu Arg Leu
370 375 380
tat cca cca ggt cct ttg ggt ggt ttg aga caa ttc act gaa gat tgt 1200
Tyr Pro Pro Gly Pro Leu Gly Gly Leu Arg Gln Phe Thr Glu Asp Cys
385 390 395 400
aca cta ggt ggc tat cac gtt tca aaa gga act aga tta atc atg aac 1248
Thr Leu Gly Gly Tyr His Val Ser Lys Gly Thr Arg Leu Ile Met Asn
405 410 415
tta tcc aag att caa aaa gat cca cgt att tgg tct gat cct act gaa 1296
Leu Ser Lys Ile Gln Lys Asp Pro Arg Ile Trp Ser Asp Pro Thr Glu
420 425 430
ttc caa cca gag aga ttc ctt acg act cat aaa gat gtc gat cca cgt 1344
Phe Gln Pro Glu Arg Phe Leu Thr Thr His Lys Asp Val Asp Pro Arg
435 440 445
ggt aaa cac ttt gaa ttc att cca ttc ggt gca gga aga cgt gca tgt 1392
Gly Lys His Phe Glu Phe Ile Pro Phe Gly Ala Gly Arg Arg Ala Cys
450 455 460
cct ggt atc aca ttc gga tta caa gta cta cat cta aca ttg gca tct 1440
Pro Gly Ile Thr Phe Gly Leu Gln Val Leu His Leu Thr Leu Ala Ser
465 470 475 480
ttc ttg cat gcg ttt gaa ttt tca aca cca tca aat gag cag gtt aac 1488
Phe Leu His Ala Phe Glu Phe Ser Thr Pro Ser Asn Glu Gln Val Asn
485 490 495
atg aga gaa tca tta ggt ctt acg aat atg aaa tct acc cca tta gaa 1536
Met Arg Glu Ser Leu Gly Leu Thr Asn Met Lys Ser Thr Pro Leu Glu
500 505 510
gtt ttg att tct cca aga cta tcc ctt aat tgc ttc aac ctt atg aaa 1584
Val Leu Ile Ser Pro Arg Leu Ser Leu Asn Cys Phe Asn Leu Met Lys
515 520 525
att 1587
Ile
<210> 96
<211> 529
<212> PRT
<213> Vitis vinifera
<400> 96
Met Tyr Phe Leu Leu Gln Tyr Leu Asn Ile Thr Thr Val Gly Val Phe
1 5 10 15
Ala Thr Leu Phe Leu Ser Tyr Cys Leu Leu Leu Trp Arg Ser Arg Ala
20 25 30
Gly Asn Lys Lys Ile Ala Pro Glu Ala Ala Ala Ala Trp Pro Ile Ile
35 40 45
Gly His Leu His Leu Leu Ala Gly Gly Ser His Gln Leu Pro His Ile
50 55 60
Thr Leu Gly Asn Met Ala Asp Lys Tyr Gly Pro Val Phe Thr Ile Arg
65 70 75 80
Ile Gly Leu His Arg Ala Val Val Val Ser Ser Trp Glu Met Ala Lys
85 90 95
Glu Cys Ser Thr Ala Asn Asp Gln Val Ser Ser Ser Arg Pro Glu Leu
100 105 110
Leu Ala Ser Lys Leu Leu Gly Tyr Asn Tyr Ala Met Phe Gly Phe Ser
115 120 125
Pro Tyr Gly Ser Tyr Trp Arg Glu Met Arg Lys Ile Ile Ser Leu Glu
130 135 140
Leu Leu Ser Asn Ser Arg Leu Glu Leu Leu Lys Asp Val Arg Ala Ser
145 150 155 160
Glu Val Val Thr Ser Ile Lys Glu Leu Tyr Lys Leu Trp Ala Glu Lys
165 170 175
Lys Asn Glu Ser Gly Leu Val Ser Val Glu Met Lys Gln Trp Phe Gly
180 185 190
Asp Leu Thr Leu Asn Val Ile Leu Arg Met Val Ala Gly Lys Arg Tyr
195 200 205
Phe Ser Ala Ser Asp Ala Ser Glu Asn Lys Gln Ala Gln Arg Cys Arg
210 215 220
Arg Val Phe Arg Glu Phe Phe His Leu Ser Gly Leu Phe Val Val Ala
225 230 235 240
Asp Ala Ile Pro Phe Leu Gly Trp Leu Asp Trp Gly Arg His Glu Lys
245 250 255
Thr Leu Lys Lys Thr Ala Ile Glu Met Asp Ser Ile Ala Gln Glu Trp
260 265 270
Leu Glu Glu His Arg Arg Arg Lys Asp Ser Gly Asp Asp Asn Ser Thr
275 280 285
Gln Asp Phe Met Asp Val Met Gln Ser Val Leu Asp Gly Lys Asn Leu
290 295 300
Gly Gly Tyr Asp Ala Asp Thr Ile Asn Lys Ala Thr Cys Leu Thr Leu
305 310 315 320
Ile Ser Gly Gly Ser Asp Thr Thr Val Val Ser Leu Thr Trp Ala Leu
325 330 335
Ser Leu Val Leu Asn Asn Arg Asp Thr Leu Lys Lys Ala Gln Glu Glu
340 345 350
Leu Asp Ile Gln Val Gly Lys Glu Arg Leu Val Asn Glu Gln Asp Ile
355 360 365
Ser Lys Leu Val Tyr Leu Gln Ala Ile Val Lys Glu Thr Leu Arg Leu
370 375 380
Tyr Pro Pro Gly Pro Leu Gly Gly Leu Arg Gln Phe Thr Glu Asp Cys
385 390 395 400
Thr Leu Gly Gly Tyr His Val Ser Lys Gly Thr Arg Leu Ile Met Asn
405 410 415
Leu Ser Lys Ile Gln Lys Asp Pro Arg Ile Trp Ser Asp Pro Thr Glu
420 425 430
Phe Gln Pro Glu Arg Phe Leu Thr Thr His Lys Asp Val Asp Pro Arg
435 440 445
Gly Lys His Phe Glu Phe Ile Pro Phe Gly Ala Gly Arg Arg Ala Cys
450 455 460
Pro Gly Ile Thr Phe Gly Leu Gln Val Leu His Leu Thr Leu Ala Ser
465 470 475 480
Phe Leu His Ala Phe Glu Phe Ser Thr Pro Ser Asn Glu Gln Val Asn
485 490 495
Met Arg Glu Ser Leu Gly Leu Thr Asn Met Lys Ser Thr Pro Leu Glu
500 505 510
Val Leu Ile Ser Pro Arg Leu Ser Leu Asn Cys Phe Asn Leu Met Lys
515 520 525
Ile
<210> 97
<211> 1437
<212> DNA
<213> Medicago truncatula
<220>
<221> CDS
<222> (1)..(1437)
<400> 97
atg gaa cct aac ttt tac ttg tca tta cta ttg ttg ttc gtg acc ttc 48
Met Glu Pro Asn Phe Tyr Leu Ser Leu Leu Leu Leu Phe Val Thr Phe
1 5 10 15
att tct tta agt ctg ttt ttc atc ttt tac aaa caa aag tcc cca ttg 96
Ile Ser Leu Ser Leu Phe Phe Ile Phe Tyr Lys Gln Lys Ser Pro Leu
20 25 30
aat ttg cca cca ggg aaa atg ggt tac cct atc ata ggt gaa agt tta 144
Asn Leu Pro Pro Gly Lys Met Gly Tyr Pro Ile Ile Gly Glu Ser Leu
35 40 45
gaa ttc cta tcc aca ggc tgg aag gga cat cct gaa aag ttc ata ttt 192
Glu Phe Leu Ser Thr Gly Trp Lys Gly His Pro Glu Lys Phe Ile Phe
50 55 60
gat aga atg cgt aag tac agt agt gag tta ttc aag act tct att gta 240
Asp Arg Met Arg Lys Tyr Ser Ser Glu Leu Phe Lys Thr Ser Ile Val
65 70 75 80
ggc gaa tcc aca gtt gtt tgc tgt ggg gca gct agt aac aaa ttc cta 288
Gly Glu Ser Thr Val Val Cys Cys Gly Ala Ala Ser Asn Lys Phe Leu
85 90 95
ttc tct aac gaa aac aaa ctg gta act gcc tgg tgg cca gat tct gtt 336
Phe Ser Asn Glu Asn Lys Leu Val Thr Ala Trp Trp Pro Asp Ser Val
100 105 110
aac aaa atc ttc cca aca act tca ctg gat tct aat ttg aag gag gaa 384
Asn Lys Ile Phe Pro Thr Thr Ser Leu Asp Ser Asn Leu Lys Glu Glu
115 120 125
tct ata aag atg aga aag ttg ctg cca cag ttc ttc aaa cca gaa gca 432
Ser Ile Lys Met Arg Lys Leu Leu Pro Gln Phe Phe Lys Pro Glu Ala
130 135 140
ctt caa aga tac gtc ggc gtt atg gat gta atc gca caa aga cat ttt 480
Leu Gln Arg Tyr Val Gly Val Met Asp Val Ile Ala Gln Arg His Phe
145 150 155 160
gtc act cac tgg gac aac aaa aat gag atc aca gtt tat cca ctt gct 528
Val Thr His Trp Asp Asn Lys Asn Glu Ile Thr Val Tyr Pro Leu Ala
165 170 175
aaa aga tac act ttc ttg ctt gcg tgt aga ctg ttc atg tct gtt gag 576
Lys Arg Tyr Thr Phe Leu Leu Ala Cys Arg Leu Phe Met Ser Val Glu
180 185 190
gat gaa aat cat gtg gcg aaa ttc tca gac cca ttc caa cta atc gct 624
Asp Glu Asn His Val Ala Lys Phe Ser Asp Pro Phe Gln Leu Ile Ala
195 200 205
gca ggc atc att tca ctt cct atc gat ctt cct ggt act cca ttc aac 672
Ala Gly Ile Ile Ser Leu Pro Ile Asp Leu Pro Gly Thr Pro Phe Asn
210 215 220
aag gcc ata aag gct tca aat ttc att aga aaa gag ctg ata aag att 720
Lys Ala Ile Lys Ala Ser Asn Phe Ile Arg Lys Glu Leu Ile Lys Ile
225 230 235 240
atc aaa caa aga cgt gtt gat ctg gca gag ggt aca gca tct cca acc 768
Ile Lys Gln Arg Arg Val Asp Leu Ala Glu Gly Thr Ala Ser Pro Thr
245 250 255
cag gat atc ttg tca cat atg cta tta aca tct gat gaa aac ggt aaa 816
Gln Asp Ile Leu Ser His Met Leu Leu Thr Ser Asp Glu Asn Gly Lys
260 265 270
tct atg aac gag ttg aac att gcc gac aag att ctt gga cta ttg ata 864
Ser Met Asn Glu Leu Asn Ile Ala Asp Lys Ile Leu Gly Leu Leu Ile
275 280 285
gga ggc cac gat aca gct tca gta gct tgc aca ttt cta gtg aag tac 912
Gly Gly His Asp Thr Ala Ser Val Ala Cys Thr Phe Leu Val Lys Tyr
290 295 300
tta gga gaa tta cca cat atc tac gat aaa gtc tac caa gag caa atg 960
Leu Gly Glu Leu Pro His Ile Tyr Asp Lys Val Tyr Gln Glu Gln Met
305 310 315 320
gaa att gcc aag tcc aaa cct gct ggg gaa ttg ttg aat tgg gat gac 1008
Glu Ile Ala Lys Ser Lys Pro Ala Gly Glu Leu Leu Asn Trp Asp Asp
325 330 335
ttg aaa aag atg aag tat tca tgg aat gtg gca tgt gag gta atg aga 1056
Leu Lys Lys Met Lys Tyr Ser Trp Asn Val Ala Cys Glu Val Met Arg
340 345 350
ttg tca cca cct tta caa ggt ggt ttt aga gag gct ata act gac ttt 1104
Leu Ser Pro Pro Leu Gln Gly Gly Phe Arg Glu Ala Ile Thr Asp Phe
355 360 365
atg ttt aac ggt ttc tct att cca aaa ggg tgg aag tta tac tgg tcc 1152
Met Phe Asn Gly Phe Ser Ile Pro Lys Gly Trp Lys Leu Tyr Trp Ser
370 375 380
gcc aac tct aca cac aaa aat gca gaa tgt ttc cca atg cct gag aaa 1200
Ala Asn Ser Thr His Lys Asn Ala Glu Cys Phe Pro Met Pro Glu Lys
385 390 395 400
ttc gat cct acc aga ttt gaa ggt aat ggt cca gcg cct tat aca ttt 1248
Phe Asp Pro Thr Arg Phe Glu Gly Asn Gly Pro Ala Pro Tyr Thr Phe
405 410 415
gta cca ttc ggt gga ggc cct aga atg tgt cct gga aag gaa tac gct 1296
Val Pro Phe Gly Gly Gly Pro Arg Met Cys Pro Gly Lys Glu Tyr Ala
420 425 430
aga tta gaa atc ttg gtt ttc atg cat aat ctg gtc aaa cgt ttt aag 1344
Arg Leu Glu Ile Leu Val Phe Met His Asn Leu Val Lys Arg Phe Lys
435 440 445
tgg gaa aag gtt att cca gac gaa aag att att gtc gat cca ttc cca 1392
Trp Glu Lys Val Ile Pro Asp Glu Lys Ile Ile Val Asp Pro Phe Pro
450 455 460
atc cca gct aaa gat ctt cca atc cgt ttg tat cct cac aaa gct 1437
Ile Pro Ala Lys Asp Leu Pro Ile Arg Leu Tyr Pro His Lys Ala
465 470 475
<210> 98
<211> 479
<212> PRT
<213> Medicago truncatula
<400> 98
Met Glu Pro Asn Phe Tyr Leu Ser Leu Leu Leu Leu Phe Val Thr Phe
1 5 10 15
Ile Ser Leu Ser Leu Phe Phe Ile Phe Tyr Lys Gln Lys Ser Pro Leu
20 25 30
Asn Leu Pro Pro Gly Lys Met Gly Tyr Pro Ile Ile Gly Glu Ser Leu
35 40 45
Glu Phe Leu Ser Thr Gly Trp Lys Gly His Pro Glu Lys Phe Ile Phe
50 55 60
Asp Arg Met Arg Lys Tyr Ser Ser Glu Leu Phe Lys Thr Ser Ile Val
65 70 75 80
Gly Glu Ser Thr Val Val Cys Cys Gly Ala Ala Ser Asn Lys Phe Leu
85 90 95
Phe Ser Asn Glu Asn Lys Leu Val Thr Ala Trp Trp Pro Asp Ser Val
100 105 110
Asn Lys Ile Phe Pro Thr Thr Ser Leu Asp Ser Asn Leu Lys Glu Glu
115 120 125
Ser Ile Lys Met Arg Lys Leu Leu Pro Gln Phe Phe Lys Pro Glu Ala
130 135 140
Leu Gln Arg Tyr Val Gly Val Met Asp Val Ile Ala Gln Arg His Phe
145 150 155 160
Val Thr His Trp Asp Asn Lys Asn Glu Ile Thr Val Tyr Pro Leu Ala
165 170 175
Lys Arg Tyr Thr Phe Leu Leu Ala Cys Arg Leu Phe Met Ser Val Glu
180 185 190
Asp Glu Asn His Val Ala Lys Phe Ser Asp Pro Phe Gln Leu Ile Ala
195 200 205
Ala Gly Ile Ile Ser Leu Pro Ile Asp Leu Pro Gly Thr Pro Phe Asn
210 215 220
Lys Ala Ile Lys Ala Ser Asn Phe Ile Arg Lys Glu Leu Ile Lys Ile
225 230 235 240
Ile Lys Gln Arg Arg Val Asp Leu Ala Glu Gly Thr Ala Ser Pro Thr
245 250 255
Gln Asp Ile Leu Ser His Met Leu Leu Thr Ser Asp Glu Asn Gly Lys
260 265 270
Ser Met Asn Glu Leu Asn Ile Ala Asp Lys Ile Leu Gly Leu Leu Ile
275 280 285
Gly Gly His Asp Thr Ala Ser Val Ala Cys Thr Phe Leu Val Lys Tyr
290 295 300
Leu Gly Glu Leu Pro His Ile Tyr Asp Lys Val Tyr Gln Glu Gln Met
305 310 315 320
Glu Ile Ala Lys Ser Lys Pro Ala Gly Glu Leu Leu Asn Trp Asp Asp
325 330 335
Leu Lys Lys Met Lys Tyr Ser Trp Asn Val Ala Cys Glu Val Met Arg
340 345 350
Leu Ser Pro Pro Leu Gln Gly Gly Phe Arg Glu Ala Ile Thr Asp Phe
355 360 365
Met Phe Asn Gly Phe Ser Ile Pro Lys Gly Trp Lys Leu Tyr Trp Ser
370 375 380
Ala Asn Ser Thr His Lys Asn Ala Glu Cys Phe Pro Met Pro Glu Lys
385 390 395 400
Phe Asp Pro Thr Arg Phe Glu Gly Asn Gly Pro Ala Pro Tyr Thr Phe
405 410 415
Val Pro Phe Gly Gly Gly Pro Arg Met Cys Pro Gly Lys Glu Tyr Ala
420 425 430
Arg Leu Glu Ile Leu Val Phe Met His Asn Leu Val Lys Arg Phe Lys
435 440 445
Trp Glu Lys Val Ile Pro Asp Glu Lys Ile Ile Val Asp Pro Phe Pro
450 455 460
Ile Pro Ala Lys Asp Leu Pro Ile Arg Leu Tyr Pro His Lys Ala
465 470 475
<210> 99
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 99
atg gct aca tct gat tct att gtt gat gac agg aag cag ttg cat gtg 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gct act ttc cct tgg ctt gct ttc ggt cat ata ctg cct tac cta caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
cta tca aaa ctg ata gct gaa aaa gga cat aaa gtg tca ttc ctt tca 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
aca act aga aac att caa aga tta tct tcc cac ata tca cca ttg att 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtc gtt caa ttg aca ctt cca aga gta cag gaa tta cca gaa gat 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct aca aca gat gtg cat cct gaa gat atc cct tac ttg aaa 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
aag gca tcc gat gga tta cag cct gag gtc act aga ttc ctt gag caa 336
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac agt cca gat tgg atc ata tac gac tac act cac tat tgg ttg cct 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
tca att gca gca tca cta ggc att tct agg gca cat ttc agt gta acc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
aca cct tgg gcc att gct tac atg ggt cca tcc gct gat gct atg att 480
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
aac ggc agt gat ggt aga act acc gtt gaa gat ttg aca acc cca cca 528
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aag tgg ttt cca ttt cca act aaa gtc tgt tgg aga aaa cac gac tta 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gca aga ctg gtt cca tac aag gca cca gga atc tca gac ggc tat aga 624
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt tta gtc ctt aaa ggg tct gac tgc cta ttg tct aag tgt tac 672
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
cat gag ttt ggg aca caa tgg cta cca ctt ttg gaa aca tta cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtt cct gtc gta cca gtt ggt cta tta cct cca gaa atc cct ggt gat 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
gag aag gac gag act tgg gtt tca atc aaa aag tgg tta gac ggg aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aaa ggc tca gtg gta tat gtg gca ctg ggt tcc gaa gtt tta gta 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
tct caa aca gaa gtt gtg gaa ctt gcc tta ggt ttg gaa cta tct gga 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
ttg cca ttt gtc tgg gcc tac aga aaa cca aaa ggc cct gca aag tcc 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gat tca gtt gaa ttg cca gac ggc ttt gtc gag aga act aga gat aga 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggg ttg gta tgg act tca tgg gct cca caa ttg aga atc ctg agt cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tct gtg tgc ggt ttc cta aca cat tgt ggt tct ggt tct ata gtt 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa gga ctg atg ttt ggt cat cca ctt atc atg ttg cca atc ttt ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
gac cag cct ttg aat gca cgt ctg tta gaa gat aaa caa gtt gga att 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa atc cca cgt aat gag gaa gat gga tgt tta acc aag gag tct gtg 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gcc aga tca tta cgt tcc gtt gtc gtt gaa aag gaa ggc gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
aag gcc aat gcc cgt gaa ctt tca aag atc tac aat gac aca aaa gta 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gag aag gaa tat gtt tct caa ttt gta gat tac cta gag aaa aac gct 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
aga gcc gta gct att gat cat gaa tcc 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 100
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 100
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 101
<211> 1377
<212> DNA
<213> Ipomoea purpurea
<220>
<221> CDS
<222> (1)..(1377)
<400> 101
atg ggt tct caa gct acc acc tac cac atg gcc atg tac cca tgg ttc 48
Met Gly Ser Gln Ala Thr Thr Tyr His Met Ala Met Tyr Pro Trp Phe
1 5 10 15
ggt gtc ggt cat ttg act ggt ttc ttc aga ttg gct aac aag ttg gct 96
Gly Val Gly His Leu Thr Gly Phe Phe Arg Leu Ala Asn Lys Leu Ala
20 25 30
ggt aag ggt cac aga atc tct ttc ttg att cca aag aac act caa tcc 144
Gly Lys Gly His Arg Ile Ser Phe Leu Ile Pro Lys Asn Thr Gln Ser
35 40 45
aaa ttg gaa tct ttc aac ttg cac cca cat ttg atc tct ttc gtt cca 192
Lys Leu Glu Ser Phe Asn Leu His Pro His Leu Ile Ser Phe Val Pro
50 55 60
att gtt gtt cca tcc att cca ggt cta cca cca ggt gct gaa acc acc 240
Ile Val Val Pro Ser Ile Pro Gly Leu Pro Pro Gly Ala Glu Thr Thr
65 70 75 80
tct gat gtc cca ttc cca tct acc cat ttg ttg atg gaa gct atg gac 288
Ser Asp Val Pro Phe Pro Ser Thr His Leu Leu Met Glu Ala Met Asp
85 90 95
aag act caa aac gat att gaa atc atc ttg aag gac ttg aag gtc gat 336
Lys Thr Gln Asn Asp Ile Glu Ile Ile Leu Lys Asp Leu Lys Val Asp
100 105 110
gtc gtt ttc tac gat ttc acc cac tgg tta cca tct ttg gcc aga aag 384
Val Val Phe Tyr Asp Phe Thr His Trp Leu Pro Ser Leu Ala Arg Lys
115 120 125
atc ggt atc aaa tcc gtc ttt tac tcc acc att tct cca ttg atg cac 432
Ile Gly Ile Lys Ser Val Phe Tyr Ser Thr Ile Ser Pro Leu Met His
130 135 140
ggt tac gct ttg tcc cca gaa aga aga gtt gtc ggt aag caa ttg act 480
Gly Tyr Ala Leu Ser Pro Glu Arg Arg Val Val Gly Lys Gln Leu Thr
145 150 155 160
gaa gct gac atg atg aag gct cca gct tct ttc cca gac cca tcc atc 528
Glu Ala Asp Met Met Lys Ala Pro Ala Ser Phe Pro Asp Pro Ser Ile
165 170 175
aaa ttg cac gct cac gaa gcc aga ggt ttc acc gcc aga acc gtt atg 576
Lys Leu His Ala His Glu Ala Arg Gly Phe Thr Ala Arg Thr Val Met
180 185 190
aag ttc ggt ggt gat atc act ttc ttc gat cgt att ttc act gct gtt 624
Lys Phe Gly Gly Asp Ile Thr Phe Phe Asp Arg Ile Phe Thr Ala Val
195 200 205
tct gaa tct gat ggt ttg gct tac tcc act tgt cgt gaa atc gaa ggt 672
Ser Glu Ser Asp Gly Leu Ala Tyr Ser Thr Cys Arg Glu Ile Glu Gly
210 215 220
caa ttc tgt gac tac att gaa act caa ttc caa aag cca gtc tta ttg 720
Gln Phe Cys Asp Tyr Ile Glu Thr Gln Phe Gln Lys Pro Val Leu Leu
225 230 235 240
gct ggt cca gct ttg cca gtt cct tcc aag tcc acc atg gaa caa aaa 768
Ala Gly Pro Ala Leu Pro Val Pro Ser Lys Ser Thr Met Glu Gln Lys
245 250 255
tgg tct gac tgg tta ggt aag ttc aag gaa ggt tct gtc att tac tgt 816
Trp Ser Asp Trp Leu Gly Lys Phe Lys Glu Gly Ser Val Ile Tyr Cys
260 265 270
gct ttc ggt tct gaa tgt acc ttg aga aag gac aaa ttc caa gaa cta 864
Ala Phe Gly Ser Glu Cys Thr Leu Arg Lys Asp Lys Phe Gln Glu Leu
275 280 285
tta tgg ggt ttg gaa ttg act ggt atg cca ttc ttt gct gct ttg aag 912
Leu Trp Gly Leu Glu Leu Thr Gly Met Pro Phe Phe Ala Ala Leu Lys
290 295 300
cct cca ttt gaa act gaa tct gtt gaa gct gcc atc cca gaa gaa ttg 960
Pro Pro Phe Glu Thr Glu Ser Val Glu Ala Ala Ile Pro Glu Glu Leu
305 310 315 320
aag gaa aag atc caa ggt aga ggt atc gtt cac ggt gaa tgg gtt caa 1008
Lys Glu Lys Ile Gln Gly Arg Gly Ile Val His Gly Glu Trp Val Gln
325 330 335
caa caa ttg ttt ttg caa cac cca tct gtt ggt tgt ttc gtt tcc cac 1056
Gln Gln Leu Phe Leu Gln His Pro Ser Val Gly Cys Phe Val Ser His
340 345 350
tgt ggt tgg gct tct ttg tcc gaa gct ttg gtc aat gac tgt caa atc 1104
Cys Gly Trp Ala Ser Leu Ser Glu Ala Leu Val Asn Asp Cys Gln Ile
355 360 365
gtc ttg ttg cca caa gtc ggt gac caa att att aac gcc aga atc atg 1152
Val Leu Leu Pro Gln Val Gly Asp Gln Ile Ile Asn Ala Arg Ile Met
370 375 380
tcc gtt tct ttg aag gtt ggt gtt gaa gtc gaa aag ggt gaa gaa gat 1200
Ser Val Ser Leu Lys Val Gly Val Glu Val Glu Lys Gly Glu Glu Asp
385 390 395 400
ggt gtt ttc tcc aga gaa tct gtc tgt aag gcc gtc aag gct gtc atg 1248
Gly Val Phe Ser Arg Glu Ser Val Cys Lys Ala Val Lys Ala Val Met
405 410 415
gat gaa aag tct gaa atc ggt aga gaa gtt aga ggt aac cac gac aaa 1296
Asp Glu Lys Ser Glu Ile Gly Arg Glu Val Arg Gly Asn His Asp Lys
420 425 430
ttg aga ggt ttc ttg atg aac gct gac ttg gac tcc aag tac atg gac 1344
Leu Arg Gly Phe Leu Met Asn Ala Asp Leu Asp Ser Lys Tyr Met Asp
435 440 445
tct ttc aac caa aag ttg caa gac tta tta ggg 1377
Ser Phe Asn Gln Lys Leu Gln Asp Leu Leu Gly
450 455
<210> 102
<211> 459
<212> PRT
<213> Ipomoea purpurea
<400> 102
Met Gly Ser Gln Ala Thr Thr Tyr His Met Ala Met Tyr Pro Trp Phe
1 5 10 15
Gly Val Gly His Leu Thr Gly Phe Phe Arg Leu Ala Asn Lys Leu Ala
20 25 30
Gly Lys Gly His Arg Ile Ser Phe Leu Ile Pro Lys Asn Thr Gln Ser
35 40 45
Lys Leu Glu Ser Phe Asn Leu His Pro His Leu Ile Ser Phe Val Pro
50 55 60
Ile Val Val Pro Ser Ile Pro Gly Leu Pro Pro Gly Ala Glu Thr Thr
65 70 75 80
Ser Asp Val Pro Phe Pro Ser Thr His Leu Leu Met Glu Ala Met Asp
85 90 95
Lys Thr Gln Asn Asp Ile Glu Ile Ile Leu Lys Asp Leu Lys Val Asp
100 105 110
Val Val Phe Tyr Asp Phe Thr His Trp Leu Pro Ser Leu Ala Arg Lys
115 120 125
Ile Gly Ile Lys Ser Val Phe Tyr Ser Thr Ile Ser Pro Leu Met His
130 135 140
Gly Tyr Ala Leu Ser Pro Glu Arg Arg Val Val Gly Lys Gln Leu Thr
145 150 155 160
Glu Ala Asp Met Met Lys Ala Pro Ala Ser Phe Pro Asp Pro Ser Ile
165 170 175
Lys Leu His Ala His Glu Ala Arg Gly Phe Thr Ala Arg Thr Val Met
180 185 190
Lys Phe Gly Gly Asp Ile Thr Phe Phe Asp Arg Ile Phe Thr Ala Val
195 200 205
Ser Glu Ser Asp Gly Leu Ala Tyr Ser Thr Cys Arg Glu Ile Glu Gly
210 215 220
Gln Phe Cys Asp Tyr Ile Glu Thr Gln Phe Gln Lys Pro Val Leu Leu
225 230 235 240
Ala Gly Pro Ala Leu Pro Val Pro Ser Lys Ser Thr Met Glu Gln Lys
245 250 255
Trp Ser Asp Trp Leu Gly Lys Phe Lys Glu Gly Ser Val Ile Tyr Cys
260 265 270
Ala Phe Gly Ser Glu Cys Thr Leu Arg Lys Asp Lys Phe Gln Glu Leu
275 280 285
Leu Trp Gly Leu Glu Leu Thr Gly Met Pro Phe Phe Ala Ala Leu Lys
290 295 300
Pro Pro Phe Glu Thr Glu Ser Val Glu Ala Ala Ile Pro Glu Glu Leu
305 310 315 320
Lys Glu Lys Ile Gln Gly Arg Gly Ile Val His Gly Glu Trp Val Gln
325 330 335
Gln Gln Leu Phe Leu Gln His Pro Ser Val Gly Cys Phe Val Ser His
340 345 350
Cys Gly Trp Ala Ser Leu Ser Glu Ala Leu Val Asn Asp Cys Gln Ile
355 360 365
Val Leu Leu Pro Gln Val Gly Asp Gln Ile Ile Asn Ala Arg Ile Met
370 375 380
Ser Val Ser Leu Lys Val Gly Val Glu Val Glu Lys Gly Glu Glu Asp
385 390 395 400
Gly Val Phe Ser Arg Glu Ser Val Cys Lys Ala Val Lys Ala Val Met
405 410 415
Asp Glu Lys Ser Glu Ile Gly Arg Glu Val Arg Gly Asn His Asp Lys
420 425 430
Leu Arg Gly Phe Leu Met Asn Ala Asp Leu Asp Ser Lys Tyr Met Asp
435 440 445
Ser Phe Asn Gln Lys Leu Gln Asp Leu Leu Gly
450 455
<210> 103
<211> 1314
<212> DNA
<213> Bellis perennis
<220>
<221> CDS
<222> (1)..(1314)
<400> 103
atg gac tct aag att gac tcc aag act ttc aga gtt gtt atg ttg cca 48
Met Asp Ser Lys Ile Asp Ser Lys Thr Phe Arg Val Val Met Leu Pro
1 5 10 15
tgg ttg gct tac tcc cac atc tct tct ttc ttg gtt ttc gct aag aga 96
Trp Leu Ala Tyr Ser His Ile Ser Ser Phe Leu Val Phe Ala Lys Arg
20 25 30
ttg acc aac cac aac ttc cac atc tac atc tgt tcc tct caa acc aac 144
Leu Thr Asn His Asn Phe His Ile Tyr Ile Cys Ser Ser Gln Thr Asn
35 40 45
atg caa tac ttg aag aac aac ttg act tct caa tac tcc aag tcc atc 192
Met Gln Tyr Leu Lys Asn Asn Leu Thr Ser Gln Tyr Ser Lys Ser Ile
50 55 60
caa ttg att gaa ttg aac ttg cca tct tct tct gaa ttg cca ttg caa 240
Gln Leu Ile Glu Leu Asn Leu Pro Ser Ser Ser Glu Leu Pro Leu Gln
65 70 75 80
tac cac acc acc cac ggt cta cca cct cat ttg acc aag act cta tcc 288
Tyr His Thr Thr His Gly Leu Pro Pro His Leu Thr Lys Thr Leu Ser
85 90 95
gat gac tac caa aaa tct ggt cca gac ttc gaa acc atc ttg atc aaa 336
Asp Asp Tyr Gln Lys Ser Gly Pro Asp Phe Glu Thr Ile Leu Ile Lys
100 105 110
ttg aac cct cac tta gtc atc tac gac ttc aac caa tta tgg gct cca 384
Leu Asn Pro His Leu Val Ile Tyr Asp Phe Asn Gln Leu Trp Ala Pro
115 120 125
gaa gtt gct tcc act ttg cac att cca tct atc caa ttg ttg tcc ggt 432
Glu Val Ala Ser Thr Leu His Ile Pro Ser Ile Gln Leu Leu Ser Gly
130 135 140
tgt gtt gct ttg tac gct ttg gat gct cat ttg tac acc aag cca ttg 480
Cys Val Ala Leu Tyr Ala Leu Asp Ala His Leu Tyr Thr Lys Pro Leu
145 150 155 160
gat gaa aac ttg gcc aag ttc cca ttc cca gaa atc tac cca aag aac 528
Asp Glu Asn Leu Ala Lys Phe Pro Phe Pro Glu Ile Tyr Pro Lys Asn
165 170 175
aga gat atc cca aag ggt ggt tcc aaa tac atc gaa aga ttt gtc gac 576
Arg Asp Ile Pro Lys Gly Gly Ser Lys Tyr Ile Glu Arg Phe Val Asp
180 185 190
tgt atg aga aga tct tgt gaa atc atc ttg gtc aga tcc act atg gaa 624
Cys Met Arg Arg Ser Cys Glu Ile Ile Leu Val Arg Ser Thr Met Glu
195 200 205
ttg gaa ggt aag tac att gac tac ttg tct aag act tta ggt aag aag 672
Leu Glu Gly Lys Tyr Ile Asp Tyr Leu Ser Lys Thr Leu Gly Lys Lys
210 215 220
gtc tta cca gtt ggt cca tta gtc caa gaa gct tct ttg ttg caa gat 720
Val Leu Pro Val Gly Pro Leu Val Gln Glu Ala Ser Leu Leu Gln Asp
225 230 235 240
gac cac atc tgg atc atg aaa tgg ttg gac aag aag gaa gaa tcc tct 768
Asp His Ile Trp Ile Met Lys Trp Leu Asp Lys Lys Glu Glu Ser Ser
245 250 255
gtt gtc ttt gtt tgt ttc ggt tct gaa tac att ttg tct gac aac gaa 816
Val Val Phe Val Cys Phe Gly Ser Glu Tyr Ile Leu Ser Asp Asn Glu
260 265 270
atc gaa gac att gct tac ggt ttg gaa ttg tct caa gtc tct ttc gtc 864
Ile Glu Asp Ile Ala Tyr Gly Leu Glu Leu Ser Gln Val Ser Phe Val
275 280 285
tgg gct atc aga gcc aag acc tct gct ttg aat ggt ttc att gac cgt 912
Trp Ala Ile Arg Ala Lys Thr Ser Ala Leu Asn Gly Phe Ile Asp Arg
290 295 300
gtt ggt gac aag ggt ttg gtc att gac aaa tgg gtt cca caa gct aac 960
Val Gly Asp Lys Gly Leu Val Ile Asp Lys Trp Val Pro Gln Ala Asn
305 310 315 320
atc ttg tcc cac tct tcc act ggt ggt ttc atc tcc cac tgt ggt tgg 1008
Ile Leu Ser His Ser Ser Thr Gly Gly Phe Ile Ser His Cys Gly Trp
325 330 335
tcc tct acc atg gaa tcc atc aga tac ggt gtt cca atc att gct atg 1056
Ser Ser Thr Met Glu Ser Ile Arg Tyr Gly Val Pro Ile Ile Ala Met
340 345 350
cca atg caa ttc gat caa cca tac aac gcc aga ttg atg gaa act gtc 1104
Pro Met Gln Phe Asp Gln Pro Tyr Asn Ala Arg Leu Met Glu Thr Val
355 360 365
ggt gct ggt att gaa gtt ggt aga gat ggt gaa ggt aga ttg aag cgt 1152
Gly Ala Gly Ile Glu Val Gly Arg Asp Gly Glu Gly Arg Leu Lys Arg
370 375 380
gaa gaa att gcc gct gtt gtc aga aag gtt gtt gtt gaa gat tct ggt 1200
Glu Glu Ile Ala Ala Val Val Arg Lys Val Val Val Glu Asp Ser Gly
385 390 395 400
gaa tct atc aga gaa aag gcc aag gaa tta ggt gaa att atg aag aag 1248
Glu Ser Ile Arg Glu Lys Ala Lys Glu Leu Gly Glu Ile Met Lys Lys
405 410 415
aac atg gaa gct gaa gtc gat ggt att gtc att gaa aac tta gtc aaa 1296
Asn Met Glu Ala Glu Val Asp Gly Ile Val Ile Glu Asn Leu Val Lys
420 425 430
ttg tgt gaa atg aac aac 1314
Leu Cys Glu Met Asn Asn
435
<210> 104
<211> 438
<212> PRT
<213> Bellis perennis
<400> 104
Met Asp Ser Lys Ile Asp Ser Lys Thr Phe Arg Val Val Met Leu Pro
1 5 10 15
Trp Leu Ala Tyr Ser His Ile Ser Ser Phe Leu Val Phe Ala Lys Arg
20 25 30
Leu Thr Asn His Asn Phe His Ile Tyr Ile Cys Ser Ser Gln Thr Asn
35 40 45
Met Gln Tyr Leu Lys Asn Asn Leu Thr Ser Gln Tyr Ser Lys Ser Ile
50 55 60
Gln Leu Ile Glu Leu Asn Leu Pro Ser Ser Ser Glu Leu Pro Leu Gln
65 70 75 80
Tyr His Thr Thr His Gly Leu Pro Pro His Leu Thr Lys Thr Leu Ser
85 90 95
Asp Asp Tyr Gln Lys Ser Gly Pro Asp Phe Glu Thr Ile Leu Ile Lys
100 105 110
Leu Asn Pro His Leu Val Ile Tyr Asp Phe Asn Gln Leu Trp Ala Pro
115 120 125
Glu Val Ala Ser Thr Leu His Ile Pro Ser Ile Gln Leu Leu Ser Gly
130 135 140
Cys Val Ala Leu Tyr Ala Leu Asp Ala His Leu Tyr Thr Lys Pro Leu
145 150 155 160
Asp Glu Asn Leu Ala Lys Phe Pro Phe Pro Glu Ile Tyr Pro Lys Asn
165 170 175
Arg Asp Ile Pro Lys Gly Gly Ser Lys Tyr Ile Glu Arg Phe Val Asp
180 185 190
Cys Met Arg Arg Ser Cys Glu Ile Ile Leu Val Arg Ser Thr Met Glu
195 200 205
Leu Glu Gly Lys Tyr Ile Asp Tyr Leu Ser Lys Thr Leu Gly Lys Lys
210 215 220
Val Leu Pro Val Gly Pro Leu Val Gln Glu Ala Ser Leu Leu Gln Asp
225 230 235 240
Asp His Ile Trp Ile Met Lys Trp Leu Asp Lys Lys Glu Glu Ser Ser
245 250 255
Val Val Phe Val Cys Phe Gly Ser Glu Tyr Ile Leu Ser Asp Asn Glu
260 265 270
Ile Glu Asp Ile Ala Tyr Gly Leu Glu Leu Ser Gln Val Ser Phe Val
275 280 285
Trp Ala Ile Arg Ala Lys Thr Ser Ala Leu Asn Gly Phe Ile Asp Arg
290 295 300
Val Gly Asp Lys Gly Leu Val Ile Asp Lys Trp Val Pro Gln Ala Asn
305 310 315 320
Ile Leu Ser His Ser Ser Thr Gly Gly Phe Ile Ser His Cys Gly Trp
325 330 335
Ser Ser Thr Met Glu Ser Ile Arg Tyr Gly Val Pro Ile Ile Ala Met
340 345 350
Pro Met Gln Phe Asp Gln Pro Tyr Asn Ala Arg Leu Met Glu Thr Val
355 360 365
Gly Ala Gly Ile Glu Val Gly Arg Asp Gly Glu Gly Arg Leu Lys Arg
370 375 380
Glu Glu Ile Ala Ala Val Val Arg Lys Val Val Val Glu Asp Ser Gly
385 390 395 400
Glu Ser Ile Arg Glu Lys Ala Lys Glu Leu Gly Glu Ile Met Lys Lys
405 410 415
Asn Met Glu Ala Glu Val Asp Gly Ile Val Ile Glu Asn Leu Val Lys
420 425 430
Leu Cys Glu Met Asn Asn
435
<210> 105
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 105
atg gcc act tct gac tct att gtc gat gac aga aag caa ttg cac gtt 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gcc act ttc cca tgg ttg gcc ttc ggt cac att ttg cca tac ttg caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
ttg tcc aag ttg att gct gaa aag ggt cac aag gtt tct ttc ttg tcc 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
act acc aga aac atc caa aga tta tct tct cac atc tct cca tta atc 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtt gtc caa ttg act tta cca aga gtt caa gaa ttg cca gaa gat 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct acc acc gat gtc cat cca gaa gat atc cca tac ttg aag 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
aag gct tct gac ggt ttg caa cct gaa gtc acc cgt ttc ttg gaa caa 336
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac tct cca gac tgg atc atc tac gac tac act cac tac tgg tta cct 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
tcc att gct gct tct ttg ggt atc tcc cgt gct cat ttc tcc gtc acc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
act cca tgg gct att gct tac atg ggt cca tct gct gat gct atg atc 480
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
aac ggt tct gat ggt aga acc act gtt gaa gat ttg acc act cca cca 528
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aaa tgg ttc cca ttc cca act aag gtc tgt tgg aga aag cac gat ttg 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gcc aga ttg gtt cca tac aag gct cca ggt atc tct gat ggt tac aga 624
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt ttg gtt ttg aaa ggt tct gac tgt ttg ttg tcc aaa tgt tac 672
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
cat gaa ttc ggt act caa tgg tta cca ttg ttg gaa act ttg cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtt cca gtt gtt cca gtc ggt cta tta cca cca gaa gtc cca ggt gac 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Val Pro Gly Asp
245 250 255
gaa aag gac gaa acc tgg gtt tcc atc aag aaa tgg tta gat ggt aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aag ggt tcc gtt gtc tac gtt gct ttg ggt tct gaa gtc ttg gtt 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
tct caa act gaa gtt gtc gaa ttg gct tta ggt ttg gaa ttg tcc ggt 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
ttg cct ttc gtc tgg gct tac aga aag cca aag ggt cca gct aaa tct 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gac tcc gtc gaa tta cca gac ggt ttc gtt gaa aga acc cgt gac aga 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggt ttg gtc tgg act tcc tgg gct cca caa ttg aga atc ttg tcc cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tct gtt tgt ggt ttc ttg acc cac tgt ggt tct ggt tcc att gtc 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa ggt ttg atg ttt ggt cac cca ttg atc atg ttg cca atc ttc ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
gac caa cca ttg aac gcc aga tta ttg gaa gac aag caa gtc ggt att 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa att cca aga aac gaa gaa gat ggt tgt ttg acc aag gaa tct gtt 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gcc aga tct ttg aga tcc gtt gtt gtc gaa aag gaa ggt gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
aag gct aac gct aga gaa ttg tct aag atc tac aac gac acc aag gtt 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gaa aag gaa tac gtt tct caa ttt gtc gac tac ttg gaa aag aac acc 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Thr
450 455 460
aga gct gtt gcc att gac cac gaa agt 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 106
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 106
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Val Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Thr
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 107
<211> 1455
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1455)
<400> 107
atg tac aac gtc acc tac cat caa aac tcc aag gct atg gct act tct 48
Met Tyr Asn Val Thr Tyr His Gln Asn Ser Lys Ala Met Ala Thr Ser
1 5 10 15
gac tct att gtc gat gac aga aag caa ttg cac gtt gcc act ttc cca 96
Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val Ala Thr Phe Pro
20 25 30
tgg tta gct ttc ggt cac atc tta cca ttc ttg caa ttg tcc aaa ttg 144
Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln Leu Ser Lys Leu
35 40 45
att gct gaa aag ggt cac aag gtt tct ttc ttg tct acc act aga aac 192
Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser Thr Thr Arg Asn
50 55 60
att caa aga ttg tct tcc cac atc tct cca ttg atc aac gtt gtt caa 240
Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile Asn Val Val Gln
65 70 75 80
ttg act cta cca aga gtc caa gaa tta cca gaa gat gct gaa gct acc 288
Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp Ala Glu Ala Thr
85 90 95
acc gat gtc cat cca gaa gat atc caa tac ttg aag aag gct gtt gac 336
Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys Lys Ala Val Asp
100 105 110
ggt ttg caa cca gaa gtt acc aga ttc ttg gaa caa cac tct cca gac 384
Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln His Ser Pro Asp
115 120 125
tgg atc atc tac gac ttc acc cac tac tgg tta cct tcc att gct gcc 432
Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro Ser Ile Ala Ala
130 135 140
tcc ttg ggt atc tcc aga gct tac ttc tgt gtt atc act cca tgg acc 480
Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile Thr Pro Trp Thr
145 150 155 160
att gct tac ttg gct cca tct tct gat gct atg atc aac gac tct gac 528
Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile Asn Asp Ser Asp
165 170 175
ggt aga acc act gtc gaa gat ttg acc act cca cct aaa tgg ttc cca 576
Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro Lys Trp Phe Pro
180 185 190
ttc cca act aag gtc tgt tgg aga aag cat gac ttg gcc aga atg gaa 624
Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu Ala Arg Met Glu
195 200 205
cca tac gaa gct cca ggt atc tcc gat ggt tac aga atg ggt atg gtt 672
Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg Met Gly Met Val
210 215 220
ttc aag ggt tct gac tgt ttg ttg ttc aag tgt tac cac gaa ttc ggt 720
Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr His Glu Phe Gly
225 230 235 240
act caa tgg tta cca ttg ttg gaa act ttg cac caa gtc cca gtt gtc 768
Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln Val Pro Val Val
245 250 255
cca gtc ggt ttg ttg cca cca gaa atc cca ggt gac gaa aag gac gaa 816
Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp Glu Lys Asp Glu
260 265 270
acc tgg gtt tcc atc aag aaa tgg ttg gat ggt aag caa aag ggt tcc 864
Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys Gln Lys Gly Ser
275 280 285
gtt gtc tac gtt gcc ttg ggt tct gaa gct tta gtc tct caa act gaa 912
Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val Ser Gln Thr Glu
290 295 300
gtt gtt gaa ttg gct ttg ggt ttg gaa ttg tct ggt cta cca ttt gtc 960
Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly Leu Pro Phe Val
305 310 315 320
tgg gct tac aga aag cca aag ggt cca gcc aaa tct gac tcc gtt gaa 1008
Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser Asp Ser Val Glu
325 330 335
ttg cca gat ggt ttc gtc gaa cgt acc aga gac aga ggt tta gtc tgg 1056
Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg Gly Leu Val Trp
340 345 350
act tcc tgg gct cct caa ttg aga atc ttg tct cac gaa tct gtc tgt 1104
Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His Glu Ser Val Cys
355 360 365
ggt ttc ttg act cac tgt ggt tct ggt tcc att gtc gaa ggt ttg atg 1152
Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val Glu Gly Leu Met
370 375 380
ttc ggt cac cca ttg att atg ttg cca att ttc tgt gac caa cca ttg 1200
Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Cys Asp Gln Pro Leu
385 390 395 400
aac gct aga tta ttg gaa gac aaa caa gtc ggt att gaa atc cca aga 1248
Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile Glu Ile Pro Arg
405 410 415
aac gaa gaa gat ggt tgt ttg acc aag gaa tct gtt gct cgt tct ttg 1296
Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val Ala Arg Ser Leu
420 425 430
aga tct gtt gtt gtc gaa aac gaa ggt gaa atc tac aag gcc aat gct 1344
Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr Lys Ala Asn Ala
435 440 445
cgt gct ttg tcc aag atc tac aac gat acc aag gtt gaa aag gaa tac 1392
Arg Ala Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val Glu Lys Glu Tyr
450 455 460
gtt tct caa ttt gtt gac tac ttg gaa aag aac gcc aga gct gtt gcc 1440
Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala Arg Ala Val Ala
465 470 475 480
att gac cac gaa tcc 1455
Ile Asp His Glu Ser
485
<210> 108
<211> 485
<212> PRT
<213> Stevia rebaudiana
<400> 108
Met Tyr Asn Val Thr Tyr His Gln Asn Ser Lys Ala Met Ala Thr Ser
1 5 10 15
Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val Ala Thr Phe Pro
20 25 30
Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln Leu Ser Lys Leu
35 40 45
Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser Thr Thr Arg Asn
50 55 60
Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile Asn Val Val Gln
65 70 75 80
Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp Ala Glu Ala Thr
85 90 95
Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys Lys Ala Val Asp
100 105 110
Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln His Ser Pro Asp
115 120 125
Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro Ser Ile Ala Ala
130 135 140
Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile Thr Pro Trp Thr
145 150 155 160
Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile Asn Asp Ser Asp
165 170 175
Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro Lys Trp Phe Pro
180 185 190
Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu Ala Arg Met Glu
195 200 205
Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg Met Gly Met Val
210 215 220
Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr His Glu Phe Gly
225 230 235 240
Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln Val Pro Val Val
245 250 255
Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp Glu Lys Asp Glu
260 265 270
Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys Gln Lys Gly Ser
275 280 285
Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val Ser Gln Thr Glu
290 295 300
Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly Leu Pro Phe Val
305 310 315 320
Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser Asp Ser Val Glu
325 330 335
Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg Gly Leu Val Trp
340 345 350
Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His Glu Ser Val Cys
355 360 365
Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val Glu Gly Leu Met
370 375 380
Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Cys Asp Gln Pro Leu
385 390 395 400
Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile Glu Ile Pro Arg
405 410 415
Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val Ala Arg Ser Leu
420 425 430
Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr Lys Ala Asn Ala
435 440 445
Arg Ala Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val Glu Lys Glu Tyr
450 455 460
Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala Arg Ala Val Ala
465 470 475 480
Ile Asp His Glu Ser
485
<210> 109
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 109
atg gcc act tct gac tcc att gtt gat gac aga aag caa ttg cac gtt 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gct act ttc cca tgg ttg gcc ttt ggt cac att ttg cct ttc ttg caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln
20 25 30
ttg tct aaa ttg att gct gaa aag ggt cac aag gtt tcc ttc ttg tct 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
acc acc aga aac atc caa aga tta tct tcc cac atc tct cct ttg atc 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtt gtt caa ttg act cta cca cgt gtc caa gaa tta cca gaa gac 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct act acc gat gtc cac cca gaa gat atc caa tac ttg aag 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys
85 90 95
aag gct gtc gat ggt ttg caa cca gaa gtc acc aga ttc ttg gaa caa 336
Lys Ala Val Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac tct cca gac tgg atc atc tac gac ttc act cac tac tgg ttg cca 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro
115 120 125
tcc att gct gct tct ttg ggt atc tcc aga gct tac ttc tgt gtc atc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile
130 135 140
act cca tgg acc att gct tac ttg gct cca tct tct gat gct atg att 480
Thr Pro Trp Thr Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile
145 150 155 160
aac gat tct gat ggt aga acc acc gtc gaa gac ttg acc act cca cca 528
Asn Asp Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aaa tgg ttc cca ttc cca acc aag gtc tgt tgg aga aag cac gac ttg 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gcc aga atg gaa cca tac gaa gct cca ggt atc tcc gat ggt tac aga 624
Ala Arg Met Glu Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt atg gtt ttc aag ggt tcc gac tgt cta tta ttc aaa tgt tac 672
Met Gly Met Val Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr
210 215 220
cat gaa ttt ggt act caa tgg tta cca ttg ttg gaa act ttg cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtc cca gtt gtt cca gtt ggt ttg ttg cct cca gaa atc cca ggt gat 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
gaa aag gac gaa acc tgg gtt tcc atc aag aaa tgg ttg gac ggt aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aag ggt tct gtt gtc tac gtt gct ttg ggt tct gaa gct ttg gtt 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val
275 280 285
tct caa act gaa gtt gtc gaa ttg gct tta ggt ttg gaa ttg tcc ggt 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
ttg cca ttc gtc tgg gct tac aga aag cca aag ggt cca gct aag tct 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gac tct gtt gaa ttg cca gat ggt ttc gtt gaa aga acc aga gac aga 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggt ttg gtc tgg act tct tgg gct cca caa ttg aga att ttg tcc cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tct gtc tgt ggt ttc ttg act cac tgt ggt tct ggt tcc att gtc 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa ggt ttg atg ttc ggt cat cca ttg atc atg ttg cca tta ttc ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Leu Phe Gly
370 375 380
gac caa cca ttg aac gct aga tta ttg gaa gac aaa caa gtc ggt att 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa att cca aga aac gaa gaa gat ggt tgt ttg acc aag gaa tcc gtt 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gct cgt tct tta cgt tct gtc gtt gtc gaa aac gaa ggt gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr
420 425 430
aag gcc aat gcc aga gaa ttg tcc aag atc tac aac gac acc aag gtt 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gaa aag gaa tac gtt tct caa ttt gtc gac tac ttg gaa aag aac gcc 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
aga gct gtt gcc atc gac cac gaa tcc 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 110
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 110
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys
85 90 95
Lys Ala Val Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile
130 135 140
Thr Pro Trp Thr Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile
145 150 155 160
Asn Asp Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Met Glu Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Met Val Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Leu Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 111
<211> 1356
<212> DNA
<213> Populus trichocarpa
<220>
<221> CDS
<222> (1)..(1356)
<400> 111
atg gac gaa cct cac gaa ttg cac att gct atg ttc cca tgg ttg gcc 48
Met Asp Glu Pro His Glu Leu His Ile Ala Met Phe Pro Trp Leu Ala
1 5 10 15
ttt ggt cac atc att cca ttc ttg gaa tta gcc aag ttg att gct caa 96
Phe Gly His Ile Ile Pro Phe Leu Glu Leu Ala Lys Leu Ile Ala Gln
20 25 30
aga ggt cac aag atc tct ttc atc tcc act cca aga aac atc caa aga 144
Arg Gly His Lys Ile Ser Phe Ile Ser Thr Pro Arg Asn Ile Gln Arg
35 40 45
tta cca acc atc cca cca aac ttg acc cca aga atc aac tta gtc tct 192
Leu Pro Thr Ile Pro Pro Asn Leu Thr Pro Arg Ile Asn Leu Val Ser
50 55 60
ttg gct ttg cca cac gtt gaa aac ttg cca aac aac gct gaa gcc act 240
Leu Ala Leu Pro His Val Glu Asn Leu Pro Asn Asn Ala Glu Ala Thr
65 70 75 80
gct gac ttg cca ttc gac aag atc cca tac ttg aag att gct tac gac 288
Ala Asp Leu Pro Phe Asp Lys Ile Pro Tyr Leu Lys Ile Ala Tyr Asp
85 90 95
aga ttg caa gac tct ttg ttc cat ttc ttg cac tct tct tct cca gac 336
Arg Leu Gln Asp Ser Leu Phe His Phe Leu His Ser Ser Ser Pro Asp
100 105 110
tgg atc atc ttc gat ttt gcc tcc tac tgg tta cca gaa att gct acc 384
Trp Ile Ile Phe Asp Phe Ala Ser Tyr Trp Leu Pro Glu Ile Ala Thr
115 120 125
aaa ttg ggt atc tcc ggt gtt ttg ttc tcc atc ttc ggt gct tgg act 432
Lys Leu Gly Ile Ser Gly Val Leu Phe Ser Ile Phe Gly Ala Trp Thr
130 135 140
tta tct ttc gct ggt cca tct tac tct gcc att ttg aac ggt gat gac 480
Leu Ser Phe Ala Gly Pro Ser Tyr Ser Ala Ile Leu Asn Gly Asp Asp
145 150 155 160
cca aga acc gaa cca caa cat ttc acc gtt cca cca aaa tgg gtc act 528
Pro Arg Thr Glu Pro Gln His Phe Thr Val Pro Pro Lys Trp Val Thr
165 170 175
ttc cca tcc aag gtt gct ttc cgt atc cac gaa gct aag aga ttc ttg 576
Phe Pro Ser Lys Val Ala Phe Arg Ile His Glu Ala Lys Arg Phe Leu
180 185 190
gtt caa atc gaa gct aac tct tct ggt gtc act gac atc ttc aga tgg 624
Val Gln Ile Glu Ala Asn Ser Ser Gly Val Thr Asp Ile Phe Arg Trp
195 200 205
ggt tcc gtt ttg gct ggt tgt gat gtc att gcc gtc aga tct tgt ttg 672
Gly Ser Val Leu Ala Gly Cys Asp Val Ile Ala Val Arg Ser Cys Leu
210 215 220
gaa ttg gaa gct gac ttc ttg aga ttg gtc gaa gat ttg cac tgt aag 720
Glu Leu Glu Ala Asp Phe Leu Arg Leu Val Glu Asp Leu His Cys Lys
225 230 235 240
cca gtt atc cca gtc ggt cta tta cca cct cca gct caa tgt tct gaa 768
Pro Val Ile Pro Val Gly Leu Leu Pro Pro Pro Ala Gln Cys Ser Glu
245 250 255
ggt ggt tcc aga gaa ggt ggt gtc gac gaa aaa tgg gtt acc att tct 816
Gly Gly Ser Arg Glu Gly Gly Val Asp Glu Lys Trp Val Thr Ile Ser
260 265 270
gaa tgg ttg gac aag caa act caa ggt tcc gtt gtc tac att gct ttc 864
Glu Trp Leu Asp Lys Gln Thr Gln Gly Ser Val Val Tyr Ile Ala Phe
275 280 285
ggt tct gaa ttg acc atc aac caa aat gaa atc act gaa ttg gct ttg 912
Gly Ser Glu Leu Thr Ile Asn Gln Asn Glu Ile Thr Glu Leu Ala Leu
290 295 300
ggt tta gaa ttg tct ggt ttg cca ttc ttc tgg gct ttc aga aac aga 960
Gly Leu Glu Leu Ser Gly Leu Pro Phe Phe Trp Ala Phe Arg Asn Arg
305 310 315 320
gat gac tcc gtt aga ttg cca gat ggt ttc gat gaa aga gtc aag ggt 1008
Asp Asp Ser Val Arg Leu Pro Asp Gly Phe Asp Glu Arg Val Lys Gly
325 330 335
cgt ggt gtt gtc tgg act tcc tgg gct cct caa ttg aga atc atg gct 1056
Arg Gly Val Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Met Ala
340 345 350
cac gaa tcc gtt ggt ggt ttc ttg act cac tgt ggt tac tcc tct gtt 1104
His Glu Ser Val Gly Gly Phe Leu Thr His Cys Gly Tyr Ser Ser Val
355 360 365
atc gaa gct ttg tcc ttc ggt ttg gct tta atc atg ttg cca ttt gcc 1152
Ile Glu Ala Leu Ser Phe Gly Leu Ala Leu Ile Met Leu Pro Phe Ala
370 375 380
att gac caa ggt ttg att gcc cgt gtt ttc gaa ggt aag aag gtc ggt 1200
Ile Asp Gln Gly Leu Ile Ala Arg Val Phe Glu Gly Lys Lys Val Gly
385 390 395 400
att gaa gtc cca aga gat gaa caa gat ggt tcc ttc acc aga aac tct 1248
Ile Glu Val Pro Arg Asp Glu Gln Asp Gly Ser Phe Thr Arg Asn Ser
405 410 415
gtt gct gaa tct cta aga ttg gtc att gtt gac aag gaa ggt tct gct 1296
Val Ala Glu Ser Leu Arg Leu Val Ile Val Asp Lys Glu Gly Ser Ala
420 425 430
tac aga gaa aac gct aag caa caa atg gtt act ttg ttt ggt ttg acc 1344
Tyr Arg Glu Asn Ala Lys Gln Gln Met Val Thr Leu Phe Gly Leu Thr
435 440 445
tac acc atg acg 1356
Tyr Thr Met Thr
450
<210> 112
<211> 452
<212> PRT
<213> Populus trichocarpa
<400> 112
Met Asp Glu Pro His Glu Leu His Ile Ala Met Phe Pro Trp Leu Ala
1 5 10 15
Phe Gly His Ile Ile Pro Phe Leu Glu Leu Ala Lys Leu Ile Ala Gln
20 25 30
Arg Gly His Lys Ile Ser Phe Ile Ser Thr Pro Arg Asn Ile Gln Arg
35 40 45
Leu Pro Thr Ile Pro Pro Asn Leu Thr Pro Arg Ile Asn Leu Val Ser
50 55 60
Leu Ala Leu Pro His Val Glu Asn Leu Pro Asn Asn Ala Glu Ala Thr
65 70 75 80
Ala Asp Leu Pro Phe Asp Lys Ile Pro Tyr Leu Lys Ile Ala Tyr Asp
85 90 95
Arg Leu Gln Asp Ser Leu Phe His Phe Leu His Ser Ser Ser Pro Asp
100 105 110
Trp Ile Ile Phe Asp Phe Ala Ser Tyr Trp Leu Pro Glu Ile Ala Thr
115 120 125
Lys Leu Gly Ile Ser Gly Val Leu Phe Ser Ile Phe Gly Ala Trp Thr
130 135 140
Leu Ser Phe Ala Gly Pro Ser Tyr Ser Ala Ile Leu Asn Gly Asp Asp
145 150 155 160
Pro Arg Thr Glu Pro Gln His Phe Thr Val Pro Pro Lys Trp Val Thr
165 170 175
Phe Pro Ser Lys Val Ala Phe Arg Ile His Glu Ala Lys Arg Phe Leu
180 185 190
Val Gln Ile Glu Ala Asn Ser Ser Gly Val Thr Asp Ile Phe Arg Trp
195 200 205
Gly Ser Val Leu Ala Gly Cys Asp Val Ile Ala Val Arg Ser Cys Leu
210 215 220
Glu Leu Glu Ala Asp Phe Leu Arg Leu Val Glu Asp Leu His Cys Lys
225 230 235 240
Pro Val Ile Pro Val Gly Leu Leu Pro Pro Pro Ala Gln Cys Ser Glu
245 250 255
Gly Gly Ser Arg Glu Gly Gly Val Asp Glu Lys Trp Val Thr Ile Ser
260 265 270
Glu Trp Leu Asp Lys Gln Thr Gln Gly Ser Val Val Tyr Ile Ala Phe
275 280 285
Gly Ser Glu Leu Thr Ile Asn Gln Asn Glu Ile Thr Glu Leu Ala Leu
290 295 300
Gly Leu Glu Leu Ser Gly Leu Pro Phe Phe Trp Ala Phe Arg Asn Arg
305 310 315 320
Asp Asp Ser Val Arg Leu Pro Asp Gly Phe Asp Glu Arg Val Lys Gly
325 330 335
Arg Gly Val Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Met Ala
340 345 350
His Glu Ser Val Gly Gly Phe Leu Thr His Cys Gly Tyr Ser Ser Val
355 360 365
Ile Glu Ala Leu Ser Phe Gly Leu Ala Leu Ile Met Leu Pro Phe Ala
370 375 380
Ile Asp Gln Gly Leu Ile Ala Arg Val Phe Glu Gly Lys Lys Val Gly
385 390 395 400
Ile Glu Val Pro Arg Asp Glu Gln Asp Gly Ser Phe Thr Arg Asn Ser
405 410 415
Val Ala Glu Ser Leu Arg Leu Val Ile Val Asp Lys Glu Gly Ser Ala
420 425 430
Tyr Arg Glu Asn Ala Lys Gln Gln Met Val Thr Leu Phe Gly Leu Thr
435 440 445
Tyr Thr Met Thr
450
<210> 113
<211> 1362
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1362)
<400> 113
atg tcc ttg aaa ggt aac gac aag gaa tta cat ttg gtc atg ttc cca 48
Met Ser Leu Lys Gly Asn Asp Lys Glu Leu His Leu Val Met Phe Pro
1 5 10 15
ttc ttt gct ttc ggt cac atc act cca ttt gtc caa ttg tcc aac aag 96
Phe Phe Ala Phe Gly His Ile Thr Pro Phe Val Gln Leu Ser Asn Lys
20 25 30
atc tcc tct ttg tac cca ggt gtc aag atc act ttc ttg gct gct tct 144
Ile Ser Ser Leu Tyr Pro Gly Val Lys Ile Thr Phe Leu Ala Ala Ser
35 40 45
gct tct gtt tcc aga atc gaa acc atg ttg aac cca tct acc aac acc 192
Ala Ser Val Ser Arg Ile Glu Thr Met Leu Asn Pro Ser Thr Asn Thr
50 55 60
aag gtc att cca ttg act ttg cca aga gtc gat ggt cta cca gaa ggt 240
Lys Val Ile Pro Leu Thr Leu Pro Arg Val Asp Gly Leu Pro Glu Gly
65 70 75 80
gtt gaa aac act gct gac gct tct cca gct acc att ggt cta ttg gtt 288
Val Glu Asn Thr Ala Asp Ala Ser Pro Ala Thr Ile Gly Leu Leu Val
85 90 95
gtt gcc att gat ttg atg caa cct caa atc aag act tta ttg gct aac 336
Val Ala Ile Asp Leu Met Gln Pro Gln Ile Lys Thr Leu Leu Ala Asn
100 105 110
ttg aag cca gac ttc gtt atc ttc gac ttc gtc cac tgg tgg tta cca 384
Leu Lys Pro Asp Phe Val Ile Phe Asp Phe Val His Trp Trp Leu Pro
115 120 125
gaa att gct tct gaa ttg ggt atc aag acc atc tac ttc tcc gtt tac 432
Glu Ile Ala Ser Glu Leu Gly Ile Lys Thr Ile Tyr Phe Ser Val Tyr
130 135 140
atg gcc aac att gtc atg cca tcc act tcc aag ttg acc ggt aac aag 480
Met Ala Asn Ile Val Met Pro Ser Thr Ser Lys Leu Thr Gly Asn Lys
145 150 155 160
cca tct acc gtc gaa gat atc aag gct ttg caa caa tct tac ggt att 528
Pro Ser Thr Val Glu Asp Ile Lys Ala Leu Gln Gln Ser Tyr Gly Ile
165 170 175
cca gtt aag act ttc gaa gcc atc tct ttg atg aac gtt ttc aag tct 576
Pro Val Lys Thr Phe Glu Ala Ile Ser Leu Met Asn Val Phe Lys Ser
180 185 190
ttc cac gac tgg atg gac aaa tgt atc aac ggt tgt aac ttg atg ttg 624
Phe His Asp Trp Met Asp Lys Cys Ile Asn Gly Cys Asn Leu Met Leu
195 200 205
atc aag tct tgt cgt gaa atg gaa ggt tcc aga att gac gat gtc acc 672
Ile Lys Ser Cys Arg Glu Met Glu Gly Ser Arg Ile Asp Asp Val Thr
210 215 220
aag caa tct acc aga cca gtt ttc ttg att ggt cca gtt gtt cca gaa 720
Lys Gln Ser Thr Arg Pro Val Phe Leu Ile Gly Pro Val Val Pro Glu
225 230 235 240
cct cac tct ggt gaa ttg gac gaa acc tgg gct aac tgg ttg aac aga 768
Pro His Ser Gly Glu Leu Asp Glu Thr Trp Ala Asn Trp Leu Asn Arg
245 250 255
ttc cca gct aag tct gtc atc tac tgt tcc ttt ggt tct gaa act ttc 816
Phe Pro Ala Lys Ser Val Ile Tyr Cys Ser Phe Gly Ser Glu Thr Phe
260 265 270
ttg acc gat gac caa atc aga gaa ttg gcc ttg ggt tta gaa ttg act 864
Leu Thr Asp Asp Gln Ile Arg Glu Leu Ala Leu Gly Leu Glu Leu Thr
275 280 285
ggt ttg cca ttc ttc ttg gtc ttg aac ttc cca gcc aat gtc gac aaa 912
Gly Leu Pro Phe Phe Leu Val Leu Asn Phe Pro Ala Asn Val Asp Lys
290 295 300
tct gct gaa ttg aag aga act tta cca gat ggt ttc ttg gaa aga gtt 960
Ser Ala Glu Leu Lys Arg Thr Leu Pro Asp Gly Phe Leu Glu Arg Val
305 310 315 320
aag gac aag ggt att gtc cac tct ggt tgg gtt caa caa aga cac atc 1008
Lys Asp Lys Gly Ile Val His Ser Gly Trp Val Gln Gln Arg His Ile
325 330 335
ttg gct cat gac tct gtt ggt tgt tac gtt ttc cac gct ggt tac ggt 1056
Leu Ala His Asp Ser Val Gly Cys Tyr Val Phe His Ala Gly Tyr Gly
340 345 350
tcc gtt atc gaa ggt ttg gtc aat gac tgt caa tta gtc atg ttg cca 1104
Ser Val Ile Glu Gly Leu Val Asn Asp Cys Gln Leu Val Met Leu Pro
355 360 365
atg aag gtt gac caa ttc acc aac tcc aag gtt att gct ttg gaa ttg 1152
Met Lys Val Asp Gln Phe Thr Asn Ser Lys Val Ile Ala Leu Glu Leu
370 375 380
aag gct ggt gtt gaa gtt aac aga aga gat gaa gat ggt tac ttc ggt 1200
Lys Ala Gly Val Glu Val Asn Arg Arg Asp Glu Asp Gly Tyr Phe Gly
385 390 395 400
aag gac gat gtc ttc gaa gct gtc gaa tct gtc atg atg gac act gaa 1248
Lys Asp Asp Val Phe Glu Ala Val Glu Ser Val Met Met Asp Thr Glu
405 410 415
aac gaa cca gcc aag tcc atc aga gaa aac cac cgt aaa ttg aag gaa 1296
Asn Glu Pro Ala Lys Ser Ile Arg Glu Asn His Arg Lys Leu Lys Glu
420 425 430
ttt ttg caa aac gat gaa atc caa aag aaa tac att gct gat ttc gtt 1344
Phe Leu Gln Asn Asp Glu Ile Gln Lys Lys Tyr Ile Ala Asp Phe Val
435 440 445
gaa aac ttg aaa gcg tta 1362
Glu Asn Leu Lys Ala Leu
450
<210> 114
<211> 454
<212> PRT
<213> Stevia rebaudiana
<400> 114
Met Ser Leu Lys Gly Asn Asp Lys Glu Leu His Leu Val Met Phe Pro
1 5 10 15
Phe Phe Ala Phe Gly His Ile Thr Pro Phe Val Gln Leu Ser Asn Lys
20 25 30
Ile Ser Ser Leu Tyr Pro Gly Val Lys Ile Thr Phe Leu Ala Ala Ser
35 40 45
Ala Ser Val Ser Arg Ile Glu Thr Met Leu Asn Pro Ser Thr Asn Thr
50 55 60
Lys Val Ile Pro Leu Thr Leu Pro Arg Val Asp Gly Leu Pro Glu Gly
65 70 75 80
Val Glu Asn Thr Ala Asp Ala Ser Pro Ala Thr Ile Gly Leu Leu Val
85 90 95
Val Ala Ile Asp Leu Met Gln Pro Gln Ile Lys Thr Leu Leu Ala Asn
100 105 110
Leu Lys Pro Asp Phe Val Ile Phe Asp Phe Val His Trp Trp Leu Pro
115 120 125
Glu Ile Ala Ser Glu Leu Gly Ile Lys Thr Ile Tyr Phe Ser Val Tyr
130 135 140
Met Ala Asn Ile Val Met Pro Ser Thr Ser Lys Leu Thr Gly Asn Lys
145 150 155 160
Pro Ser Thr Val Glu Asp Ile Lys Ala Leu Gln Gln Ser Tyr Gly Ile
165 170 175
Pro Val Lys Thr Phe Glu Ala Ile Ser Leu Met Asn Val Phe Lys Ser
180 185 190
Phe His Asp Trp Met Asp Lys Cys Ile Asn Gly Cys Asn Leu Met Leu
195 200 205
Ile Lys Ser Cys Arg Glu Met Glu Gly Ser Arg Ile Asp Asp Val Thr
210 215 220
Lys Gln Ser Thr Arg Pro Val Phe Leu Ile Gly Pro Val Val Pro Glu
225 230 235 240
Pro His Ser Gly Glu Leu Asp Glu Thr Trp Ala Asn Trp Leu Asn Arg
245 250 255
Phe Pro Ala Lys Ser Val Ile Tyr Cys Ser Phe Gly Ser Glu Thr Phe
260 265 270
Leu Thr Asp Asp Gln Ile Arg Glu Leu Ala Leu Gly Leu Glu Leu Thr
275 280 285
Gly Leu Pro Phe Phe Leu Val Leu Asn Phe Pro Ala Asn Val Asp Lys
290 295 300
Ser Ala Glu Leu Lys Arg Thr Leu Pro Asp Gly Phe Leu Glu Arg Val
305 310 315 320
Lys Asp Lys Gly Ile Val His Ser Gly Trp Val Gln Gln Arg His Ile
325 330 335
Leu Ala His Asp Ser Val Gly Cys Tyr Val Phe His Ala Gly Tyr Gly
340 345 350
Ser Val Ile Glu Gly Leu Val Asn Asp Cys Gln Leu Val Met Leu Pro
355 360 365
Met Lys Val Asp Gln Phe Thr Asn Ser Lys Val Ile Ala Leu Glu Leu
370 375 380
Lys Ala Gly Val Glu Val Asn Arg Arg Asp Glu Asp Gly Tyr Phe Gly
385 390 395 400
Lys Asp Asp Val Phe Glu Ala Val Glu Ser Val Met Met Asp Thr Glu
405 410 415
Asn Glu Pro Ala Lys Ser Ile Arg Glu Asn His Arg Lys Leu Lys Glu
420 425 430
Phe Leu Gln Asn Asp Glu Ile Gln Lys Lys Tyr Ile Ala Asp Phe Val
435 440 445
Glu Asn Leu Lys Ala Leu
450
<210> 115
<211> 1362
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1362)
<400> 115
atg tcc ttg aaa ggt aac gac aag gaa tta cat ttg gtc atg ttc cca 48
Met Ser Leu Lys Gly Asn Asp Lys Glu Leu His Leu Val Met Phe Pro
1 5 10 15
ttc ttt gct ttc ggt cac atc act cca ttt gtc caa ttg tcc aac aag 96
Phe Phe Ala Phe Gly His Ile Thr Pro Phe Val Gln Leu Ser Asn Lys
20 25 30
atc tcc tct ttg tac cca ggt gtc aag atc act ttc ttg gct gct tct 144
Ile Ser Ser Leu Tyr Pro Gly Val Lys Ile Thr Phe Leu Ala Ala Ser
35 40 45
gct tct gtt tcc aga atc gaa acc atg ttg aac cca tct acc aac acc 192
Ala Ser Val Ser Arg Ile Glu Thr Met Leu Asn Pro Ser Thr Asn Thr
50 55 60
aag gtc att cca ttg act ttg cca aga gtc gat ggt cta cca gaa ggt 240
Lys Val Ile Pro Leu Thr Leu Pro Arg Val Asp Gly Leu Pro Glu Gly
65 70 75 80
gtt gaa aac act gct gac gct tct cca gct acc att ggt cta ttg gtt 288
Val Glu Asn Thr Ala Asp Ala Ser Pro Ala Thr Ile Gly Leu Leu Val
85 90 95
gtt gcc att gat ttg atg caa cct caa atc aag act tta ttg gct aac 336
Val Ala Ile Asp Leu Met Gln Pro Gln Ile Lys Thr Leu Leu Ala Asn
100 105 110
ttg aag cca gac ttc gtt atc ttc gac ttc gtc cac tgg tgg tta cca 384
Leu Lys Pro Asp Phe Val Ile Phe Asp Phe Val His Trp Trp Leu Pro
115 120 125
gaa att gct tct gaa ttg ggt atc aag acc atc tac ttc tcc gtt tac 432
Glu Ile Ala Ser Glu Leu Gly Ile Lys Thr Ile Tyr Phe Ser Val Tyr
130 135 140
atg gcc aac att gtc atg cca tcc act tcc aag ttg acc ggt aac aag 480
Met Ala Asn Ile Val Met Pro Ser Thr Ser Lys Leu Thr Gly Asn Lys
145 150 155 160
cca tct acc gtc gaa gat atc aag gct ttg caa caa tct gac ggt att 528
Pro Ser Thr Val Glu Asp Ile Lys Ala Leu Gln Gln Ser Asp Gly Ile
165 170 175
cca gtt aag act ttc gaa gcc atc tct ttg atg aac gtt ttc aag tct 576
Pro Val Lys Thr Phe Glu Ala Ile Ser Leu Met Asn Val Phe Lys Ser
180 185 190
ttc cac gac tgg atg gac aaa tgt atc aac ggt tgt aac ttg atg ttg 624
Phe His Asp Trp Met Asp Lys Cys Ile Asn Gly Cys Asn Leu Met Leu
195 200 205
atc aag tct tgt cgt gaa atg gaa ggt tcc aga att gac gat gtc acc 672
Ile Lys Ser Cys Arg Glu Met Glu Gly Ser Arg Ile Asp Asp Val Thr
210 215 220
aag caa tct acc aga cca gtt ttc ttg att ggt cca gtt gtt cca gaa 720
Lys Gln Ser Thr Arg Pro Val Phe Leu Ile Gly Pro Val Val Pro Glu
225 230 235 240
cct cac tct ggt gaa ttg gac gaa acc tgg gct aac tgg ttg aac aga 768
Pro His Ser Gly Glu Leu Asp Glu Thr Trp Ala Asn Trp Leu Asn Arg
245 250 255
ttc cca gct aag tct gtc atc tac tgt tcc ttt ggt tct gaa act ttc 816
Phe Pro Ala Lys Ser Val Ile Tyr Cys Ser Phe Gly Ser Glu Thr Phe
260 265 270
ttg acc gat gac caa atc aga gaa ttg gcc ttg ggt tta gaa ttg act 864
Leu Thr Asp Asp Gln Ile Arg Glu Leu Ala Leu Gly Leu Glu Leu Thr
275 280 285
ggt ttg cca ttc ttc ttg gtc ttg aac ttc cca gcc aat gtc gac aaa 912
Gly Leu Pro Phe Phe Leu Val Leu Asn Phe Pro Ala Asn Val Asp Lys
290 295 300
tct gct gaa ttg aag aga act tta cca gat ggt ttc ttg gaa aga gtt 960
Ser Ala Glu Leu Lys Arg Thr Leu Pro Asp Gly Phe Leu Glu Arg Val
305 310 315 320
aag gac aag ggt att gtc cac tct ggt tgg gtt caa caa aga cac atc 1008
Lys Asp Lys Gly Ile Val His Ser Gly Trp Val Gln Gln Arg His Ile
325 330 335
ttg gct cat gac tct gtt ggt tgt tac gtt ttc cac gct ggt tac ggt 1056
Leu Ala His Asp Ser Val Gly Cys Tyr Val Phe His Ala Gly Tyr Gly
340 345 350
tcc gtt atc gaa ggt ttg gtc aat gac tgt caa tta gtc atg ttg cca 1104
Ser Val Ile Glu Gly Leu Val Asn Asp Cys Gln Leu Val Met Leu Pro
355 360 365
atg aag gtt gac caa ttc acc aac tcc aag gtt att gct ttg gaa ttg 1152
Met Lys Val Asp Gln Phe Thr Asn Ser Lys Val Ile Ala Leu Glu Leu
370 375 380
aag gct ggt gtt gaa gtt aac aga aga gat gaa gat ggt tac ttc ggt 1200
Lys Ala Gly Val Glu Val Asn Arg Arg Asp Glu Asp Gly Tyr Phe Gly
385 390 395 400
aag gac gat gtc ttc gaa gct gtc gaa tct gtc atg atg gac act gaa 1248
Lys Asp Asp Val Phe Glu Ala Val Glu Ser Val Met Met Asp Thr Glu
405 410 415
aac gaa cca gcc aag tcc atc aga gaa aac cac cgt aaa ttg aag gaa 1296
Asn Glu Pro Ala Lys Ser Ile Arg Glu Asn His Arg Lys Leu Lys Glu
420 425 430
ttt ttg caa aac gat gaa atc caa aag aaa tac att gct gat ttc gtt 1344
Phe Leu Gln Asn Asp Glu Ile Gln Lys Lys Tyr Ile Ala Asp Phe Val
435 440 445
gaa aac ttg aaa gcg tta 1362
Glu Asn Leu Lys Ala Leu
450
<210> 116
<211> 454
<212> PRT
<213> Stevia rebaudiana
<400> 116
Met Ser Leu Lys Gly Asn Asp Lys Glu Leu His Leu Val Met Phe Pro
1 5 10 15
Phe Phe Ala Phe Gly His Ile Thr Pro Phe Val Gln Leu Ser Asn Lys
20 25 30
Ile Ser Ser Leu Tyr Pro Gly Val Lys Ile Thr Phe Leu Ala Ala Ser
35 40 45
Ala Ser Val Ser Arg Ile Glu Thr Met Leu Asn Pro Ser Thr Asn Thr
50 55 60
Lys Val Ile Pro Leu Thr Leu Pro Arg Val Asp Gly Leu Pro Glu Gly
65 70 75 80
Val Glu Asn Thr Ala Asp Ala Ser Pro Ala Thr Ile Gly Leu Leu Val
85 90 95
Val Ala Ile Asp Leu Met Gln Pro Gln Ile Lys Thr Leu Leu Ala Asn
100 105 110
Leu Lys Pro Asp Phe Val Ile Phe Asp Phe Val His Trp Trp Leu Pro
115 120 125
Glu Ile Ala Ser Glu Leu Gly Ile Lys Thr Ile Tyr Phe Ser Val Tyr
130 135 140
Met Ala Asn Ile Val Met Pro Ser Thr Ser Lys Leu Thr Gly Asn Lys
145 150 155 160
Pro Ser Thr Val Glu Asp Ile Lys Ala Leu Gln Gln Ser Asp Gly Ile
165 170 175
Pro Val Lys Thr Phe Glu Ala Ile Ser Leu Met Asn Val Phe Lys Ser
180 185 190
Phe His Asp Trp Met Asp Lys Cys Ile Asn Gly Cys Asn Leu Met Leu
195 200 205
Ile Lys Ser Cys Arg Glu Met Glu Gly Ser Arg Ile Asp Asp Val Thr
210 215 220
Lys Gln Ser Thr Arg Pro Val Phe Leu Ile Gly Pro Val Val Pro Glu
225 230 235 240
Pro His Ser Gly Glu Leu Asp Glu Thr Trp Ala Asn Trp Leu Asn Arg
245 250 255
Phe Pro Ala Lys Ser Val Ile Tyr Cys Ser Phe Gly Ser Glu Thr Phe
260 265 270
Leu Thr Asp Asp Gln Ile Arg Glu Leu Ala Leu Gly Leu Glu Leu Thr
275 280 285
Gly Leu Pro Phe Phe Leu Val Leu Asn Phe Pro Ala Asn Val Asp Lys
290 295 300
Ser Ala Glu Leu Lys Arg Thr Leu Pro Asp Gly Phe Leu Glu Arg Val
305 310 315 320
Lys Asp Lys Gly Ile Val His Ser Gly Trp Val Gln Gln Arg His Ile
325 330 335
Leu Ala His Asp Ser Val Gly Cys Tyr Val Phe His Ala Gly Tyr Gly
340 345 350
Ser Val Ile Glu Gly Leu Val Asn Asp Cys Gln Leu Val Met Leu Pro
355 360 365
Met Lys Val Asp Gln Phe Thr Asn Ser Lys Val Ile Ala Leu Glu Leu
370 375 380
Lys Ala Gly Val Glu Val Asn Arg Arg Asp Glu Asp Gly Tyr Phe Gly
385 390 395 400
Lys Asp Asp Val Phe Glu Ala Val Glu Ser Val Met Met Asp Thr Glu
405 410 415
Asn Glu Pro Ala Lys Ser Ile Arg Glu Asn His Arg Lys Leu Lys Glu
420 425 430
Phe Leu Gln Asn Asp Glu Ile Gln Lys Lys Tyr Ile Ala Asp Phe Val
435 440 445
Glu Asn Leu Lys Ala Leu
450
<210> 117
<211> 1449
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1449)
<400> 117
atg gac caa atg gcc aag att gac gaa aag aag cct cac gtt gtt ttc 48
Met Asp Gln Met Ala Lys Ile Asp Glu Lys Lys Pro His Val Val Phe
1 5 10 15
att cca ttc cca gct caa tct cac atc aag tgt atg ttg aaa ttg gcc 96
Ile Pro Phe Pro Ala Gln Ser His Ile Lys Cys Met Leu Lys Leu Ala
20 25 30
aga att ttg cac caa aag ggt ttg tac atc act ttc atc aac act gac 144
Arg Ile Leu His Gln Lys Gly Leu Tyr Ile Thr Phe Ile Asn Thr Asp
35 40 45
acc aac cac gaa aga ttg gtt gcc tcc ggt ggt act caa tgg ttg gaa 192
Thr Asn His Glu Arg Leu Val Ala Ser Gly Gly Thr Gln Trp Leu Glu
50 55 60
aat gct cca ggt ttc tgg ttc aag acc gtc cca gat ggt ttc ggt tct 240
Asn Ala Pro Gly Phe Trp Phe Lys Thr Val Pro Asp Gly Phe Gly Ser
65 70 75 80
gcc aag gat gac ggt gtc aag cca act gat gct ttg aga gaa ttg atg 288
Ala Lys Asp Asp Gly Val Lys Pro Thr Asp Ala Leu Arg Glu Leu Met
85 90 95
gac tac ttg aag acc aac ttc ttc gat ttg ttc ttg gac ttg gtc ttg 336
Asp Tyr Leu Lys Thr Asn Phe Phe Asp Leu Phe Leu Asp Leu Val Leu
100 105 110
aag ttg gaa gtt cca gct acc tgt atc atc tgt gat ggt tgt atg acc 384
Lys Leu Glu Val Pro Ala Thr Cys Ile Ile Cys Asp Gly Cys Met Thr
115 120 125
ttt gct aac acc att aga gct gct gaa aag ttg aac att cca gtc atc 432
Phe Ala Asn Thr Ile Arg Ala Ala Glu Lys Leu Asn Ile Pro Val Ile
130 135 140
tta ttc tgg acc atg gct gct tgt ggt ttc atg gct ttc tac caa gct 480
Leu Phe Trp Thr Met Ala Ala Cys Gly Phe Met Ala Phe Tyr Gln Ala
145 150 155 160
aag gtc tta aag gaa aag gaa att gtt cca gtc aag gac gaa acc tac 528
Lys Val Leu Lys Glu Lys Glu Ile Val Pro Val Lys Asp Glu Thr Tyr
165 170 175
ttg acc aat ggt tac ttg gac atg gaa att gac tgg atc cca ggt atg 576
Leu Thr Asn Gly Tyr Leu Asp Met Glu Ile Asp Trp Ile Pro Gly Met
180 185 190
aag aga atc aga tta cgt gac ttg cct gaa ttc atc ttg gcc acc aag 624
Lys Arg Ile Arg Leu Arg Asp Leu Pro Glu Phe Ile Leu Ala Thr Lys
195 200 205
caa aac tac ttt gct ttc gaa ttc ttg ttt gaa act gct caa tta gct 672
Gln Asn Tyr Phe Ala Phe Glu Phe Leu Phe Glu Thr Ala Gln Leu Ala
210 215 220
gac aag gtt tct cac atg atc att cac act ttc gaa gaa ttg gaa gct 720
Asp Lys Val Ser His Met Ile Ile His Thr Phe Glu Glu Leu Glu Ala
225 230 235 240
tct tta gtc tct gaa atc aaa tct att ttc cca aac gtt tac act atc 768
Ser Leu Val Ser Glu Ile Lys Ser Ile Phe Pro Asn Val Tyr Thr Ile
245 250 255
ggt cca tta caa ttg ttg ttg aac aag atc act caa aag gaa acc aac 816
Gly Pro Leu Gln Leu Leu Leu Asn Lys Ile Thr Gln Lys Glu Thr Asn
260 265 270
aac gat tct tac tct tta tgg aag gaa gaa cca gaa tgt gtc gaa tgg 864
Asn Asp Ser Tyr Ser Leu Trp Lys Glu Glu Pro Glu Cys Val Glu Trp
275 280 285
ttg aac tcc aag gaa cca aac tcc gtt gtt tac gtc aac ttt ggt tct 912
Leu Asn Ser Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser
290 295 300
ttg gct gtc atg tct ttg caa gat ttg gtc gaa ttc ggt tgg ggt ttg 960
Leu Ala Val Met Ser Leu Gln Asp Leu Val Glu Phe Gly Trp Gly Leu
305 310 315 320
gtt aac tcc aac cat tac ttc tta tgg atc atc aga gct aac ttg att 1008
Val Asn Ser Asn His Tyr Phe Leu Trp Ile Ile Arg Ala Asn Leu Ile
325 330 335
gac ggt aag cca gct gtc atg cca caa gaa ttg aag gaa gcc atg aac 1056
Asp Gly Lys Pro Ala Val Met Pro Gln Glu Leu Lys Glu Ala Met Asn
340 345 350
gaa aaa ggt ttc gtt ggt tct tgg tgt tcc caa gaa gaa gtt ttg aac 1104
Glu Lys Gly Phe Val Gly Ser Trp Cys Ser Gln Glu Glu Val Leu Asn
355 360 365
cat cca gcc gtt ggt ggt ttc ttg act cac tgt ggc tgg ggt tcc atc 1152
His Pro Ala Val Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Ile
370 375 380
att gaa tct cta tcc gct ggt gtt cca atg ttg ggt tgg cca tcc att 1200
Ile Glu Ser Leu Ser Ala Gly Val Pro Met Leu Gly Trp Pro Ser Ile
385 390 395 400
ggt gac caa aga gcc aac tgt cgt caa atg tgt aag gaa tgg gaa gtt 1248
Gly Asp Gln Arg Ala Asn Cys Arg Gln Met Cys Lys Glu Trp Glu Val
405 410 415
ggt atg gaa atc ggt aag aac gtt aag aga gat gaa gtc gaa aaa tta 1296
Gly Met Glu Ile Gly Lys Asn Val Lys Arg Asp Glu Val Glu Lys Leu
420 425 430
gtc aga atg ttg atg gaa ggt ttg gaa ggt gaa aga atg aga aag aag 1344
Val Arg Met Leu Met Glu Gly Leu Glu Gly Glu Arg Met Arg Lys Lys
435 440 445
gct ttg gaa tgg aag aaa tct gct act ttg gct act tgt tgt aac ggt 1392
Ala Leu Glu Trp Lys Lys Ser Ala Thr Leu Ala Thr Cys Cys Asn Gly
450 455 460
tcc tct tct ttg gat gtt gaa aaa ttg gct aac gaa atc aag aaa ttg 1440
Ser Ser Ser Leu Asp Val Glu Lys Leu Ala Asn Glu Ile Lys Lys Leu
465 470 475 480
tcc agg aac 1449
Ser Arg Asn
<210> 118
<211> 483
<212> PRT
<213> Stevia rebaudiana
<400> 118
Met Asp Gln Met Ala Lys Ile Asp Glu Lys Lys Pro His Val Val Phe
1 5 10 15
Ile Pro Phe Pro Ala Gln Ser His Ile Lys Cys Met Leu Lys Leu Ala
20 25 30
Arg Ile Leu His Gln Lys Gly Leu Tyr Ile Thr Phe Ile Asn Thr Asp
35 40 45
Thr Asn His Glu Arg Leu Val Ala Ser Gly Gly Thr Gln Trp Leu Glu
50 55 60
Asn Ala Pro Gly Phe Trp Phe Lys Thr Val Pro Asp Gly Phe Gly Ser
65 70 75 80
Ala Lys Asp Asp Gly Val Lys Pro Thr Asp Ala Leu Arg Glu Leu Met
85 90 95
Asp Tyr Leu Lys Thr Asn Phe Phe Asp Leu Phe Leu Asp Leu Val Leu
100 105 110
Lys Leu Glu Val Pro Ala Thr Cys Ile Ile Cys Asp Gly Cys Met Thr
115 120 125
Phe Ala Asn Thr Ile Arg Ala Ala Glu Lys Leu Asn Ile Pro Val Ile
130 135 140
Leu Phe Trp Thr Met Ala Ala Cys Gly Phe Met Ala Phe Tyr Gln Ala
145 150 155 160
Lys Val Leu Lys Glu Lys Glu Ile Val Pro Val Lys Asp Glu Thr Tyr
165 170 175
Leu Thr Asn Gly Tyr Leu Asp Met Glu Ile Asp Trp Ile Pro Gly Met
180 185 190
Lys Arg Ile Arg Leu Arg Asp Leu Pro Glu Phe Ile Leu Ala Thr Lys
195 200 205
Gln Asn Tyr Phe Ala Phe Glu Phe Leu Phe Glu Thr Ala Gln Leu Ala
210 215 220
Asp Lys Val Ser His Met Ile Ile His Thr Phe Glu Glu Leu Glu Ala
225 230 235 240
Ser Leu Val Ser Glu Ile Lys Ser Ile Phe Pro Asn Val Tyr Thr Ile
245 250 255
Gly Pro Leu Gln Leu Leu Leu Asn Lys Ile Thr Gln Lys Glu Thr Asn
260 265 270
Asn Asp Ser Tyr Ser Leu Trp Lys Glu Glu Pro Glu Cys Val Glu Trp
275 280 285
Leu Asn Ser Lys Glu Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser
290 295 300
Leu Ala Val Met Ser Leu Gln Asp Leu Val Glu Phe Gly Trp Gly Leu
305 310 315 320
Val Asn Ser Asn His Tyr Phe Leu Trp Ile Ile Arg Ala Asn Leu Ile
325 330 335
Asp Gly Lys Pro Ala Val Met Pro Gln Glu Leu Lys Glu Ala Met Asn
340 345 350
Glu Lys Gly Phe Val Gly Ser Trp Cys Ser Gln Glu Glu Val Leu Asn
355 360 365
His Pro Ala Val Gly Gly Phe Leu Thr His Cys Gly Trp Gly Ser Ile
370 375 380
Ile Glu Ser Leu Ser Ala Gly Val Pro Met Leu Gly Trp Pro Ser Ile
385 390 395 400
Gly Asp Gln Arg Ala Asn Cys Arg Gln Met Cys Lys Glu Trp Glu Val
405 410 415
Gly Met Glu Ile Gly Lys Asn Val Lys Arg Asp Glu Val Glu Lys Leu
420 425 430
Val Arg Met Leu Met Glu Gly Leu Glu Gly Glu Arg Met Arg Lys Lys
435 440 445
Ala Leu Glu Trp Lys Lys Ser Ala Thr Leu Ala Thr Cys Cys Asn Gly
450 455 460
Ser Ser Ser Leu Asp Val Glu Lys Leu Ala Asn Glu Ile Lys Lys Leu
465 470 475 480
Ser Arg Asn
<210> 119
<211> 1404
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1404)
<400> 119
atg cca atc tcc gat atc aat gct ggt tcc cac att ttg gtt ttc cca 48
Met Pro Ile Ser Asp Ile Asn Ala Gly Ser His Ile Leu Val Phe Pro
1 5 10 15
tac cca gct caa ggt cac atg ttg act cta tta gat ttg acc cac caa 96
Tyr Pro Ala Gln Gly His Met Leu Thr Leu Leu Asp Leu Thr His Gln
20 25 30
ttg gcc atc aga aac ttg act atc act atc tta gtc acc cca aag aac 144
Leu Ala Ile Arg Asn Leu Thr Ile Thr Ile Leu Val Thr Pro Lys Asn
35 40 45
ttg cca acc att tct cca ttg ttg gct gct cat cca acc act gtt tct 192
Leu Pro Thr Ile Ser Pro Leu Leu Ala Ala His Pro Thr Thr Val Ser
50 55 60
gct ttg ttg ttg cca tta cca cct cac cca gcc atc cca tct ggt att 240
Ala Leu Leu Leu Pro Leu Pro Pro His Pro Ala Ile Pro Ser Gly Ile
65 70 75 80
gaa aac gtc aag gac ttg cca aac gat gct ttc aag gct atg atg gtt 288
Glu Asn Val Lys Asp Leu Pro Asn Asp Ala Phe Lys Ala Met Met Val
85 90 95
gct tta ggt gac ttg tac aac cca ttg aga gac tgg ttc aga aac caa 336
Ala Leu Gly Asp Leu Tyr Asn Pro Leu Arg Asp Trp Phe Arg Asn Gln
100 105 110
cct aac cca cca gtt gcc atc att tct gac ttc ttc ttg ggt tgg act 384
Pro Asn Pro Pro Val Ala Ile Ile Ser Asp Phe Phe Leu Gly Trp Thr
115 120 125
cac cac ttg gct gtc gaa tta ggt atc aga aga tac act ttc tct cca 432
His His Leu Ala Val Glu Leu Gly Ile Arg Arg Tyr Thr Phe Ser Pro
130 135 140
tct ggt gct ttg gct ttg tcc gtc att ttc tct tta tgg aga tac caa 480
Ser Gly Ala Leu Ala Leu Ser Val Ile Phe Ser Leu Trp Arg Tyr Gln
145 150 155 160
cca aag aga atc gat gtc gaa aac gaa aag gaa gct atc aaa ttc cca 528
Pro Lys Arg Ile Asp Val Glu Asn Glu Lys Glu Ala Ile Lys Phe Pro
165 170 175
aag att cca aac tct cca gaa tac cca tgg tgg caa tta tct cca atc 576
Lys Ile Pro Asn Ser Pro Glu Tyr Pro Trp Trp Gln Leu Ser Pro Ile
180 185 190
tac aga tct tac gtt gaa ggt gac cca gat tct gaa ttc atc aag gat 624
Tyr Arg Ser Tyr Val Glu Gly Asp Pro Asp Ser Glu Phe Ile Lys Asp
195 200 205
ggt ttc ttg gct gat att gct tcc tgg ggt att gtc atc aac tct ttc 672
Gly Phe Leu Ala Asp Ile Ala Ser Trp Gly Ile Val Ile Asn Ser Phe
210 215 220
acc gaa ttg gaa caa gtc tac gtt gac cat ttg aag cat gaa ttg ggt 720
Thr Glu Leu Glu Gln Val Tyr Val Asp His Leu Lys His Glu Leu Gly
225 230 235 240
cac gac caa gtc ttt gct gtc ggt cca tta ttg cct cca ggt gac aag 768
His Asp Gln Val Phe Ala Val Gly Pro Leu Leu Pro Pro Gly Asp Lys
245 250 255
act tct ggt aga ggt ggt tct tct tcc aac gat gtc ttg tcc tgg ttg 816
Thr Ser Gly Arg Gly Gly Ser Ser Ser Asn Asp Val Leu Ser Trp Leu
260 265 270
gac acc tgt gct gac aga acc gtt gtc tac gtt tgt ttc ggt tct caa 864
Asp Thr Cys Ala Asp Arg Thr Val Val Tyr Val Cys Phe Gly Ser Gln
275 280 285
atg gtt ttg acc aac ggt caa atg gaa gtt gtt gct ttg ggt ttg gaa 912
Met Val Leu Thr Asn Gly Gln Met Glu Val Val Ala Leu Gly Leu Glu
290 295 300
aag tcc cgt gtc aaa ttt gtc tgg tcc gtc aag gaa cca act gtt ggt 960
Lys Ser Arg Val Lys Phe Val Trp Ser Val Lys Glu Pro Thr Val Gly
305 310 315 320
cac gaa gct gct aac tac ggt cgt gtt cca cca ggt ttc gaa gac aga 1008
His Glu Ala Ala Asn Tyr Gly Arg Val Pro Pro Gly Phe Glu Asp Arg
325 330 335
gtt tcc ggt cgt ggt ttg gtt atc aga ggt tgg gtt cca caa gtt gcc 1056
Val Ser Gly Arg Gly Leu Val Ile Arg Gly Trp Val Pro Gln Val Ala
340 345 350
att ttg tct cac gac tcc gtt ggt gtt ttc ttg acc cac tgt ggt tgg 1104
Ile Leu Ser His Asp Ser Val Gly Val Phe Leu Thr His Cys Gly Trp
355 360 365
aac tct gtc atg gaa gct gtt gct gct gaa gtt ttg atg ttg acc tgg 1152
Asn Ser Val Met Glu Ala Val Ala Ala Glu Val Leu Met Leu Thr Trp
370 375 380
cca atg tcc gct gac caa ttc tcc aat gcc act ttg ttg cac gaa ttg 1200
Pro Met Ser Ala Asp Gln Phe Ser Asn Ala Thr Leu Leu His Glu Leu
385 390 395 400
aag gtt ggt atc aag gtt tgt gaa ggt tcc aac att gtc cca aac tct 1248
Lys Val Gly Ile Lys Val Cys Glu Gly Ser Asn Ile Val Pro Asn Ser
405 410 415
gac gaa ttg gct gaa ttg ttc tcc aaa tct cta tcc gat gaa act aga 1296
Asp Glu Leu Ala Glu Leu Phe Ser Lys Ser Leu Ser Asp Glu Thr Arg
420 425 430
ttg gaa aga aag aga gtc aag gaa ttt gcc aaa tct gcc aag gaa gcc 1344
Leu Glu Arg Lys Arg Val Lys Glu Phe Ala Lys Ser Ala Lys Glu Ala
435 440 445
gtc ggt cca aag ggt tcc tct gtc ggt gaa tta gaa aga ttg gtt gac 1392
Val Gly Pro Lys Gly Ser Ser Val Gly Glu Leu Glu Arg Leu Val Asp
450 455 460
aac ttg tct ttg 1404
Asn Leu Ser Leu
465
<210> 120
<211> 468
<212> PRT
<213> Stevia rebaudiana
<400> 120
Met Pro Ile Ser Asp Ile Asn Ala Gly Ser His Ile Leu Val Phe Pro
1 5 10 15
Tyr Pro Ala Gln Gly His Met Leu Thr Leu Leu Asp Leu Thr His Gln
20 25 30
Leu Ala Ile Arg Asn Leu Thr Ile Thr Ile Leu Val Thr Pro Lys Asn
35 40 45
Leu Pro Thr Ile Ser Pro Leu Leu Ala Ala His Pro Thr Thr Val Ser
50 55 60
Ala Leu Leu Leu Pro Leu Pro Pro His Pro Ala Ile Pro Ser Gly Ile
65 70 75 80
Glu Asn Val Lys Asp Leu Pro Asn Asp Ala Phe Lys Ala Met Met Val
85 90 95
Ala Leu Gly Asp Leu Tyr Asn Pro Leu Arg Asp Trp Phe Arg Asn Gln
100 105 110
Pro Asn Pro Pro Val Ala Ile Ile Ser Asp Phe Phe Leu Gly Trp Thr
115 120 125
His His Leu Ala Val Glu Leu Gly Ile Arg Arg Tyr Thr Phe Ser Pro
130 135 140
Ser Gly Ala Leu Ala Leu Ser Val Ile Phe Ser Leu Trp Arg Tyr Gln
145 150 155 160
Pro Lys Arg Ile Asp Val Glu Asn Glu Lys Glu Ala Ile Lys Phe Pro
165 170 175
Lys Ile Pro Asn Ser Pro Glu Tyr Pro Trp Trp Gln Leu Ser Pro Ile
180 185 190
Tyr Arg Ser Tyr Val Glu Gly Asp Pro Asp Ser Glu Phe Ile Lys Asp
195 200 205
Gly Phe Leu Ala Asp Ile Ala Ser Trp Gly Ile Val Ile Asn Ser Phe
210 215 220
Thr Glu Leu Glu Gln Val Tyr Val Asp His Leu Lys His Glu Leu Gly
225 230 235 240
His Asp Gln Val Phe Ala Val Gly Pro Leu Leu Pro Pro Gly Asp Lys
245 250 255
Thr Ser Gly Arg Gly Gly Ser Ser Ser Asn Asp Val Leu Ser Trp Leu
260 265 270
Asp Thr Cys Ala Asp Arg Thr Val Val Tyr Val Cys Phe Gly Ser Gln
275 280 285
Met Val Leu Thr Asn Gly Gln Met Glu Val Val Ala Leu Gly Leu Glu
290 295 300
Lys Ser Arg Val Lys Phe Val Trp Ser Val Lys Glu Pro Thr Val Gly
305 310 315 320
His Glu Ala Ala Asn Tyr Gly Arg Val Pro Pro Gly Phe Glu Asp Arg
325 330 335
Val Ser Gly Arg Gly Leu Val Ile Arg Gly Trp Val Pro Gln Val Ala
340 345 350
Ile Leu Ser His Asp Ser Val Gly Val Phe Leu Thr His Cys Gly Trp
355 360 365
Asn Ser Val Met Glu Ala Val Ala Ala Glu Val Leu Met Leu Thr Trp
370 375 380
Pro Met Ser Ala Asp Gln Phe Ser Asn Ala Thr Leu Leu His Glu Leu
385 390 395 400
Lys Val Gly Ile Lys Val Cys Glu Gly Ser Asn Ile Val Pro Asn Ser
405 410 415
Asp Glu Leu Ala Glu Leu Phe Ser Lys Ser Leu Ser Asp Glu Thr Arg
420 425 430
Leu Glu Arg Lys Arg Val Lys Glu Phe Ala Lys Ser Ala Lys Glu Ala
435 440 445
Val Gly Pro Lys Gly Ser Ser Val Gly Glu Leu Glu Arg Leu Val Asp
450 455 460
Asn Leu Ser Leu
465
<210> 121
<211> 1383
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1383)
<400> 121
atg gaa tct tcc aag gtc att ttg tac cca tct cca ggt att ggt cac 48
Met Glu Ser Ser Lys Val Ile Leu Tyr Pro Ser Pro Gly Ile Gly His
1 5 10 15
ttg gtt tcc atg gtc gaa ttg ggt aaa ttg att cac acc cac cac cca 96
Leu Val Ser Met Val Glu Leu Gly Lys Leu Ile His Thr His His Pro
20 25 30
tct cta tcc gtc atc atc ttg gtt ttg cca gct acc tac gaa act ggt 144
Ser Leu Ser Val Ile Ile Leu Val Leu Pro Ala Thr Tyr Glu Thr Gly
35 40 45
tcc acc acc acc tac atc aac act gtt tct acc acc act cca ttc atc 192
Ser Thr Thr Thr Tyr Ile Asn Thr Val Ser Thr Thr Thr Pro Phe Ile
50 55 60
act ttc cac cac ttg cca gtc att cca ttg cca cca gac tcc tct tct 240
Thr Phe His His Leu Pro Val Ile Pro Leu Pro Pro Asp Ser Ser Ser
65 70 75 80
gaa ttt atc gat ttg gct ttc gac att cca caa tta tac aac cca gtt 288
Glu Phe Ile Asp Leu Ala Phe Asp Ile Pro Gln Leu Tyr Asn Pro Val
85 90 95
gtc tac aac act ttg gtt gcc atc tct gaa act tcc acc atc aag gct 336
Val Tyr Asn Thr Leu Val Ala Ile Ser Glu Thr Ser Thr Ile Lys Ala
100 105 110
gtt atc ttg gac ttc ttt gtc aac gcc gct ttc caa atc tcc aag tct 384
Val Ile Leu Asp Phe Phe Val Asn Ala Ala Phe Gln Ile Ser Lys Ser
115 120 125
ttg gac ttg cca acc tac tac ttc ttc act tct ggt gct tct ggt cta 432
Leu Asp Leu Pro Thr Tyr Tyr Phe Phe Thr Ser Gly Ala Ser Gly Leu
130 135 140
tgt gct ttc ttg cat ttg cca act atc tac aag acc tac tcc ggt aac 480
Cys Ala Phe Leu His Leu Pro Thr Ile Tyr Lys Thr Tyr Ser Gly Asn
145 150 155 160
ttc aag gac ttg gac act ttc atc aac att cca ggt gtt cct cct atc 528
Phe Lys Asp Leu Asp Thr Phe Ile Asn Ile Pro Gly Val Pro Pro Ile
165 170 175
cac tct tct gac atg cca acc gtc tta ttc gac aag gaa tcc aac tct 576
His Ser Ser Asp Met Pro Thr Val Leu Phe Asp Lys Glu Ser Asn Ser
180 185 190
tac aag aac ttt gtc aag act tcc aac aac atg gcc aaa tct tct ggt 624
Tyr Lys Asn Phe Val Lys Thr Ser Asn Asn Met Ala Lys Ser Ser Gly
195 200 205
gtc att gct aac tcc ttc ttg caa ttg gaa gaa aga gct gct caa act 672
Val Ile Ala Asn Ser Phe Leu Gln Leu Glu Glu Arg Ala Ala Gln Thr
210 215 220
ttg aga gat ggt aag tcc atc acc gat ggt cca tct cca cca atc tac 720
Leu Arg Asp Gly Lys Ser Ile Thr Asp Gly Pro Ser Pro Pro Ile Tyr
225 230 235 240
ttg att ggt cca ttg atc gct tct ggt aac caa gtt gac cat aat gaa 768
Leu Ile Gly Pro Leu Ile Ala Ser Gly Asn Gln Val Asp His Asn Glu
245 250 255
aac gaa tgt ttg aaa tgg tta aac act caa cca tcc aaa tct gtt gtt 816
Asn Glu Cys Leu Lys Trp Leu Asn Thr Gln Pro Ser Lys Ser Val Val
260 265 270
ttc ttg tgt ttc ggt tct caa ggt gtt ttc aag aag gaa caa ttg aag 864
Phe Leu Cys Phe Gly Ser Gln Gly Val Phe Lys Lys Glu Gln Leu Lys
275 280 285
gaa att gct gtt ggt ttg gaa aga tct ggt caa aga ttc tta tgg gtt 912
Glu Ile Ala Val Gly Leu Glu Arg Ser Gly Gln Arg Phe Leu Trp Val
290 295 300
gtc aga aag cct cca tct gat ggt ggt aag gaa ttc ggt ttg gat gac 960
Val Arg Lys Pro Pro Ser Asp Gly Gly Lys Glu Phe Gly Leu Asp Asp
305 310 315 320
gtc ttg cca gaa ggt ttc gtt gcc aga acc aag gaa aag ggt ttg gtt 1008
Val Leu Pro Glu Gly Phe Val Ala Arg Thr Lys Glu Lys Gly Leu Val
325 330 335
gtc aag aac tgg gct cca caa cca gct att ttg ggt cac gaa tct gtc 1056
Val Lys Asn Trp Ala Pro Gln Pro Ala Ile Leu Gly His Glu Ser Val
340 345 350
ggt ggt ttc gtt tcc cac tgt ggt tgg aac tct tct ttg gaa gcc gtt 1104
Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Ser Leu Glu Ala Val
355 360 365
gtt ttc ggt gtc cca atg gtt gct tgg cca tta tac gct gaa caa aag 1152
Val Phe Gly Val Pro Met Val Ala Trp Pro Leu Tyr Ala Glu Gln Lys
370 375 380
atg aac cgt gtc tac ttg gtc gaa gaa atc aag gtt gct tta tgg tta 1200
Met Asn Arg Val Tyr Leu Val Glu Glu Ile Lys Val Ala Leu Trp Leu
385 390 395 400
aga atg tcc gct gat ggt ttt gtc tct gct gaa gct gtt gaa gaa act 1248
Arg Met Ser Ala Asp Gly Phe Val Ser Ala Glu Ala Val Glu Glu Thr
405 410 415
gtc aga caa ttg atg gac ggt aga aga gtt cgt gaa aga att ttg gaa 1296
Val Arg Gln Leu Met Asp Gly Arg Arg Val Arg Glu Arg Ile Leu Glu
420 425 430
atg tcc acc aag gcc aag gcc gct gtc gaa gat ggt ggt tct tcc aga 1344
Met Ser Thr Lys Ala Lys Ala Ala Val Glu Asp Gly Gly Ser Ser Arg
435 440 445
gtt gac ttc ttc aaa ttg act gaa tcc tgg acc cac aag 1383
Val Asp Phe Phe Lys Leu Thr Glu Ser Trp Thr His Lys
450 455 460
<210> 122
<211> 461
<212> PRT
<213> Stevia rebaudiana
<400> 122
Met Glu Ser Ser Lys Val Ile Leu Tyr Pro Ser Pro Gly Ile Gly His
1 5 10 15
Leu Val Ser Met Val Glu Leu Gly Lys Leu Ile His Thr His His Pro
20 25 30
Ser Leu Ser Val Ile Ile Leu Val Leu Pro Ala Thr Tyr Glu Thr Gly
35 40 45
Ser Thr Thr Thr Tyr Ile Asn Thr Val Ser Thr Thr Thr Pro Phe Ile
50 55 60
Thr Phe His His Leu Pro Val Ile Pro Leu Pro Pro Asp Ser Ser Ser
65 70 75 80
Glu Phe Ile Asp Leu Ala Phe Asp Ile Pro Gln Leu Tyr Asn Pro Val
85 90 95
Val Tyr Asn Thr Leu Val Ala Ile Ser Glu Thr Ser Thr Ile Lys Ala
100 105 110
Val Ile Leu Asp Phe Phe Val Asn Ala Ala Phe Gln Ile Ser Lys Ser
115 120 125
Leu Asp Leu Pro Thr Tyr Tyr Phe Phe Thr Ser Gly Ala Ser Gly Leu
130 135 140
Cys Ala Phe Leu His Leu Pro Thr Ile Tyr Lys Thr Tyr Ser Gly Asn
145 150 155 160
Phe Lys Asp Leu Asp Thr Phe Ile Asn Ile Pro Gly Val Pro Pro Ile
165 170 175
His Ser Ser Asp Met Pro Thr Val Leu Phe Asp Lys Glu Ser Asn Ser
180 185 190
Tyr Lys Asn Phe Val Lys Thr Ser Asn Asn Met Ala Lys Ser Ser Gly
195 200 205
Val Ile Ala Asn Ser Phe Leu Gln Leu Glu Glu Arg Ala Ala Gln Thr
210 215 220
Leu Arg Asp Gly Lys Ser Ile Thr Asp Gly Pro Ser Pro Pro Ile Tyr
225 230 235 240
Leu Ile Gly Pro Leu Ile Ala Ser Gly Asn Gln Val Asp His Asn Glu
245 250 255
Asn Glu Cys Leu Lys Trp Leu Asn Thr Gln Pro Ser Lys Ser Val Val
260 265 270
Phe Leu Cys Phe Gly Ser Gln Gly Val Phe Lys Lys Glu Gln Leu Lys
275 280 285
Glu Ile Ala Val Gly Leu Glu Arg Ser Gly Gln Arg Phe Leu Trp Val
290 295 300
Val Arg Lys Pro Pro Ser Asp Gly Gly Lys Glu Phe Gly Leu Asp Asp
305 310 315 320
Val Leu Pro Glu Gly Phe Val Ala Arg Thr Lys Glu Lys Gly Leu Val
325 330 335
Val Lys Asn Trp Ala Pro Gln Pro Ala Ile Leu Gly His Glu Ser Val
340 345 350
Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Ser Leu Glu Ala Val
355 360 365
Val Phe Gly Val Pro Met Val Ala Trp Pro Leu Tyr Ala Glu Gln Lys
370 375 380
Met Asn Arg Val Tyr Leu Val Glu Glu Ile Lys Val Ala Leu Trp Leu
385 390 395 400
Arg Met Ser Ala Asp Gly Phe Val Ser Ala Glu Ala Val Glu Glu Thr
405 410 415
Val Arg Gln Leu Met Asp Gly Arg Arg Val Arg Glu Arg Ile Leu Glu
420 425 430
Met Ser Thr Lys Ala Lys Ala Ala Val Glu Asp Gly Gly Ser Ser Arg
435 440 445
Val Asp Phe Phe Lys Leu Thr Glu Ser Trp Thr His Lys
450 455 460
<210> 123
<211> 1455
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1455)
<400> 123
atg tac aac gtt act tac cat caa aac tcc aag gcc atg gct acc tct 48
Met Tyr Asn Val Thr Tyr His Gln Asn Ser Lys Ala Met Ala Thr Ser
1 5 10 15
gac tcc att gtc gac gac aga aag caa ttg cac gtt gcc act ttc cca 96
Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val Ala Thr Phe Pro
20 25 30
tgg ttg gct ttc ggt cac atc ttg cca ttc ttg caa ttg tcc aaa ttg 144
Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln Leu Ser Lys Leu
35 40 45
att gct gaa aag ggt cac aaa gtc tct ttc ttg tct acc acc aga aac 192
Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser Thr Thr Arg Asn
50 55 60
atc caa aga ttg tcc tct cac atc tct cca ttg att aac gtt gtc caa 240
Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile Asn Val Val Gln
65 70 75 80
ttg act tta cca aga gtc caa gaa ttg cca gaa gat gct gaa gct act 288
Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp Ala Glu Ala Thr
85 90 95
act gat gtt cac cca gaa gat atc caa tac ttg aag aag gct gtc gat 336
Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys Lys Ala Val Asp
100 105 110
ggt ttg caa cca gaa gtt acc aga ttc ttg gaa caa cac tct cca gac 384
Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln His Ser Pro Asp
115 120 125
tgg atc atc tac gac ttc act cac tac tgg tta cca tcc att gct gct 432
Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro Ser Ile Ala Ala
130 135 140
tct ttg ggt atc tcc aga gct tac ttc tgt gtt atc act cca tgg acc 480
Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile Thr Pro Trp Thr
145 150 155 160
att gct tac ttg gct cca tct tct gat gcc atg atc aac gac tct gat 528
Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile Asn Asp Ser Asp
165 170 175
ggt aga acc act gtt gaa gat ttg acc acc cca cct aaa tgg ttc cca 576
Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro Lys Trp Phe Pro
180 185 190
ttc cca acc aag gtc tgt tgg aga aag cat gat ttg gcc aga atg gaa 624
Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu Ala Arg Met Glu
195 200 205
cca tac gaa gct cca ggt atc tct gac ggt tac aga atg ggt atg gtt 672
Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg Met Gly Met Val
210 215 220
ttc aag ggt tct gac tgt ttg ttg ttc aaa tgt tac cat gaa ttc ggt 720
Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr His Glu Phe Gly
225 230 235 240
act caa tgg tta cct cta tta gaa act ttg cac caa gtt cca gtt gtt 768
Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln Val Pro Val Val
245 250 255
cca gtc ggt ttg ttg cct cca gaa atc cca ggt gac gaa aag gac gaa 816
Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp Glu Lys Asp Glu
260 265 270
acc tgg gtt tcc atc aag aaa tgg ttg gat ggt aag caa aag ggt tcc 864
Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys Gln Lys Gly Ser
275 280 285
gtt gtc tac gtt gct tta ggt tct gaa gct tta gtc tct caa act gaa 912
Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val Ser Gln Thr Glu
290 295 300
gtt gtc gaa ttg gct ttg ggt ttg gaa ttg tcc ggt ttg cca ttt gtc 960
Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly Leu Pro Phe Val
305 310 315 320
tgg gct tac aga aag cca aag ggt cca gcc aag tct gat tct gtc gaa 1008
Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser Asp Ser Val Glu
325 330 335
tta cca gat ggt ttc gtc gaa aga acc cgt gac cgt ggt ttg gtc tgg 1056
Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg Gly Leu Val Trp
340 345 350
act tcc tgg gct cca caa ttg aga att ttg tcc cac gaa tct gtc tgt 1104
Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His Glu Ser Val Cys
355 360 365
ggt ttc ttg act cac tgt ggt tct ggt tcc att gtt gaa ggt ttg atg 1152
Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val Glu Gly Leu Met
370 375 380
ttc ggt cac cca ttg atc atg ttg cca atc ttc tgt gac caa cca tta 1200
Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Cys Asp Gln Pro Leu
385 390 395 400
aac gct aga ttg ttg gaa gac aag caa gtc ggt att gaa att cca aga 1248
Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile Glu Ile Pro Arg
405 410 415
aac gaa gaa gac ggt tgt ttg acc aag gaa tcc gtt gcc aga tct ttg 1296
Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val Ala Arg Ser Leu
420 425 430
aga tct gtt gtt gtc gaa aac gaa ggt gaa atc tac aag gct aac gct 1344
Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr Lys Ala Asn Ala
435 440 445
cgt gct cta tcc aag atc tac aac gac acc aag gtc gaa aag gaa tac 1392
Arg Ala Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val Glu Lys Glu Tyr
450 455 460
gtt tct caa ttt gtt gac tac ttg gaa aag aac gcc aga gct gtt gcc 1440
Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala Arg Ala Val Ala
465 470 475 480
att gac cac gaa agt 1455
Ile Asp His Glu Ser
485
<210> 124
<211> 485
<212> PRT
<213> Stevia rebaudiana
<400> 124
Met Tyr Asn Val Thr Tyr His Gln Asn Ser Lys Ala Met Ala Thr Ser
1 5 10 15
Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val Ala Thr Phe Pro
20 25 30
Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln Leu Ser Lys Leu
35 40 45
Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser Thr Thr Arg Asn
50 55 60
Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile Asn Val Val Gln
65 70 75 80
Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp Ala Glu Ala Thr
85 90 95
Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys Lys Ala Val Asp
100 105 110
Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln His Ser Pro Asp
115 120 125
Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro Ser Ile Ala Ala
130 135 140
Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile Thr Pro Trp Thr
145 150 155 160
Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile Asn Asp Ser Asp
165 170 175
Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro Lys Trp Phe Pro
180 185 190
Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu Ala Arg Met Glu
195 200 205
Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg Met Gly Met Val
210 215 220
Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr His Glu Phe Gly
225 230 235 240
Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln Val Pro Val Val
245 250 255
Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp Glu Lys Asp Glu
260 265 270
Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys Gln Lys Gly Ser
275 280 285
Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val Ser Gln Thr Glu
290 295 300
Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly Leu Pro Phe Val
305 310 315 320
Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser Asp Ser Val Glu
325 330 335
Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg Gly Leu Val Trp
340 345 350
Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His Glu Ser Val Cys
355 360 365
Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val Glu Gly Leu Met
370 375 380
Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Cys Asp Gln Pro Leu
385 390 395 400
Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile Glu Ile Pro Arg
405 410 415
Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val Ala Arg Ser Leu
420 425 430
Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr Lys Ala Asn Ala
435 440 445
Arg Ala Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val Glu Lys Glu Tyr
450 455 460
Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala Arg Ala Val Ala
465 470 475 480
Ile Asp His Glu Ser
485
<210> 125
<211> 1485
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1485)
<400> 125
atg tct cca aag atg gtt gct cca cca acc aac ttg cac ttc gtc tta 48
Met Ser Pro Lys Met Val Ala Pro Pro Thr Asn Leu His Phe Val Leu
1 5 10 15
ttc cca ttg atg gct caa ggt cac ttg gtt cca atg gtc gac att gct 96
Phe Pro Leu Met Ala Gln Gly His Leu Val Pro Met Val Asp Ile Ala
20 25 30
aga att ttg gct caa aga ggt gct acc gtc acc atc atc acc act cca 144
Arg Ile Leu Ala Gln Arg Gly Ala Thr Val Thr Ile Ile Thr Thr Pro
35 40 45
tac cat gcc aac aga gtc aga cca gtt atc tcc aga gcc att gct acc 192
Tyr His Ala Asn Arg Val Arg Pro Val Ile Ser Arg Ala Ile Ala Thr
50 55 60
aac ttg aaa atc caa ttg ttg gaa ttg caa ttg aga tcc act gaa gct 240
Asn Leu Lys Ile Gln Leu Leu Glu Leu Gln Leu Arg Ser Thr Glu Ala
65 70 75 80
ggt ttg cca gaa ggt tgt gaa tct ttc gac caa ttg cca tct ttc gaa 288
Gly Leu Pro Glu Gly Cys Glu Ser Phe Asp Gln Leu Pro Ser Phe Glu
85 90 95
tac tgg aag aac atc tcc act gcc att gac ttg ttg caa caa cca gct 336
Tyr Trp Lys Asn Ile Ser Thr Ala Ile Asp Leu Leu Gln Gln Pro Ala
100 105 110
gaa gat cta ttg aga gaa ttg tct cca cct cca gac tgt atc atc tcc 384
Glu Asp Leu Leu Arg Glu Leu Ser Pro Pro Pro Asp Cys Ile Ile Ser
115 120 125
gat ttc ttg ttc cca tgg acc act gat gtc gcc aga aga ttg aac att 432
Asp Phe Leu Phe Pro Trp Thr Thr Asp Val Ala Arg Arg Leu Asn Ile
130 135 140
cca aga ttg gtt ttc aac ggt cca ggt tgt ttc tac ttg ttg tgt atc 480
Pro Arg Leu Val Phe Asn Gly Pro Gly Cys Phe Tyr Leu Leu Cys Ile
145 150 155 160
cac gtt gcc atc act tct aac atc tta ggt gaa aac gaa cct gtt tcc 528
His Val Ala Ile Thr Ser Asn Ile Leu Gly Glu Asn Glu Pro Val Ser
165 170 175
tcc aac act gaa aga gtt gtt ttg cca ggt tta cca gat cgt att gaa 576
Ser Asn Thr Glu Arg Val Val Leu Pro Gly Leu Pro Asp Arg Ile Glu
180 185 190
gtt acc aaa ttg caa atc gtc ggt tct tcc aga cca gct aac gtt gac 624
Val Thr Lys Leu Gln Ile Val Gly Ser Ser Arg Pro Ala Asn Val Asp
195 200 205
gaa atg ggt tcc tgg tta aga gct gtc gaa gct gaa aag gct tct ttc 672
Glu Met Gly Ser Trp Leu Arg Ala Val Glu Ala Glu Lys Ala Ser Phe
210 215 220
ggt att gtc gtc aac acc ttt gaa gaa ttg gaa cca gaa tac gtc gaa 720
Gly Ile Val Val Asn Thr Phe Glu Glu Leu Glu Pro Glu Tyr Val Glu
225 230 235 240
gaa tac aag acc gtc aag gac aag aag atg tgg tgt atc ggt cca gtt 768
Glu Tyr Lys Thr Val Lys Asp Lys Lys Met Trp Cys Ile Gly Pro Val
245 250 255
tct cta tgt aac aag act ggt cca gac ttg gct gaa aga ggt aac aag 816
Ser Leu Cys Asn Lys Thr Gly Pro Asp Leu Ala Glu Arg Gly Asn Lys
260 265 270
gct gcc atc acc gaa cac aac tgt ttg aaa tgg ttg gat gaa aga aag 864
Ala Ala Ile Thr Glu His Asn Cys Leu Lys Trp Leu Asp Glu Arg Lys
275 280 285
ttg ggt tct gtc ttg tac gtt tgt ttg ggt tct ttg gcc aga atc tcc 912
Leu Gly Ser Val Leu Tyr Val Cys Leu Gly Ser Leu Ala Arg Ile Ser
290 295 300
gct gct caa gcc att gaa tta ggt tta ggt ttg gaa tct atc aac aga 960
Ala Ala Gln Ala Ile Glu Leu Gly Leu Gly Leu Glu Ser Ile Asn Arg
305 310 315 320
cca ttt atc tgg tgt gtt aga aac gaa act gat gaa ttg aag act tgg 1008
Pro Phe Ile Trp Cys Val Arg Asn Glu Thr Asp Glu Leu Lys Thr Trp
325 330 335
ttc ttg gac ggt ttc gaa gaa aga gtt cgt gac aga ggt ttg att gtt 1056
Phe Leu Asp Gly Phe Glu Glu Arg Val Arg Asp Arg Gly Leu Ile Val
340 345 350
cac ggt tgg gct cca caa gtt ttg atc tta tct cac cca acc att ggt 1104
His Gly Trp Ala Pro Gln Val Leu Ile Leu Ser His Pro Thr Ile Gly
355 360 365
ggt ttc ttg act cac tgt ggt tgg aac tct acc att gaa tcc atc act 1152
Gly Phe Leu Thr His Cys Gly Trp Asn Ser Thr Ile Glu Ser Ile Thr
370 375 380
gct ggt gtt cca atg atc acc tgg cca ttc ttt gct gac caa ttc ttg 1200
Ala Gly Val Pro Met Ile Thr Trp Pro Phe Phe Ala Asp Gln Phe Leu
385 390 395 400
aat gaa gct ttc atc gtt gaa gtt ttg aaa att ggt gtc aga atc ggt 1248
Asn Glu Ala Phe Ile Val Glu Val Leu Lys Ile Gly Val Arg Ile Gly
405 410 415
gtt gaa aga gct tgt ttg ttc ggt gaa gaa gac aag gtc ggt gtc tta 1296
Val Glu Arg Ala Cys Leu Phe Gly Glu Glu Asp Lys Val Gly Val Leu
420 425 430
gtc aag aag gaa gat gtt aag aag gct gtt gaa tgt ttg atg gac gaa 1344
Val Lys Lys Glu Asp Val Lys Lys Ala Val Glu Cys Leu Met Asp Glu
435 440 445
gat gaa gac ggt gac caa aga aga aag cgt gtc att gaa ttg gcc aag 1392
Asp Glu Asp Gly Asp Gln Arg Arg Lys Arg Val Ile Glu Leu Ala Lys
450 455 460
atg gct aag att gct atg gct gaa ggt ggt tct tct tac gaa aac gtt 1440
Met Ala Lys Ile Ala Met Ala Glu Gly Gly Ser Ser Tyr Glu Asn Val
465 470 475 480
tcc tct ttg atc aga gat gtc act gaa act gtc cgt gct cct cat 1485
Ser Ser Leu Ile Arg Asp Val Thr Glu Thr Val Arg Ala Pro His
485 490 495
<210> 126
<211> 495
<212> PRT
<213> Stevia rebaudiana
<400> 126
Met Ser Pro Lys Met Val Ala Pro Pro Thr Asn Leu His Phe Val Leu
1 5 10 15
Phe Pro Leu Met Ala Gln Gly His Leu Val Pro Met Val Asp Ile Ala
20 25 30
Arg Ile Leu Ala Gln Arg Gly Ala Thr Val Thr Ile Ile Thr Thr Pro
35 40 45
Tyr His Ala Asn Arg Val Arg Pro Val Ile Ser Arg Ala Ile Ala Thr
50 55 60
Asn Leu Lys Ile Gln Leu Leu Glu Leu Gln Leu Arg Ser Thr Glu Ala
65 70 75 80
Gly Leu Pro Glu Gly Cys Glu Ser Phe Asp Gln Leu Pro Ser Phe Glu
85 90 95
Tyr Trp Lys Asn Ile Ser Thr Ala Ile Asp Leu Leu Gln Gln Pro Ala
100 105 110
Glu Asp Leu Leu Arg Glu Leu Ser Pro Pro Pro Asp Cys Ile Ile Ser
115 120 125
Asp Phe Leu Phe Pro Trp Thr Thr Asp Val Ala Arg Arg Leu Asn Ile
130 135 140
Pro Arg Leu Val Phe Asn Gly Pro Gly Cys Phe Tyr Leu Leu Cys Ile
145 150 155 160
His Val Ala Ile Thr Ser Asn Ile Leu Gly Glu Asn Glu Pro Val Ser
165 170 175
Ser Asn Thr Glu Arg Val Val Leu Pro Gly Leu Pro Asp Arg Ile Glu
180 185 190
Val Thr Lys Leu Gln Ile Val Gly Ser Ser Arg Pro Ala Asn Val Asp
195 200 205
Glu Met Gly Ser Trp Leu Arg Ala Val Glu Ala Glu Lys Ala Ser Phe
210 215 220
Gly Ile Val Val Asn Thr Phe Glu Glu Leu Glu Pro Glu Tyr Val Glu
225 230 235 240
Glu Tyr Lys Thr Val Lys Asp Lys Lys Met Trp Cys Ile Gly Pro Val
245 250 255
Ser Leu Cys Asn Lys Thr Gly Pro Asp Leu Ala Glu Arg Gly Asn Lys
260 265 270
Ala Ala Ile Thr Glu His Asn Cys Leu Lys Trp Leu Asp Glu Arg Lys
275 280 285
Leu Gly Ser Val Leu Tyr Val Cys Leu Gly Ser Leu Ala Arg Ile Ser
290 295 300
Ala Ala Gln Ala Ile Glu Leu Gly Leu Gly Leu Glu Ser Ile Asn Arg
305 310 315 320
Pro Phe Ile Trp Cys Val Arg Asn Glu Thr Asp Glu Leu Lys Thr Trp
325 330 335
Phe Leu Asp Gly Phe Glu Glu Arg Val Arg Asp Arg Gly Leu Ile Val
340 345 350
His Gly Trp Ala Pro Gln Val Leu Ile Leu Ser His Pro Thr Ile Gly
355 360 365
Gly Phe Leu Thr His Cys Gly Trp Asn Ser Thr Ile Glu Ser Ile Thr
370 375 380
Ala Gly Val Pro Met Ile Thr Trp Pro Phe Phe Ala Asp Gln Phe Leu
385 390 395 400
Asn Glu Ala Phe Ile Val Glu Val Leu Lys Ile Gly Val Arg Ile Gly
405 410 415
Val Glu Arg Ala Cys Leu Phe Gly Glu Glu Asp Lys Val Gly Val Leu
420 425 430
Val Lys Lys Glu Asp Val Lys Lys Ala Val Glu Cys Leu Met Asp Glu
435 440 445
Asp Glu Asp Gly Asp Gln Arg Arg Lys Arg Val Ile Glu Leu Ala Lys
450 455 460
Met Ala Lys Ile Ala Met Ala Glu Gly Gly Ser Ser Tyr Glu Asn Val
465 470 475 480
Ser Ser Leu Ile Arg Asp Val Thr Glu Thr Val Arg Ala Pro His
485 490 495
<210> 127
<211> 1272
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1272)
<400> 127
atg ttg caa ttg gct acc tac ttg cac tct caa ggt atc tcc atc act 48
Met Leu Gln Leu Ala Thr Tyr Leu His Ser Gln Gly Ile Ser Ile Thr
1 5 10 15
att gct caa tac cca aac ttc aac tct cca gac tcc tcc aac cac cca 96
Ile Ala Gln Tyr Pro Asn Phe Asn Ser Pro Asp Ser Ser Asn His Pro
20 25 30
gaa ttg act ttc tta cca tta tct tcc ggt aac ttg tcc gtt gct gat 144
Glu Leu Thr Phe Leu Pro Leu Ser Ser Gly Asn Leu Ser Val Ala Asp
35 40 45
atc tct ggt ggt ttc ttc aaa ttc att caa acc ttg aac cac aac tgt 192
Ile Ser Gly Gly Phe Phe Lys Phe Ile Gln Thr Leu Asn His Asn Cys
50 55 60
aag cct cac ttc aga gaa tac ttg gtt caa aac atg tcc tct gat gac 240
Lys Pro His Phe Arg Glu Tyr Leu Val Gln Asn Met Ser Ser Asp Asp
65 70 75 80
aag gaa tcc att gtc atc atc aga gac aac ttg atg ttc ttt gct ggt 288
Lys Glu Ser Ile Val Ile Ile Arg Asp Asn Leu Met Phe Phe Ala Gly
85 90 95
gaa atc gct ggt gaa tta ggt tta cca tct atc atc ttg aga ggt tct 336
Glu Ile Ala Gly Glu Leu Gly Leu Pro Ser Ile Ile Leu Arg Gly Ser
100 105 110
aac gct gtc atg ttg act gct tcc gat atc att cca caa ttg cac caa 384
Asn Ala Val Met Leu Thr Ala Ser Asp Ile Ile Pro Gln Leu His Gln
115 120 125
gaa ggt aga ttc cca cca cca gac tct cta ttg caa gaa acc att cca 432
Glu Gly Arg Phe Pro Pro Pro Asp Ser Leu Leu Gln Glu Thr Ile Pro
130 135 140
gaa ttg gtt cca ttc aga tac aag gac ttg cca ttc atc ggt tac cca 480
Glu Leu Val Pro Phe Arg Tyr Lys Asp Leu Pro Phe Ile Gly Tyr Pro
145 150 155 160
att cac caa acc ttg gaa ttc tcc att acc atg atg acc cca aag tct 528
Ile His Gln Thr Leu Glu Phe Ser Ile Thr Met Met Thr Pro Lys Ser
165 170 175
cca gct tct gcc atc ttg atc aac act tta gaa ttc ttg gaa caa tct 576
Pro Ala Ser Ala Ile Leu Ile Asn Thr Leu Glu Phe Leu Glu Gln Ser
180 185 190
gct ttg act caa atc cgt gac cac tac aag gtc cca gtt ttc act atc 624
Ala Leu Thr Gln Ile Arg Asp His Tyr Lys Val Pro Val Phe Thr Ile
195 200 205
ggt cca ttg cac aag att gtc act acc aga tcc act tcc atc ttg gaa 672
Gly Pro Leu His Lys Ile Val Thr Thr Arg Ser Thr Ser Ile Leu Glu
210 215 220
gaa gac acc tct tgt atc aac tgg ttg gac aag caa tct cca aag tct 720
Glu Asp Thr Ser Cys Ile Asn Trp Leu Asp Lys Gln Ser Pro Lys Ser
225 230 235 240
gtt gtc tac gtt tct cta ggt tct ttg gcc aaa ttg gac gaa aag gtt 768
Val Val Tyr Val Ser Leu Gly Ser Leu Ala Lys Leu Asp Glu Lys Val
245 250 255
gct tct gaa atg gct tgt ggt ttg gcc atg tcc aac cat aaa ttc tta 816
Ala Ser Glu Met Ala Cys Gly Leu Ala Met Ser Asn His Lys Phe Leu
260 265 270
tgg gtt gtc aga cca ggt atg gtc cat ggt ttc gaa tgg gtt gaa ttt 864
Trp Val Val Arg Pro Gly Met Val His Gly Phe Glu Trp Val Glu Phe
275 280 285
ttg cca gac tct ttg gtt ggt gaa atg aag gcc aga ggt ttg att gtc 912
Leu Pro Asp Ser Leu Val Gly Glu Met Lys Ala Arg Gly Leu Ile Val
290 295 300
aaa tgg gct cca caa acc act gtt ttg gct cac aac gct gtc ggt ggt 960
Lys Trp Ala Pro Gln Thr Thr Val Leu Ala His Asn Ala Val Gly Gly
305 310 315 320
ttt tgg tcc cac tgt ggt tgg aac tct act atc gaa tgt ttg gct gaa 1008
Phe Trp Ser His Cys Gly Trp Asn Ser Thr Ile Glu Cys Leu Ala Glu
325 330 335
ggt gtt cca atg atg tgt caa cct ttc ttc gct gac caa ttg ttg aat 1056
Gly Val Pro Met Met Cys Gln Pro Phe Phe Ala Asp Gln Leu Leu Asn
340 345 350
gct cgt tac gtt tcc gat gtc tgg aag acc ggt ttc gaa att gtc att 1104
Ala Arg Tyr Val Ser Asp Val Trp Lys Thr Gly Phe Glu Ile Val Ile
355 360 365
gaa aag ggt gaa att gct tgt gcc atc aag aga gtt ttg gtc gat gaa 1152
Glu Lys Gly Glu Ile Ala Cys Ala Ile Lys Arg Val Leu Val Asp Glu
370 375 380
gaa ggt gaa gaa atg aga caa aga gct atg gaa atc aag gaa aag gtc 1200
Glu Gly Glu Glu Met Arg Gln Arg Ala Met Glu Ile Lys Glu Lys Val
385 390 395 400
aag att gcc atc aac gat ggt ggt tct tct tac gac tct ttc aag gat 1248
Lys Ile Ala Ile Asn Asp Gly Gly Ser Ser Tyr Asp Ser Phe Lys Asp
405 410 415
ttg gtt gct ttc atc tcc tct ttg 1272
Leu Val Ala Phe Ile Ser Ser Leu
420
<210> 128
<211> 424
<212> PRT
<213> Stevia rebaudiana
<400> 128
Met Leu Gln Leu Ala Thr Tyr Leu His Ser Gln Gly Ile Ser Ile Thr
1 5 10 15
Ile Ala Gln Tyr Pro Asn Phe Asn Ser Pro Asp Ser Ser Asn His Pro
20 25 30
Glu Leu Thr Phe Leu Pro Leu Ser Ser Gly Asn Leu Ser Val Ala Asp
35 40 45
Ile Ser Gly Gly Phe Phe Lys Phe Ile Gln Thr Leu Asn His Asn Cys
50 55 60
Lys Pro His Phe Arg Glu Tyr Leu Val Gln Asn Met Ser Ser Asp Asp
65 70 75 80
Lys Glu Ser Ile Val Ile Ile Arg Asp Asn Leu Met Phe Phe Ala Gly
85 90 95
Glu Ile Ala Gly Glu Leu Gly Leu Pro Ser Ile Ile Leu Arg Gly Ser
100 105 110
Asn Ala Val Met Leu Thr Ala Ser Asp Ile Ile Pro Gln Leu His Gln
115 120 125
Glu Gly Arg Phe Pro Pro Pro Asp Ser Leu Leu Gln Glu Thr Ile Pro
130 135 140
Glu Leu Val Pro Phe Arg Tyr Lys Asp Leu Pro Phe Ile Gly Tyr Pro
145 150 155 160
Ile His Gln Thr Leu Glu Phe Ser Ile Thr Met Met Thr Pro Lys Ser
165 170 175
Pro Ala Ser Ala Ile Leu Ile Asn Thr Leu Glu Phe Leu Glu Gln Ser
180 185 190
Ala Leu Thr Gln Ile Arg Asp His Tyr Lys Val Pro Val Phe Thr Ile
195 200 205
Gly Pro Leu His Lys Ile Val Thr Thr Arg Ser Thr Ser Ile Leu Glu
210 215 220
Glu Asp Thr Ser Cys Ile Asn Trp Leu Asp Lys Gln Ser Pro Lys Ser
225 230 235 240
Val Val Tyr Val Ser Leu Gly Ser Leu Ala Lys Leu Asp Glu Lys Val
245 250 255
Ala Ser Glu Met Ala Cys Gly Leu Ala Met Ser Asn His Lys Phe Leu
260 265 270
Trp Val Val Arg Pro Gly Met Val His Gly Phe Glu Trp Val Glu Phe
275 280 285
Leu Pro Asp Ser Leu Val Gly Glu Met Lys Ala Arg Gly Leu Ile Val
290 295 300
Lys Trp Ala Pro Gln Thr Thr Val Leu Ala His Asn Ala Val Gly Gly
305 310 315 320
Phe Trp Ser His Cys Gly Trp Asn Ser Thr Ile Glu Cys Leu Ala Glu
325 330 335
Gly Val Pro Met Met Cys Gln Pro Phe Phe Ala Asp Gln Leu Leu Asn
340 345 350
Ala Arg Tyr Val Ser Asp Val Trp Lys Thr Gly Phe Glu Ile Val Ile
355 360 365
Glu Lys Gly Glu Ile Ala Cys Ala Ile Lys Arg Val Leu Val Asp Glu
370 375 380
Glu Gly Glu Glu Met Arg Gln Arg Ala Met Glu Ile Lys Glu Lys Val
385 390 395 400
Lys Ile Ala Ile Asn Asp Gly Gly Ser Ser Tyr Asp Ser Phe Lys Asp
405 410 415
Leu Val Ala Phe Ile Ser Ser Leu
420
<210> 129
<211> 1422
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1422)
<400> 129
atg tct acc tct gaa ttg gtt ttc att cca tct cca ggt gct ggt cat 48
Met Ser Thr Ser Glu Leu Val Phe Ile Pro Ser Pro Gly Ala Gly His
1 5 10 15
ttg cca cca act gtt gaa ttg gcc aaa ttg ttg ttg cac cgt gac caa 96
Leu Pro Pro Thr Val Glu Leu Ala Lys Leu Leu Leu His Arg Asp Gln
20 25 30
aga ttg tct gtc acc atc att gtc atg aac tta tgg tta ggt cca aag 144
Arg Leu Ser Val Thr Ile Ile Val Met Asn Leu Trp Leu Gly Pro Lys
35 40 45
cac aac act gaa gcc aga cca tgt gtt cct tcc ttg aga ttc gtc gat 192
His Asn Thr Glu Ala Arg Pro Cys Val Pro Ser Leu Arg Phe Val Asp
50 55 60
atc cca tgt gac gaa tct acc atg gct ttg atc tct cca aac act ttc 240
Ile Pro Cys Asp Glu Ser Thr Met Ala Leu Ile Ser Pro Asn Thr Phe
65 70 75 80
atc tct gcc ttt gtc gaa cac cac aag cca aga gtt cgt gac atc gtt 288
Ile Ser Ala Phe Val Glu His His Lys Pro Arg Val Arg Asp Ile Val
85 90 95
cgt ggt atc att gaa tct gac tct gtc aga ttg gct ggt ttc gtc ttg 336
Arg Gly Ile Ile Glu Ser Asp Ser Val Arg Leu Ala Gly Phe Val Leu
100 105 110
gac atg ttc tgt atg cca atg tcc gat gtt gct aac gaa ttt ggt gtt 384
Asp Met Phe Cys Met Pro Met Ser Asp Val Ala Asn Glu Phe Gly Val
115 120 125
cca tct tac aac tac ttc acc tct ggt gct gcc act tta ggt ttg atg 432
Pro Ser Tyr Asn Tyr Phe Thr Ser Gly Ala Ala Thr Leu Gly Leu Met
130 135 140
ttc cat ttg caa tgg aag aga gat cac gaa ggt tac gac gct act gaa 480
Phe His Leu Gln Trp Lys Arg Asp His Glu Gly Tyr Asp Ala Thr Glu
145 150 155 160
ttg aag aac tct gac act gaa tta tct gtc cca tct tac gtt aac cct 528
Leu Lys Asn Ser Asp Thr Glu Leu Ser Val Pro Ser Tyr Val Asn Pro
165 170 175
gtt cca gcc aag gtc ttg cca gaa gtt gtt ttg gac aag gaa ggt ggt 576
Val Pro Ala Lys Val Leu Pro Glu Val Val Leu Asp Lys Glu Gly Gly
180 185 190
tcc aag atg ttc ttg gat ttg gct gaa aga atc aga gaa tcc aag ggt 624
Ser Lys Met Phe Leu Asp Leu Ala Glu Arg Ile Arg Glu Ser Lys Gly
195 200 205
atc atc gtt aac tct tgt caa gct att gaa aga cac gct ttg gaa tac 672
Ile Ile Val Asn Ser Cys Gln Ala Ile Glu Arg His Ala Leu Glu Tyr
210 215 220
cta tcc tcc aac aac aac ggt att cca cct gtt ttc cca gtc ggt cca 720
Leu Ser Ser Asn Asn Asn Gly Ile Pro Pro Val Phe Pro Val Gly Pro
225 230 235 240
atc ttg aac ttg gaa aac aag aag gac gat gct aag act gac gaa atc 768
Ile Leu Asn Leu Glu Asn Lys Lys Asp Asp Ala Lys Thr Asp Glu Ile
245 250 255
atg aga tgg ttg aac gaa caa cca gaa tcc tct gtt gtt ttc ttg tgt 816
Met Arg Trp Leu Asn Glu Gln Pro Glu Ser Ser Val Val Phe Leu Cys
260 265 270
ttc ggt tcc atg ggt tct ttc aac gaa aag caa gtc aaa gaa att gct 864
Phe Gly Ser Met Gly Ser Phe Asn Glu Lys Gln Val Lys Glu Ile Ala
275 280 285
gtt gcc att gaa aga tct ggt cac aga ttt tta tgg tct ttg aga aga 912
Val Ala Ile Glu Arg Ser Gly His Arg Phe Leu Trp Ser Leu Arg Arg
290 295 300
cca act cca aag gaa aag att gaa ttc cca aag gaa tac gaa aac ttg 960
Pro Thr Pro Lys Glu Lys Ile Glu Phe Pro Lys Glu Tyr Glu Asn Leu
305 310 315 320
gaa gaa gtc tta cca gaa ggt ttc ttg aag aga acc tct tct atc ggt 1008
Glu Glu Val Leu Pro Glu Gly Phe Leu Lys Arg Thr Ser Ser Ile Gly
325 330 335
aag gtt atc ggt tgg gct cca caa atg gct gtc ttg tcc cac cca tcc 1056
Lys Val Ile Gly Trp Ala Pro Gln Met Ala Val Leu Ser His Pro Ser
340 345 350
gtc ggt ggt ttc gtt tcc cac tgt ggt tgg aac tcc act ttg gaa tcc 1104
Val Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Thr Leu Glu Ser
355 360 365
atg tgg tgt ggt gtc cca atg gct gct tgg cca tta tac gct gaa caa 1152
Met Trp Cys Gly Val Pro Met Ala Ala Trp Pro Leu Tyr Ala Glu Gln
370 375 380
act ttg aat gct ttc ttg ttg gtt gtc gaa ttg ggt ttg gct gct gaa 1200
Thr Leu Asn Ala Phe Leu Leu Val Val Glu Leu Gly Leu Ala Ala Glu
385 390 395 400
att aga atg gac tac aga acc gat acc aag gct ggt tac gat ggt ggt 1248
Ile Arg Met Asp Tyr Arg Thr Asp Thr Lys Ala Gly Tyr Asp Gly Gly
405 410 415
atg gaa gtc acc gtt gaa gaa att gaa gac ggt atc aga aag ttg atg 1296
Met Glu Val Thr Val Glu Glu Ile Glu Asp Gly Ile Arg Lys Leu Met
420 425 430
tct gac ggt gaa atc aga aac aag gtc aag gat gtc aag gaa aaa tcc 1344
Ser Asp Gly Glu Ile Arg Asn Lys Val Lys Asp Val Lys Glu Lys Ser
435 440 445
aga gct gcc gtt gtt gag ggt ggt tct tct tac gct tcc att ggt aaa 1392
Arg Ala Ala Val Val Glu Gly Gly Ser Ser Tyr Ala Ser Ile Gly Lys
450 455 460
ttc atc gaa cac gtt tcc aac gtc acc ata 1422
Phe Ile Glu His Val Ser Asn Val Thr Ile
465 470
<210> 130
<211> 474
<212> PRT
<213> Stevia rebaudiana
<400> 130
Met Ser Thr Ser Glu Leu Val Phe Ile Pro Ser Pro Gly Ala Gly His
1 5 10 15
Leu Pro Pro Thr Val Glu Leu Ala Lys Leu Leu Leu His Arg Asp Gln
20 25 30
Arg Leu Ser Val Thr Ile Ile Val Met Asn Leu Trp Leu Gly Pro Lys
35 40 45
His Asn Thr Glu Ala Arg Pro Cys Val Pro Ser Leu Arg Phe Val Asp
50 55 60
Ile Pro Cys Asp Glu Ser Thr Met Ala Leu Ile Ser Pro Asn Thr Phe
65 70 75 80
Ile Ser Ala Phe Val Glu His His Lys Pro Arg Val Arg Asp Ile Val
85 90 95
Arg Gly Ile Ile Glu Ser Asp Ser Val Arg Leu Ala Gly Phe Val Leu
100 105 110
Asp Met Phe Cys Met Pro Met Ser Asp Val Ala Asn Glu Phe Gly Val
115 120 125
Pro Ser Tyr Asn Tyr Phe Thr Ser Gly Ala Ala Thr Leu Gly Leu Met
130 135 140
Phe His Leu Gln Trp Lys Arg Asp His Glu Gly Tyr Asp Ala Thr Glu
145 150 155 160
Leu Lys Asn Ser Asp Thr Glu Leu Ser Val Pro Ser Tyr Val Asn Pro
165 170 175
Val Pro Ala Lys Val Leu Pro Glu Val Val Leu Asp Lys Glu Gly Gly
180 185 190
Ser Lys Met Phe Leu Asp Leu Ala Glu Arg Ile Arg Glu Ser Lys Gly
195 200 205
Ile Ile Val Asn Ser Cys Gln Ala Ile Glu Arg His Ala Leu Glu Tyr
210 215 220
Leu Ser Ser Asn Asn Asn Gly Ile Pro Pro Val Phe Pro Val Gly Pro
225 230 235 240
Ile Leu Asn Leu Glu Asn Lys Lys Asp Asp Ala Lys Thr Asp Glu Ile
245 250 255
Met Arg Trp Leu Asn Glu Gln Pro Glu Ser Ser Val Val Phe Leu Cys
260 265 270
Phe Gly Ser Met Gly Ser Phe Asn Glu Lys Gln Val Lys Glu Ile Ala
275 280 285
Val Ala Ile Glu Arg Ser Gly His Arg Phe Leu Trp Ser Leu Arg Arg
290 295 300
Pro Thr Pro Lys Glu Lys Ile Glu Phe Pro Lys Glu Tyr Glu Asn Leu
305 310 315 320
Glu Glu Val Leu Pro Glu Gly Phe Leu Lys Arg Thr Ser Ser Ile Gly
325 330 335
Lys Val Ile Gly Trp Ala Pro Gln Met Ala Val Leu Ser His Pro Ser
340 345 350
Val Gly Gly Phe Val Ser His Cys Gly Trp Asn Ser Thr Leu Glu Ser
355 360 365
Met Trp Cys Gly Val Pro Met Ala Ala Trp Pro Leu Tyr Ala Glu Gln
370 375 380
Thr Leu Asn Ala Phe Leu Leu Val Val Glu Leu Gly Leu Ala Ala Glu
385 390 395 400
Ile Arg Met Asp Tyr Arg Thr Asp Thr Lys Ala Gly Tyr Asp Gly Gly
405 410 415
Met Glu Val Thr Val Glu Glu Ile Glu Asp Gly Ile Arg Lys Leu Met
420 425 430
Ser Asp Gly Glu Ile Arg Asn Lys Val Lys Asp Val Lys Glu Lys Ser
435 440 445
Arg Ala Ala Val Val Glu Gly Gly Ser Ser Tyr Ala Ser Ile Gly Lys
450 455 460
Phe Ile Glu His Val Ser Asn Val Thr Ile
465 470
<210> 131
<211> 1437
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1437)
<400> 131
atg gcc tcc att gct gaa atg caa aag cct cat gcc atc tgt atc cca 48
Met Ala Ser Ile Ala Glu Met Gln Lys Pro His Ala Ile Cys Ile Pro
1 5 10 15
tac cca gct caa ggt cac atc aac cca atg atg caa ttt gcc aaa ttg 96
Tyr Pro Ala Gln Gly His Ile Asn Pro Met Met Gln Phe Ala Lys Leu
20 25 30
ttg cac ttc aag ggt ttc cac atc tct ttc gtt aac aac cat tac aac 144
Leu His Phe Lys Gly Phe His Ile Ser Phe Val Asn Asn His Tyr Asn
35 40 45
cac aag aga tta caa aga tct aga ggt cta tcc gct ttg gaa ggt ttg 192
His Lys Arg Leu Gln Arg Ser Arg Gly Leu Ser Ala Leu Glu Gly Leu
50 55 60
cca gac ttc cac ttc tac tcc atc cca gat ggt ttg cct cca tct aac 240
Pro Asp Phe His Phe Tyr Ser Ile Pro Asp Gly Leu Pro Pro Ser Asn
65 70 75 80
gct gaa gct act caa tcc att cca ggt cta tgt gaa tct att cca aag 288
Ala Glu Ala Thr Gln Ser Ile Pro Gly Leu Cys Glu Ser Ile Pro Lys
85 90 95
cac tct ttg gaa cca ttc tgt gac ttg att gcc act ttg aat ggt tct 336
His Ser Leu Glu Pro Phe Cys Asp Leu Ile Ala Thr Leu Asn Gly Ser
100 105 110
gat gtt cca cca gtc tcc tgt atc atc tct gac ggt gtc atg tct ttc 384
Asp Val Pro Pro Val Ser Cys Ile Ile Ser Asp Gly Val Met Ser Phe
115 120 125
act ttg caa gct gct gaa aga ttc ggt tta cca gaa gtc tta ttc tgg 432
Thr Leu Gln Ala Ala Glu Arg Phe Gly Leu Pro Glu Val Leu Phe Trp
130 135 140
act cca tct gct tgt ggt ttc ttg gct tac act cac tac aga gat ttg 480
Thr Pro Ser Ala Cys Gly Phe Leu Ala Tyr Thr His Tyr Arg Asp Leu
145 150 155 160
gtt gac aaa gaa tac att cca ttg aag gac acc aat gat ttg acc aac 528
Val Asp Lys Glu Tyr Ile Pro Leu Lys Asp Thr Asn Asp Leu Thr Asn
165 170 175
ggt tac ttg gaa act tct ttg gac tgg atc cca ggt atg aag aac atc 576
Gly Tyr Leu Glu Thr Ser Leu Asp Trp Ile Pro Gly Met Lys Asn Ile
180 185 190
aga tta aag gac ttc cca tct ttc atc aga acc acc gat atc aac gat 624
Arg Leu Lys Asp Phe Pro Ser Phe Ile Arg Thr Thr Asp Ile Asn Asp
195 200 205
atc atg ttg aac tac ttc ttg att gaa act gaa gcc att cca aag ggt 672
Ile Met Leu Asn Tyr Phe Leu Ile Glu Thr Glu Ala Ile Pro Lys Gly
210 215 220
gtt gcc atc atc ttg aac act ttc gat gct ttg gaa aag gac tct atc 720
Val Ala Ile Ile Leu Asn Thr Phe Asp Ala Leu Glu Lys Asp Ser Ile
225 230 235 240
act cca gtc ttg gct ttg aac cct caa atc tac acc atc ggt cca ttg 768
Thr Pro Val Leu Ala Leu Asn Pro Gln Ile Tyr Thr Ile Gly Pro Leu
245 250 255
cac atg atg caa caa tac gtc gac cat gat gaa aga ttg aag cac att 816
His Met Met Gln Gln Tyr Val Asp His Asp Glu Arg Leu Lys His Ile
260 265 270
ggt tcc aac tta tgg aag gaa gac gtt tct tgt atc aac tgg ttg gac 864
Gly Ser Asn Leu Trp Lys Glu Asp Val Ser Cys Ile Asn Trp Leu Asp
275 280 285
acc aag aag cca aac tcc gtt gtc tac gtc aac ttt ggt tcc atc act 912
Thr Lys Lys Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser Ile Thr
290 295 300
gtt atg acc aag gaa caa ttg att gaa ttc ggt tgg ggt ttg gct aac 960
Val Met Thr Lys Glu Gln Leu Ile Glu Phe Gly Trp Gly Leu Ala Asn
305 310 315 320
tct aag aag gac ttc tta tgg atc acc aga cca gac att gtt ggt ggt 1008
Ser Lys Lys Asp Phe Leu Trp Ile Thr Arg Pro Asp Ile Val Gly Gly
325 330 335
aac gaa gct atg att cca gct gaa ttc att gaa gaa acc aag gaa aga 1056
Asn Glu Ala Met Ile Pro Ala Glu Phe Ile Glu Glu Thr Lys Glu Arg
340 345 350
ggt atg gtt act tcc tgg tgt tcc caa gaa gaa gtt ttg aag cac cca 1104
Gly Met Val Thr Ser Trp Cys Ser Gln Glu Glu Val Leu Lys His Pro
355 360 365
tcc att ggt gtt ttc ttg acc cac tct ggt tgg aac tcc acc att gaa 1152
Ser Ile Gly Val Phe Leu Thr His Ser Gly Trp Asn Ser Thr Ile Glu
370 375 380
tcc atc tct aac ggt gtt cca atg atc tgt tgg cca ttc ttc gct gaa 1200
Ser Ile Ser Asn Gly Val Pro Met Ile Cys Trp Pro Phe Phe Ala Glu
385 390 395 400
caa caa acc aac tgt cgt tac tgt tgt gtc gaa tgg gaa atc ggt ttg 1248
Gln Gln Thr Asn Cys Arg Tyr Cys Cys Val Glu Trp Glu Ile Gly Leu
405 410 415
gaa att gac act gac gtc aag cgt gaa gaa gtt gaa gct caa gtt aga 1296
Glu Ile Asp Thr Asp Val Lys Arg Glu Glu Val Glu Ala Gln Val Arg
420 425 430
gaa atg atg gat ggt tct aag ggt aag atg atg aaa aac aag gct ttg 1344
Glu Met Met Asp Gly Ser Lys Gly Lys Met Met Lys Asn Lys Ala Leu
435 440 445
gaa tgg aag aag aag gct gaa gaa gct gtc tct att ggt ggt tct tct 1392
Glu Trp Lys Lys Lys Ala Glu Glu Ala Val Ser Ile Gly Gly Ser Ser
450 455 460
tac ttg aac ttt gaa aaa ttg gtc acc gat gtc ttg ttg cgc aaa 1437
Tyr Leu Asn Phe Glu Lys Leu Val Thr Asp Val Leu Leu Arg Lys
465 470 475
<210> 132
<211> 479
<212> PRT
<213> Stevia rebaudiana
<400> 132
Met Ala Ser Ile Ala Glu Met Gln Lys Pro His Ala Ile Cys Ile Pro
1 5 10 15
Tyr Pro Ala Gln Gly His Ile Asn Pro Met Met Gln Phe Ala Lys Leu
20 25 30
Leu His Phe Lys Gly Phe His Ile Ser Phe Val Asn Asn His Tyr Asn
35 40 45
His Lys Arg Leu Gln Arg Ser Arg Gly Leu Ser Ala Leu Glu Gly Leu
50 55 60
Pro Asp Phe His Phe Tyr Ser Ile Pro Asp Gly Leu Pro Pro Ser Asn
65 70 75 80
Ala Glu Ala Thr Gln Ser Ile Pro Gly Leu Cys Glu Ser Ile Pro Lys
85 90 95
His Ser Leu Glu Pro Phe Cys Asp Leu Ile Ala Thr Leu Asn Gly Ser
100 105 110
Asp Val Pro Pro Val Ser Cys Ile Ile Ser Asp Gly Val Met Ser Phe
115 120 125
Thr Leu Gln Ala Ala Glu Arg Phe Gly Leu Pro Glu Val Leu Phe Trp
130 135 140
Thr Pro Ser Ala Cys Gly Phe Leu Ala Tyr Thr His Tyr Arg Asp Leu
145 150 155 160
Val Asp Lys Glu Tyr Ile Pro Leu Lys Asp Thr Asn Asp Leu Thr Asn
165 170 175
Gly Tyr Leu Glu Thr Ser Leu Asp Trp Ile Pro Gly Met Lys Asn Ile
180 185 190
Arg Leu Lys Asp Phe Pro Ser Phe Ile Arg Thr Thr Asp Ile Asn Asp
195 200 205
Ile Met Leu Asn Tyr Phe Leu Ile Glu Thr Glu Ala Ile Pro Lys Gly
210 215 220
Val Ala Ile Ile Leu Asn Thr Phe Asp Ala Leu Glu Lys Asp Ser Ile
225 230 235 240
Thr Pro Val Leu Ala Leu Asn Pro Gln Ile Tyr Thr Ile Gly Pro Leu
245 250 255
His Met Met Gln Gln Tyr Val Asp His Asp Glu Arg Leu Lys His Ile
260 265 270
Gly Ser Asn Leu Trp Lys Glu Asp Val Ser Cys Ile Asn Trp Leu Asp
275 280 285
Thr Lys Lys Pro Asn Ser Val Val Tyr Val Asn Phe Gly Ser Ile Thr
290 295 300
Val Met Thr Lys Glu Gln Leu Ile Glu Phe Gly Trp Gly Leu Ala Asn
305 310 315 320
Ser Lys Lys Asp Phe Leu Trp Ile Thr Arg Pro Asp Ile Val Gly Gly
325 330 335
Asn Glu Ala Met Ile Pro Ala Glu Phe Ile Glu Glu Thr Lys Glu Arg
340 345 350
Gly Met Val Thr Ser Trp Cys Ser Gln Glu Glu Val Leu Lys His Pro
355 360 365
Ser Ile Gly Val Phe Leu Thr His Ser Gly Trp Asn Ser Thr Ile Glu
370 375 380
Ser Ile Ser Asn Gly Val Pro Met Ile Cys Trp Pro Phe Phe Ala Glu
385 390 395 400
Gln Gln Thr Asn Cys Arg Tyr Cys Cys Val Glu Trp Glu Ile Gly Leu
405 410 415
Glu Ile Asp Thr Asp Val Lys Arg Glu Glu Val Glu Ala Gln Val Arg
420 425 430
Glu Met Met Asp Gly Ser Lys Gly Lys Met Met Lys Asn Lys Ala Leu
435 440 445
Glu Trp Lys Lys Lys Ala Glu Glu Ala Val Ser Ile Gly Gly Ser Ser
450 455 460
Tyr Leu Asn Phe Glu Lys Leu Val Thr Asp Val Leu Leu Arg Lys
465 470 475
<210> 133
<211> 1374
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1374)
<400> 133
atg gaa aac aag act gaa act acc gtt cgt cgt aga aga aga atc atc 48
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
tta ttc cca gtt cca gtt caa ggt cac atc aac cca att ttg caa ttg 96
Leu Phe Pro Val Pro Val Gln Gly His Ile Asn Pro Ile Leu Gln Leu
20 25 30
gct aac gtc tta tac tcc aag ggt ttc tcc atc act att ttc cac act 144
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
aac ttc aac aag cca aag acc tcc aac tac cca cac ttc acc ttc aga 192
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
ttt att ttg gac aat gac cct caa gat gtc aga atc tcc aac ttg cca 240
Phe Ile Leu Asp Asn Asp Pro Gln Asp Val Arg Ile Ser Asn Leu Pro
65 70 75 80
act cac ggt cca ttg acc gtc atg aga att ttg atc atc aac gaa cac 288
Thr His Gly Pro Leu Thr Val Met Arg Ile Leu Ile Ile Asn Glu His
85 90 95
ggt gct gat gaa ttg caa aga gaa ttg gaa ttg ttg atg ttg gcc tcc 336
Gly Ala Asp Glu Leu Gln Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
gaa gaa gac ggt gaa gtc tcc tgt ttg att acc gac caa atc tgg tac 384
Glu Glu Asp Gly Glu Val Ser Cys Leu Ile Thr Asp Gln Ile Trp Tyr
115 120 125
ttc acc caa tct gtt gct gac tct ttg aac ttg aga aga ttg gtt ttg 432
Phe Thr Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
atg act tct tct ttg ttc aac ttc cac gct cat gtt tct ttg cct caa 480
Met Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
ttc gat gaa ttg ggt tac ttg gac cca gac gac aag act aga tta gaa 528
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
gaa caa gct tct ggt ttc cca atg ttg aag gtc aag gat atc aaa tgt 576
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Cys
180 185 190
ggt ttc tct atg tgg aag caa ggt aag gaa atc ttt gaa aac atc acc 624
Gly Phe Ser Met Trp Lys Gln Gly Lys Glu Ile Phe Glu Asn Ile Thr
195 200 205
aag caa acc aag gct tct tct ggt gtc atc tgg aac tct ttc aag gaa 672
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
ttg gaa gaa tct gaa ttg gaa acc gtc atc aga gaa atc cca gct cca 720
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
tcc ttc ttg att cca ttg cca aag cat ttg act gct tcc tct tct tct 768
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
cta tta gac cac gac aga act gtt ttc cca tgg tta gac caa caa cca 816
Leu Leu Asp His Asp Arg Thr Val Phe Pro Trp Leu Asp Gln Gln Pro
260 265 270
tct cgt tct gtc ttg tac gtt tct ttc ggt tct gct act gaa gtc gat 864
Ser Arg Ser Val Leu Tyr Val Ser Phe Gly Ser Ala Thr Glu Val Asp
275 280 285
gct aag gac ttc ttg gaa att gct cgt ggt ttg gtc gac tcc aag caa 912
Ala Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys Gln
290 295 300
tct ttc tta tgg gtt gtt aga cca ggt ttc gtc aag ggt tcc acc tgg 960
Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr Trp
305 310 315 320
gtt gaa cca ttg cca gat ggt ttc ttg ggt gaa aga ggt aga att gtc 1008
Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile Val
325 330 335
aaa tgg gtt cca caa caa gaa gtt ttg gct cac ggt gcc att ggt gct 1056
Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly Ala
340 345 350
ttc tgg acc cac tct ggt tgg aac tcc act ttg gaa tcc gtt tgt gaa 1104
Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys Glu
355 360 365
ggt gtt cca atg att ttc tct gcc ttt gcc ttt gac caa cca tta aac 1152
Gly Val Pro Met Ile Phe Ser Ala Phe Ala Phe Asp Gln Pro Leu Asn
370 375 380
gct aga tac atg tcc gat gtc ttg aag gtt ggt gtc tac ttg gaa aac 1200
Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu Asn
385 390 395 400
ggt tgg gaa aga ggt gaa att gcc aac gcc atc aga aga gtt atg gtc 1248
Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met Val
405 410 415
gat gaa gaa ggt ggt tac atc aga caa aac gct tcc gtc ttg aag caa 1296
Asp Glu Glu Gly Gly Tyr Ile Arg Gln Asn Ala Ser Val Leu Lys Gln
420 425 430
aag gct gat gtt tcc ttg atg aag ggt ggt tct tct tac gaa tct cta 1344
Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser Leu
435 440 445
gaa tct ttg gtt gct tac atc tcc agc tta 1374
Glu Ser Leu Val Ala Tyr Ile Ser Ser Leu
450 455
<210> 134
<211> 458
<212> PRT
<213> Stevia rebaudiana
<400> 134
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
Leu Phe Pro Val Pro Val Gln Gly His Ile Asn Pro Ile Leu Gln Leu
20 25 30
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
Phe Ile Leu Asp Asn Asp Pro Gln Asp Val Arg Ile Ser Asn Leu Pro
65 70 75 80
Thr His Gly Pro Leu Thr Val Met Arg Ile Leu Ile Ile Asn Glu His
85 90 95
Gly Ala Asp Glu Leu Gln Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
Glu Glu Asp Gly Glu Val Ser Cys Leu Ile Thr Asp Gln Ile Trp Tyr
115 120 125
Phe Thr Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
Met Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Cys
180 185 190
Gly Phe Ser Met Trp Lys Gln Gly Lys Glu Ile Phe Glu Asn Ile Thr
195 200 205
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
Leu Leu Asp His Asp Arg Thr Val Phe Pro Trp Leu Asp Gln Gln Pro
260 265 270
Ser Arg Ser Val Leu Tyr Val Ser Phe Gly Ser Ala Thr Glu Val Asp
275 280 285
Ala Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys Gln
290 295 300
Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr Trp
305 310 315 320
Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile Val
325 330 335
Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly Ala
340 345 350
Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys Glu
355 360 365
Gly Val Pro Met Ile Phe Ser Ala Phe Ala Phe Asp Gln Pro Leu Asn
370 375 380
Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu Asn
385 390 395 400
Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met Val
405 410 415
Asp Glu Glu Gly Gly Tyr Ile Arg Gln Asn Ala Ser Val Leu Lys Gln
420 425 430
Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser Leu
435 440 445
Glu Ser Leu Val Ala Tyr Ile Ser Ser Leu
450 455
<210> 135
<211> 1377
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1377)
<400> 135
atg gaa aac aag act gaa act act gtc aga aga aga cgt cgt atc atc 48
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
ttg ttc cca gtt cca ttc caa ggt cac atc aac cca atg tta caa tta 96
Leu Phe Pro Val Pro Phe Gln Gly His Ile Asn Pro Met Leu Gln Leu
20 25 30
gct aac gtc ttg tac tcc aag ggt ttc tcc atc acc att ttc cac acc 144
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
aac ttc aac aag cca aag act tct aac tac cca cac ttc act ttc aga 192
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
ttt atc ttg gac aac gac cct caa gat gtc aga att tcc aac ttg cca 240
Phe Ile Leu Asp Asn Asp Pro Gln Asp Val Arg Ile Ser Asn Leu Pro
65 70 75 80
acc cac ggt cca ttg gct gtc atg aga att ttg atc atc aac gaa cac 288
Thr His Gly Pro Leu Ala Val Met Arg Ile Leu Ile Ile Asn Glu His
85 90 95
ggt gcc gac gaa ttg aga aga gaa ttg gaa ttg ttg atg ttg gct tct 336
Gly Ala Asp Glu Leu Arg Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
gaa gaa gat ggt gaa gtt tcc tgt ttg att gct gac caa atc tgg tac 384
Glu Glu Asp Gly Glu Val Ser Cys Leu Ile Ala Asp Gln Ile Trp Tyr
115 120 125
ttc acc caa tct gtt gct gac tct ttg aac ttg aga aga ttg gtc ttg 432
Phe Thr Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
gtc act tct tct ttg ttc aac ttc cac gct cat gtc tct ttg cca caa 480
Val Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
ttt gac gaa ttg ggt tac ttg gac cca gat gac aag acc aga ttg gaa 528
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
gaa caa gct tct ggt ttc cca atg ttg aag gtt aag gat atc aaa tgt 576
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Cys
180 185 190
tct ttc tcc atg tgg aag aaa tac aag gaa tac ttt gaa aac atc acc 624
Ser Phe Ser Met Trp Lys Lys Tyr Lys Glu Tyr Phe Glu Asn Ile Thr
195 200 205
aag caa acc aag gcc tcc tcc ggt gtt atc tgg aac tct ttc aag gaa 672
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
tta gaa gaa tct gaa ttg gaa act gtt atc aga gaa att cca gct cca 720
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
tct ttc tta atc cca tta cca aag cat ttg act gct tct tct tcc tct 768
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
ttg ttg gac cac gac aga acc gtt ttc cca tgg ttg gac caa caa cca 816
Leu Leu Asp His Asp Arg Thr Val Phe Pro Trp Leu Asp Gln Gln Pro
260 265 270
tcc aga tct gtc ttg tac gtt tct ttc ggt tct ggt act gaa gtc tta 864
Ser Arg Ser Val Leu Tyr Val Ser Phe Gly Ser Gly Thr Glu Val Leu
275 280 285
gat gaa aag gac ttc ttg gaa att gcc aga ggt tta gtc gac tcc aag 912
Asp Glu Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys
290 295 300
caa tct ttc ttg tgg gtt gtt aga cca ggt ttc gtc aag ggt tcc acc 960
Gln Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr
305 310 315 320
tgg gtt gaa cca ttg cca gat ggt ttc ttg ggt gaa aga ggt aga att 1008
Trp Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile
325 330 335
gtc aaa tgg gtt cct caa caa gaa gtt ttg gct cac ggt gcc att ggt 1056
Val Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly
340 345 350
gct ttc tgg acc cac tct ggt tgg aac tcc act ttg gaa tct gtt tgt 1104
Ala Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys
355 360 365
gaa ggt gtt cca atg att ttc tcc gat ttc ggt cta gac caa cca ttg 1152
Glu Gly Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu
370 375 380
aac gct cgt tac atg tcc gat gtt ttg aag gtt ggt gtc tac ttg gaa 1200
Asn Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu
385 390 395 400
aac ggt tgg gaa aga ggt gaa att gct aac gcc atc aga aga gtc atg 1248
Asn Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met
405 410 415
gtt gac gaa gaa ggt gaa tac atc aga caa aat gct cgt gtc ttg aaa 1296
Val Asp Glu Glu Gly Glu Tyr Ile Arg Gln Asn Ala Arg Val Leu Lys
420 425 430
caa aag gct gat gtt tct ttg atg aag ggt ggt tct tct tac gaa tcc 1344
Gln Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser
435 440 445
ttg gaa tct tta gtc tct tac atc tcc tcc tta 1377
Leu Glu Ser Leu Val Ser Tyr Ile Ser Ser Leu
450 455
<210> 136
<211> 459
<212> PRT
<213> Stevia rebaudiana
<400> 136
Met Glu Asn Lys Thr Glu Thr Thr Val Arg Arg Arg Arg Arg Ile Ile
1 5 10 15
Leu Phe Pro Val Pro Phe Gln Gly His Ile Asn Pro Met Leu Gln Leu
20 25 30
Ala Asn Val Leu Tyr Ser Lys Gly Phe Ser Ile Thr Ile Phe His Thr
35 40 45
Asn Phe Asn Lys Pro Lys Thr Ser Asn Tyr Pro His Phe Thr Phe Arg
50 55 60
Phe Ile Leu Asp Asn Asp Pro Gln Asp Val Arg Ile Ser Asn Leu Pro
65 70 75 80
Thr His Gly Pro Leu Ala Val Met Arg Ile Leu Ile Ile Asn Glu His
85 90 95
Gly Ala Asp Glu Leu Arg Arg Glu Leu Glu Leu Leu Met Leu Ala Ser
100 105 110
Glu Glu Asp Gly Glu Val Ser Cys Leu Ile Ala Asp Gln Ile Trp Tyr
115 120 125
Phe Thr Gln Ser Val Ala Asp Ser Leu Asn Leu Arg Arg Leu Val Leu
130 135 140
Val Thr Ser Ser Leu Phe Asn Phe His Ala His Val Ser Leu Pro Gln
145 150 155 160
Phe Asp Glu Leu Gly Tyr Leu Asp Pro Asp Asp Lys Thr Arg Leu Glu
165 170 175
Glu Gln Ala Ser Gly Phe Pro Met Leu Lys Val Lys Asp Ile Lys Cys
180 185 190
Ser Phe Ser Met Trp Lys Lys Tyr Lys Glu Tyr Phe Glu Asn Ile Thr
195 200 205
Lys Gln Thr Lys Ala Ser Ser Gly Val Ile Trp Asn Ser Phe Lys Glu
210 215 220
Leu Glu Glu Ser Glu Leu Glu Thr Val Ile Arg Glu Ile Pro Ala Pro
225 230 235 240
Ser Phe Leu Ile Pro Leu Pro Lys His Leu Thr Ala Ser Ser Ser Ser
245 250 255
Leu Leu Asp His Asp Arg Thr Val Phe Pro Trp Leu Asp Gln Gln Pro
260 265 270
Ser Arg Ser Val Leu Tyr Val Ser Phe Gly Ser Gly Thr Glu Val Leu
275 280 285
Asp Glu Lys Asp Phe Leu Glu Ile Ala Arg Gly Leu Val Asp Ser Lys
290 295 300
Gln Ser Phe Leu Trp Val Val Arg Pro Gly Phe Val Lys Gly Ser Thr
305 310 315 320
Trp Val Glu Pro Leu Pro Asp Gly Phe Leu Gly Glu Arg Gly Arg Ile
325 330 335
Val Lys Trp Val Pro Gln Gln Glu Val Leu Ala His Gly Ala Ile Gly
340 345 350
Ala Phe Trp Thr His Ser Gly Trp Asn Ser Thr Leu Glu Ser Val Cys
355 360 365
Glu Gly Val Pro Met Ile Phe Ser Asp Phe Gly Leu Asp Gln Pro Leu
370 375 380
Asn Ala Arg Tyr Met Ser Asp Val Leu Lys Val Gly Val Tyr Leu Glu
385 390 395 400
Asn Gly Trp Glu Arg Gly Glu Ile Ala Asn Ala Ile Arg Arg Val Met
405 410 415
Val Asp Glu Glu Gly Glu Tyr Ile Arg Gln Asn Ala Arg Val Leu Lys
420 425 430
Gln Lys Ala Asp Val Ser Leu Met Lys Gly Gly Ser Ser Tyr Glu Ser
435 440 445
Leu Glu Ser Leu Val Ser Tyr Ile Ser Ser Leu
450 455
<210> 137
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 137
atg gct act tct gac tcc att gtc gat gac aga aag caa ttg cac gtt 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gcc act ttc cca tgg ttg gct ttc ggt cac att ttg cca ttc ttg caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln
20 25 30
ttg tcc aaa ttg att gct gaa aag ggt cac aag gtt tct ttc ttg tcc 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
act acc aga aac atc caa aga tta tct tct cac atc tct cca ttg atc 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtt gtc caa ttg act ttg cca aga gtc caa gaa ttg cca gaa gat 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct acc acc gat gtc cac cca gaa gat atc caa tac ttg aag 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys
85 90 95
aag gct gtc gat ggt ttg caa cca gaa gtc acc aga ttc ttg gaa caa 336
Lys Ala Val Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac tct cca gac tgg atc atc tac gac ttc acc cac tac tgg tta cca 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro
115 120 125
tcc att gct gct tct ttg ggt atc tcc aga gct tac ttc tgt gtc atc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile
130 135 140
act cca tgg acc att gct tac ttg gct cca tct tct gat gcc atg atc 480
Thr Pro Trp Thr Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile
145 150 155 160
aac gac tct gac ggt aga acc act gtt gaa gat ttg acc act cca cct 528
Asn Asp Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aaa tgg ttc cca ttc cca acc aag gtt tgt tgg aga aag cat gac ttg 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gcc aga atg gaa cct tac gaa gct cca ggt att tct gat ggt tac aga 624
Ala Arg Met Glu Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt atg gtt ttc aag ggt tct gac tgt cta tta ttc aaa tgt tac 672
Met Gly Met Val Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr
210 215 220
cat gaa ttc ggt act caa tgg ttg cca ttg ttg gaa act ttg cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtt cca gtt gtt cca gtc ggt ttg ttg cca cca gaa att cca ggt gat 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
gaa aag gat gaa acc tgg gtt tcc atc aag aaa tgg ttg gac ggt aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aag ggt tcc gtt gtc tac gtt gct ttg ggt tct gaa gct ttg gtt 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val
275 280 285
tct caa act gaa gtt gtc gaa ttg gct ttg ggt cta gaa ttg tcc ggt 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
tta cca ttt gtc tgg gct tac aga aag cca aag ggt cca gcc aaa tct 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gat tcc gtc gaa tta cca gac ggt ttc gtc gaa aga acc cgt gac cgt 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggt tta gtc tgg act tcc tgg gct cca caa ttg aga atc ttg tcc cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tcc gtt tgt ggt ttc ttg act cac tgt ggt tct ggt tct att gtc 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa ggt ttg atg ttc ggt cac cca ttg att atg ttg cct ttg ttt ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Leu Phe Gly
370 375 380
gac caa cca tta aac gct aga tta ttg gaa gac aag caa gtc ggt atc 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa att cca aga aac gaa gaa gac ggt tgt ttg acc aag gaa tcc gtt 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gct cgt tct ttg aga tcc gtt gtt gtc gaa aac gaa ggt gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr
420 425 430
aag gcc aac gcc aga gaa ttg tct aag atc tac aac gac acc aag gtt 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gaa aag gaa tac gtt tct caa ttt gtc gac tac ttg gaa aag aac gct 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
aga gct gtt gcc atc gac cac gaa agc 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 138
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 138
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Phe Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Gln Tyr Leu Lys
85 90 95
Lys Ala Val Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Phe Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala Tyr Phe Cys Val Ile
130 135 140
Thr Pro Trp Thr Ile Ala Tyr Leu Ala Pro Ser Ser Asp Ala Met Ile
145 150 155 160
Asn Asp Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Met Glu Pro Tyr Glu Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Met Val Phe Lys Gly Ser Asp Cys Leu Leu Phe Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Ile Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Ala Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Leu Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Asn Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Ala
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 139
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<220>
<221> CDS
<222> (1)..(1419)
<400> 139
atg gct acc tct gac tcc att gtc gat gac cgt aaa caa ttg cac gtt 48
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
gct act ttc cca tgg ttg gct ttc ggt cac atc ttg cct tac ttg caa 96
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
ttg tcc aag ttg att gct gaa aag ggt cac aag gtt tct ttc ttg tcc 144
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
act act aga aac atc caa aga tta tct tct cac atc tct cca ttg atc 192
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
aac gtt gtc caa ttg act ttg cca aga gtt caa gaa tta cca gaa gat 240
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
gct gaa gct acc acc gat gtc cac cca gaa gac att cca tac ttg aag 288
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
aag gcc tct gac ggt ttg caa cca gaa gtc act aga ttc ttg gaa caa 336
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
cac tct cca gac tgg atc atc tac gac tac act cac tac tgg ttg cca 384
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
tcc att gct gct tct ttg ggt atc tcc aga gcc cat ttc tcc gtc acc 432
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
act cca tgg gct att gct tac atg ggt cca tct gct gat gct atg atc 480
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
aac ggt tct gac ggt aga acc acc gtc gaa gat ttg act acc cca cct 528
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
aaa tgg ttc cca ttc cca acc aag gtt tgt tgg aga aag cat gat ttg 576
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
gcc aga tta gtc cca tac aag gct cca ggt att tct gat ggt tac aga 624
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
atg ggt ttg gtc ttg aaa ggt tct gac tgt ttg ttg tcc aaa tgt tac 672
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
cac gaa ttc ggt act caa tgg tta cca ttg ttg gaa act ttg cac caa 720
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
gtt cca gtt gtt cca gtc ggt ttg ttg cct cca gaa gtt cca ggt gac 768
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Val Pro Gly Asp
245 250 255
gaa aag gat gaa acc tgg gtt tcc atc aag aaa tgg ttg gat ggt aag 816
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
caa aag ggt tcc gtt gtc tac gtt gct ttg ggt tct gaa gtc ttg gtc 864
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
agc caa act gaa gtt gtc gaa ttg gct tta ggt ttg gaa ttg tcc ggt 912
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
cta cca ttt gtc tgg gct tac aga aag cca aag ggt cca gct aag tct 960
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
gac tcc gtt gaa ttg cca gat ggt ttc gtc gaa aga acc aga gac aga 1008
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
ggt ttg gtt tgg act tct tgg gct cca caa ttg aga atc tta tct cac 1056
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
gaa tct gtt tgt ggt ttc ttg acc cac tgt ggt tct ggt tcc atc gtc 1104
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
gaa ggt ttg atg ttc ggt cac cca tta atc atg ttg cca att ttc ggt 1152
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
gac caa cca ttg aac gcc aga tta ttg gaa gac aag caa gtc ggt att 1200
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
gaa att cca aga aac gaa gaa gat ggt tgt ttg acc aag gaa tcc gtt 1248
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
gcc aga tct cta cgt tct gtt gtt gtc gaa aag gaa ggt gaa atc tac 1296
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
aag gcc aac gct cgt gaa tta tcc aag atc tac aac gac acc aag gtt 1344
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
gaa aag gaa tac gtt tct caa ttt gtt gac tac ttg gaa aag aac act 1392
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Thr
450 455 460
aga gct gtt gcc att gac cat gag tct 1419
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 140
<211> 473
<212> PRT
<213> Stevia rebaudiana
<400> 140
Met Ala Thr Ser Asp Ser Ile Val Asp Asp Arg Lys Gln Leu His Val
1 5 10 15
Ala Thr Phe Pro Trp Leu Ala Phe Gly His Ile Leu Pro Tyr Leu Gln
20 25 30
Leu Ser Lys Leu Ile Ala Glu Lys Gly His Lys Val Ser Phe Leu Ser
35 40 45
Thr Thr Arg Asn Ile Gln Arg Leu Ser Ser His Ile Ser Pro Leu Ile
50 55 60
Asn Val Val Gln Leu Thr Leu Pro Arg Val Gln Glu Leu Pro Glu Asp
65 70 75 80
Ala Glu Ala Thr Thr Asp Val His Pro Glu Asp Ile Pro Tyr Leu Lys
85 90 95
Lys Ala Ser Asp Gly Leu Gln Pro Glu Val Thr Arg Phe Leu Glu Gln
100 105 110
His Ser Pro Asp Trp Ile Ile Tyr Asp Tyr Thr His Tyr Trp Leu Pro
115 120 125
Ser Ile Ala Ala Ser Leu Gly Ile Ser Arg Ala His Phe Ser Val Thr
130 135 140
Thr Pro Trp Ala Ile Ala Tyr Met Gly Pro Ser Ala Asp Ala Met Ile
145 150 155 160
Asn Gly Ser Asp Gly Arg Thr Thr Val Glu Asp Leu Thr Thr Pro Pro
165 170 175
Lys Trp Phe Pro Phe Pro Thr Lys Val Cys Trp Arg Lys His Asp Leu
180 185 190
Ala Arg Leu Val Pro Tyr Lys Ala Pro Gly Ile Ser Asp Gly Tyr Arg
195 200 205
Met Gly Leu Val Leu Lys Gly Ser Asp Cys Leu Leu Ser Lys Cys Tyr
210 215 220
His Glu Phe Gly Thr Gln Trp Leu Pro Leu Leu Glu Thr Leu His Gln
225 230 235 240
Val Pro Val Val Pro Val Gly Leu Leu Pro Pro Glu Val Pro Gly Asp
245 250 255
Glu Lys Asp Glu Thr Trp Val Ser Ile Lys Lys Trp Leu Asp Gly Lys
260 265 270
Gln Lys Gly Ser Val Val Tyr Val Ala Leu Gly Ser Glu Val Leu Val
275 280 285
Ser Gln Thr Glu Val Val Glu Leu Ala Leu Gly Leu Glu Leu Ser Gly
290 295 300
Leu Pro Phe Val Trp Ala Tyr Arg Lys Pro Lys Gly Pro Ala Lys Ser
305 310 315 320
Asp Ser Val Glu Leu Pro Asp Gly Phe Val Glu Arg Thr Arg Asp Arg
325 330 335
Gly Leu Val Trp Thr Ser Trp Ala Pro Gln Leu Arg Ile Leu Ser His
340 345 350
Glu Ser Val Cys Gly Phe Leu Thr His Cys Gly Ser Gly Ser Ile Val
355 360 365
Glu Gly Leu Met Phe Gly His Pro Leu Ile Met Leu Pro Ile Phe Gly
370 375 380
Asp Gln Pro Leu Asn Ala Arg Leu Leu Glu Asp Lys Gln Val Gly Ile
385 390 395 400
Glu Ile Pro Arg Asn Glu Glu Asp Gly Cys Leu Thr Lys Glu Ser Val
405 410 415
Ala Arg Ser Leu Arg Ser Val Val Val Glu Lys Glu Gly Glu Ile Tyr
420 425 430
Lys Ala Asn Ala Arg Glu Leu Ser Lys Ile Tyr Asn Asp Thr Lys Val
435 440 445
Glu Lys Glu Tyr Val Ser Gln Phe Val Asp Tyr Leu Glu Lys Asn Thr
450 455 460
Arg Ala Val Ala Ile Asp His Glu Ser
465 470
<210> 141
<211> 2361
<212> DNA
<213> Stevia rebaudiana
<400> 141
atgaaaaccg gtttcatctc tcccgccact gttttccacc accgaatctc ccccgccacc 60
accttccgac accacctctc ccctgccacc accaactcca ctggtattgt tgctctgcga 120
gacatcaact tccgatgcaa ggctgtttcc aaggagtact ccgatctgct ccagaaggac 180
gaggcctctt tcaccaagtg ggacgacgac aaggtcaagg accacctcga caccaacaag 240
aacctctacc ccaacgacga gatcaaggag tttgtcgagt ccgtcaaggc catgttcggc 300
tccatgaacg acggcgagat taatgtctct gcttacgaca ccgcctgggt tgctctggtc 360
caggatgtcg acggttccgg ctctcctcag ttcccttcct ctctcgagtg gatcgccaac 420
aaccagctgt ccgacggttc ttggggtgac cacctgctct tctctgctca cgaccgaatc 480
atcaacaccc tggcctgtgt cattgctctg acctcttgga acgtccaccc ctccaagtgc 540
gagaagggtc tgaacttcct ccgagagaac atctgcaagc tcgaggacga gaacgccgag 600
cacatgccca ttggcttcga ggtcaccttc ccctctctga ttgacattgc caagaagctc 660
aacattgagg tccccgagga cacccccgct ctcaaggaga tctacgctcg acgagacatc 720
aagctcacca agatccccat ggaggttctc cacaaggtcc ccaccactct cctccactct 780
ctcgagggta tgcccgatct cgagtgggag aagctgctca agctgcagtg caaggacggc 840
tctttcctct tctccccctc ttccactgcc ttcgccctca tgcagaccaa ggacgagaag 900
tgtctccagt acctcaccaa cattgtcacc aagttcaacg gtggtgtccc caacgtctac 960
cccgttgacc tctttgagca catctgggtt gttgaccgac tccagcgact cggtatcgcc 1020
cgatacttca agtccgagat caaggactgt gtcgagtaca tcaacaagta ctggaccaag 1080
aacggtatct gctgggcccg aaacacccac gtccaggaca ttgacgacac cgccatgggc 1140
ttccgagttc tgcgagccca cggctacgat gtcacccccg atgtctttcg acagtttgag 1200
aaggacggca agtttgtctg tttcgccggt cagtccaccc aggccgtcac cggtatgttc 1260
aacgtctacc gagcttctca gatgctcttc cccggtgagc gaatcctcga ggacgccaag 1320
aagttctcct acaactacct caaggagaag cagtccacca acgagctgct cgacaagtgg 1380
atcattgcca aggatctgcc cggtgaggtt ggctacgccc tcgacatccc ctggtacgcc 1440
tctctgcccc gactggagac tcgatactac ctcgagcagt acggtggtga ggacgatgtc 1500
tggatcggta agaccctgta ccgaatgggc tacgtttcca acaacaccta cctcgagatg 1560
gccaagctcg actacaacaa ctacgttgcc gtcctccagc tcgagtggta caccatccag 1620
cagtggtacg tcgacattgg tatcgagaag ttcgagtccg acaacatcaa gtccgtcctt 1680
gtctcctact acctcgctgc tgcctccatc ttcgagcccg agcgatccaa ggagcgaatt 1740
gcctgggcca agaccaccat cctcgtcgac aagatcacct ccatcttcga ctcctcccag 1800
tcctccaagg aagatatcac cgccttcatt gacaagttcc gaaacaagtc ctcctccaag 1860
aagcactcca tcaacggcga gccctggcac gaggtcatgg ttgctctcaa gaaaactctc 1920
cacggctttg ccctcgacgc tctgatgacc cactctcagg acatccaccc ccagctccac 1980
caggcctggg agatgtggct caccaagctc caggacggtg ttgatgtcac tgctgagctc 2040
atggtccaga tgatcaacat gaccgccggc cgatgggttt ccaaggagct cctcacccac 2100
ccccagtacc agcgactctc cactgtcacc aactctgtct gccacgacat caccaagctc 2160
cacaacttca aggagaactc caccaccgtc gactccaagg tccaggagct ggtccagctc 2220
gttttctccg acacccccga tgatctcgac caggacatga agcagacctt cctgactgtc 2280
atgaaaactt tctactacaa ggcctggtgc gaccccaaca ccatcaacga ccacatctcc 2340
aaggtctttg agattgtgat t 2361
<210> 142
<211> 2229
<212> DNA
<213> Stevia rebaudiana
<400> 142
atgtgcaagg ctgtttccaa ggagtactcc gatctgctcc agaaggacga ggcctctttc 60
accaagtggg acgacgacaa ggtcaaggac cacctcgaca ccaacaagaa cctctacccc 120
aacgacgaga tcaaggagtt tgtcgagtcc gtcaaggcca tgttcggctc catgaacgac 180
ggcgagatta atgtctctgc ttacgacacc gcctgggttg ctctggtcca ggatgtcgac 240
ggttccggct ctcctcagtt cccttcctct ctcgagtgga tcgccaacaa ccagctgtcc 300
gacggttctt ggggtgacca cctgctcttc tctgctcacg accgaatcat caacaccctg 360
gcctgtgtca ttgctctgac ctcttggaac gtccacccct ccaagtgcga gaagggtctg 420
aacttcctcc gagagaacat ctgcaagctc gaggacgaga acgccgagca catgcccatt 480
ggcttcgagg tcaccttccc ctctctgatt gacattgcca agaagctcaa cattgaggtc 540
cccgaggaca cccccgctct caaggagatc tacgctcgac gagacatcaa gctcaccaag 600
atccccatgg aggttctcca caaggtcccc accactctcc tccactctct cgagggtatg 660
cccgatctcg agtgggagaa gctgctcaag ctgcagtgca aggacggctc tttcctcttc 720
tccccctctt ccactgcctt cgccctcatg cagaccaagg acgagaagtg tctccagtac 780
ctcaccaaca ttgtcaccaa gttcaacggt ggtgtcccca acgtctaccc cgttgacctc 840
tttgagcaca tctgggttgt tgaccgactc cagcgactcg gtatcgcccg atacttcaag 900
tccgagatca aggactgtgt cgagtacatc aacaagtact ggaccaagaa cggtatctgc 960
tgggcccgaa acacccacgt ccaggacatt gacgacaccg ccatgggctt ccgagttctg 1020
cgagcccacg gctacgatgt cacccccgat gtctttcgac agtttgagaa ggacggcaag 1080
tttgtctgtt tcgccggtca gtccacccag gccgtcaccg gtatgttcaa cgtctaccga 1140
gcttctcaga tgctcttccc cggtgagcga atcctcgagg acgccaagaa gttctcctac 1200
aactacctca aggagaagca gtccaccaac gagctgctcg acaagtggat cattgccaag 1260
gatctgcccg gtgaggttgg ctacgccctc gacatcccct ggtacgcctc tctgccccga 1320
ctggagactc gatactacct cgagcagtac ggtggtgagg acgatgtctg gatcggtaag 1380
accctgtacc gaatgggcta cgtttccaac aacacctacc tcgagatggc caagctcgac 1440
tacaacaact acgttgccgt cctccagctc gagtggtaca ccatccagca gtggtacgtc 1500
gacattggta tcgagaagtt cgagtccgac aacatcaagt ccgtccttgt ctcctactac 1560
ctcgctgctg cctccatctt cgagcccgag cgatccaagg agcgaattgc ctgggccaag 1620
accaccatcc tcgtcgacaa gatcacctcc atcttcgact cctcccagtc ctccaaggaa 1680
gatatcaccg ccttcattga caagttccga aacaagtcct cctccaagaa gcactccatc 1740
aacggcgagc cctggcacga ggtcatggtt gctctcaaga aaactctcca cggctttgcc 1800
ctcgacgctc tgatgaccca ctctcaggac atccaccccc agctccacca ggcctgggag 1860
atgtggctca ccaagctcca ggacggtgtt gatgtcactg ctgagctcat ggtccagatg 1920
atcaacatga ccgccggccg atgggtttcc aaggagctcc tcacccaccc ccagtaccag 1980
cgactctcca ctgtcaccaa ctctgtctgc cacgacatca ccaagctcca caacttcaag 2040
gagaactcca ccaccgtcga ctccaaggtc caggagctgg tccagctcgt tttctccgac 2100
acccccgatg atctcgacca ggacatgaag cagaccttcc tgactgtcat gaaaactttc 2160
tactacaagg cctggtgcga ccccaacacc atcaacgacc acatctccaa ggtctttgag 2220
attgtgatt 2229
<210> 143
<211> 2352
<212> DNA
<213> Stevia rebaudiana
<400> 143
atgaacctgt ctctctgtat cgcctctcct ctcctcacca agtccaaccg acccgctgct 60
ctgtctgcca tccacaccgc ctccacctcc cacggcggcc agaccaaccc caccaacctc 120
atcattgaca ccaccaagga gcgaatccag aagcagttca agaacgtcga gatctccgtt 180
tcctcctacg acaccgcctg ggtcgccatg gtcccctctc ccaactcccc caagtctccc 240
tgcttccccg agtgtctcaa ctggctcatc aacaaccagc tcaacgacgg ctcttggggt 300
ctggtcaacc acacccacaa ccacaaccac cccctcctca aggactctct ctcttccact 360
ctcgcctgca ttgttgctct caagcgatgg aacgttggcg aggaccagat caacaagggt 420
ctgtctttca ttgagtccaa cctcgcctcc gccaccgaga agtcccagcc ctcccccatt 480
ggctttgata tcatcttccc cggtctgctc gagtacgcca agaacctcga tatcaacctg 540
ctctccaagc agaccgactt ctctctcatg ctgcacaagc gagagctcga gcagaagcga 600
tgccactcca acgagatgga cggctacctg gcctacattt ccgagggtct gggtaacctc 660
tacgactgga acatggtcaa gaagtaccag atgaagaacg gttccgtttt caactccccc 720
tctgccaccg ctgctgcctt catcaaccac cagaaccccg gctgtctcaa ctacctcaac 780
tctctgctcg acaagtttgg taacgccgtc cccactgtct acccccacga tctcttcatc 840
cgactctcca tggtcgacac cattgagcga ctcggtattt cccaccactt ccgagtcgag 900
atcaagaacg ttctcgatga gacttaccga tgctgggttg agcgagatga gcagatcttc 960
atggacgttg tcacctgtgc tctggccttc cgactcctcc gaatcaacgg ttacgaggtt 1020
tcccccgacc ccctcgccga gatcaccaac gagctggctc tcaaggacga gtacgccgcc 1080
ctcgagactt accacgcttc tcacattctg taccaagagg atctgtcctc cggcaagcag 1140
attctcaagt ccgccgactt cctcaaggag atcatctcca ctgactccaa ccgactctcc 1200
aagctcatcc acaaggaagt cgagaacgct ctcaagttcc ccatcaacac cggtctggag 1260
cgaatcaaca cccgacgaaa catccagctc tacaacgtcg acaacacccg aattctcaag 1320
accacctacc actcttccaa catctccaac accgactacc tgcgactcgc cgtcgaggac 1380
ttctacacct gccagtccat ctaccgagag gagctcaagg gtctggagcg atgggttgtc 1440
gagaacaagc tcgaccagct caagtttgcc cgacaaaaga ctgcctactg ctacttctcc 1500
gttgctgcca ccctctcttc tcccgagctc tccgacgccc gaatctcttg ggccaagaac 1560
ggtatcctga ccactgttgt cgacgacttc tttgacattg gtggcaccat tgacgagctg 1620
accaacctca tccagtgcgt cgagaagtgg aacgtcgacg ttgacaagga ctgttgttcc 1680
gagcacgtcc gaatcctctt cctggctctc aaggacgcca tctgctggat cggtgacgag 1740
gccttcaagt ggcaggctcg agatgtcact tcccacgtca tccagacctg gctcgagctc 1800
atgaactcca tgctgcgaga ggccatctgg acccgagatg cctacgtccc caccctcaac 1860
gagtacatgg agaacgccta cgtcagcttt gctctcggtc ccattgtcaa gcccgccatc 1920
tactttgtcg gtcccaagct gtccgaggag attgtcgagt cctccgagta ccacaacctc 1980
ttcaagctca tgtccaccca gggccgactc ctcaacgata tccactcctt caagcgagag 2040
ttcaaggaag gtaagctcaa cgccgttgct ctgcacctgt ccaacggtga gtccggcaag 2100
gtcgaggaag aggtcgtcga ggagatgatg atgatgatca agaacaagcg aaaggagctc 2160
atgaagctca tcttcgagga gaacggctcc attgtccccc gagcctgcaa ggacgccttc 2220
tggaacatgt gccacgtcct caacttcttc tacgccaacg acgacggttt caccggcaac 2280
accattctcg acaccgtcaa ggacatcatc tacaaccctc tggttctggt caacgagaac 2340
gaggagcaga gg 2352
<210> 144
<211> 2271
<212> DNA
<213> Stevia rebaudiana
<400> 144
atgacctccc acggcggcca gaccaacccc accaacctca tcattgacac caccaaggag 60
cgaatccaga agcagttcaa gaacgtcgag atctccgttt cctcctacga caccgcctgg 120
gtcgccatgg tcccctctcc caactccccc aagtctccct gcttccccga gtgtctcaac 180
tggctcatca acaaccagct caacgacggc tcttggggtc tggtcaacca cacccacaac 240
cacaaccacc ccctcctcaa ggactctctc tcttccactc tcgcctgcat tgttgctctc 300
aagcgatgga acgttggcga ggaccagatc aacaagggtc tgtctttcat tgagtccaac 360
ctcgcctccg ccaccgagaa gtcccagccc tcccccattg gctttgatat catcttcccc 420
ggtctgctcg agtacgccaa gaacctcgat atcaacctgc tctccaagca gaccgacttc 480
tctctcatgc tgcacaagcg agagctcgag cagaagcgat gccactccaa cgagatggac 540
ggctacctgg cctacatttc cgagggtctg ggtaacctct acgactggaa catggtcaag 600
aagtaccaga tgaagaacgg ttccgttttc aactccccct ctgccaccgc tgctgccttc 660
atcaaccacc agaaccccgg ctgtctcaac tacctcaact ctctgctcga caagtttggt 720
aacgccgtcc ccactgtcta cccccacgat ctcttcatcc gactctccat ggtcgacacc 780
attgagcgac tcggtatttc ccaccacttc cgagtcgaga tcaagaacgt tctcgatgag 840
acttaccgat gctgggttga gcgagatgag cagatcttca tggacgttgt cacctgtgct 900
ctggccttcc gactcctccg aatcaacggt tacgaggttt cccccgaccc cctcgccgag 960
atcaccaacg agctggctct caaggacgag tacgccgccc tcgagactta ccacgcttct 1020
cacattctgt accaagagga tctgtcctcc ggcaagcaga ttctcaagtc cgccgacttc 1080
ctcaaggaga tcatctccac tgactccaac cgactctcca agctcatcca caaggaagtc 1140
gagaacgctc tcaagttccc catcaacacc ggtctggagc gaatcaacac ccgacgaaac 1200
atccagctct acaacgtcga caacacccga attctcaaga ccacctacca ctcttccaac 1260
atctccaaca ccgactacct gcgactcgcc gtcgaggact tctacacctg ccagtccatc 1320
taccgagagg agctcaaggg tctggagcga tgggttgtcg agaacaagct cgaccagctc 1380
aagtttgccc gacaaaagac tgcctactgc tacttctccg ttgctgccac cctctcttct 1440
cccgagctct ccgacgcccg aatctcttgg gccaagaacg gtatcctgac cactgttgtc 1500
gacgacttct ttgacattgg tggcaccatt gacgagctga ccaacctcat ccagtgcgtc 1560
gagaagtgga acgtcgacgt tgacaaggac tgttgttccg agcacgtccg aatcctcttc 1620
ctggctctca aggacgccat ctgctggatc ggtgacgagg ccttcaagtg gcaggctcga 1680
gatgtcactt cccacgtcat ccagacctgg ctcgagctca tgaactccat gctgcgagag 1740
gccatctgga cccgagatgc ctacgtcccc accctcaacg agtacatgga gaacgcctac 1800
gtcagctttg ctctcggtcc cattgtcaag cccgccatct actttgtcgg tcccaagctg 1860
tccgaggaga ttgtcgagtc ctccgagtac cacaacctct tcaagctcat gtccacccag 1920
ggccgactcc tcaacgatat ccactccttc aagcgagagt tcaaggaagg taagctcaac 1980
gccgttgctc tgcacctgtc caacggtgag tccggcaagg tcgaggaaga ggtcgtcgag 2040
gagatgatga tgatgatcaa gaacaagcga aaggagctca tgaagctcat cttcgaggag 2100
aacggctcca ttgtcccccg agcctgcaag gacgccttct ggaacatgtg ccacgtcctc 2160
aacttcttct acgccaacga cgacggtttc accggcaaca ccattctcga caccgtcaag 2220
gacatcatct acaaccctct ggttctggtc aacgagaacg aggagcagag g 2271
<210> 145
<211> 1539
<212> DNA
<213> Stevia rebaudiana
<400> 145
atggacgccg tcactggtct gctcaccgtc cccgccactg ccatcaccat cggtggtact 60
gccgttgctc tggccgttgc tctcatcttc tggtacctca agtcctacac ctctgcccga 120
cgatcccagt ccaaccacct cccccgagtc cccgaggtcc ccggtgtccc cctgctcggt 180
aacctgctcc agctcaagga gaagaagccc tacatgacct tcacccgatg ggctgccacc 240
tacggcccca tctactccat caagactggt gccacctcca tggttgttgt ctcttccaac 300
gagattgcta aagaggctct cgtcacccga ttccagtcca tctccacccg aaacctctcc 360
aaggctctca aggttctgac cgccgacaag accatggttg ccatgtccga ctacgacgac 420
taccacaaga ccgtcaagcg acacattctc accgccgttc tcggccccaa cgcccagaag 480
aagcaccgaa tccaccgaga catcatgatg gacaacatct ccacccagct ccacgagttt 540
gtcaagaaca accccgagca ggaagaggtt gatctccgaa agatcttcca gtccgagctc 600
ttcggtctgg ccatgcgaca ggctctgggc aaggatgtcg agtctctcta cgtcgaggat 660
ctcaagatca ccatgaaccg agatgagatc ttccaggttc tggttgtcga ccccatgatg 720
ggtgccattg atgtcgactg gcgagacttc ttcccctacc tcaagtgggt ccccaacaag 780
aagtttgaga acaccatcca gcagatgtac atccgacgag aggctgtcat gaagtctctc 840
atcaaggaga acaagaagcg aattgcctcc ggtgagaagc tcaactccta cattgactac 900
ctgctctccg aggcccagac cctcaccgac cagcagctgc tcatgtctct gtgggagccc 960
atcattgagt cctccgacac caccatggtc accaccgagt gggccatgta cgagctcgcc 1020
aagaacccca agctccagga ccgactctac cgagacatca agtctgtctg tggctccgag 1080
aagatcaccg aggagcacct gtcccagctc ccctacatca ctgccatctt ccacgagact 1140
ctccgacgac actctcccgt ccccatcatc cctctccgac acgtccacga ggacaccgtt 1200
ctgggtggct accacgtccc cgccggtacc gagctcgctg tcaacatcta cggctgcaac 1260
atggacaaga acgtctggga gaaccccgag gagtggaacc ccgagcgatt catgaaggag 1320
aacgagacta ttgacttcca gaaaaccatg gcctttggtg gtggcaagcg agtctgcgcc 1380
ggctctctcc aggctctgct gaccgcctcc attggtattg gccgaatggt ccaggagttc 1440
gagtggaagc tcaaggacat gacccaggaa gaggtcaaca ccattggcct caccacccag 1500
atgctccgac ctctccgagc catcatcaag ccccgaata 1539
<210> 146
<211> 1566
<212> DNA
<213> Stevia rebaudiana
<400> 146
atgggtctgt tccccctcga ggactcctac gctctggtct ttgagggcct cgccatcacc 60
ctcgccctct actacctgct ctccttcatc tacaagacct ccaagaaaac ctgcactcct 120
cccaaggcct ccggtgagca ccccatcacc ggccacctca acctgctctc cggctcttcc 180
ggcctccccc acctcgccct cgcctctctg gctgaccgat gtggtcccat cttcaccatc 240
cgactcggta tccgacgagt cctcgttgtc tccaactggg agattgccaa ggagatcttc 300
accacccacg atctcattgt ctccaaccga cccaagtacc tggctgccaa gattctcggt 360
ttcaactacg tttccttctc ctttgccccc tacggcccct actgggtcgg tatccgaaag 420
atcattgcca ccaagctcat gtcctcttct cgactccaga agctccagtt cgtccgagtc 480
tttgagctcg agaactccat gaagtccatc cgagagtctt ggaaggagaa gaaggacgag 540
gaaggcaagg ttctggtcga gatgaagaag tggttctggg agctcaacat gaacattgtt 600
ctgcgaactg ttgctggtaa gcagtacacc ggtaccgtcg acgacgccga tgccaagcga 660
atctccgagc tcttccgaga gtggttccac tacaccggcc gatttgttgt cggtgacgcc 720
ttccccttcc tcggctggct cgatctcggt ggctacaaga aaaccatgga gctcgtcgcc 780
tcccgactcg actccatggt cagcaagtgg ctcgacgagc accgaaagaa gcaggccaac 840
gacgacaaga aagaggacat ggacttcatg gacatcatga tctccatgac cgaggccaac 900
tctcccctcg agggttacgg taccgacacc atcatcaaga ccacctgcat gaccctcatt 960
gtctccggtg tcgacaccac ctccattgtc ctgacctggg ctctgtctct gctgctcaac 1020
aaccgagaca ctctcaagaa ggcccaggaa gagctcgaca tgtgtgtcgg caagggccga 1080
caggtcaacg agtccgatct ggtcaacctc atctacctcg aggctgttct caaggaagct 1140
ctgcgactct accccgctgc cttcctcggt ggtccccgag ccttcctcga ggactgcact 1200
gttgccggct accgaatccc caagggtacc tgtctgctca tcaacatgtg gaagctccac 1260
cgagatccca acatctggtc cgacccctgc gagttcaagc ccgagcgatt cctgaccccc 1320
aaccagaagg acgttgatgt cattggtatg gacttcgagc tcatcccctt cggtgccggc 1380
cgacgatact gccccggtac ccgactcgcc ctccagatgc tgcacattgt tctcgccact 1440
ctgctccaga actttgagat gtccaccccc aacgacgccc ccgtcgacat gactgcttcc 1500
gtcggtatga ccaacgccaa ggcttctcct ctcgaggtcc tcctgtctcc ccgagtcaag 1560
tggtca 1566
<210> 147
<211> 1443
<212> DNA
<213> Stevia rebaudiana
<400> 147
atggacgcca tggccaccac cgagaagaag ccccacgtca tcttcatccc cttccccgcc 60
cagtcccaca tcaaggccat gctcaagctc gcccagctcc tccaccacaa gggcctccag 120
atcacctttg tcaacaccga cttcatccac aaccagttcc tcgagtcctc cggcccccac 180
tgtctggacg gtgctcccgg tttccgattt gagactatcc ccgatggtgt ctcccactcc 240
cccgaggcct ccatccccat ccgagagtct ctgctccgat ccattgagac taacttcctc 300
gaccgattca ttgatctcgt caccaagctc cccgatcctc ccacctgtat catctccgac 360
ggtttcctgt ccgttttcac cattgatgct gccaagaagc tcggtatccc cgtcatgatg 420
tactggactc tggctgcctg tggtttcatg ggtttctacc acatccactc tctgatcgag 480
aagggctttg ctcctctcaa ggacgcctcc tacctcacca acggttacct cgacaccgtc 540
attgactggg tccccggtat ggagggtatc cgactcaagg acttccccct cgactggtcc 600
accgacctca acgacaaggt tctcatgttc accaccgagg ctccccagcg atcccacaag 660
gtttcccacc acatcttcca caccttcgac gagctcgagc cctccatcat caagactctg 720
tctctgcgat acaaccacat ctacaccatt ggccccctcc agctcctcct cgaccagatc 780
cccgaggaga agaagcagac cggtatcacc tctctgcacg gctactctct cgtcaaggaa 840
gagcccgagt gcttccagtg gctccagtcc aaggagccca actccgttgt ctacgtcaac 900
tttggctcca ccaccgtcat gtctctcgag gacatgaccg agtttggctg gggtctggcc 960
aactccaacc actacttcct gtggatcatc cgatccaacc tcgtcattgg cgagaacgcc 1020
gttctgcctc ccgagctcga ggagcacatc aagaagcgag gcttcattgc ctcttggtgc 1080
tcccaggaga aggttctcaa gcacccctcc gtcggtggtt tcctgaccca ctgcggctgg 1140
ggctccacca ttgagtctct gtccgctggt gtccccatga tctgctggcc ctactcctgg 1200
gaccagctca ccaactgccg atacatctgc aaggagtggg aggttggtct ggagatgggt 1260
accaaggtca agcgagatga ggtcaagcga ctcgtccagg agctcatggg cgagggtggt 1320
cacaagatgc gaaacaaggc caaggactgg aaggagaagg cccgaattgc cattgccccc 1380
aacggctctt cttctctcaa cattgacaag atggtcaagg agatcactgt tctcgctcga 1440
aac 1443
<210> 148
<211> 1380
<212> DNA
<213> Stevia rebaudiana
<400> 148
atggccgagc agcagaagat caagaagtct ccccacgttc tgctcatccc cttccctctg 60
cagggccaca tcaacccctt catccagttc ggcaagcgac tcatctccaa gggtgtcaag 120
accactctgg tcaccaccat ccacaccctc aactccactc tcaaccactc caacaccacc 180
accacctcca tcgagatcca ggccatctcc gacggctgtg acgagggtgg tttcatgtct 240
gctggtgagt cttacctcga gactttcaag caggtcggtt ccaagtctct ggctgacctc 300
atcaagaagc tccagtccga gggtaccacc attgacgcca tcatctacga ctccatgacc 360
gagtgggttc tcgatgtcgc catcgagttt ggtattgacg gtggctcctt cttcacccag 420
gcctgtgtcg tcaactctct ctactaccac gtccacaagg gtctgatctc tctgcccctc 480
ggcgagactg tctccgtccc cggtttcccc gttctgcagc gatgggagac tcctctcatt 540
ctccagaacc acgagcagat ccagtccccc tggtcccaga tgctcttcgg ccagttcgcc 600
aacattgacc aggcccgatg ggttttcacc aactccttct acaagctcga ggaagaggtc 660
attgagtgga cccgaaagat ctggaacctc aaggtcattg gccccaccct cccctccatg 720
tacctcgaca agcgactcga tgacgacaag gacaacggtt tcaacctcta caaggccaac 780
caccacgagt gcatgaactg gctcgacgac aagcccaagg agtccgttgt ctacgttgcc 840
tttggctctc tggtcaagca cggccccgag caggttgagg agatcacccg agctctgatt 900
gactccgatg tcaacttcct gtgggtcatc aagcacaagg aagagggtaa gctccccgag 960
aacctgtccg aggtcatcaa gaccggcaag ggcctcattg ttgcctggtg caagcagctc 1020
gacgttctcg cccacgagtc cgtcggctgc tttgtcaccc actgcggttt caactccacc 1080
ctcgaggcta tctctctcgg tgtccccgtt gttgccatgc cccagttctc cgaccagacc 1140
accaacgcca agctcctcga tgagattctc ggtgtcggtg tccgagtcaa ggctgacgag 1200
aacggtattg tccgacgagg taacctggct tcttgtatca agatgatcat ggaggaagag 1260
cgaggtgtca tcatccgaaa gaacgccgtc aagtggaagg atctggccaa ggttgctgtc 1320
cacgagggtg gctcttccga caacgacatt gtcgagtttg tctccgagct catcaaggcc 1380
<210> 149
<211> 1374
<212> DNA
<213> Stevia rebaudiana
<400> 149
atggagaaca agaccgagac taccgtccga cgacgacgac gaatcattct cttccccgtc 60
cccttccagg gccacatcaa ccccattctg cagctcgcca acgttctgta ctccaagggc 120
ttctccatca ccatcttcca caccaacttc aacaagccca agacctccaa ctacccccac 180
ttcactttcc gattcatcct cgacaacgac ccccaggacg agcgaatctc caacctgccc 240
acccacggtc ctctggctgg tatgcgaatc cccatcatca acgagcacgg tgctgacgag 300
ctccgacgag agctcgagct gctcatgctc gcctccgaag aggacgagga agtctcctgt 360
ctgatcaccg atgctctgtg gtactttgcc cagtccgtcg ccgactctct caacctgcga 420
cgactcgttc tcatgacctc ctctctgttc aacttccacg cccacgtttc tctgccccag 480
tttgacgagc tcggttacct cgaccccgat gacaagaccc gactcgagga gcaggcttcc 540
ggtttcccca tgctcaaggt caaggacatc aagtccgcct actccaactg gcagattctc 600
aaggagattc tcggcaagat gatcaagcag accaaggcct cctccggtgt catctggaac 660
tccttcaagg agctcgagga gtccgagctc gagactgtca tccgagagat ccccgctccc 720
tctttcctca tccccctgcc caagcacctc accgcttcct cctcttctct gctcgaccac 780
gaccgaaccg tctttcagtg gctcgaccag cagccccctt cctccgtcct ctacgtttcc 840
ttcggctcca cctccgaggt cgacgagaag gacttcctcg agattgctcg aggcctcgtt 900
gactccaagc agtccttcct gtgggttgtc cgacccggct ttgtcaaggg ctccacctgg 960
gttgagcccc tgcccgatgg tttcctcggt gagcgaggcc gaattgtcaa gtgggtcccc 1020
cagcaggaag ttctggccca cggtgccatt ggtgccttct ggacccactc cggctggaac 1080
tccactctcg agtccgtctg cgagggtgtc cccatgatct tctccgactt tggcctcgac 1140
cagcccctca acgcccgata catgtccgat gttctcaagg tcggtgtcta cctcgagaac 1200
ggctgggagc gaggtgagat tgccaacgcc atccgacgag tcatggtcga cgaggaaggt 1260
gagtacatcc gacagaacgc ccgagtcctc aagcagaagg ccgatgtctc tctcatgaag 1320
ggtggttctt cttacgagtc tctcgagtct ctcgtttcct acatctcttc tttg 1374
<210> 150
<211> 2130
<212> DNA
<213> Stevia rebaudiana
<400> 150
atgcagtccg actccgtcaa ggtcagcccc ttcgatcttg tctccgccgc catgaacggc 60
aaggccatgg agaagctcaa cgcctccgag tccgaggacc ccaccaccct ccccgctctc 120
aagatgctcg tcgagaaccg agagctcctc accctcttca ccacctcctt cgccgttctc 180
attggctgtc tggtctttct catgtggcga cgatcttcct ccaagaagct cgtccaggac 240
cccgtccccc aggtcattgt tgtcaagaag aaggagaagg agtccgaggt cgacgacggc 300
aagaagaagg tttccatctt ctacggtact cagaccggta ctgccgaggg tttcgccaag 360
gctctcgtcg aggaagccaa ggtccgatac gaaaagacct ccttcaaggt catcgatctc 420
gatgactacg ctgctgacga cgacgagtac gaggagaagc tcaagaagga gtctctcgcc 480
ttcttcttcc tggccaccta cggtgacggt gagcccactg acaacgccgc caacttctac 540
aagtggttca ccgagggtga cgacaagggc gagtggctca agaagctcca gtacggtgtc 600
tttggcctcg gtaaccgaca gtacgagcac ttcaacaaga ttgccattgt tgtcgacgac 660
aagctgaccg agatgggcgc caagcgactc gttcccgtcg gcctcggtga tgatgaccag 720
tgtatcgagg acgacttcac cgcctggaag gagctcgtct ggcccgagct cgaccagctg 780
ctgcgagatg aggacgacac ttccgtcacc actccctaca ccgctgctgt tctcgagtac 840
cgagttgtct accacgacaa gcccgctgac tcctacgccg aggaccagac ccacaccaac 900
ggccacgttg tccacgacgc ccagcacccc tctcgatcca acgttgcctt caagaaggag 960
ctccacacct cccagtccga ccgatcttgc acccacctcg agtttgacat ctcccacact 1020
ggtctgtctt acgaaaccgg cgaccacgtc ggtgtctact ccgagaacct ctccgaggtt 1080
gtcgacgagg ctctcaagct gctcggtctg tctcccgaca cctacttctc cgtccacgcc 1140
gataaagagg acggcacccc catcggtggt gcttctctgc ctcctccctt tcccccttgc 1200
actctgcgag atgctctgac ccgatacgcc gacgttctct cttctcccaa gaaggttgct 1260
ctgctggccc tcgccgccca cgcctccgac ccctccgagg ccgaccgact caagttcctc 1320
gcctctcccg ctggtaagga cgagtacgcc cagtggatcg tcgccaacca gcgatctctg 1380
ctcgaggtca tgcagtcctt cccctctgcc aagccccctc tgggtgtttt cttcgccgcc 1440
gttgctcccc gactccagcc ccgatactac tccatctctt cttctcccaa gatgtccccc 1500
aaccgaatcc acgtcacctg tgctctggtc tacgaaacca cccccgccgg ccgaatccac 1560
cgaggcctct gttccacctg gatgaagaac gccgtccccc tgaccgagtc ccccgactgc 1620
tcccaggctt ccatcttcgt ccgaacctcc aacttccgac tccccgttga ccccaaggtc 1680
cccgtcatca tgatcggtcc cggtaccggt ctggccccct tccgaggctt cctgcaggag 1740
cgactcgctc tcaaggagtc cggcaccgag ctcggttcct ccatcttctt ctttggctgc 1800
cgaaaccgaa aggtcgactt catctacgag gacgagctca acaactttgt cgagactggt 1860
gctctctccg agctcattgt tgccttctcc cgagagggta ccgccaagga gtacgtccag 1920
cacaagatgt cccagaaggc ctccgacatc tggaagctcc tctccgaggg tgcctacctc 1980
tacgtctgcg gtgacgccaa gggtatggcc aaggatgtcc accgaaccct gcacaccatt 2040
gtccaggagc agggctctct cgactcttcc aaggccgagc tctacgtcaa gaacctccag 2100
atgtccggcc gatacctccg agatgtgtgg 2130
<210> 151
<211> 2397
<212> DNA
<213> Lactuca sativa
<400> 151
atgaaaacca tgatctcttc tcccatcccc gctttccacc cccgattctc tcctgctgct 60
ggctctcgac gactctcccc catcctcccc tcttccggct ccgtcgttct gaccggctcc 120
aagacccagt gcaaggctgt ctccaagtcc cccacccagg agtacttcga cgttctgcag 180
aagaacggtc tgcccttcat caactggcag aacgacgttg tcgaggacga gctcgacaag 240
gagaagaaga ttctctaccc caacgacgag atcaagggct ttgtcgagcg aatcaaggtc 300
atgctcggct ccatggacga gggtgagatc actgtctccg cctacgacac cgcctgggtt 360
gctctcgtcc aggacattga cggcaacggt cgacccgagt tcccctcctc tctcgagtgg 420
atcgtcaaga accagctgtc cgacggctct tggggtgacc acctcatctt ctccgcccac 480
gaccgaatca tcaacaccct cgcctgtgtc attgctctga cctcttggaa cgtccacccc 540
ggcaagtgcc agaagggtct gaagttcctc aacgacaaca tctccaagct cgaagaggag 600
aaccccgagc acatgcccat tggtttcgag gttgccttcc cctctctcat tgacattgcc 660
cgaaagctcg atatccaggt ccccgaggac tcccctgctc tcaaggagat ctacgcccga 720
cgaaacctca agctcaccaa gatccccaag tctctcatgc acaaggtccc caccactctg 780
ctgcactctc tcgagggtat gcccgatctc gagtgggaga agctgctcaa gctccagtgc 840
aaggacggtt ccttcctctt ctccccctcc tccaccgcct tcgccctcat gcagaccaag 900
gaccagaagt gtctgcagta cctcaccgac gccgtcacca agttcaacgg tggtgtcccc 960
aacgtctacc ccgtcgacct ctttgagcac atctgggttg tcgaccgact gcagcgactc 1020
ggtatctctc gatactttga ctccgagatc aaggactgtg tcgactacat ctaccgatac 1080
tggaccaagg acggtatctg ctgggccaag aactccaacg tccaggacat tgacgacacc 1140
gccatgggct tccgagttct gcgaatgcac ggctacaagg tcaccactga cgttttccga 1200
cagttcgaga aggacggcaa gttcgtctgt ttccccggcc agaccaccca ggccgtcacc 1260
ggtatgttca acctcttccg agcttcccag gttctcttcc ccgacgagaa gattctcgag 1320
gacgccaaga agttctccta caactacctc aaggagaagc agtccaccaa cgagctcctc 1380
gacaagtgga tcattgccaa ggatctgccc ggtgaggtcg agtacgccct cgatgtcccc 1440
tggtacgcct ctctgcctcg actcgagact cgattctacc tcgagcagta cggtggcgag 1500
gacgatgtct ggatcggtaa gactctctac cgaatgggta acgtttccaa caacacctac 1560
ctcgagatgg ccaagctcga ctacaacaac tgtctggcca tccaccacct cgagtggaac 1620
accatgcagc agtggtacgt cgactttggt atggagcgat tcggtacctc cgatatcacc 1680
tctctgcttg tctcttacta cctcgccgct gcctccgtct ttgagcccga gcgatccaag 1740
gagcgaattg cctgggccaa gaccaccact ctggtcgaca ccatctcctc cttcttccac 1800
tctctcaaga tctccaacga gcaccgacga gagtttgtcg aggagttccg aaacatctcc 1860
aactccatcc accacgccaa gtacggcaag ccctggcacg gtctgatggt tgctctcaag 1920
ggtactctgc acgagattgc tctcgatgtt ctcatgaccc accgacgaga catccacccc 1980
cagctccacc acgcctggga gatgtggctc atgcgatggc agcagggtgt tgacgccacc 2040
gagggccagg ctgagctcat tgtccagacc atcaacatga ccgccggtcg atgggtttcc 2100
aacgagctgc tcgcccaccc ccagtaccga ctcctctcct ccgtcatcaa caacatctgc 2160
cacgagatct accacaaccg aacctgcatg gaggtcaact ccaccaccat ctccacctcc 2220
attgactcca agatgcagga gctcgtccag ctggttctct ccgactctct cgatgatctc 2280
gaccaggatc tcaagcagac cttcctgacc gttgccaaga ccttctacta caaggcctac 2340
tgcgaccccg agactatcaa cgtccacatc tccaaggtca tgtttgagac tattatt 2397
<210> 152
<211> 2271
<212> DNA
<213> Lactuca sativa
<400> 152
atgtgcaagg ctgtctccaa gtcccccacc caggagtact tcgacgttct gcagaagaac 60
ggtctgccct tcatcaactg gcagaacgac gttgtcgagg acgagctcga caaggagaag 120
aagattctct accccaacga cgagatcaag ggctttgtcg agcgaatcaa ggtcatgctc 180
ggctccatgg acgagggtga gatcactgtc tccgcctacg acaccgcctg ggttgctctc 240
gtccaggaca ttgacggcaa cggtcgaccc gagttcccct cctctctcga gtggatcgtc 300
aagaaccagc tgtccgacgg ctcttggggt gaccacctca tcttctccgc ccacgaccga 360
atcatcaaca ccctcgcctg tgtcattgct ctgacctctt ggaacgtcca ccccggcaag 420
tgccagaagg gtctgaagtt cctcaacgac aacatctcca agctcgaaga ggagaacccc 480
gagcacatgc ccattggttt cgaggttgcc ttcccctctc tcattgacat tgcccgaaag 540
ctcgatatcc aggtccccga ggactcccct gctctcaagg agatctacgc ccgacgaaac 600
ctcaagctca ccaagatccc caagtctctc atgcacaagg tccccaccac tctgctgcac 660
tctctcgagg gtatgcccga tctcgagtgg gagaagctgc tcaagctcca gtgcaaggac 720
ggttccttcc tcttctcccc ctcctccacc gccttcgccc tcatgcagac caaggaccag 780
aagtgtctgc agtacctcac cgacgccgtc accaagttca acggtggtgt ccccaacgtc 840
taccccgtcg acctctttga gcacatctgg gttgtcgacc gactgcagcg actcggtatc 900
tctcgatact ttgactccga gatcaaggac tgtgtcgact acatctaccg atactggacc 960
aaggacggta tctgctgggc caagaactcc aacgtccagg acattgacga caccgccatg 1020
ggcttccgag ttctgcgaat gcacggctac aaggtcacca ctgacgtttt ccgacagttc 1080
gagaaggacg gcaagttcgt ctgtttcccc ggccagacca cccaggccgt caccggtatg 1140
ttcaacctct tccgagcttc ccaggttctc ttccccgacg agaagattct cgaggacgcc 1200
aagaagttct cctacaacta cctcaaggag aagcagtcca ccaacgagct cctcgacaag 1260
tggatcattg ccaaggatct gcccggtgag gtcgagtacg ccctcgatgt cccctggtac 1320
gcctctctgc ctcgactcga gactcgattc tacctcgagc agtacggtgg cgaggacgat 1380
gtctggatcg gtaagactct ctaccgaatg ggtaacgttt ccaacaacac ctacctcgag 1440
atggccaagc tcgactacaa caactgtctg gccatccacc acctcgagtg gaacaccatg 1500
cagcagtggt acgtcgactt tggtatggag cgattcggta cctccgatat cacctctctg 1560
cttgtctctt actacctcgc cgctgcctcc gtctttgagc ccgagcgatc caaggagcga 1620
attgcctggg ccaagaccac cactctggtc gacaccatct cctccttctt ccactctctc 1680
aagatctcca acgagcaccg acgagagttt gtcgaggagt tccgaaacat ctccaactcc 1740
atccaccacg ccaagtacgg caagccctgg cacggtctga tggttgctct caagggtact 1800
ctgcacgaga ttgctctcga tgttctcatg acccaccgac gagacatcca cccccagctc 1860
caccacgcct gggagatgtg gctcatgcga tggcagcagg gtgttgacgc caccgagggc 1920
caggctgagc tcattgtcca gaccatcaac atgaccgccg gtcgatgggt ttccaacgag 1980
ctgctcgccc acccccagta ccgactcctc tcctccgtca tcaacaacat ctgccacgag 2040
atctaccaca accgaacctg catggaggtc aactccacca ccatctccac ctccattgac 2100
tccaagatgc aggagctcgt ccagctggtt ctctccgact ctctcgatga tctcgaccag 2160
gatctcaagc agaccttcct gaccgttgcc aagaccttct actacaaggc ctactgcgac 2220
cccgagacta tcaacgtcca catctccaag gtcatgtttg agactattat t 2271
<210> 153
<211> 2283
<212> DNA
<213> Picea glauca
<400> 153
atgaagatgt ccaagtccgt cgaggtccag cactgcgccg tccagttcct gtcctccacc 60
accgaccaga ttgagatccg agagcgaaac ctccagatct ccaccgaggc catgaagatg 120
aagtcttgga tcgagactgt caagtacatt ctgcagtcca tggaggacgg tgagatcacc 180
atctctgctt acgacactgc ctggatcgcc ctcgtccccg ctctcaacgg ctcttccgag 240
ccccagttcc cctcttctct gcagtggctc atcaacaacc agctccagga cggctcttgg 300
ggtgaccccc tcatgttcct catccgagat cgaatcatca acaccctcgc ctgtgtcctt 360
gctctcaaga cctggaacat ccactctctc ggtgtcaaca agggtctgtc cttcctccag 420
acctacatcc ccaagatgaa cgacgagcac gacgcccaca cccccgttgg ctttgagatt 480
gtctttcctg ctctcatgga ggacgccaag atcatggagc tcgatctgcc ctacgacgcc 540
gagttcctcc agaagatcta cgacgagcga gatctcaaga tgaagcgaat ccccatgaag 600
gttctgcacg agttcccctc cactctgctc cactctctcg agggcctccg agacaaggtc 660
aactgggagg agctcctcaa gctccagtcc aagaacggtt ccttcctctt ctcccccgcc 720
tccaccgcct gtgctctggc ccagacctcc gacaccaact gtctgcgata cctcaacgag 780
atcaccaaga agtacgacgg tggtgctccc aacgtctacc ccgttgacct ctttgagcga 840
ctctggaccg tcgaccgaat tgagcgactc ggtattgccc gatactttga gtccgagatc 900
accgactctc tcgagtacgt ctaccgatac tggaccaacc agggtattgg ctgggcccga 960
gactcccccg tcaaggacgt tgacgacacc tccatggcct tccgactgct gcgatcccac 1020
ggcttcgatg tcactgccga ggccttcaac cacttcaagc aggacgacca gttcttctgc 1080
ttctttggcc agaccaagca gaccgtcacc ggtatgtaca acctctaccg agcctcccag 1140
ttctccttcc ccggtgagtc catcctcgag gaagctcgag tctttaccaa gaacttcctc 1200
gaggagaagc gagctgagaa gcagctgcga gacaagtgga tcattgccaa gggcctcaag 1260
gaagaggtcg agtacgctct caagttcccc tggtacgcct ctcagcctcg aattgacacc 1320
cgaatgtaca tcaaccagta ccgagttgat gatgtctgga tcggtaaggc tctctaccga 1380
atgcccattg tcaacaacaa gacctacatt gagctcgcca aggctgactt caacatctgc 1440
cagtccatcc accgaaccga gctccacggt atcatccgat ggtaccgaga gtccggtctg 1500
gacgagctcg gtctgcgaca ggaccagatt gtcaagtctt acttcctcgc cgccattgcc 1560
atctacgagc ccgacatggc ctccgcccga ctcgcctggg ccaagtctgc tgttctcatg 1620
gctgccatcc gaatcttctt ctccggtgag aactgctttg cccaccaccg acgacagttc 1680
ctcgacgcct tcacccgatg ggacggccga gccatgcgag actcccccaa ctccgccaag 1740
cgactcttct cttgtctctt ccgaatggtc aacctcttct ccgtcgacgg tgttgttgcc 1800
cagggccgag acatctccgg tgatctgcga caccgatggg agcactggct cgcctccgag 1860
gccgaggatc tgaccgacgc ccaggaccac gagaagctcg gtaccgaggc tgagattgtt 1920
gttctgaccg ctgctttcct cggccgagag actatctccc ccgatctcat ctcccacccc 1980
gacttctcct ccatcatgaa ggtcaccaac accgtctgct ctctgctgcg acgaattgcc 2040
acctacaagg aagagggctg cgactccccc tccggtactg aagaggacga ccgactcaag 2100
cgacgagctg aagagggtat gggccacctt gtccgagccg tctaccgaca ccagtactcc 2160
cccgtcccct ctggtgtcaa gcgactctgt ctggttgtcg gcaagtcttt ctactacgcc 2220
gcccactgca acaacgagga agtcggcaac cacgttgaga ctgttctctt ccagcccgtg 2280
tat 2283
<210> 154
<211> 1548
<212> DNA
<213> Bradyrhizobium japonicum
<400> 154
atgaacgccc tctccgagca cattctgtcc gagctccgac gactcctctc cgagatgtcc 60
gacggtggtt ccgtcggccc ctccgtctac gacaccgccc aggctctgcg attccacggt 120
aacgtcaccg gccgacagga cgcctacgcc tggctcattg cccagcagca ggctgacggt 180
ggctggggtt ccgccgactt ccctctcttc cgacacgccc ccacctgggc tgctctgctc 240
gctctgcagc gagccgaccc cctccccggt gctgccgacg ccgtccagac cgccacccga 300
ttcctccagc gacagcccga cccctacgcc cacgccgtcc ccgaggacgc ccccattggt 360
gccgagctca tcctccccca gttctgcggt gaggctgctt ctctgctcgg tggtgttgcc 420
ttcccccgac accccgctct gctccccctc cgacaggcct gtctggtcaa gctcggtgcc 480
gtcgccatgc tcccctccgg ccaccctctg ctccactctt gggaggcctg gggtacttct 540
cccaccactg cctgccccga cgacgacggc tccatcggta tctctcccgc tgccaccgcc 600
gcctggcgag cccaggctgt cacccgaggc tccactcccc aggtcggccg agccgatgcc 660
tacctccaga tggcctctcg agccacccga tccggtatcg agggtgtttt ccccaacgtc 720
tggcccatca acgtctttga gccctgctgg tccctctaca ccctccacct cgccggcctc 780
ttcgcccacc ccgctctcgc cgaggctgtc cgagtcatcg tcgcccagct cgacgcccga 840
ctcggtgtcc acggtcttgg ccccgctctg cactttgctg ccgacgccga cgacaccgcc 900
gttgctctct gtgttctgca cctcgccggc cgagatcccg ccgtcgatgc tctgcgacac 960
tttgagattg gtgagctctt cgtcaccttc cccggtgagc gaaacgcctc cgtttccacc 1020
aacatccacg ccctccacgc cctccgactc ctcggcaagc ccgctgccgg tgcctccgcc 1080
tacgtcgagg ccaaccgaaa cccccacggt ctgtgggaca acgagaagtg gcacgtcagc 1140
tggctctacc ccaccgccca cgccgttgct gctctcgccc agggcaagcc ccagtggcga 1200
gatgagcgag ctctggctgc tctgctccag gcccagcgag atgacggtgg ctggggtgct 1260
ggccgaggct ccaccttcga ggagactgcc tacgccctct tcgccctcca cgtcatggac 1320
ggctctgaag aggccaccgg ccgacgacga atcgcccagg ttgttgcccg agctctcgag 1380
tggatgctcg cccgacacgc cgcccacggt ctgccccaga cccctctgtg gatcggtaag 1440
gagctctact gccccacccg agttgtccga gttgccgagc tcgccggtct gtggctcgct 1500
ctgcgatggg gccgacgagt cctcgccgag ggtgccggtg ctgcccca 1548
<210> 155
<211> 2364
<212> DNA
<213> Lactuca sativa
<400> 155
atgaacattg cccagatcac ctcttctgcc atgctcgtcc cctcttccca cattccccac 60
cgatcttggg tcgtcaactg ctgcatggtc cagtacaacc cctccggtct gcgaaccgct 120
tcttcccagg ctggccaggt caaccccact gtcatgaccc tcgatgtcac caaggagcga 180
atccgaaagc tcttcaacaa cgttgaggtt tccgtttcct cttacgacac cgcctgggtt 240
gccatggtcc cctctcccaa ctcccccaag tctccttgtt tccccgactg tctcaactgg 300
ctcctcgaca accagctcga cgacggctct tggggtctgc tcccccacca gtctcctctg 360
atcaaggaca ccctctcttc cactctggcc tgtgtccttg ctctcaagcg atggaacgtc 420
ggtaaggacc agatcaacaa gggtctgcac tacattgagt ccaactttgc ctccgtcacc 480
gacaagaacc aggcttctcc ctttggcttc gacatcatct tccccggtat gctcgagtac 540
gccaaggacc tcgatatcaa gctgcccctc aaccagaccc acctgtccgt catgctccac 600
gagcgagagc tcgagctccg acgatgccac tccaacggcc gagaggccta cctcgcctac 660
atctccgagg gtcttggtaa cctcaacgac tggaacatgg tcatgaagta ccagatgaag 720
aacggctctc tgttcaactc cccctctgcc accgcctccg ttctcatcca ccaccagaac 780
gccggctgtc tgcactacct gacctctctg ctcgacaagt ttggtaacgc tgtccccacc 840
gtctacccca ttgatctgta cgtccgactc tccatggtcg acaccctcga gcgactcggt 900
atcaagcgac acttcatggt cgagatccag aacgttctcg acgagactta ccgatgctgg 960
gtccagggtg atgtccagat cttcatggac gttgtcacct gtgctctggc cttccgagtt 1020
ctgcgatcca acggttacga ggtttcttcc gaccctctgg ccaagatcac caaggaaggt 1080
gactacatga actcccccga gaagcccttc aaggacgtct acacctctct cgaggtctac 1140
aaggcctccc agatcatcta ccaagaggag ctcgccttcc gagagcagaa cctgacctcc 1200
tacctgccct cttccaacaa gctctccaac tacattctca aagaggtcga cgacgctctc 1260
aagttcccct tcaacggctc tctcgagcga atgtccaccc gacgaaacat tgagcactac 1320
aacctcaacc acacccgaat tctcaagacc acctactcct cttccaacat ctccaacaag 1380
gactacctca agctcgccgt ccaggacttc aacgagtgcc agtccatcta ctgtgaggag 1440
ctcaaggacc tcgagcgatg ggttgtcgag aaccgactcg acaagctcaa gtttgcccga 1500
caaaagactg cttactgcta cttctccgct gcctctttcc tctcctcccc cgatctgtcc 1560
gacgcccgaa tctcttgggc caagtcctcc attctgacca ccgtcattga cgacttcttc 1620
gatgtcggtg gctccatgga cgagctcgtc aactttgtcc acatcattga gaagtggaac 1680
gtcaacgtcg agaacgactg ctgctccgaa gaggtcggtg ttctcttcct ggctctcaag 1740
gacgccgtct gctggatcgg tgacaaggcc ttcaagatcc aggagcgaaa catcacctcc 1800
cacgtcattg agatctggct cgatctcgtc aagtccatgc tgcgagaggc catctgggcc 1860
aaggacggct ccatccccac catcaacgag tacatggaga acggctacgt ttcctttgct 1920
ctcggtccca ttgttctgcc cactctctac ttcctcggtg tcaagctgtc cgaggaagtt 1980
gtccagtcct ccgagtacca caagctctac gaggtcatgt ccacccaggg ccgactcatg 2040
aacgacatcc actccttcaa gcgagagaag aaggctggta agctcaacgc cgtcgccctc 2100
tacatgtccg acggcaagtc cggctccgtc gaagaggaag ttgtcgagga gatgaagatc 2160
ctcaccaagt cccagcgaaa ggagatgatg aagctcgttc tcgagactaa gggctccgtc 2220
gtcccccgag tctgcaagga tgttttctgg aacatgtgca acgttctcaa cctcttctac 2280
gccaccgacg acggtttcac tggtaacgcc attctcgatg ttgtcaagga gatcatctac 2340
gagcccgttt cccacgagct catc 2364
<210> 156
<211> 2250
<212> DNA
<213> Lactuca sativa
<400> 156
atggcttctt cccaggctgg ccaggtcaac cccactgtca tgaccctcga tgtcaccaag 60
gagcgaatcc gaaagctctt caacaacgtt gaggtttccg tttcctctta cgacaccgcc 120
tgggttgcca tggtcccctc tcccaactcc cccaagtctc cttgtttccc cgactgtctc 180
aactggctcc tcgacaacca gctcgacgac ggctcttggg gtctgctccc ccaccagtct 240
cctctgatca aggacaccct ctcttccact ctggcctgtg tccttgctct caagcgatgg 300
aacgtcggta aggaccagat caacaagggt ctgcactaca ttgagtccaa ctttgcctcc 360
gtcaccgaca agaaccaggc ttctcccttt ggcttcgaca tcatcttccc cggtatgctc 420
gagtacgcca aggacctcga tatcaagctg cccctcaacc agacccacct gtccgtcatg 480
ctccacgagc gagagctcga gctccgacga tgccactcca acggccgaga ggcctacctc 540
gcctacatct ccgagggtct tggtaacctc aacgactgga acatggtcat gaagtaccag 600
atgaagaacg gctctctgtt caactccccc tctgccaccg cctccgttct catccaccac 660
cagaacgccg gctgtctgca ctacctgacc tctctgctcg acaagtttgg taacgctgtc 720
cccaccgtct accccattga tctgtacgtc cgactctcca tggtcgacac cctcgagcga 780
ctcggtatca agcgacactt catggtcgag atccagaacg ttctcgacga gacttaccga 840
tgctgggtcc agggtgatgt ccagatcttc atggacgttg tcacctgtgc tctggccttc 900
cgagttctgc gatccaacgg ttacgaggtt tcttccgacc ctctggccaa gatcaccaag 960
gaaggtgact acatgaactc ccccgagaag cccttcaagg acgtctacac ctctctcgag 1020
gtctacaagg cctcccagat catctaccaa gaggagctcg ccttccgaga gcagaacctg 1080
acctcctacc tgccctcttc caacaagctc tccaactaca ttctcaaaga ggtcgacgac 1140
gctctcaagt tccccttcaa cggctctctc gagcgaatgt ccacccgacg aaacattgag 1200
cactacaacc tcaaccacac ccgaattctc aagaccacct actcctcttc caacatctcc 1260
aacaaggact acctcaagct cgccgtccag gacttcaacg agtgccagtc catctactgt 1320
gaggagctca aggacctcga gcgatgggtt gtcgagaacc gactcgacaa gctcaagttt 1380
gcccgacaaa agactgctta ctgctacttc tccgctgcct ctttcctctc ctcccccgat 1440
ctgtccgacg cccgaatctc ttgggccaag tcctccattc tgaccaccgt cattgacgac 1500
ttcttcgatg tcggtggctc catggacgag ctcgtcaact ttgtccacat cattgagaag 1560
tggaacgtca acgtcgagaa cgactgctgc tccgaagagg tcggtgttct cttcctggct 1620
ctcaaggacg ccgtctgctg gatcggtgac aaggccttca agatccagga gcgaaacatc 1680
acctcccacg tcattgagat ctggctcgat ctcgtcaagt ccatgctgcg agaggccatc 1740
tgggccaagg acggctccat ccccaccatc aacgagtaca tggagaacgg ctacgtttcc 1800
tttgctctcg gtcccattgt tctgcccact ctctacttcc tcggtgtcaa gctgtccgag 1860
gaagttgtcc agtcctccga gtaccacaag ctctacgagg tcatgtccac ccagggccga 1920
ctcatgaacg acatccactc cttcaagcga gagaagaagg ctggtaagct caacgccgtc 1980
gccctctaca tgtccgacgg caagtccggc tccgtcgaag aggaagttgt cgaggagatg 2040
aagatcctca ccaagtccca gcgaaaggag atgatgaagc tcgttctcga gactaagggc 2100
tccgtcgtcc cccgagtctg caaggatgtt ttctggaaca tgtgcaacgt tctcaacctc 2160
ttctacgcca ccgacgacgg tttcactggt aacgccattc tcgatgttgt caaggagatc 2220
atctacgagc ccgtttccca cgagctcatc 2250
<210> 157
<211> 2271
<212> DNA
<213> Picea glauca
<400> 157
atgaagcgag agcagtacac catcctcaac gagaaggagt ccatggccga ggagctcatc 60
ctccgaatca agcgaatgtt ctccgagatt gagaacaccc agacctccgc ctccgcctac 120
gacaccgcct gggttgccat ggtcccctct ctcgactcct cccagcagcc ccagttcccc 180
cagtgcctct cttggatcat cgacaaccag ctcctcgacg gctcttgggg tatcccctac 240
ctcatcatca aggaccgact ctgccacacc ctcgcctgtg tcattgctct gcgaaagtgg 300
aacgccggta accagaacgt cgagactggt ctgcgattcc tgcgagagaa cattgagggt 360
attgtccacg aggacgagta cacccccatt ggtttccaga tcatcttccc cgccatgctc 420
gaggaagctc gaggtctggg tctggagctc ccctacgatc tgacccccat caagctcatg 480
ctgacccacc gagagaagat catgaagggc aaggccattg accacatgca cgagtacgac 540
tcctctctca tctacaccgt cgagggtatc cacaagattg tcgactggaa caaggttctc 600
aagcaccaga acaaggacgg ctctctcttc aactccccct ctgccaccgc ctgtgctctc 660
atgcacaccc gaaagtccaa ctgtctggag tacctctctt ccatgctcca gaagctcggt 720
aacggtgtcc cctccgtcta ccccatcaac ctctacgccc gaatctccat gattgaccga 780
ctccagcgac tcggtctggc ccgacacttc cgaaacgaga tcatccacgc cctcgacgac 840
atctaccgat actggatgca gaaggagact tctcgagagg gcaagtctct cacccccgac 900
attgtttcca cctccattgc cttcatgctg ctgcgactcc acggctacga tgtccccgct 960
gacgttttct gctgctacca cctccactcc attgagcagt ccggtgaggc tgtcactgcc 1020
atgctgtctc tgtaccgagc ctcccagatc atgttccccg gtgagactat cctcgaggag 1080
atcaagaccg tttcccgaaa gtacctcgac aagcgaaagg agaacggtcg aatctactac 1140
cacaacattg tcatgaagga cctccgaggt gaggttgagt acgccctctc cgtcccctgg 1200
tacgcctctc tcgagcgaat cgagaaccga cgatacattg accagtacgg tgtcaacgac 1260
acctggatcg ccaagacctc ttacaagatc ccctgtatct ccaacgatct gttcctggct 1320
ctggccaagc aggactacaa catctgtcag gccatccagc agaaggagct ccgagagctc 1380
gagcgatggt tcgccgacaa caagttctcc cacctcaact ttgcccgaca gaagctcatc 1440
tactgctact tctccgctgc tgccactctc ttctctcccg agctctctgc tgctcgagtt 1500
gtctgggcca agaacggtgt catcaccacc gttgttgacg acttcttcga tgttggtggc 1560
tcctccgagg agatccactc ttttgtcgag gctgtccgag tctgggatga ggctgccacc 1620
gacggtctgt ccgagaacgt ccagatcctc ttctccgctc tgtacaacac cgttgacgag 1680
attgtccagc aggcctttgt ctttcagggc cgagacatct ccatccacct gcgagagatc 1740
tggtaccgac tcgtcaactc catgatgacc gaggcccagt gggcccgaac ccactgcatc 1800
ccctccatgc acgagtacat ggagaacgcc gagccttcca ttgctctcga gcccattgtt 1860
ctctcctctc tctactttgt cggccccaag ctgtccgagg agatcatctg ccaccccgag 1920
tactacaacc tcatgcacct gctcaacatc tgtggccgac tcctcaacga tatccagggc 1980
tgcaagcgag aggcccacca gggcaagctc aactccgtca ctctctacat ggaggagaac 2040
tccggcacca ccatggagga tgccattgtc tacctgcgaa agaccatcga cgagtcccga 2100
cagctcctcc tcaaagaggt tctgcgacct tccattgtcc cccgagagtg caagcagctc 2160
cactggaaca tgatgcgaat cctccagctc ttctacctca agaacgacgg tttcacctcc 2220
cccaccgaga tgctcggcta cgtcaacgcc gtcattgtcg accccatatt g 2271
<210> 158
<211> 900
<212> DNA
<213> Bradyrhizobium japonicum
<400> 158
atgatccaga ccgagcgagc cgtccagcag gttctcgagt ggggtcgatc tctgaccggc 60
tttgccgacg agcacgccgt cgaggccgtc cgaggtggtc agtacatcct ccagcgaatc 120
cacccctctc tgcgaggtac ctccgcccga accggccgag atccccagga cgagactctc 180
attgtcacct tctaccgaga gctcgctctg ctcttctggc tcgacgactg caacgatctc 240
ggcctcatct cccccgagca gctcgccgcc gtcgagcagg ctctcggcca gggtgtcccc 300
tgtgctctgc ccggtttcga gggctgtgcc gtcctccgag cttctctcgc caccctcgcc 360
tacgaccgac gagactacgc ccagctcctc gacgacaccc gatgctactc cgctgctctg 420
cgagctggtc acgcccaggc tgttgctgcc gagcgatggt cctacgccga gtacctccac 480
aacggtatcg actccattgc ctacgccaac gttttctgct gtctgtctct gctctggggc 540
ctcgacatgg ccactctgcg agcccgaccc gccttccgac aggttctgcg actcatctcc 600
gccatcggtc gactccagaa cgatctccac ggctgcgaca aggaccgatc cgccggtgag 660
gctgacaacg ccgtcattct gctcctccag cgataccccg ccatgcccgt tgtcgagttc 720
ctcaacgacg agctcgccgg ccacacccga atgctgcacc gagtcatggc cgaggagcga 780
ttccccgctc cctggggtcc tctcattgag gccatggctg ccatccgagt ccagtactac 840
cgaacctcca cctcccgata ccgatccgat gccgtccgag gtggtcagcg agctcccgcc 900
<210> 159
<211> 2838
<212> DNA
<213> Phaeosphaeria sp
<400> 159
atgttcgcca agttcgacat gctcgaggaa gaggcccgag ctctggtccg aaaggtcggt 60
aacgccgtcg accccatcta cggtttctcc accacctcct gccagatcta cgacaccgcc 120
tgggccgcca tgatctccaa agaggagcac ggtgacaagg tctggctctt ccccgagtcc 180
ttcaagtacc tcctcgagaa gcagggtgag gacggctctt gggagcgaca cccccgatcc 240
aagaccgtcg gtgttctcaa caccgccgcc gcctgtctgg ctctcctccg acacgtcaag 300
aaccccctgc agctgcagga cattgctgcc caggacatcg agctccgaat ccagcgaggt 360
ctgcgatctc tcgaggagca gctcatcgcc tgggatgacg ttctggacac caaccacatt 420
ggtgtcgaga tgattgtccc cgctctgctc gactacctcc aggccgagga cgagaacgtc 480
gacttcgagt ttgagtccca ctctctgctc atgcagatgt acaaggagaa gatggcccga 540
ttctcccccg agtctctcta ccgagctcga ccctcttctg ctctgcacaa cctcgaggct 600
ctcatcggta agctcgattt cgacaaggtc ggccaccacc tctacaacgg ctccatgatg 660
gcctccccct cttccactgc cgccttcctc atgcacgcct ctccctggtc ccacgaggcc 720
gaggcctacc tccgacacgt ctttgaggct ggtaccggca agggctccgg tggtttcccc 780
ggtacctacc ccaccaccta ctttgagctc aactgggtcc tctccaccct catgaagtcc 840
ggcttcactc tctccgatct cgagtgcgac gagctctcct ccattgccaa caccatcgcc 900
gagggtttcg agtgcgacca cggtgtcatc ggtttcgctc cccgagccgt cgacgttgat 960
gacactgcca agggtctgct caccctcacc ctcctcggta tggacgaggg tgtctccccc 1020
gctcccatga ttgccatgtt cgaggccaag gaccacttcc tgaccttcct cggtgagcga 1080
gatccctcct tcacctccaa ctgccacgtt ctgctctctc tgctccaccg aaccgacctc 1140
ctccagtacc tcccccagat ccgaaagacc accaccttcc tctgtgaggc ctggtgggcc 1200
tgcgacggcc agatcaagga caagtggcac ctctcccacc tctaccccac catgctcatg 1260
gtccaggcct ttgccgagat cctcctcaag tccgccgagg gtgagcctct gcacgacgcc 1320
tttgacgccg ccactctgtc tcgagtctcc atctgtgttt tccaggcctg tctgcgaact 1380
ctgctcgccc agtcccagga cggctcttgg cacggccagc ccgaggcttc ttgctacgcc 1440
gtcctcaccc tcgccgagtc cggccgactt gttctgctcc aggctctgca gccccagatt 1500
gctgctgcca tggagaaggc tgctgatgtc atgcaggctg gtcgatggtc ttgttccgac 1560
cacgactgtg actggacctc caagaccgcc taccgagtcg acctcgttgc tgctgcttac 1620
cgactcgccg ccatgaaggc ctcttccaac ctcaccttca ccgtcgacga caacgtttcc 1680
aagcgatcca acggtttcca gcagctcgtc ggccgaaccg acctcttctc cggtgtcccc 1740
gcctgggagc tccaggcctc tttcctcgag tccgctctgt ttgtccctct gctccgaaac 1800
caccgactcg atgtctttga ccgagatgat atcaaggttt ccaaggacca ctacctcgat 1860
atgatcccct tcacctgggt cggctgcaac aaccgatctc gaacctacgt ttccacctcc 1920
ttcctcttcg acatgatgat catctccatg ctcggctacc agattgacga gttcttcgag 1980
gctgaggctg cccccgcctt cgcccagtgc atcggtcagc tccaccaggt tgttgacaag 2040
gtcgtcgacg aggtcattga cgaggttgtc gacaaggttg tcggtaaggt cgtcggtaag 2100
gtcgtcggca aggtcgtcga cgagcgagtc gactccccca cccacgaggc catcgccatc 2160
tgcaacatcg aggcttctct gcgacgattc gtcgaccacg ttctccacca ccagcacgtt 2220
ctgcacgcct cccagcagga gcaggatatc ctctggcgag agctccgagc cttcctgcac 2280
gcccacgttg tccagatggc cgacaactcc actctcgctc ctcccggccg aaccttcttt 2340
gactgggtcc gaaccactgc tgccgaccac gttgcctgtg cctactcctt cgccttcgcc 2400
tgctgcatca cctccgccac cattggccag ggccagtcca tgtttgccac cgtcaacgag 2460
ctctacctgg tccaggctgc tgcccgacac atgaccacca tgtgccgaat gtgcaacgac 2520
attggctccg tcgaccgaga cttcattgag gccaacatca actccgtcca cttccccgag 2580
ttctccactc tgtctctcgt tgctgacaag aagaaggctc tggcccgact cgccgcctac 2640
gagaagtctt gtctgaccca caccctcgac cagtttgaga acgaggttct ccagtctccc 2700
cgagtctcct ccgccgcctc cggtgacttc cgaacccgaa aggtcgccgt tgtccgattc 2760
ttcgccgatg tcactgactt ctacgaccag ctctacatcc tccgagatct gtcctcctct 2820
ctcaagcacg tcgggaca 2838
<210> 160
<211> 2856
<212> DNA
<213> Gibberella fujikuroi
<400> 160
atgcccggca agattgagaa cggtaccccc aaggacctca agaccggcaa cgacttcgtt 60
tccgctgcca agtctctgct cgaccgagcc ttcaagtccc accactccta ctacggtctg 120
tgctccactt cctgccaggt ctacgacacc gcctgggttg ccatgatccc caagacccga 180
gacaacgtca agcagtggct cttccccgag tgtttccact acctgctcaa gacccaggct 240
gctgacggct cttggggctc tctgcccacc acccagaccg ccggtatcct cgacactgcc 300
tccgccgttc tggctctgct ctgccacgcc caggagcctc tgcagatcct cgatgtctcc 360
cccgatgaga tgggtctgcg aatcgagcac ggtgtcactt ctctcaagcg acagctcgcc 420
gtctggaacg acgttgagga caccaaccac attggtgtcg agttcatcat ccccgctctg 480
ctctccatgc tcgagaagga gctcgatgtc ccctccttcg agttcccttg ccgatccatt 540
ctggagcgaa tgcacggcga gaagctcggt cactttgatc tcgagcaggt ctacggcaag 600
ccctcttctc tgctgcactc tctcgaggcc ttcctcggca agctcgattt cgaccgactc 660
tcccaccacc tctaccacgg ctccatgatg gcctccccct cttccaccgc cgcctacctc 720
attggtgcca ccaagtggga cgacgaggcc gaggactacc tccgacacgt catgcgaaac 780
ggtgccggcc acggtaacgg tggtatctct ggtaccttcc ccaccaccca ctttgagtgc 840
tcctggatca ttgccactct cctcaaggtc ggtttcactc tcaagcagat tgacggtgac 900
ggtctgcgag gcctctccac cattctcctc gaggctctgc gtgacgagaa cggtgtcatt 960
ggcttcgctc cccgaaccgc cgacgttgat gacactgcca aggctctgct ggctctgtct 1020
ctggtcaacc agcccgtttc ccccgacatc atgatcaagg ttttcgaggg taaggaccac 1080
ttcaccacct ttggctccga gcgagatccc tctctgacct ccaacctcca cgttctcctc 1140
tctctgctca agcagtccaa cctgtctcag taccaccccc agatcctcaa gaccactctc 1200
ttcacctgtc gatggtggtg gggctccgac cactgtgtca aggacaagtg gaacctctcc 1260
cacctctacc ccaccatgct cctcgtcgag gccttcaccg aggttctgca cctcatcgac 1320
ggtggtgagc tctcctctct gtttgacgag tctttcaagt gcaagatcgg tctgtccatc 1380
ttccaggccg ttctccgaat catcctcacc caggacaacg acggctcttg gcgaggctac 1440
cgagagcaga cctgttacgc cattctcgcc ctcgtccagg cccgacacgt ctgtttcttc 1500
acccacatgg tcgaccgact ccagtcctgt gtcgaccgag gtttctcttg gctcaagtct 1560
tgctccttcc actctcagga cctcacctgg acctccaaga ccgcttacga ggttggtttc 1620
gttgccgagg cctacaagct cgctgctctc cagtccgcct ctctcgaggt ccccgctgcc 1680
accatcggcc actccgtcac ctctgctgtc ccctcctccg atctcgagaa gtacatgcga 1740
ctcgtccgaa agaccgctct gttctctccc ctcgatgagt ggggtctgat ggcctccatc 1800
atcgagtcct ccttctttgt ccccctcctc caggcccagc gagttgagat ctacccccga 1860
gacaacatca aggtcgacga ggacaagtac ctctccatca tccccttcac ctgggttggc 1920
tgcaacaacc gatctcgaac ctttgcctcc aaccgatggc tctacgacat gatgtacctg 1980
tctctgctcg gctaccagac cgacgagtac atggaggccg ttgccggccc cgtctttggt 2040
gatgtctctc tgctgcacca gaccatcgac aaggtcattg acaacaccat gggtaacctg 2100
gcccgagcca acggtaccgt ccactccggc aacggtcacc agcacgagtc ccccaacatt 2160
ggccaggtcg aggacaccct cactcgattt accaactccg tcctcaacca caaggacgtc 2220
ctcaactcct cctcctccga ccaggacacc ctccgacgag agttccgaac cttcatgcac 2280
gcccacatca cccagatcga ggacaactcc cgattctcca agcaggcttc ttccgacgcc 2340
ttctcctccc ccgagcagtc ctacttccag tgggtcaact ccactggtgg ttcccacgtt 2400
gcctgcgcct actctttcgc cttctccaac tgcctcatgt ccgccaacct cctccagggt 2460
aaggacgcct tcccttccgg tacccagaag tacctcatct cctccgtcat gcgacacgcc 2520
accaacatgt gccgaatgta caacgatttc ggttccatcg cccgagacaa cgccgagcga 2580
aacgtcaact ccatccactt tcccgagttc actctctgta acggtacctc ccagaacctc 2640
gacgagcgaa aggagcgact cctcaagatt gctacttacg agcagggcta cctcgaccga 2700
gctctcgagg ctctggagcg acagtcccga gatgatgctg gtgaccgagc cggctccaag 2760
gacatgcgaa agctcaagat tgtcaagctc ttctgtgacg tcaccgacct ctacgaccag 2820
ctctacgtca tcaaggacct ctcctcctcc atgaaa 2856
<210> 161
<211> 1533
<212> DNA
<213> Lactuca sativa
<400> 161
atggatctgc agaccatggc ccccatgggc tctgctgcca ttgccatcgg tggtcccgct 60
gttgccgttg ctggtggtat ctctctgctc ttcctcaagt ctttcctgtc ccagcagccc 120
ggcaacccca accacctccc ctccgtcccc gctgtccccg gtgtccctct gctcggtaac 180
ctgctcgagc tcaaggagaa gaagccctac aagaccttca ccaagtgggc cgagacttac 240
ggccccatct actccatcaa gactggtgcc acctccatgg tcgttgtcaa ctccaaccag 300
ctcgccaaag aggccatggt cacccgattc gactccatct ccacccgaaa gctctccaag 360
gctctccaga ttctcactgc tgacaagacc atggttgcca tgtccgacta cgacgactac 420
cacaagaccg tcaagcgaaa cctgctcacc tccatcctcg gtcccgctgc ccagaagcga 480
caccgagccc accgagatgc catgggtgac aacctgtctc gacagctcca cgctcttgct 540
ctcaactctc ctcaagaggc catcaacttc cgacagatct tccagtccga gctcttcact 600
ctcgccttca agcagacctt tggccgagac attgagtcca tcttcgtcgg tgatctcggt 660
accaccatga cccgagagga gatgttccag atcctcgttg tcgaccccat gatgggtgcc 720
attgacgttg actggcgaga cttcttcccc tacctcaagt ggatccccaa cgccaagctc 780
gaggagaaga ttgagcagat gtacatccga cgaaaggccg tcatgaaggc cgtcatccag 840
gagcaccgaa agcgaattga ctccggcgag aacctcgact cttacattga cttcctgctc 900
gccgaggccc agcctctgac cgagaagcag ctcctcatgt ctctgtggga gcccatcatt 960
gagacttccg acaccaccat ggtcaccacc gagtgggcca tgtacgagct ctccaagcac 1020
cccaacaagc agcagcgact ctacaacgag atccgaaaca tctgtggttc cgagaagatc 1080
accgaggaga agctctgcaa gatgccctac ctgtctgctg ttttccacga gactctgcga 1140
gtccactccc ccgtttccat catccccctg cgatacgtcc acgagaacac cgagctcggc 1200
ggctaccacg tccccgccgg caccgagctc gccgtcaaca tctacggctg caacatggag 1260
cgagagatct gggagaaccc cgaggagtgg tcccccgagc gattcctcgc cgagaacgag 1320
cccgtcaacc tgcagaaaac catggccttt ggtgctggta agcgagtctg cgctggtgcc 1380
atgcaggcca tgctgctcgc ctgtgtcggt attggccgaa tggtccagga gtttgagtgg 1440
cgactcaagg acgacgttga agaggatgtc aacaccctcg gcctcaccac ccagcgactc 1500
aaccccatgc tcgccgtcat caagccccgg aat 1533
<210> 162
<211> 1536
<212> DNA
<213> Lactuca sativa
<400> 162
atggacggtg tcattgacat gcagaccatc cccctgcgaa ctgccattgc cattggtggt 60
actgctgttg ctctcgttgt tgctctctac ttctggttcc tgcgatctta cgcctccccc 120
tctcaccact ccaaccacct tcctcccgtc cccgaggtcc ccggtgtccc cgtcctcggt 180
aacctgctcc agctcaagga gaagaagccc tacatgacct tcaccaagtg ggccgagatg 240
tacggcccca tctactccat ccgaaccggt gccacctcca tggttgttgt ctcttccaac 300
gagattgcca aggaagttgt tgtcacccga ttcccctcca tctccacccg aaagctctct 360
tacgctctca aggtcctcac tgaggacaag tccatggttg ccatgtccga ctaccacgac 420
taccacaaga ccgtcaagcg acacattctc accgccgtcc tcggccccaa cgcccagaag 480
aagttccgag cccaccgaga caccatgatg gagaacgttt ccaacgagct ccacgccttc 540
ttcgagaaga accccaacca ggaagtcaac ctgcgaaaga tcttccagtc tcagctcttt 600
ggtctggcca tgaagcaggc tctcggcaag gatgtcgagt ccatctacgt caaggatctc 660
gagactacca tgaagcgaga ggagatcttc gaggttctgg tcgtcgaccc catgatgggt 720
gccattgagg tcgactggcg agacttcttc ccctacctca agtgggtccc caacaagtct 780
ttcgagaaca tcatccaccg aatgtacacc cgacgagagg ctgtcatgaa ggctctcatc 840
caggagcaca agaagcgaat tgcctccggt gagaacctca actcctacat tgactacctg 900
ctctccgagg cccagaccct caccgacaag cagctcctca tgtctctgtg ggagcccatc 960
atcgagtcct ccgacaccac catggtcacc accgagtggg ccatgtacga gctcgccaag 1020
aaccccaaca tgcaggaccg actctacgag gagatccagt ctgtctgtgg ctccgagaag 1080
atcactgagg agaacctgtc ccagctcccc tacctctacg ccgttttcca ggagactctg 1140
cgaaagcact gccccgtccc catcatgccc ctgcgatacg tccacgagaa caccgtcctc 1200
ggtggctacc acgtccccgc tggtaccgag gttgccatca acatctacgg ctgcaacatg 1260
gacaagaagg tctgggagaa ccccgaggag tggaaccccg agcgattcct gtccgagaag 1320
gagtccatgg acctctacaa gaccatggcc tttggtggtg gcaagcgagt ctgtgccggc 1380
tctctgcagg ccatggtcat ctcctgcatt ggtatcggtc gactcgtcca ggactttgag 1440
tggaagctca aggacgatgc tgaggaagat gtcaacaccc tcggcctcac cacccagaag 1500
ctccaccctc tgctcgccct catcaacccc cggaaa 1536
<210> 163
<211> 1575
<212> DNA
<213> Sphaceloma manihoticola
<400> 163
atgatggacg acaccacctc cccctactcc acctaccact ccgtccgatc catccgaaac 60
cagtccgcct gggctctggc ccccattgcc gtctttatct gctacgttgt tctgcgacac 120
aaccgaaagt ccgtccccgc tgcctctgct ggttcccact ccatcctcga gcccctgtgg 180
ctcgcccgac tccgattcat ccgagactct cgattcatca tcggtcaggg ctactccaag 240
ttcaaggaca ccatcttcaa ggtcaccaag gtcggtgccg atatcattgt tgttgctccc 300
aagtacgtcg aggagatccg acgactctcc cgagacaccg gccgatccgt cgagcccttc 360
atccacgact ttgccggtga gctcctcggt ggcctcaact tcctcgagtc cgatctccag 420
acccgagttg tccagcagaa gctcaccccc aacctcaaga ccattgtccc cgtcatggag 480
gacgagatgc actacgctct cgtttccgag ctcgactctt gtctcgatgg ttccgagcac 540
tggacccgag tcgacatgat ccacatgctc tcccgaattg tctctcgaat ctccgcccga 600
atcttcctcg gccccaagta ctgccgaaac gatctgtggc tcaagaccac tgccgagtac 660
accgagaacc tcttcctgac cggtactctg ctccgattcg tcccccgaat gctccagaag 720
tggatcgctc ctctgctccc ctctttccga cagctccagg agaaccgaca ggccgcccga 780
aagatcatct ccgagattct gaccgaccac cagcccgaga agcacgacga gacttccgac 840
aacggtgacc cctaccccga cattctcacc ctcatgttcc aggctgcccg aggtaaggag 900
aaggacattg aggacattgc ccagcacact ctgctcctct ctctgtcctc catccacacc 960
actgctctga ccatgaccca ggctctctac gatctctgtg cctaccccca gtacctcgac 1020
cccgtcaagc acgagattgc cgacactctg cagtccgagg gctcttggtc caaggccatg 1080
ctcgacaagc tccacatgat ggactctctg ctccgagagt cccagcgact ctctcccgtc 1140
tttctgctca ccttcaaccg aatcctccac acccctctga ccctctccaa cggtatccac 1200
ctccccaagg gtacccgaat tgccgccccc tccgacgcca tcctcaacga cccctctctg 1260
gtccccggcc cccagcccgc tgacaccttc gaccccttcc gatacatcaa ccactccacc 1320
ggtgatgcca agaaaaccaa gaccaacttc cagaccacct ctctgcagaa catggccttt 1380
ggctacggta agtacgcctg ccccggccga ttctacgttg ccaacgagat caagctcgtt 1440
ctcggccacc tgctcatgca ctacgagttc aagttccctc ccggtatggg ccgacccgtc 1500
aactccactg tcgacaccga catgtacccc gatctcggtg cccgactcct cgtccgaaag 1560
cgaaagatgg aggag 1575
<210> 164
<211> 1440
<212> DNA
<213> Artemisia annua
<400> 164
atgcccatga ccgtcatgct gctctttgtt ttcctgctct tcattgccat ctgtttcttc 60
ctcgtccacc gacacaactc caccaccacc aagaacctgc ctcccggctc cttcggctgg 120
cccttcattg gtgagactct cgcctacatc cgatccaagc gaggtggtga ccccgagcga 180
ttcaccaagg agcgaatcga gaagtacggc tccactctgg tctttaagac ctccgttgcc 240
ggtgagcgaa tggccgtttt ctgcggtccc gagggtaaca agttcctctt tggcaacgag 300
aacaagctcg ttgcttcttg gtggcccaac tccgtccgaa ttctcttcga gaagtgtctg 360
atcaccatcc gaggtgacga ggccaagtgg ctccgaaaga tgatgttcgc ctacctcggt 420
cccgatgctc tgtccaaccg atacaccggt accatggagg ttgtcacccg actccacatc 480
cagaaccact ggcagggcaa gtccgagctc aaggtctttg agactgtccg accttacctc 540
ttcgagctcg cctgccgact cttcctgtct ctcgacgacc ccaagcacgt tgccgagctc 600
ggtaccctct tcaacacctt cctcaagggt ctgaccgagc tccccatcaa catccccggt 660
acccgattct accgagccaa gcgagctgcc aacgccatca agaagcagct cattgtcatc 720
atcaagcagc gacgacaggc tctcaagcaa gaggaccagt cctcctcctt cgaggacctc 780
ctctcccacc tccttgtctc ctccgacgag aacggccgat tcctgtccga ggctgagatt 840
gccaacaacg ttctgctcct cctcttcgcc ggccacgaca cttctgctgt ctccatcact 900
ctgctcatga agtctctggc cgagcacccc caggtctacg acaacgtcct caaggagcag 960
ctcggtattc tcgaggccaa ggctcccggt gagatgctca actgggagga tatccagaag 1020
atgcgatact cttggtacgt tgtctgcgag gtcatgcgac tcattcctcc cgttgtcggc 1080
tctttccgag aggctctggt cgactttgag tacgccggct acaccatccc caagggctgg 1140
aagatcatct ggtccgccgt catgacccac aaagaggaga acaacttccc caacgccacc 1200
aagttcgacc cctctcgatt cgagggtgct ggtcccaccc ccttcaccta cgtccccttc 1260
ggtggtggcc cccgaatgtg cctcggcaag gagctcgccc gagtccgaat cctcgtcttt 1320
ctgcacaaca tcatgaccaa gttcaagtgg gatctgctca tccccgacga gaagattggc 1380
tacgaccccc tcgccacccc cgtcaagggt ctgcccgtcc gactccaccc ccaccaggtt 1440
<210> 165
<211> 1365
<212> DNA
<213> Ricinus communis
<400> 165
atggagctcg tcatgttccc cgttctggct ctcgtttcca ctctcttcct gctcgccctc 60
cacttcatca tccgaaccct caaggagcga ctctttggct ctcccaacct gcctcccggc 120
cgactcggct ggcccctcat tggtgagact cccgccttct tccgagctgg ctttgaggcc 180
aagcccgaga agttcatcgg tgagcgaatg gagaagtacg actcccgagt ctttaagacc 240
tctctgctcg gcaagccctt cgccgtcatc tccggtaccg ccggccacaa gttcctcttc 300
tccaacgaga acaagctcgt caacctgtgg tggcccgagt ccgtccgaat gctcttcaag 360
tctgctctcg tttccgttgt cggtgacgag gccaagcgaa tccgaaagat gctcatgacc 420
ttcctcggcc tcgatgctct caagaactac accgagcgaa ttgacatggt cacccagcag 480
cacatccgaa cctactggga gggtaaagag gaagtcaccg tctactccac tctcaagctc 540
tacaccttca ctctggcctg caacctcttc gcctccatca acgaccccga gcgactctcc 600
aagctcggtg cccacttcga cgtctttgtc aagggtgtca tctctctgcc catctccatc 660
cccggtaccc gactctacaa gtccatgaag gctgccaacg ccatccgaga ggagctcaag 720
ctcattgtcc gagatcgaaa agaggctctc gagcgaaaga tggcctcccc cacccaggat 780
ctgctctcct acctgctcgt cgactccgac accaacggcc gattcctgtc cgagatggag 840
attctcgaca acatcatgct gctcctctac gctgagcagc tcgagattgc caactccaag 900
aagcccggtg agctgctcca gtgggaggac gtccagaaga tgcgatactc ttggaacgtc 960
atctccgagg ttctgcgact gtctcctccc gtttcttctg cctaccgaca cgccattgtt 1020
gacttcacct acgagggtta caccatcccc aagggctggc agctcttcac ctccttcggt 1080
accacccacc gagatcccgc tctcttcccc aaccccgagc gattcgacgc ctctcgattc 1140
gagggtaacg gtcctccctc ctactcctac atccccttcg gtggtggtcc ccgaatgtgc 1200
attggctacg agtttgcccg actcgagatg ctcatcttcc tgcacaacat catcaagcga 1260
ttcaagtggg acattctcat ccccgacgag cagtttggct acaaccccct gctcgccccc 1320
tcccagggtt tccccgtccg actgcgaccc caccactccc atctc 1365
<210> 166
<211> 1428
<212> DNA
<213> Stevia rebaudiana
<400> 166
atgatccagg ttctgacccc catcctcctc ttcctcatct tcttcgtttt ctggaaggtc 60
tacaagcacc aaaagaccaa gatcaacctg ccccctggct ccttcggctg gcccttcctc 120
ggtgagactc ttgctctgct ccgagccggc tgggactccg agcccgagcg attcgtccga 180
gagcgaatca agaagcacgg ctctcctctc gttttcaaga cctctctctt cggtgaccga 240
ttcgccgttc tctgtggtcc cgctggtaac aagttcctct tctgcaacga gaacaagctc 300
gttgcctctt ggtggcccgt ccccgtccga aagctcttcg gcaagtctct gctcaccatc 360
cgaggtgacg aggccaagtg gatgcgaaag atgctgctct cctacctcgg tcccgatgcc 420
tttgccaccc actacgctgt caccatggac gttgtcaccc gacgacacat tgacgtccac 480
tggcgaggca aggaagaggt caacgttttc cagaccgtca agctctacgc ctttgagctc 540
gcctgccgac tcttcatgaa cctcgacgac cccaaccaca ttgccaagct cggctctctc 600
ttcaacatct tcctcaaggg tatcatcgag ctccccattg acgttcccgg tacccgattc 660
tactcctcca agaaggctgc tgctgccatc cgaatcgagc tcaagaagct catcaaggcc 720
cgaaagctcg agctcaagga aggcaaggcc tcctcctccc aggatctgct gtcccacctg 780
ctcacctccc ccgacgagaa cggtatgttc ctgaccgagg aagagattgt cgacaacatt 840
ctgctcctcc tctttgccgg ccacgacacc tctgctctgt ccatcactct gctcatgaaa 900
actctcggtg agcactccga tgtctacgac aaggtcctca aggagcagct cgagatctcc 960
aagactaaag aggcctggga gtctctcaag tgggaggaca tccagaagat gaagtactcc 1020
tggtccgtca tctgcgaggt catgcgactc aaccctcccg tcatcggtac ctaccgagag 1080
gctctggtcg acattgacta cgccggctac accatcccca agggctggaa gctccactgg 1140
tccgctgtct ccacccagcg agatgaggcc aactttgagg acgtcacccg attcgacccc 1200
tctcgattcg agggtgccgg tcccactccc ttcacctttg tccccttcgg tggtggtccc 1260
cgaatgtgtc tcggcaagga gtttgcccga ctcgaggttc tggccttcct gcacaacatt 1320
gtcaccaact tcaagtggga tctgctcatc cccgacgaga agattgagta cgaccccatg 1380
gccacccccg ccaagggtct gcccatccga ctccaccccc accaggtg 1428
<210> 167
<211> 1575
<212> DNA
<213> Arabidopsis thaliana
<400> 167
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc 60
ttctccgtcg gctaccacgt ctacggccga gctgttgtcg agcagtggcg aatgcgacga 120
tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc 180
gagatgcagc gaatccagtc cgaggccaag cactgctccg gtgacaacat catctcccac 240
gactactctt cttctctgtt cccccacttt gaccactggc gaaagcagta cggccgaatc 300
tacacctact ccactggcct caagcagcac ctctacatca accaccccga gatggtcaag 360
gagctctccc agaccaacac cctcaacctc ggccgaatca cccacatcac caagcgactc 420
aaccccattc tcggtaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga 480
cgaatcattg cctacgagtt cacccacgac aagatcaagg gtatggtcgg tctgatggtc 540
gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcaagcgagg tggtgagatg 600
ggctgtgaca tccgagtcga cgaggacctc aaggatgtct ccgctgacgt cattgccaag 660
gcctgtttcg gctcttcctt ctccaagggc aaggccatct tctccatgat ccgagatctg 720
ctcaccgcca tcaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt 780
ttcggctcca agaagcacgg tgacgttgac attgacgctc tcgagatgga gctcgagtcc 840
tccatctggg agactgtcaa ggagcgagag attgagtgca aggacaccca caagaaggac 900
ctcatgcagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag 960
tctgcttacc gacgattcgt tgtcgacaac tgcaagtcca tctactttgc cggccacgac 1020
tccaccgccg tttccgtttc ttggtgcctc atgctgctcg ctctcaaccc ctcttggcag 1080
gtcaagatcc gagatgagat tctgtcctcc tgcaagaacg gtatccccga cgccgagtcc 1140
atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc 1200
gctcccattg tcggccgaga ggcctccaag gacattcgac tcggtgatct ggttgtcccc 1260
aagggtgtct gtatctggac cctcatcccc gctctgcacc gagatcccga gatctggggt 1320
cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcctgcaag 1380
tacccccagt cctacatccc ctttggcctc ggcccccgaa cctgtgtcgg caagaacttt 1440
ggtatgatgg aggtcaaggt cctcgtttct ctgattgtct ccaagttctc cttcactctg 1500
tctcccacct accagcactc tccctcccac aagctgctcg tcgagcccca gcacggtgtt 1560
gtcatccgag ttgta 1575
<210> 168
<211> 1437
<212> DNA
<213> Ixeris dentata
<400> 168
atggacgccg ttgccgtcaa ctccgagact atgtcccacg ttgtctttat ccccttcccc 60
gcccagtccc acatcaagtg catgctcaag ctcgcccgac tgctgcacca caagggtctg 120
cacatcacct ttgtcaacac cgagctcaac cacaaccagc tgctgtcctc cggtggtccc 180
aactctctcg acggtgagcc cggtttccga ttcaagacca tccccgatgg tgtccccgag 240
ggtgctcccg acttcatgta cgctctgtgt gactccgttc tcaacaagat gctcgacccc 300
ttcgtcgatc tcattggccg actcgagtcc cccgccacct gtatcatcgg tgacggtatg 360
atgcccttca ctgttgctgc tgccgagaag ctcaagctcc ccatcatgca cttctggacc 420
ttccccgctg ctgccttcct cggctactac caggccccca acctcattga gaagggcttc 480
atccctccca aggacgagtc ttggtccacc aacggctacc tcgagactgt tgtcgactcc 540
atctccggtc tggagggttt ccgaatccga gacatccccg cctacttccg aaccaccgac 600
cccaacgact ccgacttcaa ctacatcatt gagtgtgtca aggccatccg aaaggtttcc 660
aacattgttc tccacacctt tgaggagctc gagtccacca tcatcaaggc tctgcagccc 720
atgatccccc acgtctacac cattggcccc ctcgagctcc tcctcaaccc catcaagctc 780
gaagaggaga ctgagaagct cgatatcaag ggctactctc tgtggaagga agatgacgag 840
tgcctcaagt ggctcgactc caaggagccc aactccgtca tctacgtcaa cttcggctct 900
ctcatctcca tgtccaagga gcagctcgcc gagtttggct ggggtctggt caactccaac 960
cactgcttcc tctgggtcat ccgacgagat ctggttgtcg gtgactctgc tcctctgcct 1020
cccgagctca aggagcgaat caacgagcga ggtttcattg cctcttggtg cccccaggag 1080
aaggtcctca agcactcctc cgtcggtggt ttcctcaccc actgtggctg gggctccatc 1140
atcgagtctc tgtccgccgg tgtccccatg ctctgctggc cctacctctg ggaccagccc 1200
accaactgcc gacaggcctg caaggagtgg gaggtcggtc tggagattga gggtaacgtc 1260
aacaaggacg aggtcgagcg actcacccga gagctcattg gtggtgagaa gggcaagcag 1320
atgcgatcca aggccctcga gtggaagaag aagattgaga ttgccaccgg ccccaagggc 1380
tcttcttctc tcaacgtcga gcgactggcc aacgacatca acatgttctc ccgaaat 1437
<210> 169
<211> 1446
<212> DNA
<213> Ricinus communis
<400> 169
atgggctcca ttgtccgaga tcacgacaag ccccacgttg tctgtgtccc ctaccccgcc 60
cagggccacg tcaaccccat ggtcaagctc gccaagctcc tccactacaa cgacttccac 120
gtcacctttg tcaacaccga gtacaaccac cgacgactcc tcaactcccg aggcccctct 180
tctctcgacg gcctccccga cttccgattc gaggccatct ccgacggtct gcccccttcc 240
gacgccaacg ccacccagga tatcccctct ctctgtgact ccacctccaa gaactctctg 300
gcccccttcc gaaacctgct gctcaagctc aagtcctccg actctctgcc tcccgtcacc 360
tgtatcatct ccgacgcctg catgtccttc accctcgatg ctgctgagga gttcggtatc 420
cccgagatcc tcttctggac cccctcttct tgtggtgttc tcggctactc ccagtaccac 480
actctcattg agaagggtct gactcccctc aaggacgcct cctacctgac caacggctac 540
ctcgagacta ccctcgactg gatccccggt atgaaggaca tccgattccg agatctgccc 600
tctttcatcc gaaccaccga ccgaaacgac atcatgctca acttcgttgt ccgagagctc 660
gagcgaacct cccgagcctc tgctgttgtt ttcaacacct tctacgcctt tgagaaggac 720
gttctcgacg ttctgtccac catgttccct cccatctact ccatcggtcc cctccagctc 780
ctcgtcgacc agatccccat tgaccgaaac ctcggtaaca ttggctccaa cctgtggaag 840
gagcagcccg agtgcattga ctggctcgac accaaggagc ccaactccgt tgtctacgtc 900
aactttggct ccatcaccgt catcactccc cagcagatga ttgagtttgc ctggggtctg 960
gcctcttcca agaagccctt cctgtggatc atccgacccg atctggtcat tggtgagaac 1020
gccatgctcc ccgctgagtt tgtctccgag actaaggacc gaggtatgct cgcctcttgg 1080
ggtccccagg agcagatcct caagcacccc gctgtcggtg gtttcctgtc ccacatgggc 1140
tggaactcca ccctcgactc catgtccggt ggtgtcccca tggtctgctg gcccttcttc 1200
gccgagcagc agaccaactg ccgattcgcc tgcaccgagt ggggtgtcgg tatggagatt 1260
gacaacaacg tcaagcgaga tgaggtcaag aagctcgttg aggtcctcat ggacggcaag 1320
aagggcaagg agatgaagtc caaggccatg gagtggaaaa ccaaggctga ggaagctgcc 1380
aagcccggtg gttcctccca caacaacctc gaccgactcg tcaagttcat caagggccag 1440
aagaat 1446
<210> 170
<211> 1374
<212> DNA
<213> Ixeris dentata
<400> 170
atggccgagg agcacaacaa gaccaacaac tcctctcccc acgttgtcat cttccccttc 60
ccctcccagg gccacatcaa ccccctcatc cagtttgcca agcgactctc ctccaagggt 120
gtcaagccca ctctcatcac caccatctac attgccaaga cctctcctta ccccaactcc 180
tccattgttg tcgagcccat ctccgacggt ttcgacgacg gtggtttcaa gtccgccacc 240
tccgctgagt cctacattga caccttccac caggtcggct ccaagtctct ggccaacctc 300
atccgaaagc tcgtcaacga gggcaaccac gttgatgcca tcatctacga ctccttcgtc 360
acctgggctc tcgatgttgc catggagtac ggtattgacg gtggctgctt cttcacccag 420
gcctgtgccg tcaacaacat ctactaccac gtctacaagg gtgtcctcga gatccccctg 480
caggctgctg ctcctcccac cgtcaccatt ctgctccccg agctccccca gctccagctg 540
tgggagactc cctcttttgt ccacaacccc ggcccctacc ccggctgggc ccacattgtc 600
tttaaccagt tccccaacat ccacaacgcc cgatgggttt tctccaacac cttcttcaag 660
ctcgaggagc aggtcatcaa gtggatgcga ctcatgtggc ctctcatggt cgtcggtccc 720
accgtcccct ccatgtacct cgacaagcga ctcgaggacg acgacgacta cggtatgtct 780
ctgctcaagc ccaaccacat tgagtgcatg ggctggctca acaacaagcc caagggctcc 840
gtcgtctacg tttctttcgg ctcttacggt gagctcggtg ttgcccagat ggaggagatt 900
gcctggggtc tgaacgagtc ctccgtcaac tacctgtggg ttgtccgaga gactgagaag 960
gagaagctcc ccaagtcctt cctggccaac ggtctgattg tcgagtggtg ccgacagctc 1020
gaggtcctcg cccacgaggc tgttggctgt ttcgtcaccc actgcggttt caactcctct 1080
ctcgagacta tctctctcgg tgtccccgtt gttgccatcc cccagtggac cgaccagacc 1140
accaacgcca agtgcctcga ggatatctgg ggtgtcggta tccgagccaa gacccccgtc 1200
acccgaacca acctggtctg gtgcatcaag gagatcatgg agggtgagcg aggtgctgtt 1260
gctcgaaaga acgccatcaa gtggaaggat ctggccattg aggctgtctc ccccggcggc 1320
tcttccgaca aggacatcaa cgagtttgtc tcccagctct cccccatcaa gtgt 1374
<210> 171
<211> 1362
<212> DNA
<213> Populus trichocarpa
<400> 171
atggacaaca agaagtccca cgtcattgtt ctgacctacc ccgcccaggg ccacatcaac 60
cccctgctcc agtttgccaa gcgactcgcc tccaagggcc tcaaggccac tctggccacc 120
acctactaca ccgtcaactc catcgacgcc cccaccgtcg gtgttgagcc catctccgac 180
ggtttcgacg agggtggttt caagcaggct tcttctctcg atgtctacct cgagtccttc 240
aagaccgtcg gctcccgaac tctgaccgag ctcgttttca agttcaaggc ttccggctct 300
cccgtcaact gtgttgtcta cgactccatg ctcccctggg ccctcgacgt tgctcgagat 360
ctcggtatct acgctgctgc cttcatgacc acctctgctt ccgtctgctc catgtactgg 420
cgaatcgatc tcggtctgct gtctctgccc ctcaagcagc agaccgccac tgtctctctg 480
cccggtctgc cccctctcgg ctgctgtgac ctcccctctt tcctcgccga gcccacctcc 540
cagaccgcct acctcgaggt catcatggag aagttccact ctctcaacga ggacgactgg 600
gttttctgca actccttcga ggacctcgag attgagctcg tcaaggccat gcgaggcaag 660
tggcccctcg tcatggtcgg ccccatggtc ccctccgcct acctcgacca gcagatcgac 720
ggtgaccgag cctacggtgc ttctctgtgg aagcccacct cctcccagtg cttcacctgg 780
ctcgacacca agccccctcg atccgtcatc tacgtttcct tcggctccat gggcaacatc 840
tccgccgagc aggttgagga gattgcctgg ggcctcaagg cttccaaccg acccttcctg 900
tgggtcatga aggagtccga gaagaagctc cccaccggtt tcctcaactc cgtcggtgag 960
actggtatgg ttgtctcttg gtgcaaccag ctcgaggttc tcgcccacca ggccattggc 1020
tgctttgtca cccactgtgg ctggaactcc actctcgagg gtctgggcct cggtgtcccc 1080
atggtctgtg tcaccgagcg atccgaccag cccatgaacg ccaagttcgt cgaggatgtc 1140
tggaaggtcg gtgtccgagc caagaaggac gaggttggta ttgtcacccg agaggagctc 1200
gagaagtgca tccgaggtgt catggacggt gagaacggtg aggagatcaa gcgaaacgcc 1260
aacaagtggc gagagctcgc ccgatctgct gtttccgtcg gtggttcctc cgacatgaac 1320
atcaacgagt ttgttgtcaa gctgctcgag ggcaagaagg ga 1362
<210> 172
<211> 1377
<212> DNA
<213> Nicotiana tabacum
<400> 172
atgaccaccc agaaggccca ctgtctgatc ctcccctacc ccgcccaggg ccacatcaac 60
cccatgctcc agttctccaa gcgactccag tccaagggtg tcaagatcac cattgctgcc 120
accaagtctt tcctcaagac catgcaggag ctgtccacct ctgtctccgt cgaggccatc 180
tccgacggct acgacgacgg tggccgagag caggctggta cctttgtcgc ctacatcacc 240
cgattcaaag aggttggctc cgacactctg tcccagctca tcggtaagct caccaactgt 300
ggctgccctg tctcttgcat tgtctacgac cccttcctgc cctgggctgt cgaggtcggt 360
aacaacttcg gtgttgccac tgctgccttc ttcacccagt cttgtgccgt cgacaacatc 420
tactaccacg tccacaaggg tgttctcaag ctccctccca ccgacgttga caaggagatc 480
tccatccccg gtctgctcac cattgaggct tccgatgtcc cctcttttgt ctccaacccc 540
gagtcctccc gaatcctcga gatgctcgtc aaccagttct ccaacctcga gaacaccgac 600
tgggttctca tcaactcctt ctacgagctc gagaaggaag tcattgactg gatggccaag 660
atctacccca tcaagaccat tggccccacc atcccctcca tgtacctcga caagcgactc 720
cccgatgaca aggagtacgg tctgtccgtc tttaagccca tgaccaacgc ctgcctcaac 780
tggctcaacc accagcccgt ttcttccgtc gtctacgttt ctttcggctc tctggccaag 840
ctcgaggctg agcagatgga ggagctcgcc tggggtctgt ccaactccaa caagaacttc 900
ctgtgggttg tccgatccac tgaggagtcc aagctcccca acaacttcct cgaggagctc 960
gcctccgaga agggtctggt tgtctcttgg tgcccccagc tccaggttct cgagcacaag 1020
tccatcggtt gcttcctgac ccactgcggc tggaactcca ctctcgaggc catctctctc 1080
ggtgtcccca tgattgccat gccccactgg tccgaccagc ccaccaacgc caagctcgtc 1140
gaggatgtct gggagatggg tatccgaccc aagcaggatg agaagggtct ggtccgacga 1200
gaggtcattg aggagtgtat caagattgtc atggaggaga agaagggcaa gaagatccga 1260
gagaacgcca agaagtggaa ggagctcgcc cgaaaggctg ttgacgaggg tggctcttcc 1320
gaccgaaaca ttgaggagtt cgtttccaag ctcgtcacca tcgcttccgt cgaatcg 1377
<210> 173
<211> 1434
<212> DNA
<213> Vaccaria hispanica
<400> 173
atgtccaaca acgagaacaa cgccacccag gtcattgttc tgccctacca cggccagggc 60
cacatgaaca ccatggtcca gtttgccaag cgactcgcct ggaagggtgt ccacgtcacc 120
atcgccacca ccttcaacac catccagcag atgaagctca acatctcctc ttacaactcc 180
atcactctcg agcccatcta cgacgacacc gacgactcca ctctgcacat caaggaccga 240
atggcccgat tcgaggctga ggctgcctcc aacctgaccc gagttctcga ggccaagaag 300
cagcagcagg ctctcaacaa gaagtgtctg ctcgtctacc acggctctct caactgggcc 360
ctcgttgtcg cccaccagca gaacgttgcc ggtgctgcct tcttcaccgc cgcctctgct 420
tctttcgcct gctactacta cctgcacctc gagtcccagg gcaagggtgt cgatctcgag 480
gagctcccct ccattctgcc ccctcccaag gtcatcgtcc agaagctccc caagtccttc 540
ctggcctacg gtgacaacaa ctcccacaac aacaacaata acaacaataa caacaacaat 600
aacaacaaca tgggtctgca ccctctggtt ctgtggctcc tcaaggacta cggcaactcc 660
gtcaaggccg actttgtcct cctcaactcc ttcgacaagc tcgaggaaga ggccatcaag 720
tggatctcca acatctgctc cgtcaagacc atcggtccca ccatcccctc cacctacctc 780
gacaagcaga tcgagaacga tgtcgactac ggtttcaacc agtacaagcc caccaacgag 840
gactgcatga agtggctcga caccaaggaa gccaactccg ttgtctacat tgccttcggc 900
tctgttgctc gactgtccgt cgagcagatg gccgagattg ccaaggctct cgaccactcc 960
tccaagtctt tcatctgggt tgtccgagag actgagaagg agaagctccc cgtcgacctc 1020
gtcgagaaga tctccggcca gggtatggtt gtcccctggg ctccccagct cgaggtcctc 1080
gcccacgatg ctgtcggctg tttcgtttcc cactgtggct ggaactccac cattgaggct 1140
ctgtctttcg gtgtccccat cctcgccatg ccccagttcc tcgaccagct cgttgatgcc 1200
cactttgtcg accgagtctg gggtgtcggt attgccccca ctgtcgacga gaacgatctc 1260
gtcacccagg aagagatctc ccgatgcctc gacgagatga tgggtggtgg tcccgagggt 1320
gagaagatca agaagaacgt tgccatgtgg aaggagctga ccaaagaggc tctcgacaag 1380
ggtggctctt ccgacaagca cattgacgag atcattgagt ggctctcttc tagt 1434
<210> 174
<211> 1443
<212> DNA
<213> Streptococcus mutans
<400> 174
atgcccatca tcaacaagac catgctcatc acctacgccg actctctcgg caagaacctc 60
aaggagctca acgagaacat tgagaactac ttcggtgatg ctgttggtgg tgtccacctg 120
ctccccttct tcccctccac tggtgaccga ggcttcgccc ccattgacta ccacgaggtt 180
gactctgcct tcggtgactg ggatgatgtc aagtgcctcg gtgagaagta ctacctcatg 240
ttcgacttca tgatcaacca catctcccga cagtccaagt actacaagga ctaccaggag 300
aagcacgagg cttctgccta caaggacctc ttcctcaact gggacaagtt ctggcccaag 360
aaccgaccta cccaggaaga tgtcgatctc atctacaagc gaaaggaccg agctcccaag 420
caggagatcc agtttgccga cggctccgtc gagcacctgt ggaacacctt cggtgaggag 480
cagattgacc tcgatgtcac caaggaagtc accatggact tcatccgatc caccattgag 540
aacctggctg ccaacggctg tgacctcatc cgactcgacg cctttgccta cgccgtcaag 600
aagctcgaca ccaacgactt cttcgtcgag cccgagatct ggactctgct cgacaaggtc 660
cgagacattg ctgccgtttc cggtgccgag atcctccccg agatccacga gcactacacc 720
atccagttca agattgctga ccacgactac tacgtctacg actttgctct gcccatggtc 780
actctgtact ctctgtactc ctccaaggtc gaccgactcg ccaagtggct caagatgtct 840
cccatgaagc agttcaccac cctcgacacc cacgacggta tcggtgttgt tgatgtcaag 900
gacattctga ccgacgagga gatcacctac acctccaacg agctctacaa ggtcggtgcc 960
aacgtcaacc gaaagtactc cactgccgag tacaacaacc tcgatatcta ccagatcaac 1020
tccacctact actctgctct cggtgacgac gaccagaagt acttcctggc ccgactcatc 1080
caggcctttg cccccggtat cccccaggtc tactacgttg gcttcctcgc cggcaagaac 1140
gatctcgagc tgctcgagtc caccaaagag ggccgaaaca tcaaccgaca ctactactcc 1200
tccgaggaga ttgctaaaga ggtcaagcga cccgttgtca aggctctgct caacctcttc 1260
acctaccgaa accagtccgc cgcctttgac ctcgacggcc gaatcgaggt tgagactccc 1320
aacgaggcca ccattgtcat cgagcgacag aacaaggacg gctctcacat tgccaaggcc 1380
gagatcaacc tgcaggacat gacctaccga gtcaccgaga acgaccagac catctccttc 1440
gag 1443
<210> 175
<211> 1392
<212> DNA
<213> Lobelia erinus
<400> 175
atggacaaca accacctcgg tgagactctg ctccccctcg ctcccaagaa cggccgacga 60
gttctcttct tcccctaccc cctccagggc cacatctccc ccatgctcaa cctcgccaac 120
ctgctccact ccaagggttt caccatcacc atcatccaca ccaacctcaa ctcccccaac 180
cagtccgact acccccactt caccttccga ccctttgacg acggtttccc tccttactcc 240
aagggctggc agctcgccac cctctgctcc cgatgtgtcg agcccttccg agagtgtctg 300
gcccagatct tcctgtccga ccacaccgcc cccgagggtg agcgagagtc cattgcctgc 360
ctcattgccg atggtctgtg gaacttcctc ggtgctgccg tctacaactt caagctcccc 420
atgattgttc tgcgaaccgg taacatgtcc aacattgttg ccaacgtcaa gctcccctgc 480
ttcatcgaga agggctactt tgaccacacc aaggaaggct ccaagctcga ggctgccgtc 540
cccgagttcc ccaccatcaa gttcaaggac attctcaaga cctacggctc caaccccaag 600
gccatctgtg agactctgac cgctctgctc aaggagatgc gagcttcttc cggtgtcatc 660
tggaactcct gcaaggagct cgagcagtcc gagctccaga tgatctgcaa ggagttcccc 720
gtcccccact tcctcatcgg tcctctgcac aagtacttcc ccgcctcttc ttcttctctc 780
gttgcccacg acccctcttc catctcttgg ctcaactcca aggctcccaa ctccgtcctc 840
tacgtttctt ttggctccat ctcctccatg gacgaggctg agttcctcga gactgcctgg 900
ggtctggcca actccatgca gcagttcctg tgggttgtcc gacccggctc cgtccgaggt 960
tcccagtggc tcgagtctct gcccgatggt ttcattgaca agctcgacgg ccgaggccac 1020
attgtcaagt gggctcccca gcaagaggtt ctcgcccacc aggccaccgg tggtttctgg 1080
acccactgtg gctggaactc cactctcgag tccatgtgcg agggtgtccc catgatctgc 1140
tcccacggta tcatggacca gcccatcaac gcccgatacg tcaccgatgt ctggaaggtc 1200
ggtatcgagc tcgagaaggg ctttgactcc gaggagatca agatggccat ccgacgactc 1260
atggtcgata aagagggcca ggagatccga gagcgatctt ctcgactcaa ggagtctctg 1320
tccaactgtc tcaagcaggg tggttcctcc cacgactccg tcgagtctct ggtcgaccac 1380
attctcagct tc 1392
<210> 176
<211> 1380
<212> DNA
<213> Arabidopsis thaliana
<400> 176
atggaggagc gaaagggccg acgaatcatc atgttccctc tgcccttccc cggccacttc 60
aaccccatga tcgagctcgc tggtatcttc caccaccgag gcttctccgt caccatcctc 120
cacacctcct acaacttccc cgacccctct cgacaccccc acttcacctt ccgaaccatc 180
tcccacaaca aggaaggcga agaggacccc ctgtcccagt ccgagacttc ttccatggac 240
ctcattgttc tcgtccgacg actcaagcag cgatacgctg agcccttccg aaagtccgtt 300
gctgctgagg tcggtggtgg tgagactgtc tgctgcctcg tttccgatgc catctggggt 360
aagaacaccg aggttgttgc cgaggagatt ggtgtccgac gagttgtcct ccgaaccggt 420
ggtgcctctt ctttctgcgc ctttgctgct ttccctctgc tccgagacaa gggctacctc 480
cccatccagg actcccgact cgacgagccc gtcaccgagc tgccccctct caaggtcaag 540
gatctgcccg tcatggagac taacgagccc gaggagctct accgagttgt caacgacatg 600
gtcgagggtg ccaagtcctc ttccggtgtc atctggaaca cctttgagga tctcgagcga 660
ctctctctca tgaactgctc ctccaagctc caggtcccct tcttccccat cggtcccttc 720
cacaagtact ccgaggaccc cacccccaag accgagaaca aggaagatac cgactggctc 780
gacaagcagg acccccagtc cgttgtctac gcctcttttg gctctctcgc cgccatcgag 840
gagaaggagt tcctcgagat tgcctggggt ctgcgaaact ccgagcgacc cttcctgtgg 900
gttgtccgac ccggctccgt ccgaggcacc gagtggctcg agtctctgcc cctcggcttc 960
atggagaaca ttggtgacaa gggcaagatc gtcaagtggg ccaaccagct cgaggttctg 1020
gcccaccccg ccatcggtgc cttctggacc cactgtggct ggaactccac tctcgagtcc 1080
atctgtgagg gtgtccccat gatctgcacc tcttgtttca ccgaccagca cgtcaacgcc 1140
cgatacattg tcgatgtctg gcgagtcggt atgctgctcg agcgatccaa gatggagaag 1200
aaggagattg agaaggtcct ccgatccgtc atgatggaga agggtgacgg tctgcgagag 1260
cgatctctca agctcaagga gcgagccgac ttctgcctct ccaaggacgg ctcctcctcc 1320
aagtacctcg acaagcttgt ctcccacgtt ctgtcttttg actcctacgc ctttgcatcc 1380
<210> 177
<211> 2139
<212> DNA
<213> Gibberella fujikuroi
<400> 177
atggctgagc tcgacactct cgacattgtt gttctcggtg tcatcttcct cggtaccgtt 60
gcctacttca ccaagggtaa gctgtggggt gtcaccaagg acccctacgc caacggcttt 120
gctgctggtg gtgcctccaa gcccggccga acccgaaaca ttgttgaggc catggaggag 180
tccggcaaga actgtgttgt tttctacggc tcccagaccg gcaccgccga ggactacgcc 240
tctcgactgg ccaaggaagg caagtctcga ttcggtctga acaccatgat cgctgatctc 300
gaggactacg actttgacaa cctggacacc gtcccctccg acaacatcgt catgtttgtc 360
ctggccacct acggtgaggg tgagcccacc gacaacgccg tcgacttcta cgagttcatc 420
accggtgagg acgcttcttt caacgagggt aacgaccctc ctctcggcaa cctcaactac 480
gttgccttcg gtctgggcaa caacacctac gagcactaca actccatggt ccgaaacgtc 540
aacaaggctc tcgagaagct cggcgcccac cgaatcggtg aggctggtga gggtgacgac 600
ggtgccggta ccatggagga agatttcctg gcctggaagg accccatgtg ggaggctctc 660
gccaagaaga tgggcctcga ggagcgagag gccgtctacg agcccatctt tgccatcaac 720
gagcgagatg atctcacccc cgaggccaac gaggtctacc tcggtgagcc caacaagctc 780
cacctcgagg gtaccgccaa gggccccttc aactcccaca acccctacat tgcccccatt 840
gctgagtcct acgagctgtt ctccgccaag gaccgaaact gtctgcacat ggagattgac 900
atctccggct ccaacctcaa gtacgagact ggtgaccaca tcgccatctg gcccaccaac 960
cccggtgaag aggtcaacaa gttcctcgac atcctcgatc tctccggcaa gcagcactct 1020
gttgtcaccg tcaaggctct cgagcccacc gccaaggtcc ccttccccaa ccccaccacc 1080
tacgacgcca tcctgcgata ccacctcgag atctgtgccc ccgtttctcg acagttcgtt 1140
tccactctgg ctgcctttgc ccccaacgac gacatcaagg ccgagatgaa ccgactcggc 1200
tccgacaagg actacttcca cgagaaaacc ggtcctcact actacaacat tgctcgattc 1260
ctggcttctg tctccaaggg tgagaagtgg accaagatcc ccttctccgc cttcattgag 1320
ggtctgacca agctccagcc ccgatactac tccatctctt cttcttctct cgtccagccc 1380
aagaagatct ccatcaccgc tgttgtcgag tcccagcaga tccccggccg agatgacccc 1440
ttccgaggtg ttgccaccaa ctacctcttc gccctcaagc agaagcagaa cggtgacccc 1500
aaccccgctc ccttcggtca gtcttacgag ctcaccggcc cccgaaacaa gtacgacggt 1560
atccacgtcc ccgtccacgt ccgacactcc aacttcaagc tcccctccga ccccggcaag 1620
cccatcatca tgattggccc cggtactggt gttgctccct tccgaggctt tgtccaggag 1680
cgagccaagc aggctcgaga tggtgtcgag gtcggtaaga ctctgctctt cttcggctgc 1740
cgaaagtcca ccgaggactt catgtaccag aaggagtggc aggagtacaa agaggctctc 1800
ggtgacaagt tcgagatgat caccgccttc tcccgagagg gctccaagaa ggtctacgtc 1860
cagcaccgac tcaaggagcg atccaaggaa gtctccgacc tcctctccca gaaggcctac 1920
ttctacgtct gcggtgacgc cgcccacatg gcccgagagg tcaacaccgt tctggcccag 1980
atcattgccg agggccgagg tgtctccgag gccaagggcg aggagattgt caagaacatg 2040
cgatctgcca accagtacca ggtctgctcc gactttgtca ccctccactg caaggagact 2100
acttacgcca actccgagct ccaggaagat gtctggtct 2139
<210> 178
<211> 2076
<212> DNA
<213> Arabidopsis thaliana
<400> 178
atgacctctg ctctctacgc ctccgatctc ttcaagcagc tcaagtccat catgggtacc 60
gactccctgt ccgacgatgt cgttctggtc attgccacca cctctctggc tctggtcgcc 120
ggtttcgttg tcctcctctg gaagaaaacc accgctgacc gatccggcga gctcaagccc 180
ctcatgatcc ccaagtctct catggccaag gacgaggacg acgatctcga tctcggttcc 240
ggcaagaccc gagtctccat cttctttggc acccagaccg gtaccgccga gggtttcgcc 300
aaggctctct ccgaggagat caaggcccga tacgagaagg ccgccgtcaa ggtcatcgac 360
ctcgacgact acgctgctga cgacgaccag tacgaggaga agctcaagaa ggagactctc 420
gccttcttct gtgtcgccac ctacggtgac ggcgagccca ccgacaacgc cgcccgattc 480
tacaagtggt tcaccgagga gaacgagcga gacatcaagc tccagcagct ggcctacggt 540
gttttcgccc tcggcaaccg acagtacgag cacttcaaca agatcggtat tgtcctcgac 600
gaggagctct gcaagaaggg tgccaagcga ctcattgagg ttggtctggg tgatgatgac 660
cagtccattg aggacgactt caacgcctgg aaggagtctc tgtggtccga gctcgacaag 720
ctgctcaagg acgaggacga caagtccgtt gccaccccct acaccgccgt catccccgag 780
taccgagtcg tcacccacga cccccgattc accacccaga agtccatgga gtccaacgtt 840
gccaacggta acaccaccat tgacatccac cacccctgcc gagtcgacgt tgccgtccag 900
aaggagctcc acacccacga gtccgaccga tcttgtatcc acctcgagtt tgacatctcc 960
cgaaccggta tcacctacga gactggtgac cacgttggtg tctacgccga gaaccacgtt 1020
gagattgtcg aggaagctgg taagctgctc ggccactctc tcgacctcgt tttctccatc 1080
cacgccgaca aagaggacgg ctctcccctc gagtccgctg ttccccctcc cttccccggc 1140
ccctgcactc tcggcaccgg tctggcccga tacgctgacc tcctcaaccc tccccgaaag 1200
tccgctctgg ttgctctcgc cgcctacgcc accgagccct ccgaggccga gaagctcaag 1260
cacctgacct cccccgacgg caaggacgag tactcccagt ggatcgttgc ttcccagcga 1320
tctctgctcg aggtcatggc cgccttcccc tctgccaagc cccctctggg tgttttcttt 1380
gccgccattg ctccccgact ccagccccga tactactcca tctcctcctc tccccgactc 1440
gccccctctc gagtccacgt cacctctgct ctcgtctacg gtcccactcc cactggccga 1500
atccacaagg gtgtctgctc cacctggatg aagaacgccg tccccgctga gaagtcccac 1560
gagtgctccg gtgctcccat cttcatccga gcctccaact tcaagctccc ctccaacccc 1620
tccaccccca ttgtcatggt cggccccggt actggtctgg cccccttccg aggcttcctc 1680
caggagcgaa tggctctcaa ggaagatggt gaggagctcg gctcttctct gctcttcttt 1740
ggctgccgaa accgacagat ggacttcatc tacgaggacg agctcaacaa ctttgtcgac 1800
cagggtgtca tctccgagct catcatggcc ttctcccgag agggtgccca gaaggagtac 1860
gtccagcaca agatgatgga gaaggccgcc caggtctggg atctcatcaa ggaagagggc 1920
tacctctacg tctgtggtga cgccaagggc atggcccgag atgtccaccg aactctgcac 1980
accattgtcc aggagcagga aggtgtctct tcctccgagg ctgaggccat tgtcaagaag 2040
ctgcagaccg agggccgata cctgcgagat gtatgg 2076
<210> 179
<211> 2133
<212> DNA
<213> Arabidopsis thaliana
<400> 179
atgtcctcct cttcttcttc ttccacctcc atgattgatc tcatggctgc catcatcaag 60
ggtgagcccg tcattgtctc cgaccccgcc aacgcctccg cctacgagtc cgttgctgcc 120
gagctgtcct ccatgctcat cgagaaccga cagtttgcca tgatcgtcac cacctccatt 180
gctgttctca ttggctgcat tgtcatgctc gtctggcgac gatctggctc cggtaactcc 240
aagcgagtcg agcccctcaa gcccctggtc atcaagcccc gagaagagga gatcgacgac 300
ggccgaaaga aggtcaccat cttctttggc acccagaccg gtactgctga gggcttcgcc 360
aaggctctcg gtgaggaagc caaggctcga tacgaaaaga cccgattcaa gattgtcgac 420
ctcgatgatt acgctgccga tgacgacgag tacgaggaga agctcaagaa agaggacgtt 480
gccttcttct tcctcgccac ctacggtgac ggtgagccca ccgacaacgc tgcccgattc 540
tacaagtggt tcaccgaggg taacgaccga ggcgagtggc tcaagaacct caagtacggt 600
gttttcggtc tgggcaaccg acagtacgag cacttcaaca aggttgccaa ggttgtcgac 660
gacatcctcg tcgagcaggg tgcccagcga ctcgtccagg tcggcctcgg tgatgatgac 720
cagtgcatcg aggacgactt cactgcctgg cgagaggctc tgtggcccga gctcgacacc 780
attctgcgag aggaaggtga caccgccgtt gccaccccct acaccgccgc cgtcctcgag 840
taccgagtct ccatccacga ctccgaggat gccaagttca acgacatcaa catggccaac 900
ggtaacggct acaccgtctt tgacgcccag cacccctaca aggccaacgt cgccgtcaag 960
cgagagctcc acacccccga gtccgaccga tcttgtatcc acctcgagtt tgacattgct 1020
ggttccggtc tgacctacga gactggtgac cacgttggtg tcctctgtga caacctgtcc 1080
gagactgtcg acgaggctct gcgactcctc gacatgtccc ccgacactta cttctctctg 1140
cacgccgaga aagaggacgg tactcccatc tcttcttctc tgccccctcc cttccctccc 1200
tgcaacctgc gaaccgctct gacccgatac gcctgcctcc tctcttctcc caagaagtct 1260
gctctcgttg ctctggccgc ccacgcctcc gaccccaccg aggctgagcg actcaagcac 1320
ctcgcctctc ccgctggcaa ggacgagtac tccaagtggg ttgtcgagtc ccagcgatct 1380
ctgctcgagg tcatggccga gttcccctcc gccaagcccc ctctcggtgt tttcttcgcc 1440
ggtgttgctc cccgactcca gccccgattc tactccatct cctcttcccc caagatcgcc 1500
gagactcgaa tccacgttac ctgtgctctg gtctacgaga agatgcccac cggccgaatc 1560
cacaagggtg tctgctccac ctggatgaag aacgccgttc cctacgagaa gtccgagaac 1620
tgttcctctg ctcccatctt tgtccgacag tccaacttca agctcccctc cgactccaag 1680
gtccccatca tcatgattgg ccccggtacc ggcctcgccc ccttccgagg cttcctgcag 1740
gagcgactcg ccctcgtcga gtccggtgtc gagctcggcc cctccgtcct cttctttggc 1800
tgccgaaacc gacgaatgga cttcatctac gaagaggagc tccagcgatt cgtcgagtcc 1860
ggtgctctcg ccgagctctc cgttgccttc tcccgagagg gtcccaccaa ggagtacgtc 1920
cagcacaaga tgatggacaa ggcctccgac atctggaaca tgatctccca gggcgcctac 1980
ctctacgtct gcggtgacgc caagggtatg gcccgagatg tccaccgatc tctgcacacc 2040
attgcccagg agcagggctc catggactcc accaaggccg agggtttcgt caagaacctc 2100
cagacctccg gccgatacct ccgagatgtc tgg 2133
<210> 180
<211> 1575
<212> DNA
<213> Gibberella fujikuroi
<400> 180
atgtccaagt ccaactccat gaactccacc tcccacgaga ctctcttcca gcagctcgtt 60
ctcggcctcg accgaatgcc cctcatggac gtccactggc tcatctacgt tgcctttggt 120
gcctggctct gctcctacgt catccacgtt ctgtcctctt cctccactgt caaggtcccc 180
gtcgtcggtt accgatccgt tttcgagccc acctggctcc tccgactgcg attcgtctgg 240
gagggtggtt ccatcattgg ccagggctac aacaagttca aggactccat cttccaggtc 300
cgaaagctcg gtaccgacat tgtcatcatc cctcccaact acattgacga ggtccgaaag 360
ctctcccagg acaagacccg atccgtcgag cccttcatca acgactttgc cggccagtac 420
acccgaggta tggtctttct gcagtccgat ctccagaacc gagtcatcca gcagcgactc 480
acccccaagc ttgtctctct caccaaggtc atgaaggaag agctcgacta cgctctgacc 540
aaggagatgc ccgacatgaa gaacgacgag tgggttgagg tcgacatctc ttccatcatg 600
gtccgactca tctctcgaat ctccgcccga gttttcctcg gccccgagca ctgccgaaac 660
caggagtggc tcaccaccac cgccgagtac tccgagtctc tcttcatcac cggcttcatc 720
ctccgagttg tcccccacat tctccgaccc ttcattgctc ctctgctgcc ctcttaccga 780
accctgctgc gaaacgtttc ttccggccga cgagtcattg gtgatatcat ccgatcccag 840
cagggtgacg gtaacgagga catcctctct tggatgcgag atgctgccac tggtgaggag 900
aagcagatcg acaacattgc ccagcgaatg ctcattctgt ctctcgcctc catccacacc 960
accgccatga ccatgaccca cgccatgtac gatctgtgtg cctgccccga gtacattgag 1020
cccctccgag atgaggtcaa gtccgtcgtt ggtgcttctg gctgggacaa gaccgctctc 1080
aaccgattcc acaagctcga ctctttcctc aaggagtccc agcgattcaa ccccgttttc 1140
ctgctcacct tcaaccgaat ctaccaccag tccatgaccc tctccgatgg taccaacatc 1200
ccctccggta cccgaattgc tgtcccctct cacgccatgc tccaggactc cgcccacgtc 1260
cccggtccca ctcctcccac tgagttcgac ggtttccgat actccaagat ccgatccgac 1320
tccaactacg cccagaagta cctcttctcc atgaccgact cttccaacat ggcctttggc 1380
tacggtaagt acgcctgccc cggccgattc tacgcctcca acgagatgaa gctgactctg 1440
gccattctgc tcctccagtt tgagttcaag ctccccgacg gtaagggccg accccgaaac 1500
atcaccatcg actccgacat gatccccgac ccccgagctc gactctgtgt ccgaaagcga 1560
tctctgcgtg acgag 1575
<210> 181
<211> 1419
<212> DNA
<213> Stevia rebaudiana
<400> 181
atggccacct ccgactccat tgtcgacgac cgaaagcagc tgcacgttgc caccttcccc 60
tggctcgcct ttggccacat tctgccctac ctccagctct ccaagctcat tgctgagaag 120
ggccacaagg tttctttcct gtccaccacc cgaaacatcc agcgactctc ctcccacatc 180
tctcctctca tcaacgttgt ccagctcacc ctcccccgag tccaggagct ccccgaggat 240
gccgaggcca ccactgatgt ccaccccgag gacatcccct acctcaagaa ggcctccgac 300
ggtctgcagc ccgaggtcac ccgattcctc gagcagcact ctcccgactg gatcatctac 360
gactacaccc actactggct cccctccatt gctgcttctc tcggtatctc tcgagcccac 420
ttctccgtca ccaccccctg ggccattgct tacatgggcc cctctgctga cgccatgatc 480
aacggttccg acggccgaac caccgtcgag gatctcacca cccctcccaa gtggttcccc 540
ttccccacca aggtctgctg gcgaaagcac gatctcgccc gactcgtccc ctacaaggcc 600
cccggtatct ccgacggtta ccgaatgggt ctggttctca agggctccga ctgtctgctc 660
tccaagtgct accacgagtt tggtacccag tggctccccc tgctcgagac tctgcaccag 720
gtccccgttg tccccgtcgg tctgctccct cccgagatcc ccggtgacga gaaggacgag 780
acttgggttt ccatcaagaa gtggctcgac ggcaagcaga agggctccgt cgtctacgtt 840
gctctcggct ccgaggttct tgtctcccag actgaggtcg tcgagctcgc cctcggtctg 900
gagctctccg gtctgccctt cgtctgggcc taccgaaagc ccaagggtcc cgccaagtcc 960
gactccgtcg agctccccga cggtttcgtc gagcgaactc gagatcgagg tctggtctgg 1020
acctcttggg ctccccagct ccgaatcctc tcccacgagt ccgtctgcgg tttcctgacc 1080
cactgtggtt ccggctccat tgtcgagggc ctcatgttcg gccaccccct catcatgctg 1140
cccatcttcg gtgaccagcc cctcaacgcc cgactcctcg aggacaagca ggtcggtatc 1200
gagatccccc gaaacgaaga ggacggctgc ctcaccaagg agtctgttgc ccgatctctg 1260
cgatctgttg ttgtcgagaa agagggtgag atctacaagg ccaacgcccg agagctctcc 1320
aagatctaca acgacaccaa ggtcgagaag gagtacgttt cccagtttgt cgactacctc 1380
gagaagaacg cccgagctgt cgccattgac cacgagagt 1419
<210> 182
<211> 2552
<212> DNA
<213> 人工序列
<220>
<223> tCPS-cwpT
<400> 182
cccactagtt ataaagtcac aagtatctca gtatacccgt ctaaccacac atttatcacc 60
atgtgcaagg ctgtttccaa ggagtactcc gatctgctcc agaaggacga ggcctctttc 120
accaagtggg acgacgacaa ggtcaaggac cacctcgaca ccaacaagaa cctctacccc 180
aacgacgaga tcaaggagtt tgtcgagtcc gtcaaggcca tgttcggctc catgaacgac 240
ggcgagatta atgtctctgc ttacgacacc gcctgggttg ctctggtcca ggatgtcgac 300
ggttccggct ctcctcagtt cccttcctct ctcgagtgga tcgccaacaa ccagctgtcc 360
gacggttctt ggggtgacca cctgctcttc tctgctcacg accgaatcat caacaccctg 420
gcctgtgtca ttgctctgac ctcttggaac gtccacccct ccaagtgcga gaagggtctg 480
aacttcctcc gagagaacat ctgcaagctc gaggacgaga acgccgagca catgcccatt 540
ggcttcgagg tcaccttccc ctctctgatt gacattgcca agaagctcaa cattgaggtc 600
cccgaggaca cccccgctct caaggagatc tacgctcgac gagacatcaa gctcaccaag 660
atccccatgg aggttctcca caaggtcccc accactctcc tccactctct cgagggtatg 720
cccgatctcg agtgggagaa gctgctcaag ctgcagtgca aggacggctc tttcctcttc 780
tccccctctt ccactgcctt cgccctcatg cagaccaagg acgagaagtg tctccagtac 840
ctcaccaaca ttgtcaccaa gttcaacggt ggtgtcccca acgtctaccc cgttgacctc 900
tttgagcaca tctgggttgt tgaccgactc cagcgactcg gtatcgcccg atacttcaag 960
tccgagatca aggactgtgt cgagtacatc aacaagtact ggaccaagaa cggtatctgc 1020
tgggcccgaa acacccacgt ccaggacatt gacgacaccg ccatgggctt ccgagttctg 1080
cgagcccacg gctacgatgt cacccccgat gtctttcgac agtttgagaa ggacggcaag 1140
tttgtctgtt tcgccggtca gtccacccag gccgtcaccg gtatgttcaa cgtctaccga 1200
gcttctcaga tgctcttccc cggtgagcga atcctcgagg acgccaagaa gttctcctac 1260
aactacctca aggagaagca gtccaccaac gagctgctcg acaagtggat cattgccaag 1320
gatctgcccg gtgaggttgg ctacgccctc gacatcccct ggtacgcctc tctgccccga 1380
ctggagactc gatactacct cgagcagtac ggtggtgagg acgatgtctg gatcggtaag 1440
accctgtacc gaatgggcta cgtttccaac aacacctacc tcgagatggc caagctcgac 1500
tacaacaact acgttgccgt cctccagctc gagtggtaca ccatccagca gtggtacgtc 1560
gacattggta tcgagaagtt cgagtccgac aacatcaagt ccgtccttgt ctcctactac 1620
ctcgctgctg cctccatctt cgagcccgag cgatccaagg agcgaattgc ctgggccaag 1680
accaccatcc tcgtcgacaa gatcacctcc atcttcgact cctcccagtc ctccaaggaa 1740
gatatcaccg ccttcattga caagttccga aacaagtcct cctccaagaa gcactccatc 1800
aacggcgagc cctggcacga ggtcatggtt gctctcaaga aaactctcca cggctttgcc 1860
ctcgacgctc tgatgaccca ctctcaggac atccaccccc agctccacca ggcctgggag 1920
atgtggctca ccaagctcca ggacggtgtt gatgtcactg ctgagctcat ggtccagatg 1980
atcaacatga ccgccggccg atgggtttcc aaggagctcc tcacccaccc ccagtaccag 2040
cgactctcca ctgtcaccaa ctctgtctgc cacgacatca ccaagctcca caacttcaag 2100
gagaactcca ccaccgtcga ctccaaggtc caggagctgg tccagctcgt tttctccgac 2160
acccccgatg atctcgacca ggacatgaag cagaccttcc tgactgtcat gaaaactttc 2220
tactacaagg cctggtgcga ccccaacacc atcaacgacc acatctccaa ggtctttgag 2280
attgtgattt aagttttttg atcaatgatc caatggcttt cacatacccc cccacgccta 2340
taattaaaac acagagaaat ataatctaac ttaataaata ttacggagaa tctttcgagt 2400
gttcagcaga aatatagcca ttgtaacaaa agccggctat cgaccgcttt atcgaagaat 2460
atttcccgcc ccccagtggc caaacgatat cgaaacaaaa gagctgaaat catatccttc 2520
agtagtagta tagtcctgtt atcacagcat ca 2552
<210> 183
<211> 2394
<212> DNA
<213> 人工序列
<220>
<223> tKS
<400> 183
agatatacaa gcaatgtcac tctccttcgt actcgtacat acaacacaac tacattcaaa 60
atgacctccc acggcggcca gaccaacccc accaacctca tcattgacac caccaaggag 120
cgaatccaga agcagttcaa gaacgtcgag atctccgttt cctcctacga caccgcctgg 180
gtcgccatgg tcccctctcc caactccccc aagtctccct gcttccccga gtgtctcaac 240
tggctcatca acaaccagct caacgacggc tcttggggtc tggtcaacca cacccacaac 300
cacaaccacc ccctcctcaa ggactctctc tcttccactc tcgcctgcat tgttgctctc 360
aagcgatgga acgttggcga ggaccagatc aacaagggtc tgtctttcat tgagtccaac 420
ctcgcctccg ccaccgagaa gtcccagccc tcccccattg gctttgatat catcttcccc 480
ggtctgctcg agtacgccaa gaacctcgat atcaacctgc tctccaagca gaccgacttc 540
tctctcatgc tgcacaagcg agagctcgag cagaagcgat gccactccaa cgagatggac 600
ggctacctgg cctacatttc cgagggtctg ggtaacctct acgactggaa catggtcaag 660
aagtaccaga tgaagaacgg ttccgttttc aactccccct ctgccaccgc tgctgccttc 720
atcaaccacc agaaccccgg ctgtctcaac tacctcaact ctctgctcga caagtttggt 780
aacgccgtcc ccactgtcta cccccacgat ctcttcatcc gactctccat ggtcgacacc 840
attgagcgac tcggtatttc ccaccacttc cgagtcgaga tcaagaacgt tctcgatgag 900
acttaccgat gctgggttga gcgagatgag cagatcttca tggacgttgt cacctgtgct 960
ctggccttcc gactcctccg aatcaacggt tacgaggttt cccccgaccc cctcgccgag 1020
atcaccaacg agctggctct caaggacgag tacgccgccc tcgagactta ccacgcttct 1080
cacattctgt accaagagga tctgtcctcc ggcaagcaga ttctcaagtc cgccgacttc 1140
ctcaaggaga tcatctccac tgactccaac cgactctcca agctcatcca caaggaagtc 1200
gagaacgctc tcaagttccc catcaacacc ggtctggagc gaatcaacac ccgacgaaac 1260
atccagctct acaacgtcga caacacccga attctcaaga ccacctacca ctcttccaac 1320
atctccaaca ccgactacct gcgactcgcc gtcgaggact tctacacctg ccagtccatc 1380
taccgagagg agctcaaggg tctggagcga tgggttgtcg agaacaagct cgaccagctc 1440
aagtttgccc gacaaaagac tgcctactgc tacttctccg ttgctgccac cctctcttct 1500
cccgagctct ccgacgcccg aatctcttgg gccaagaacg gtatcctgac cactgttgtc 1560
gacgacttct ttgacattgg tggcaccatt gacgagctga ccaacctcat ccagtgcgtc 1620
gagaagtgga acgtcgacgt tgacaaggac tgttgttccg agcacgtccg aatcctcttc 1680
ctggctctca aggacgccat ctgctggatc ggtgacgagg ccttcaagtg gcaggctcga 1740
gatgtcactt cccacgtcat ccagacctgg ctcgagctca tgaactccat gctgcgagag 1800
gccatctgga cccgagatgc ctacgtcccc accctcaacg agtacatgga gaacgcctac 1860
gtcagctttg ctctcggtcc cattgtcaag cccgccatct actttgtcgg tcccaagctg 1920
tccgaggaga ttgtcgagtc ctccgagtac cacaacctct tcaagctcat gtccacccag 1980
ggccgactcc tcaacgatat ccactccttc aagcgagagt tcaaggaagg taagctcaac 2040
gccgttgctc tgcacctgtc caacggtgag tccggcaagg tcgaggaaga ggtcgtcgag 2100
gagatgatga tgatgatcaa gaacaagcga aaggagctca tgaagctcat cttcgaggag 2160
aacggctcca ttgtcccccg agcctgcaag gacgccttct ggaacatgtg ccacgtcctc 2220
aacttcttct acgccaacga cgacggtttc accggcaaca ccattctcga caccgtcaag 2280
gacatcatct acaaccctct ggttctggtc aacgagaacg aggagcagag gtaactatcc 2340
gaagatcaag agcgaagcaa gttgtaagtc caggacatgt ttcccgccca cgcg 2394
<210> 184
<211> 2979
<212> DNA
<213> 人工序列
<220>
<223> CPSKS
<400> 184
cccactagtt ataaagtcac aagtatctca gtatacccgt ctaaccacac atttatcacc 60
atgcccggca agattgagaa cggtaccccc aaggacctca agaccggcaa cgacttcgtt 120
tccgctgcca agtctctgct cgaccgagcc ttcaagtccc accactccta ctacggtctg 180
tgctccactt cctgccaggt ctacgacacc gcctgggttg ccatgatccc caagacccga 240
gacaacgtca agcagtggct cttccccgag tgtttccact acctgctcaa gacccaggct 300
gctgacggct cttggggctc tctgcccacc acccagaccg ccggtatcct cgacactgcc 360
tccgccgttc tggctctgct ctgccacgcc caggagcctc tgcagatcct cgatgtctcc 420
cccgatgaga tgggtctgcg aatcgagcac ggtgtcactt ctctcaagcg acagctcgcc 480
gtctggaacg acgttgagga caccaaccac attggtgtcg agttcatcat ccccgctctg 540
ctctccatgc tcgagaagga gctcgatgtc ccctccttcg agttcccttg ccgatccatt 600
ctggagcgaa tgcacggcga gaagctcggt cactttgatc tcgagcaggt ctacggcaag 660
ccctcttctc tgctgcactc tctcgaggcc ttcctcggca agctcgattt cgaccgactc 720
tcccaccacc tctaccacgg ctccatgatg gcctccccct cttccaccgc cgcctacctc 780
attggtgcca ccaagtggga cgacgaggcc gaggactacc tccgacacgt catgcgaaac 840
ggtgccggcc acggtaacgg tggtatctct ggtaccttcc ccaccaccca ctttgagtgc 900
tcctggatca ttgccactct cctcaaggtc ggtttcactc tcaagcagat tgacggtgac 960
ggtctgcgag gcctctccac cattctcctc gaggctctgc gtgacgagaa cggtgtcatt 1020
ggcttcgctc cccgaaccgc cgacgttgat gacactgcca aggctctgct ggctctgtct 1080
ctggtcaacc agcccgtttc ccccgacatc atgatcaagg ttttcgaggg taaggaccac 1140
ttcaccacct ttggctccga gcgagatccc tctctgacct ccaacctcca cgttctcctc 1200
tctctgctca agcagtccaa cctgtctcag taccaccccc agatcctcaa gaccactctc 1260
ttcacctgtc gatggtggtg gggctccgac cactgtgtca aggacaagtg gaacctctcc 1320
cacctctacc ccaccatgct cctcgtcgag gccttcaccg aggttctgca cctcatcgac 1380
ggtggtgagc tctcctctct gtttgacgag tctttcaagt gcaagatcgg tctgtccatc 1440
ttccaggccg ttctccgaat catcctcacc caggacaacg acggctcttg gcgaggctac 1500
cgagagcaga cctgttacgc cattctcgcc ctcgtccagg cccgacacgt ctgtttcttc 1560
acccacatgg tcgaccgact ccagtcctgt gtcgaccgag gtttctcttg gctcaagtct 1620
tgctccttcc actctcagga cctcacctgg acctccaaga ccgcttacga ggttggtttc 1680
gttgccgagg cctacaagct cgctgctctc cagtccgcct ctctcgaggt ccccgctgcc 1740
accatcggcc actccgtcac ctctgctgtc ccctcctccg atctcgagaa gtacatgcga 1800
ctcgtccgaa agaccgctct gttctctccc ctcgatgagt ggggtctgat ggcctccatc 1860
atcgagtcct ccttctttgt ccccctcctc caggcccagc gagttgagat ctacccccga 1920
gacaacatca aggtcgacga ggacaagtac ctctccatca tccccttcac ctgggttggc 1980
tgcaacaacc gatctcgaac ctttgcctcc aaccgatggc tctacgacat gatgtacctg 2040
tctctgctcg gctaccagac cgacgagtac atggaggccg ttgccggccc cgtctttggt 2100
gatgtctctc tgctgcacca gaccatcgac aaggtcattg acaacaccat gggtaacctg 2160
gcccgagcca acggtaccgt ccactccggc aacggtcacc agcacgagtc ccccaacatt 2220
ggccaggtcg aggacaccct cactcgattt accaactccg tcctcaacca caaggacgtc 2280
ctcaactcct cctcctccga ccaggacacc ctccgacgag agttccgaac cttcatgcac 2340
gcccacatca cccagatcga ggacaactcc cgattctcca agcaggcttc ttccgacgcc 2400
ttctcctccc ccgagcagtc ctacttccag tgggtcaact ccactggtgg ttcccacgtt 2460
gcctgcgcct actctttcgc cttctccaac tgcctcatgt ccgccaacct cctccagggt 2520
aaggacgcct tcccttccgg tacccagaag tacctcatct cctccgtcat gcgacacgcc 2580
accaacatgt gccgaatgta caacgatttc ggttccatcg cccgagacaa cgccgagcga 2640
aacgtcaact ccatccactt tcccgagttc actctctgta acggtacctc ccagaacctc 2700
gacgagcgaa aggagcgact cctcaagatt gctacttacg agcagggcta cctcgaccga 2760
gctctcgagg ctctggagcg acagtcccga gatgatgctg gtgaccgagc cggctccaag 2820
gacatgcgaa agctcaagat tgtcaagctc ttctgtgacg tcaccgacct ctacgaccag 2880
ctctacgtca tcaaggacct ctcctcctcc atgaaataac tatccgaaga tcaagagcga 2940
agcaagttgt aagtccagga catgtttccc gcccacgcg 2979
<210> 185
<211> 1698
<212> DNA
<213> 人工序列
<220>
<223> KAH4
<400> 185
ttgaatcttt tttcttcttc tcttctctat attcattctt gaattaaaca cacatcaaca 60
atggagtctc tggttgtcca caccgtcaac gccatctggt gcattgtcat tgtcggtatc 120
ttctccgtcg gctaccacgt ctacggccga gctgttgtcg agcagtggcg aatgcgacga 180
tctctcaagc tccagggtgt caagggtcct cctccctcca tcttcaacgg taacgtttcc 240
gagatgcagc gaatccagtc cgaggccaag cactgctccg gtgacaacat catctcccac 300
gactactctt cttctctgtt cccccacttt gaccactggc gaaagcagta cggccgaatc 360
tacacctact ccactggcct caagcagcac ctctacatca accaccccga gatggtcaag 420
gagctctccc agaccaacac cctcaacctc ggccgaatca cccacatcac caagcgactc 480
aaccccattc tcggtaacgg tatcatcacc tccaacggcc cccactgggc ccaccagcga 540
cgaatcattg cctacgagtt cacccacgac aagatcaagg gtatggtcgg tctgatggtc 600
gagtccgcca tgcccatgct caacaagtgg gaggagatgg tcaagcgagg tggtgagatg 660
ggctgtgaca tccgagtcga cgaggacctc aaggatgtct ccgctgacgt cattgccaag 720
gcctgtttcg gctcttcctt ctccaagggc aaggccatct tctccatgat ccgagatctg 780
ctcaccgcca tcaccaagcg atccgtcctc ttccgattca acggtttcac cgacatggtt 840
ttcggctcca agaagcacgg tgacgttgac attgacgctc tcgagatgga gctcgagtcc 900
tccatctggg agactgtcaa ggagcgagag attgagtgca aggacaccca caagaaggac 960
ctcatgcagc tcattctcga gggtgccatg cgatcttgtg acggtaacct gtgggacaag 1020
tctgcttacc gacgattcgt tgtcgacaac tgcaagtcca tctactttgc cggccacgac 1080
tccaccgccg tttccgtttc ttggtgcctc atgctgctcg ctctcaaccc ctcttggcag 1140
gtcaagatcc gagatgagat tctgtcctcc tgcaagaacg gtatccccga cgccgagtcc 1200
atccccaacc tcaagaccgt caccatggtc atccaggaga ctatgcgact ctaccctccc 1260
gctcccattg tcggccgaga ggcctccaag gacattcgac tcggtgatct ggttgtcccc 1320
aagggtgtct gtatctggac cctcatcccc gctctgcacc gagatcccga gatctggggt 1380
cccgacgcca acgacttcaa gcccgagcga ttctccgagg gtatctccaa ggcctgcaag 1440
tacccccagt cctacatccc ctttggcctc ggcccccgaa cctgtgtcgg caagaacttt 1500
ggtatgatgg aggtcaaggt cctcgtttct ctgattgtct ccaagttctc cttcactctg 1560
tctcccacct accagcactc tccctcccac aagctgctcg tcgagcccca gcacggtgtt 1620
gtcatccgag ttgtataaac ttcgagctaa tccagtagct tacgttaccc aggggcaggt 1680
caactggcta gccacgag 1698
<210> 186
<211> 1698
<212> DNA
<213> 人工序列
<220>
<223> KOgib
<400> 186
cacacccgaa atcgttaagc atttccttct gagtataaga atcattcgct agccacaaaa 60
atgtccaagt ccaactccat gaactccacc tcccacgaga ctctcttcca gcagctcgtt 120
ctcggcctcg accgaatgcc cctcatggac gtccactggc tcatctacgt tgcctttggt 180
gcctggctct gctcctacgt catccacgtt ctgtcctctt cctccactgt caaggtcccc 240
gtcgtcggtt accgatccgt tttcgagccc acctggctcc tccgactgcg attcgtctgg 300
gagggtggtt ccatcattgg ccagggctac aacaagttca aggactccat cttccaggtc 360
cgaaagctcg gtaccgacat tgtcatcatc cctcccaact acattgacga ggtccgaaag 420
ctctcccagg acaagacccg atccgtcgag cccttcatca acgactttgc cggccagtac 480
acccgaggta tggtctttct gcagtccgat ctccagaacc gagtcatcca gcagcgactc 540
acccccaagc ttgtctctct caccaaggtc atgaaggaag agctcgacta cgctctgacc 600
aaggagatgc ccgacatgaa gaacgacgag tgggttgagg tcgacatctc ttccatcatg 660
gtccgactca tctctcgaat ctccgcccga gttttcctcg gccccgagca ctgccgaaac 720
caggagtggc tcaccaccac cgccgagtac tccgagtctc tcttcatcac cggcttcatc 780
ctccgagttg tcccccacat tctccgaccc ttcattgctc ctctgctgcc ctcttaccga 840
accctgctgc gaaacgtttc ttccggccga cgagtcattg gtgatatcat ccgatcccag 900
cagggtgacg gtaacgagga catcctctct tggatgcgag atgctgccac tggtgaggag 960
aagcagatcg acaacattgc ccagcgaatg ctcattctgt ctctcgcctc catccacacc 1020
accgccatga ccatgaccca cgccatgtac gatctgtgtg cctgccccga gtacattgag 1080
cccctccgag atgaggtcaa gtccgtcgtt ggtgcttctg gctgggacaa gaccgctctc 1140
aaccgattcc acaagctcga ctctttcctc aaggagtccc agcgattcaa ccccgttttc 1200
ctgctcacct tcaaccgaat ctaccaccag tccatgaccc tctccgatgg taccaacatc 1260
ccctccggta cccgaattgc tgtcccctct cacgccatgc tccaggactc cgcccacgtc 1320
cccggtccca ctcctcccac tgagttcgac ggtttccgat actccaagat ccgatccgac 1380
tccaactacg cccagaagta cctcttctcc atgaccgact cttccaacat ggcctttggc 1440
tacggtaagt acgcctgccc cggccgattc tacgcctcca acgagatgaa gctgactctg 1500
gccattctgc tcctccagtt tgagttcaag ctccccgacg gtaagggccg accccgaaac 1560
atcaccatcg actccgacat gatccccgac ccccgagctc gactctgtgt ccgaaagcga 1620
tctctgcgtg acgagtaagc tatttacagc atgtgtaatg aggaatataa cgttgattga 1680
attgtttgtg aaaaatgt 1698
<210> 187
<211> 2262
<212> DNA
<213> 人工序列
<220>
<223> CPR1
<400> 187
cgtccatata tatctatgct gcgtcgtcct tttcgtgaca tcaccaaaac acatacaaaa 60
atggctgagc tcgacactct cgacattgtt gttctcggtg tcatcttcct cggtaccgtt 120
gcctacttca ccaagggtaa gctgtggggt gtcaccaagg acccctacgc caacggcttt 180
gctgctggtg gtgcctccaa gcccggccga acccgaaaca ttgttgaggc catggaggag 240
tccggcaaga actgtgttgt tttctacggc tcccagaccg gcaccgccga ggactacgcc 300
tctcgactgg ccaaggaagg caagtctcga ttcggtctga acaccatgat cgctgatctc 360
gaggactacg actttgacaa cctggacacc gtcccctccg acaacatcgt catgtttgtc 420
ctggccacct acggtgaggg tgagcccacc gacaacgccg tcgacttcta cgagttcatc 480
accggtgagg acgcttcttt caacgagggt aacgaccctc ctctcggcaa cctcaactac 540
gttgccttcg gtctgggcaa caacacctac gagcactaca actccatggt ccgaaacgtc 600
aacaaggctc tcgagaagct cggcgcccac cgaatcggtg aggctggtga gggtgacgac 660
ggtgccggta ccatggagga agatttcctg gcctggaagg accccatgtg ggaggctctc 720
gccaagaaga tgggcctcga ggagcgagag gccgtctacg agcccatctt tgccatcaac 780
gagcgagatg atctcacccc cgaggccaac gaggtctacc tcggtgagcc caacaagctc 840
cacctcgagg gtaccgccaa gggccccttc aactcccaca acccctacat tgcccccatt 900
gctgagtcct acgagctgtt ctccgccaag gaccgaaact gtctgcacat ggagattgac 960
atctccggct ccaacctcaa gtacgagact ggtgaccaca tcgccatctg gcccaccaac 1020
cccggtgaag aggtcaacaa gttcctcgac atcctcgatc tctccggcaa gcagcactct 1080
gttgtcaccg tcaaggctct cgagcccacc gccaaggtcc ccttccccaa ccccaccacc 1140
tacgacgcca tcctgcgata ccacctcgag atctgtgccc ccgtttctcg acagttcgtt 1200
tccactctgg ctgcctttgc ccccaacgac gacatcaagg ccgagatgaa ccgactcggc 1260
tccgacaagg actacttcca cgagaaaacc ggtcctcact actacaacat tgctcgattc 1320
ctggcttctg tctccaaggg tgagaagtgg accaagatcc ccttctccgc cttcattgag 1380
ggtctgacca agctccagcc ccgatactac tccatctctt cttcttctct cgtccagccc 1440
aagaagatct ccatcaccgc tgttgtcgag tcccagcaga tccccggccg agatgacccc 1500
ttccgaggtg ttgccaccaa ctacctcttc gccctcaagc agaagcagaa cggtgacccc 1560
aaccccgctc ccttcggtca gtcttacgag ctcaccggcc cccgaaacaa gtacgacggt 1620
atccacgtcc ccgtccacgt ccgacactcc aacttcaagc tcccctccga ccccggcaag 1680
cccatcatca tgattggccc cggtactggt gttgctccct tccgaggctt tgtccaggag 1740
cgagccaagc aggctcgaga tggtgtcgag gtcggtaaga ctctgctctt cttcggctgc 1800
cgaaagtcca ccgaggactt catgtaccag aaggagtggc aggagtacaa agaggctctc 1860
ggtgacaagt tcgagatgat caccgccttc tcccgagagg gctccaagaa ggtctacgtc 1920
cagcaccgac tcaaggagcg atccaaggaa gtctccgacc tcctctccca gaaggcctac 1980
ttctacgtct gcggtgacgc cgcccacatg gcccgagagg tcaacaccgt tctggcccag 2040
atcattgccg agggccgagg tgtctccgag gccaagggcg aggagattgt caagaacatg 2100
cgatctgcca accagtacca ggtctgctcc gactttgtca ccctccactg caaggagact 2160
acttacgcca actccgagct ccaggaagat gtctggtctt aaaattaaca gatagtttgc 2220
cggtgataat tctcttaacc tcccacactc ctttgacata ac 2262
<210> 188
<211> 2256
<212> DNA
<213> 人工序列
<220>
<223> CPR3
<400> 188
cgtccatata tatctatgct gcgtcgtcct tttcgtgaca tcaccaaaac acatacaaaa 60
atgtcctcct cttcttcttc ttccacctcc atgattgatc tcatggctgc catcatcaag 120
ggtgagcccg tcattgtctc cgaccccgcc aacgcctccg cctacgagtc cgttgctgcc 180
gagctgtcct ccatgctcat cgagaaccga cagtttgcca tgatcgtcac cacctccatt 240
gctgttctca ttggctgcat tgtcatgctc gtctggcgac gatctggctc cggtaactcc 300
aagcgagtcg agcccctcaa gcccctggtc atcaagcccc gagaagagga gatcgacgac 360
ggccgaaaga aggtcaccat cttctttggc acccagaccg gtactgctga gggcttcgcc 420
aaggctctcg gtgaggaagc caaggctcga tacgaaaaga cccgattcaa gattgtcgac 480
ctcgatgatt acgctgccga tgacgacgag tacgaggaga agctcaagaa agaggacgtt 540
gccttcttct tcctcgccac ctacggtgac ggtgagccca ccgacaacgc tgcccgattc 600
tacaagtggt tcaccgaggg taacgaccga ggcgagtggc tcaagaacct caagtacggt 660
gttttcggtc tgggcaaccg acagtacgag cacttcaaca aggttgccaa ggttgtcgac 720
gacatcctcg tcgagcaggg tgcccagcga ctcgtccagg tcggcctcgg tgatgatgac 780
cagtgcatcg aggacgactt cactgcctgg cgagaggctc tgtggcccga gctcgacacc 840
attctgcgag aggaaggtga caccgccgtt gccaccccct acaccgccgc cgtcctcgag 900
taccgagtct ccatccacga ctccgaggat gccaagttca acgacatcaa catggccaac 960
ggtaacggct acaccgtctt tgacgcccag cacccctaca aggccaacgt cgccgtcaag 1020
cgagagctcc acacccccga gtccgaccga tcttgtatcc acctcgagtt tgacattgct 1080
ggttccggtc tgacctacga gactggtgac cacgttggtg tcctctgtga caacctgtcc 1140
gagactgtcg acgaggctct gcgactcctc gacatgtccc ccgacactta cttctctctg 1200
cacgccgaga aagaggacgg tactcccatc tcttcttctc tgccccctcc cttccctccc 1260
tgcaacctgc gaaccgctct gacccgatac gcctgcctcc tctcttctcc caagaagtct 1320
gctctcgttg ctctggccgc ccacgcctcc gaccccaccg aggctgagcg actcaagcac 1380
ctcgcctctc ccgctggcaa ggacgagtac tccaagtggg ttgtcgagtc ccagcgatct 1440
ctgctcgagg tcatggccga gttcccctcc gccaagcccc ctctcggtgt tttcttcgcc 1500
ggtgttgctc cccgactcca gccccgattc tactccatct cctcttcccc caagatcgcc 1560
gagactcgaa tccacgttac ctgtgctctg gtctacgaga agatgcccac cggccgaatc 1620
cacaagggtg tctgctccac ctggatgaag aacgccgttc cctacgagaa gtccgagaac 1680
tgttcctctg ctcccatctt tgtccgacag tccaacttca agctcccctc cgactccaag 1740
gtccccatca tcatgattgg ccccggtacc ggcctcgccc ccttccgagg cttcctgcag 1800
gagcgactcg ccctcgtcga gtccggtgtc gagctcggcc cctccgtcct cttctttggc 1860
tgccgaaacc gacgaatgga cttcatctac gaagaggagc tccagcgatt cgtcgagtcc 1920
ggtgctctcg ccgagctctc cgttgccttc tcccgagagg gtcccaccaa ggagtacgtc 1980
cagcacaaga tgatggacaa ggcctccgac atctggaaca tgatctccca gggcgcctac 2040
ctctacgtct gcggtgacgc caagggtatg gcccgagatg tccaccgatc tctgcacacc 2100
attgcccagg agcagggctc catggactcc accaaggccg agggtttcgt caagaacctc 2160
cagacctccg gccgatacct ccgagatgtc tggtaaaatt aacagatagt ttgccggtga 2220
taattctctt aacctcccac actcctttga cataac 2256
<210> 189
<211> 1566
<212> DNA
<213> 人工序列
<220>
<223> UGT1
<400> 189
cccactagtt ataaagtcac aagtatctca gtatacccgt ctaaccacac atttatcacc 60
atggacgcca tggccaccac cgagaagaag ccccacgtca tcttcatccc cttccccgcc 120
cagtcccaca tcaaggccat gctcaagctc gcccagctcc tccaccacaa gggcctccag 180
atcacctttg tcaacaccga cttcatccac aaccagttcc tcgagtcctc cggcccccac 240
tgtctggacg gtgctcccgg tttccgattt gagactatcc ccgatggtgt ctcccactcc 300
cccgaggcct ccatccccat ccgagagtct ctgctccgat ccattgagac taacttcctc 360
gaccgattca ttgatctcgt caccaagctc cccgatcctc ccacctgtat catctccgac 420
ggtttcctgt ccgttttcac cattgatgct gccaagaagc tcggtatccc cgtcatgatg 480
tactggactc tggctgcctg tggtttcatg ggtttctacc acatccactc tctgatcgag 540
aagggctttg ctcctctcaa ggacgcctcc tacctcacca acggttacct cgacaccgtc 600
attgactggg tccccggtat ggagggtatc cgactcaagg acttccccct cgactggtcc 660
accgacctca acgacaaggt tctcatgttc accaccgagg ctccccagcg atcccacaag 720
gtttcccacc acatcttcca caccttcgac gagctcgagc cctccatcat caagactctg 780
tctctgcgat acaaccacat ctacaccatt ggccccctcc agctcctcct cgaccagatc 840
cccgaggaga agaagcagac cggtatcacc tctctgcacg gctactctct cgtcaaggaa 900
gagcccgagt gcttccagtg gctccagtcc aaggagccca actccgttgt ctacgtcaac 960
tttggctcca ccaccgtcat gtctctcgag gacatgaccg agtttggctg gggtctggcc 1020
aactccaacc actacttcct gtggatcatc cgatccaacc tcgtcattgg cgagaacgcc 1080
gttctgcctc ccgagctcga ggagcacatc aagaagcgag gcttcattgc ctcttggtgc 1140
tcccaggaga aggttctcaa gcacccctcc gtcggtggtt tcctgaccca ctgcggctgg 1200
ggctccacca ttgagtctct gtccgctggt gtccccatga tctgctggcc ctactcctgg 1260
gaccagctca ccaactgccg atacatctgc aaggagtggg aggttggtct ggagatgggt 1320
accaaggtca agcgagatga ggtcaagcga ctcgtccagg agctcatggg cgagggtggt 1380
cacaagatgc gaaacaaggc caaggactgg aaggagaagg cccgaattgc cattgccccc 1440
aacggctctt cttctctcaa cattgacaag atggtcaagg agatcactgt tctcgctcga 1500
aactaactat ccgaagatca agagcgaagc aagttgtaag tccaggacat gtttcccgcc 1560
cacgcg 1566
<210> 190
<211> 1503
<212> DNA
<213> 人工序列
<220>
<223> UGT3
<400> 190
ttgaatcttt tttcttcttc tcttctctat attcattctt gaattaaaca cacatcaaca 60
atggccgagc agcagaagat caagaagtct ccccacgttc tgctcatccc cttccctctg 120
cagggccaca tcaacccctt catccagttc ggcaagcgac tcatctccaa gggtgtcaag 180
accactctgg tcaccaccat ccacaccctc aactccactc tcaaccactc caacaccacc 240
accacctcca tcgagatcca ggccatctcc gacggctgtg acgagggtgg tttcatgtct 300
gctggtgagt cttacctcga gactttcaag caggtcggtt ccaagtctct ggctgacctc 360
atcaagaagc tccagtccga gggtaccacc attgacgcca tcatctacga ctccatgacc 420
gagtgggttc tcgatgtcgc catcgagttt ggtattgacg gtggctcctt cttcacccag 480
gcctgtgtcg tcaactctct ctactaccac gtccacaagg gtctgatctc tctgcccctc 540
ggcgagactg tctccgtccc cggtttcccc gttctgcagc gatgggagac tcctctcatt 600
ctccagaacc acgagcagat ccagtccccc tggtcccaga tgctcttcgg ccagttcgcc 660
aacattgacc aggcccgatg ggttttcacc aactccttct acaagctcga ggaagaggtc 720
attgagtgga cccgaaagat ctggaacctc aaggtcattg gccccaccct cccctccatg 780
tacctcgaca agcgactcga tgacgacaag gacaacggtt tcaacctcta caaggccaac 840
caccacgagt gcatgaactg gctcgacgac aagcccaagg agtccgttgt ctacgttgcc 900
tttggctctc tggtcaagca cggccccgag caggttgagg agatcacccg agctctgatt 960
gactccgatg tcaacttcct gtgggtcatc aagcacaagg aagagggtaa gctccccgag 1020
aacctgtccg aggtcatcaa gaccggcaag ggcctcattg ttgcctggtg caagcagctc 1080
gacgttctcg cccacgagtc cgtcggctgc tttgtcaccc actgcggttt caactccacc 1140
ctcgaggcta tctctctcgg tgtccccgtt gttgccatgc cccagttctc cgaccagacc 1200
accaacgcca agctcctcga tgagattctc ggtgtcggtg tccgagtcaa ggctgacgag 1260
aacggtattg tccgacgagg taacctggct tcttgtatca agatgatcat ggaggaagag 1320
cgaggtgtca tcatccgaaa gaacgccgtc aagtggaagg atctggccaa ggttgctgtc 1380
cacgagggtg gctcttccga caacgacatt gtcgagtttg tctccgagct catcaaggcc 1440
taaacttcga gctaatccag tagcttacgt tacccagggg caggtcaact ggctagccac 1500
gag 1503
<210> 191
<211> 1497
<212> DNA
<213> 人工序列
<220>
<223> UGT4
<400> 191
cacacccgaa atcgttaagc atttccttct gagtataaga atcattcgct agccacaaaa 60
atggagaaca agaccgagac taccgtccga cgacgacgac gaatcattct cttccccgtc 120
cccttccagg gccacatcaa ccccattctg cagctcgcca acgttctgta ctccaagggc 180
ttctccatca ccatcttcca caccaacttc aacaagccca agacctccaa ctacccccac 240
ttcactttcc gattcatcct cgacaacgac ccccaggacg agcgaatctc caacctgccc 300
acccacggtc ctctggctgg tatgcgaatc cccatcatca acgagcacgg tgctgacgag 360
ctccgacgag agctcgagct gctcatgctc gcctccgaag aggacgagga agtctcctgt 420
ctgatcaccg atgctctgtg gtactttgcc cagtccgtcg ccgactctct caacctgcga 480
cgactcgttc tcatgacctc ctctctgttc aacttccacg cccacgtttc tctgccccag 540
tttgacgagc tcggttacct cgaccccgat gacaagaccc gactcgagga gcaggcttcc 600
ggtttcccca tgctcaaggt caaggacatc aagtccgcct actccaactg gcagattctc 660
aaggagattc tcggcaagat gatcaagcag accaaggcct cctccggtgt catctggaac 720
tccttcaagg agctcgagga gtccgagctc gagactgtca tccgagagat ccccgctccc 780
tctttcctca tccccctgcc caagcacctc accgcttcct cctcttctct gctcgaccac 840
gaccgaaccg tctttcagtg gctcgaccag cagccccctt cctccgtcct ctacgtttcc 900
ttcggctcca cctccgaggt cgacgagaag gacttcctcg agattgctcg aggcctcgtt 960
gactccaagc agtccttcct gtgggttgtc cgacccggct ttgtcaaggg ctccacctgg 1020
gttgagcccc tgcccgatgg tttcctcggt gagcgaggcc gaattgtcaa gtgggtcccc 1080
cagcaggaag ttctggccca cggtgccatt ggtgccttct ggacccactc cggctggaac 1140
tccactctcg agtccgtctg cgagggtgtc cccatgatct tctccgactt tggcctcgac 1200
cagcccctca acgcccgata catgtccgat gttctcaagg tcggtgtcta cctcgagaac 1260
ggctgggagc gaggtgagat tgccaacgcc atccgacgag tcatggtcga cgaggaaggt 1320
gagtacatcc gacagaacgc ccgagtcctc aagcagaagg ccgatgtctc tctcatgaag 1380
ggtggttctt cttacgagtc tctcgagtct ctcgtttcct acatctcttc tttgtaagct 1440
atttacagca tgtgtaatga ggaatataac gttgattgaa ttgtttgtga aaaatgt 1497
<210> 192
<211> 1542
<212> DNA
<213> 人工序列
<220>
<223> UGT9
<400> 192
cgtccatata tatctatgct gcgtcgtcct tttcgtgaca tcaccaaaac acatacaaaa 60
atggccacct ccgactccat tgtcgacgac cgaaagcagc tgcacgttgc caccttcccc 120
tggctcgcct ttggccacat tctgccctac ctccagctct ccaagctcat tgctgagaag 180
ggccacaagg tttctttcct gtccaccacc cgaaacatcc agcgactctc ctcccacatc 240
tctcctctca tcaacgttgt ccagctcacc ctcccccgag tccaggagct ccccgaggat 300
gccgaggcca ccactgatgt ccaccccgag gacatcccct acctcaagaa ggcctccgac 360
ggtctgcagc ccgaggtcac ccgattcctc gagcagcact ctcccgactg gatcatctac 420
gactacaccc actactggct cccctccatt gctgcttctc tcggtatctc tcgagcccac 480
ttctccgtca ccaccccctg ggccattgct tacatgggcc cctctgctga cgccatgatc 540
aacggttccg acggccgaac caccgtcgag gatctcacca cccctcccaa gtggttcccc 600
ttccccacca aggtctgctg gcgaaagcac gatctcgccc gactcgtccc ctacaaggcc 660
cccggtatct ccgacggtta ccgaatgggt ctggttctca agggctccga ctgtctgctc 720
tccaagtgct accacgagtt tggtacccag tggctccccc tgctcgagac tctgcaccag 780
gtccccgttg tccccgtcgg tctgctccct cccgagatcc ccggtgacga gaaggacgag 840
acttgggttt ccatcaagaa gtggctcgac ggcaagcaga agggctccgt cgtctacgtt 900
gctctcggct ccgaggttct tgtctcccag actgaggtcg tcgagctcgc cctcggtctg 960
gagctctccg gtctgccctt cgtctgggcc taccgaaagc ccaagggtcc cgccaagtcc 1020
gactccgtcg agctccccga cggtttcgtc gagcgaactc gagatcgagg tctggtctgg 1080
acctcttggg ctccccagct ccgaatcctc tcccacgagt ccgtctgcgg tttcctgacc 1140
cactgtggtt ccggctccat tgtcgagggc ctcatgttcg gccaccccct catcatgctg 1200
cccatcttcg gtgaccagcc cctcaacgcc cgactcctcg aggacaagca ggtcggtatc 1260
gagatccccc gaaacgaaga ggacggctgc ctcaccaagg agtctgttgc ccgatctctg 1320
cgatctgttg ttgtcgagaa agagggtgag atctacaagg ccaacgcccg agagctctcc 1380
aagatctaca acgacaccaa ggtcgagaag gagtacgttt cccagtttgt cgactacctc 1440
gagaagaacg cccgagctgt cgccattgac cacgagagtt aaaattaaca gatagtttgc 1500
cggtgataat tctcttaacc tcccacactc ctttgacata ac 1542
<210> 193
<211> 880
<212> DNA
<213> 人工序列
<220>
<223> pTPI
<400> 193
aaacaaaaga gctgaaatca tatccttcag tagtagtata gtcctgttat cacagcatca 60
attacccccg tccaagtaag ttgattggga tttttgttta cagatacagt aatatacttg 120
actatttctt tacaggtgac tcagaaagtg catgttggaa atgagccaca gaccaagaca 180
agatatgaca aaattgcact attcgatgca gaattcgacg gtgtttccat tggtgttatg 240
acattcatct gcattcatac aaaaaagtct tggtagtggt acttttgcgt tattacctcc 300
gatatctacg caccccccaa cccccctgct acagtaaaga gtgtgagtct actgtacatg 360
cttactaaac cacctactgt acagcgaaac ccctcagcaa aatcacacaa tcagctcatt 420
acaacacacc caatgacctc accacaaatt ctatacgcct tttgacgcca ttattacagt 480
agcttgcaac gccgttgtct taggttccat ttttagtgct ctattacctc acttaacccg 540
tataggcaga tcaggccatg gcactaagtg tagagctaga ggttgatatc gccacgagtg 600
ctccatcagg gctagggtgg ggttagaaat acagtccgtg cgcactcaaa aggcgtccgg 660
gttagggcat ccgataatat cgcctggact cggcgccata ttctcgactt ctgggcgcgt 720
tgtattcatc tcctccgctt cccaacactt ccacccgttt ctccatccca accaatagaa 780
tagggtaacc ttattcggga cactttcgtc atacatagtc agatatacaa gcaatgtcac 840
tctccttcgt actcgtacat acaacacaac tacattcaaa 880
<210> 194
<211> 1183
<212> DNA
<213> 人工序列
<220>
<223> gpdT-pGPD
<400> 194
ctatccgaag atcaagagcg aagcaagttg taagtccagg acatgtttcc cgcccacgcg 60
agtgatttat aacacctctc ttttttgaca cccgctcgcc ttgaaattca tgtcacataa 120
attatagtca acgacgtttg aataacttgt cttgtagttc gatgatgatc atatgattac 180
attaatagta attactgtat ggcgcgccgg tagtcggaaa gagccgggac cggccggcga 240
gcataaaccg gacgcagtag gatgtcctgc acgggtcttt ttgtggggtg tggagaaagg 300
ggtgcttgga gatggaagcc ggtagaaccg ggctgcttgg ggggatttgg ggccgctggg 360
ctccaaagag gggtaggcat ttcgttgggg ttacgtaatt gcggcatttg ggtcctgcgc 420
gcatgtccca ttggtcagaa ttagtccgga taggagactt atcagccaat cacagcgccg 480
gatccacctg taggttgggt tgggtgggag cacccctcca cagagtagag tcaaacagca 540
gcagcaacat gatagttggg ggtgtgcgtg ttaaaggaaa aaaaaagaag cttgggttat 600
attcccgctc tatttagagg ttgcgggata gacgccgacg gagggcaatg gcgccatgga 660
accttgcgga tatcgatacg ccgcggcgga ctgcgtccga accagctcca gcagcgtttt 720
ttccgggcca ttgagccgac tgcgaccccg ccaacgtgtc ttggcccacg cactcatgtc 780
atgttggtgt tgggaggcca ctttttaagt agcacaaggc acctagctcg cagcaaggtg 840
tccgaaccaa agaagcggct gcagtggtgc aaacggggcg gaaacggcgg gaaaaagcca 900
cgggggcacg aattgaggca cgccctcgaa tttgagacga gtcacggccc cattcgcccg 960
cgcaatggct cgccaacgcc cggtcttttg caccacatca ggttacccca agccaaacct 1020
ttgtgttaaa aagcttaaca tattataccg aacgtaggtt tgggcgggct tgctccgtct 1080
gtccaaggca acatttatat aagggtctgc atcgccggct caattgaatc ttttttcttc 1140
ttctcttctc tatattcatt cttgaattaa acacacatca aca 1183
<210> 195
<211> 639
<212> DNA
<213> 人工序列
<220>
<223> pgmT-pTEF
<400> 195
acttcgagct aatccagtag cttacgttac ccaggggcag gtcaactggc tagccacgag 60
tctgtcccag gtcgcaattt agtgtaataa acaatatata tattgagtct aaagggaatt 120
gtagctattg tgattgtgtg attttcgtct tgctggttct tattgtgtcc cattcgtttc 180
atcctgatga ggacccctgg cgtacgccgg cgaagcttgg taccagagac gggttggcgg 240
cgtatttgtg tcccaaaaaa cagccccaat tgccccaatt gaccccaaat tgacccagta 300
gcgggcccaa ccccggcgag agcccccttc accccacata tcaaacctcc cccggttccc 360
acacttgccg ttaagggcgt agggtactgc agtctggaat ctacgcttgt tcagactttg 420
tactagtttc tttgtctggc catccgggta acccatgccg gacgcaaaat agactactga 480
aaattttttt gctttgtggt tgggacttta gccaagggta taaaagacca ccgtccccga 540
attacctttc ctcttctttt ctctctctcc ttgtcaactc acacccgaaa tcgttaagca 600
tttccttctg agtataagaa tcattcgcta gccacaaaa 639
<210> 196
<211> 1009
<212> DNA
<213> 人工序列
<220>
<223> pgkT-pPGM
<400> 196
gctatttaca gcatgtgtaa tgaggaatat aacgttgatt gaattgtttg tgaaaaatgt 60
agaaaatttc agtgaagttg tgttttctat atagtaagca cttttggtac aagtatctgc 120
acatccctgc atgttacaag cctgatcatg cagggcaata ttctgactat aaatatacct 180
cgatatttta gcaagctata cgtacgtacc aaccacagat tacgacccat tcgcagtcac 240
agttcactag ggtttgggtt gcatccgttg agagtggttt gtttttaacc ttctccatgt 300
gctcactcag gttttgggtt cagatcaaat caaggcgtga accactgttt gaggacaaat 360
gtgacacaac caaccagtgt caggggcaag tccgtgacaa aggggaagat acaatgcaat 420
tactgacagt tacggactgc ctcgatgccc taaccttgcc ccaaaataag acaactgtcc 480
tcgtttaagc gcaaccctat tcagcgtcac gtcataatag cgtttggata gcactagtct 540
atgaggagcg ttttatgttg cggtgagggc gattggtgct catatgggtt caattgaggt 600
ggtggaacga gcttagtctt caattgaggt gcgagcgaca caattgggtg tcacgtggcc 660
taattgacct cggatcgtgg agtccccagt tatacagcaa ccacgaggtg catgagtagg 720
agacgtcacc agacaatagg gtttttttgg actggagagg gtagggcaaa agcgctcaac 780
gggctgtttg gggagctatg ggggaggaat tggcgatatt tgtgaggttg acggctccga 840
tttgcgtgtt ttgtcgcttc tgcatctccc catacccata tcttccctcc ccacctcttt 900
ccacgataat tttacggatc agcaataagg ttccttctcc tagtttccac gtccatatat 960
atctatgctg cgtcgtcctt ttcgtgacat caccaaaaca catacaaaa 1009
<210> 197
<211> 2472
<212> DNA
<213> 人工序列
<220>
<223> LEU2 and flanking sequences
<400> 197
aaaattaaca gatagtttgc cggtgataat tctcttaacc tcccacactc ctttgacata 60
acgatttatg taacgaaact gaaatttgac cagatattgt tgtaaataga aaatctggct 120
tgtaggtgga aactagtaac ggccgccagt gtgctggaat tcggctccac tgtcctccac 180
tacaaacaca cccaatctgc ttcttctagt caaggttgct acaccggtaa attataaatc 240
atcatttcat tagcagggct gggccctttt tatagagtct tatacactag cggaccctgc 300
cggtagacca acccgcaggc gcgtcagttt gctccttcca tcaatgcgtc gtagaaacga 360
cttactcctt cttgagcagc tccttgacct tgttggcaac aaagtctccg acctcggagg 420
tggaggaaga gcctccgata tcggcggtag tgataccagc ctcgacggac tccttgacgg 480
cagcctcaac agcgtcaccg gcgggcttca tgttaagaga gaacttgagc atcatggcgg 540
cagacagaat ggtggcaatg gggttgacct tctgcttgcc gagatcgggg gcagatccgt 600
gacagggctc gtacagaccg aacgcctcgt tggtgtcggg cagagaagcc agagaggcgg 660
agggcagcag acccagagaa ccggggatga cggaggcctc gtcggagatg atatcgccaa 720
acatgttggt ggtgatgatg ataccattca tcttggaggg ctgcttgatg aggatcatgg 780
cggccgagtc gatcagctgg tggttgagct cgagctgggg gaattcgtcc ttgaggactc 840
gagtgacagt ctttcgccaa agtcgagagg aggccagcac gttggccttg tcaagagacc 900
acacgggaag aggggggttg tgctgaaggg ccaggaaggc ggccattcgg gcaattcgct 960
caacctcagg aacggagtag gtctcggtgt cggaagcgac gccagatccg tcatcctcct 1020
ttcgctctcc aaagtagata cctccgacga gctctcggac aatgatgaag tcggtgccct 1080
caacgtttcg gatgggggag agatcggcga gcttgggcga cagcagctgg cagggtcgca 1140
ggttggcgta caggttcagg tcctttcgca gcttgaggag accctgctcg ggtcgcacgt 1200
cggttcgtcc gtcgggagtg gtccatacgg tgttggcagc gcctccgaca gcaccgagca 1260
taatagagtc agcctttcgg cagatgtcga gagtagcgtc ggtgatgggc tcgccctcct 1320
tctcaatggc agctcctcca atgagtcggt cctcaaacac aaactcggtg ccggaggcct 1380
cagcaacaga cttgagcacc ttgacggcct cggcaatcac ctcggggcca cagaagtcgc 1440
cgccgagaag aacaatcttc ttggagtcag tcttggtctt cttagtttcg ggttccattg 1500
tggatgtgtg tggttgtatg tgtgatgtgg tgtgtggagt gaaaatctgt ggctggcaaa 1560
cgctcttgta tatatacgca cttttgcccg tgctatgtgg aagactaaac ctccgaagat 1620
tgtgactcag gtagtgcggt atcggctagg gacccaaacc ttgtcgatgc cgatagcgct 1680
atcgaacgta ccccagccgg ccgggagtat gtcggagggg acatacgaga tcgtcaaggg 1740
tttgtggcca actggtaaat aaatgatgac tcaggcgacg acggaattcg acagcaacta 1800
ctcctttcac caaccatgtg cattttagct cgaataacat tcacaggctt ggtgatctac 1860
atccatggtg tctggccgat taccgtggtg ttttggcagt aacgagaata ttgagtggac 1920
tcttcccatc accaataaag actcatacta caatcacgag cgcttcagct gccactatag 1980
tgttggtgac acaatacccc tcgatgctgg gcattactgt agcaagagat attatttcat 2040
ggcgcatttt tccagtctac ctgacttttt agtgtgattt cttctccaca ttttatgctc 2100
agtgtgaaaa gttggagtgc acacttaatt atcgccggtt ttcggaaagt actatgtgct 2160
caaggttgca ccccacgtta cgtatgcagc acattgagca gcctttggac cgtggagata 2220
acggtgtgga gatagcaacg ggtagtcttc gtaataagca atgcagccga attctgcaga 2280
tatccatcac actggcggcc gctcgagcat gcatctagat ggcctccttg gccgggtttc 2340
aattcaattc atcatttttt ttttattctt ttttttgatt tcggtttctt tgaaattttt 2400
ttgattcggt aatctccgaa cagaaggaag aacgaaggaa ggagcacaga cttagattgg 2460
tatatatacg ca 2472
<210> 198
<211> 6989
<212> DNA
<213> 人工序列
<220>
<223> 载体序列
<400> 198
ctagatggcc tccttggccg ggtttcaatt caattcatca tttttttttt attctttttt 60
ttgatttcgg tttctttgaa atttttttga ttcggtaatc tccgaacaga aggaagaacg 120
aaggaaggag cacagactta gattggtata tatacgcata tgtagtgttg aagaaacatg 180
aaattgccca gtattcttaa cccaactgca cagaacaaaa acctgcagga aacgaagata 240
aatcatgtcg aaagctacat ataaggaacg tgctgctact catcctagtc ctgttgctgc 300
caagctattt aatatcatgc acgaaaagca aacaaacttg tgtgcttcat tggatgttcg 360
taccaccaag gaattactgg agttagttga agcattaggt cccaaaattt gtttactaaa 420
aacacatgtg gatatcttga ctgatttttc catggagggc acagttaagc cgctaaaggc 480
attatccgcc aagtacaatt ttttactctt cgaagacaga aaatttgctg acattggtaa 540
tacagtcaaa ttgcagtact ctgcgggtgt atacagaata gcagaatggg cagacattac 600
gaatgcacac ggtgtggtgg gcccaggtat tgttagcggt ttgaagcagg cggcagaaga 660
agtaacaaag gaacctagag gccttttgat gttagcagaa ttgtcatgca agggctccct 720
atctactgga gaatatacta agggtactgt tgacattgcg aagagcgaca aagattttgt 780
tatcggcttt attgctcaaa gagacatggg tggaagagat gaaggttacg attggttgat 840
tatgacaccc ggtgtgggtt tagatgacaa gggagacgca ttgggtcaac agtatagaac 900
cgtggatgat gtggtctcta caggatctga cattattatt gttggaagag gactatttgc 960
aaagggaagg gatgctaagg tagagggtga acgttacaga aaagcaggct gggaagcata 1020
tttgagaaga tgcggccagc aaaactaaaa aactgtatta taagtaaatg catgtatact 1080
aaactcacaa attagagctt caatttaatt atatcagtta ttacccggga atctcggtcg 1140
taatgatttt tataatgacg aaaaaaaaaa aattggaaag aaaacccccc ccccgcagcg 1200
ttgggtcctg gccacgggtg cgcatgatcg tgctcctgtc gttgaggacc cggctaggct 1260
ggcggggttg ccttactggt tagcagaatg aatcaccgat acgcgagcga acgtgaagcg 1320
actgctgctg caaaacgtct gcgacctgag caacaacatg aatggtcttc ggtttccgtg 1380
tttcgtaaag tctggaaacg cggaagtcag cgccctgcac cattatgttc cggatctgca 1440
tcgcaggatg ctgctggcta ccctgtggaa cacctacatc tgtattaacg aagcgctggc 1500
attgaccctg agtgattttt ctctggtccc gccgcatcca taccgccagt tgtttaccct 1560
cacaacgttc cagtaaccgg gcatgttcat catcagtaac ccgtatcgtg agcatcctct 1620
ctcgtttcat cggtatcatt acccccatga acagaaattc ccccttacac ggaggcatca 1680
agtgaccaaa caggaaaaaa ccgcccttaa catggcccgc tttatcagaa gccagacatt 1740
aacgcttctg gagaaactca acgagctgga cgcggatgaa caggcagaca tctgtgaatc 1800
gcttcacgac cacgctgatg agctttaccg caggtgggcc attctcatga agaatatctt 1860
gaatttattg tcatattact agttggtgtg gaagtccata tatcggtgat caatatagtg 1920
gttgacatgc tggctagtca acattgagcc ttttgatcat gcaaatatat tacggtattt 1980
tacaatcaaa tatcaaactt aactattgac tttataactt atttaggtgg taacattctt 2040
ataaaaaaga aaaaaattac tgcaaaacag tactagcttt taacttgtat cctaggttat 2100
ctatgctgtc tcaccataga gaatattacc tatttcagaa tgtatgtcca tgattcgccg 2160
ggtaaataca tataatacac aaatctggct taataaagtc tataatatat ctcataaaga 2220
agtgctaaat tggctagtgc tatatatttt taagaaaatt tcttttgact aagtccatat 2280
cgactttgta aaagttcact ttagcataca tatattacac gagccagaaa ttgtaacttt 2340
tgcctaaaat cacaaattgc aaaatttaat tgcttgcaaa aggtcacatg cttataatca 2400
acttttttaa aaatttaaaa tactttttta ttttttattt ttaaacataa atgaaataat 2460
ttatttattg tttatgatta ccgaaacata aaacctgctc aagaaaaaga aactgttttg 2520
tccttggaaa aaaagcacta cctaggagcg gccaaaatgc cgaggctttc atagcttaaa 2580
ctctttacag aaaataggca ttatagatca gttcgagttt tcttattctt ccttccggtt 2640
ttatcgtcac agttttacag taaataagta tcacctctta gagttaacta tgagataagc 2700
aagtatcatc tcatttcatt tacctgaagt cgagtaaaca gaaaatccaa ttgttgatga 2760
acctcaatga cttagaacta tctatcggca gatcatataa agaggattta ggtacctaga 2820
ggactgtacc tggagtatat atatatatat atatatatta tctcaactat agtccataga 2880
ggtttctttc ttgaggcctt aaactgctaa agaatgatat tggtggaatg caagcaccaa 2940
tctctcttct ttcgtaactg ttcatatact tcaaaccaag aatgtaacgg gcattgaccc 3000
atccaaaacc ttcagtagct gcccctttaa agtcagcacc ttgattaccg tattctgctt 3060
caacacgatg aggatctgtt cctcttgtga catcatattt ttcaaccaca ataccattat 3120
aatcgacaaa agcctttgtc atcatgaaaa gccatctata agctagccta ttcgttacag 3180
ttaaataacc ataagaacgg aggccttccc aagcaagaat ttgatggggt gcccaaccaa 3240
atggatagtc ccattgtcta attggtctcg aaatagaaat tgggcctcga gaacgctccg 3300
tacatgcagc taaacctcca agcatctcta acttgggtag tgctttctcc accattttct 3360
gtgcttgctc cttcgtggca agtccagccc ataatgccca gaatgtagtt gcggattcgt 3420
atgacgttct gtgcttgatt tttgtgttgt agtcaaagaa aaaccccgac tcgtcatccc 3480
acatatattt ggtaattgat gaggcaacgc taattatcaa catatagatt gttatctatc 3540
tgcatgaaca cgaaatcttt acttgacgac ttgaggctga tggtgtttat gcaaagaaac 3600
cactgtgttt aatatgtgtc actgtttgat attactgtca gcgtagaaga taatagtaaa 3660
agcggttaat aagtgtattt gagataagtg tgataaagtt tttacagcga aaagacgata 3720
aatacaagaa aatgattacg aggatacgga gagaggtatg tacatgtgta tttatatact 3780
aagctgccgg cggttgtttg caagaccgag aaaaggctag caagaatcgg gtcattgtag 3840
cgtatgcgcc tgtgaacatt ctcttcaaca agtttgattc cattgcggtg aaatggtaaa 3900
agtcaacccc ctgcgatgta tattttcctg tacaatcaat caaaaagcca aatgatttag 3960
cattatcttt acatcttgtt attttacaga ttttatgttt agatctttta tgcttgcttt 4020
tcaaaaggcc tgcaggcaag tgcacaaaca atacttaaat aaatactact cagtaataac 4080
ctatttctta gcatttttga cgaaatttgc tattttgtta gagtctttta caccatttgt 4140
ctccacacct ccgcttacat caacaccaat aacgccattt aatctaagcg catcaccaac 4200
attttctggc gtcagtccac cagctaacat aaaatgtaag ctctgcctcg cgcgtttcgg 4260
tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4320
agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4380
gggcgcagcc atgacccagt cacgtagcga tagcggagtg tatactggct taactatgcg 4440
gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc 4500
gtaaggagaa aataccgcat caggcgctct tccgcttcct cgctcactga ctcgctgcgc 4560
tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 4620
acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 4680
aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 4740
cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 4800
gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 4860
tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 4920
tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 4980
cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 5040
gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 5100
ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt 5160
ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 5220
ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 5280
agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 5340
aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 5400
atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 5460
tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 5520
tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca 5580
tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 5640
gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc 5700
tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt 5760
ttgcgcaacg ttgttgccat tgctgcaggc atcgtggtgt cacgctcgtc gtttggtatg 5820
gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc 5880
aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg 5940
ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga 6000
tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 6060
ccgagttgct cttgcccggc gtcaacacgg gataataccg cgccacatag cagaacttta 6120
aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 6180
ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 6240
ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 6300
agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 6360
tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 6420
ataggggttt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat 6480
tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc ttggccacct 6540
aggccggcct tcccggaagt atatactaat tcattccttg aatatttatg aaaagcaaca 6600
tgtgtatttc ttgtgtgtgc ggcaaacgta gcaattgcaa ctgcataaac gatgattgta 6660
aaagtatcac actttgctca gacaggttag attcacctgg tacgagggca gtgtcttaaa 6720
ggttccatct acctcggccc ttgtttcttg aagagtggtc aatatgtgtt ttatacagct 6780
gaaatttccc ctgtatgttg agatcgtgta tattggtcat aatctgggct ctttagtcga 6840
tcccagtttt ctcgggcaag tttttttctc cacaaagtac cgctggaaaa ctctatgtga 6900
cttgttgaca gattacttgg gttatctgcg ggatatgtct tggataggca accgggcata 6960
tatcaccggg cggactgttg gttctgtac 6989
<210> 199
<211> 1170
<212> DNA
<213> 人工序列
<220>
<223> pENO
<400> 199
tcgggcaagt ttttttctcc acaaagtacc gctggaaaac tctatgtgac ttgttgacag 60
attacttggg ttatctgcgg gatatgtctt ggataggcaa ccgggcatat atcaccgggc 120
ggactgttgg ttctgtacgt acatacagca ctttgagctc atgtctcaca cgcaaccatg 180
gtgcgtggag gctttggcat cctttctact tgtagtggct atagtacttg cagtccaagc 240
aaacatgagt atgtgcttgt atgtactgaa acccgtctac ggtaatattt tagagtgtgg 300
aactatggga tgagtgctca ttcgatacta tgttgtcacc cgatttgccg tttgcgaggt 360
aagacacatt cggtggttca ggcggctact tgtatgtagc atccacgttc atgttttgtg 420
gatcagatta atggtatgga tatgcacggg gcgtttcccc ggtaacgtgt aggcagtcca 480
gtgcaaccca gacagctgag ctctctatag ccgtgcgtgt gcggtcatat cacgctacac 540
ttagctacag aataaagctc ggtagcgcca acagcgttga caaatagctc aagggcgtgg 600
agcacagggt ttaggaggtt ttaatgggcg agaaggcgcg tagatgtagt cttcctcggt 660
cccatcggta atcacgtgtg tgccgatttg caagacgaaa agccacgaga ataaaccggg 720
agaggggatg gaagtccccg aacagcaacc agcccttgcc ctcgtggaca taacctttca 780
cttgccagaa ctctaagcgt caccacggta tacaagcgca cgtagaagat tgtggaagtc 840
gtgttggaga ctgttgattt gggcggtgga ggggggtatt tgagagcaag tttgagattt 900
gtgccattga gggggaggtt attgtggcca tgcagtcgga tttgccgtca cgggaccgca 960
acatgctttt cattgcagtc cttcaactat ccatctcacc tcccccaatg gcttttaact 1020
ttcgaatgac gaaagcaccc ccctttgtac agatgactat ttgggaccaa tccaatagcg 1080
caattgggtt tgcatcatgt ataaaaggag caatccccca ctagttataa agtcacaagt 1140
atctcagtat acccgtctaa ccacacattt 1170
<210> 200
<211> 2276
<212> DNA
<213> 人工序列
<220>
<223> HPH
<400> 200
aaattaacag atagtttgcc ggtgataatt ctcttaacct cccacactcc tttgacataa 60
cgatttatgt aacgaaactg aaatttgacc agatattgtt gtaaatagaa aatctggctt 120
gtaggtggca aactagtaac ggccgccagt gtgctggaat tgaatattta ccgttcgtat 180
aatgtatgct atacgaagtt ataccggtct cgtagtgttc acgttcagtt cacggtgagc 240
ttaaaactat cttcaagaag agatttgaga cctgatttat acttgcagca atgtttactt 300
cttatcgcga tacacgaatg tgatacggat caaagtaagc aggactacga taagataacg 360
aatgcggtgc agtccatgtc gattaggtat agatacattt attttgtgtt atgttacatt 420
ttggggggat actgtcctac ttgtagtacc tacttgtagt ggcgcgtcta ttcctttgcc 480
ctcggacgag tgctggggcg tcggtttcca ctatcggcga gtacttctac acagccatcg 540
gtccagacgg ccgcgcttct gcgggcgatt tgtgtacgcc cgacagtccc ggctccggat 600
cggacgattg cgtcgcatcg accctgcgcc caagctgcat catcgaaatt gccgtcaacc 660
aagctctgat agagttggtc aagaccaatg cggagcatat acgcccggag ccgcggcgat 720
cctgcaagct ccggatgcct ccgctcgaag tagcgcgtct gctgctccat acaagccaac 780
cacggcctcc agaagaagat gttggcgacc tcgtattggg aatccccgaa catcgcctcg 840
ctccagtcaa tgaccgctgt tatgcggcca ttgtccgtca ggacattgtt ggagccgaaa 900
tccgcgtgca cgaggtgccg gacttcgggg cagtcctcgg cccaaagcat cagctcatcg 960
agagcctgcg cgacggacgc actgacggtg tcgtccatca cagtttgcca gtgatacaca 1020
tggggatcag caatcgcgca tatgaaatca cgccatgtag tgtattgacc gattccttgc 1080
ggtccgaatg ggccgaaccc gctcgtctgg ctaagatcgg ccgcagcgat cgcatccatg 1140
gcctccgcga ccggctgcag aacagcgggc agttcggttt caggcaggtc ttgcaacgtg 1200
acaccctgtg cacggcggga gatgcaatag gtcaggctct cgctgaattc cccaatgtca 1260
agcacttccg gaatcgggag cgcggccgat gcaaagtgcc gataaacata acgatctttg 1320
tagaaaccat cggcgcagct atttacccgc aggacatatc cacgccctcc tacatcgaag 1380
ctgaaagcac gagattcttc gccctccgag agctgcatca ggtcggagac gctgtcgaac 1440
ttttcgatca gaaacttctc gacagacgtc gcggtgagtt caggcttttt catatgggta 1500
cctgagaaca tttttgtgtc taggtgtttg tgtttggact gcgatcagtg aagaaaagaa 1560
gaggaaaaat tgtgcaagaa attttgcttt caagacttgg ctgatgcagc agggtaactc 1620
tgggacacag acctatgttt gtggttaaac tcaatgcacg tggtacgtgc gtggagcgct 1680
tacccatcca agggtgtgga catggaaccg acggtccgtg gagttgtgta atgtcatttt 1740
ggcgactctt gaagcaaggc tataaaaaaa ttgtgtggct tgagtcttat cgagctcggt 1800
cactacaaga gttaatcttc ctgtctcagg cagacaggtc aggcagggtt acttttgggt 1860
gtgctgtaac tcactgtatg gccgttagtg cgcatagacg ttgtacatac tggaccgaat 1920
tgtagcgtgc tcaatagggc caataaagct attgtaggga tccataactt cgtataatgt 1980
atgctatacg aacggtaccc gggcaattct gcagatatcc atcacactgg cggccgctcg 2040
agcatgcatc tagatggcct ccttggccgg gtttcaattc aattcatcat ttttttttta 2100
ttcttttttt tgatttcggt ttctttgaaa tttttttgat tcggtaatct ccgaacagaa 2160
ggaagaacga aggaaggagc acagacttag attggtatat atacgcatat gtagtgttga 2220
agaaacatga aattgcccag tattcttaac ccaactgcac agaacaaaaa cctgca 2276
<210> 201
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Eno2.启动子
<400> 201
gtgtcgacgc tgcgggtata gaaagggttc tttactctat agtacctcct cgctcagcat 60
ctgcttcttc ccaaagatga acgcggcgtt atgtcactaa cgacgtgcac caacttgcgg 120
aaagtggaat cccgttccaa aactggcatc cactaattga tacatctaca caccgcacgc 180
cttttttctg aagcccactt tcgtggactt tgccatatgc aaaattcatg aagtgtgata 240
ccaagtcagc atacacctca ctagggtagt ttctttggtt gtattgatca tttggttcat 300
cgtggttcat taattttttt tctccattgc tttctggctt tgatcttact atcatttgga 360
tttttgtcga aggttgtaga attgtatgtg acaagtggca ccaagcatat ataaaaaaaa 420
aaagcattat cttcctacca gagttgattg ttaaaaacgt atttatagca aacgcaattg 480
taattaattc ttattttgta tcttttcttc ccttgtctca atcttttatt tttattttat 540
ttttcttttc ttagtttctt tcataacacc aagcaactaa tactataaca tacaataata 600
<210> 202
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Fba1.启动子
<400> 202
ctacttggct tcacatacgt tgcatacgtc gatatagata ataatgataa tgacagcagg 60
attatcgtaa tacgtaatag ttgaaaatct caaaaatgtg tgggtcatta cgtaaataat 120
gataggaatg ggattcttct atttttcctt tttccattct agcagccgtc gggaaaacgt 180
ggcatcctct ctttcgggct caattggagt cacgctgccg tgagcatcct ctctttccat 240
atctaacaac tgagcacgta accaatggaa aagcatgagc ttagcgttgc tccaaaaaag 300
tattggatgg ttaataccat ttgtctgttc tcttctgact ttgactcctc aaaaaaaaaa 360
aatctacaat caacagatcg cttcaattac gccctcacaa aaactttttt ccttcttctt 420
cgcccacgtt aaattttatc cctcatgttg tctaacggat ttctgcactt gatttattat 480
aaaaagacaa agacataata cttctctatc aatttcagtt attgttcttc cttgcgttat 540
tcttctgttc ttctttttct tttgtcatat ataaccataa ccaagtaata catattcaaa 600
<210> 203
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Tef1.启动子
<400> 203
ttggctgata atagcgtata aacaatgcat actttgtacg ttcaaaatac aatgcagtag 60
atatatttat gcatattaca tataatacat atcacatagg aagcaacagg cgcgttggac 120
ttttaatttt cgaggaccgc gaatccttac atcacaccca atcccccaca agtgatcccc 180
cacacaccat agcttcaaaa tgtttctact ccttttttac tcttccagat tttctcggac 240
tccgcgcatc gccgtaccac ttcaaaacac ccaagcacag catactaaat ttcccctctt 300
tcttcctcta gggtgtcgtt aattacccgt actaaaggtt tggaaaagaa aaaagacacc 360
gcctcgtttc tttttcttcg tcgaaaaagg caataaaaat ttttatcacg tttctttttc 420
ttgaaaattt ttttttttga tttttttctc tttcgatgac ctcccattga tatttaagtt 480
aataaacggt cttcaatttc tcaagtttca gtttcatttt tcttgttcta ttacaacttt 540
ttttacttct tgctcattag aaagaaagca tagcaatcta atctaagttt taattacaaa 600
<210> 204
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Pgk1.启动子
<400> 204
gggccagaaa aaggaagtgt ttccctcctt cttgaattga tgttaccctc ataaagcacg 60
tggcctctta tcgagaaaga aattaccgtc gctcgtgatt tgtttgcaaa aagaacaaaa 120
ctgaaaaaac ccagacacgc tcgacttcct gtcttcctat tgattgcagc ttccaatttc 180
gtcacacaac aaggtcctag cgacggctca caggttttgt aacaagcaat cgaaggttct 240
ggaatggcgg gaaagggttt agtaccacat gctatgatgc ccactgtgat ctccagagca 300
aagttcgttc gatcgtactg ttactctctc tctttcaaac agaattgtcc gaatcgtgtg 360
acaacaacag cctgttctca cacactcttt tcttctaacc aagggggtgg tttagtttag 420
tagaacctcg tgaaacttac atttacatat atataaactt gcataaattg gtcaatgcaa 480
gaaatacata tttggtcttt tctaattcgt agtttttcaa gttcttagat gctttctttt 540
tctctttttt acagatcatc aaggaagtaa ttatctactt tttacaacaa atataaaaca 600
<210> 205
<211> 1001
<212> DNA
<213> 人工序列
<220>
<223> Kl prom 12.pro
<400> 205
cgtaaaaact aaaacgagcc cccaccaaag aacaaaaaag aaggtgctgg gcccccactt 60
tcttcccttg cacgtgatag gaagatggct acagaaacaa gaagatggaa atcgaaggaa 120
agagggagac tggaagctgt aaaaactgaa atgaaaaaaa aaaaaaaaaa aaaaaaacaa 180
gaagctgaaa atggaagact gaaatttgaa aaatggtaaa aaaaaaaaag aaacacgaag 240
ctaaaaacct ggattccatt ttgagaagaa gcaagaaagg taagtatggt aacgaccgta 300
caggcaagcg cgaaggcaaa tggaaaagct ggagtccgga agataatcat ttcatcttct 360
tttgttagaa cagaacagtg gatgtccctc atctcggtaa cgtattgtcc atgccctaga 420
actctctgtc cctaaaaaga ggacaaaaac ccaatggttt ccccagcttc cagtggagcc 480
accgatccca ctggaaacca ctggacagga agagaaaatc acggacttcc tctattgaag 540
gataattcaa cactttcacc agatcccaaa tgtcccgccc ctattcccgt gttccatcac 600
gtaccataac ttaccatttc atcacgttct ctatggcaca ctggtactgc ttcgactgct 660
ttgcttcatc ttctctatgg gccaatgagc taatgagcac aatgtgctgc gaaataaagg 720
gatatctaat ttatattatt acattataat atgtactagt gtggttattg gtaattgtac 780
ttaattttga tatataaagg gtggatcttt ttcattttga atcagaattg gaattgcaac 840
ttgtctcttg tcactattac ttaatagtaa ttatatttct tattaacctt ttttttaagt 900
caaaacacca aggacaagaa ctactcttca aaggtatttc aagttatcat acgtgtcaca 960
cacgcttcac agtttcaagt aaaaaaaaag aatattacac a 1001
<210> 206
<211> 403
<212> DNA
<213> 人工序列
<220>
<223> Ag lox_TEF1.启动子
<400> 206
taccgttcgt ataatgtatg ctatacgaag ttatgtcccc gccgggtcac ccggccagcg 60
acatggaggc ccagaatacc ctccttgaca gtcttgacgt gcgcagctca ggggcatgat 120
gtgactgtcg cccgtacatt tagcccatac atccccatgt ataatcattt gcatccatac 180
attttgatgg ccgcacggcg cgaagcaaaa attacggctc ctcgctgcag acctgcgagc 240
agggaaacgc tcccctcaca gacgcgttga attgtcccca cgccgcgccc ctgtagagaa 300
atataaaagg ttaggatttg ccactgaggt tcttctttca tatacttcct tttaaaatct 360
tgctaggata cagttctcac atcacatccg aacataaaca aca 403
<210> 207
<211> 1000
<212> DNA
<213> 人工序列
<220>
<223> Kl prom 6.pro
<400> 207
caaagggggg gcagggacag ggatacgaca agggctgggg aaaaaaaaaa agatagatac 60
gattggccgg gtaagcctgg ggaaatgtag caagtgcggg taagttaaaa ggtaaccacg 120
tgactccgga agagtcacgt ggttacggac ttttttctct agatctcagc tttttatcgg 180
tcttaccctg ccctcctgcc ccctgcccct tccctttgcc ccaaaaagaa aggaaatctg 240
ttggatttcg ctcaggccat ccctttcgtt aatatcggtt atcgctttac acactgcaca 300
tccttctgtc caaaaggaat ccagaagttt agcttttcct tcctttccca cagacattag 360
cctaggccct ctctcatcat ttgcatgcct cagccaatgt accaagaata acgcaacgag 420
gttgggaaat tttaacccaa caatcgatgc agatgtgaca agagattaga cacgttccag 480
ataccagatt acacagcttg tgctagcaga gtgacatatg gtggtgttgt gtctcgttta 540
gtacctgtaa tcgagagtgt tcaaatcagt cgatttgaac acccttactg ccactgaata 600
ttgattgaat accgtttatt gaaggtttta tgagtgatct tctttcggtc caggacaatt 660
tgttgagctt tttctatgta gagttccgtc cctttttttt ttttttttgc tttctcgcac 720
ttactagcac tatttttttt tcacacacta aaacacttta ttttaatcta tatatatata 780
tatatatata tgtaggaatg gaatcacaga catttgatac tcatcctcat ccttattaat 840
tcttgtttta atttgtttga cttagccaaa ccaccaatct caacccatcg tatttcaggt 900
attgtgtgtc tagtgtgtct ctggtatacg gaaataagtg ccagaagtaa ggaagaaaca 960
aagaacaagt gtctgaatac tactagcctc tcttttcata 1000
<210> 208
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Pma1.pro
<400> 208
ccatcatgaa aaatctctcg acaccgttta tccattgctt ttttgttgtc tttttccctc 60
gttcacagaa agtctgaaga agctatagta gaactatgag ctttttttgt ttctgttttc 120
cttttttttt tttttacctc tgtggaaatt gttactctca cactctttag ttcgtttgtt 180
tgttttgttt attccaatta tgaccggtga cgaaacgtgg tcgatggtgg gtaccgctta 240
tgctcccctc cattagtttc gattatataa aaaggccaaa tattgtatta ttttcaaatg 300
tcctatcatt atcgtctaac atctaatttc tcttaaattt tttctctttc tttcctataa 360
caccaatagt gaaaatcttt ttttcttcta tatctacaaa aacttttttt ttctatcaac 420
ctcgttgata aattttttct ttaacaatcg ttaataatta attaattgga aaataaccat 480
tttttctctc ttttatacac acattcaaaa gaaagaaaaa aaatataccc cagctagtta 540
aagaaaatca ttgaaaagaa taagaagata agaaagattt aattatcaaa caatatcaaa 600
<210> 209
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Vps68.启动子
<400> 209
gatatagccg ctacgactga ttagtgatgt gataaagata gaagatttaa gtcacagagg 60
cgtgcatcta cgattttggc gtttcacatt ttttacactt aaattttagt gatctagccg 120
tgaccttggc agcagtttcc aaaatcattc catgaccatg tcatgcttaa gaacgttaga 180
cccagaacaa gtggacctgt attctaactc ttcactcttg ggcaaagata atagtattat 240
cttttacccc attttttgta tgttttttcg tttattgagt ttggcgtttc ctatttagaa 300
atagtacaat ccggtcaatc attcgatagt gaaatatata tatttaacta ggaaaattag 360
taaaacctca tttaaagatc attcaccttg atatatacta ctattgacct tttgttaatg 420
accattttcg taaaaatgaa ctgcgattct cttctggaat ttgttaccct accttattca 480
ctaaatcaga aataataatg tgcagcgccc ctttcataaa gaaggcaagt atagggcata 540
tagttaaagg tcagaactct ttatccccaa ctacaagatc aattagaaaa tcacatcata 600
<210> 210
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc Oye2.启动子
<400> 210
actacaattt agcggcttag cacaatacgc gttttcaact tcctacgcta gcgatgacaa 60
aatgtctcca agaggcggaa cttgcgacgg atgcatggaa atatcttacg taatgaactt 120
ccgtaatgaa cttccgtaat tcaagatctc ttagcatctc ttgttcaatc ttcagactct 180
actaagtgtt cttaccaacc attggatgct cattacaaat gaatgaatat attgcacgga 240
acggaagcgg catgcttttt ccgtgtcgtg tgcttagtaa agcaaaacgg agtagaatcg 300
gtaagaactt cctttttggg ttggaaaatc attgccattg tttggacacc tttctttttc 360
cgtattgttc gagcaccgcg tttctttttg ggtacttgat gaggtagcag attcctggaa 420
cgtgctttct ctcgaggtaa cctgccttgt tcctcctggt gactttctaa aatataaaag 480
gaaaagcata tctctagttt cgagtttttt cttcatactt tatttcctta tgttaaacgg 540
tccagatata gaataaatca tcatattaag ctaaatatag acgataatat agtatcgata 600
<210> 211
<211> 810
<212> DNA
<213> 人工序列
<220>
<223> KANMX
<400> 211
atgggtaagg aaaagactca cgtttcgagg ccgcgattaa attccaacat ggatgctgat 60
ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac aatctatcga 120
ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg tagcgttgcc 180
aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat gcctcttccg 240
accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac tgcgatcccc 300
ggcaaaacag cattccaggt attagaagaa tatcctgatt caggtgaaaa tattgttgat 360
gcgctggcag tgttcctgcg ccggttgcat tcgattcctg tttgtaattg tccttttaac 420
agcgatcgcg tatttcgttt ggctcaggcg caatcacgaa tgaataacgg tttggttgat 480
gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg gaaagaaatg 540
cataagcttt tgccattctc accggattca gtcgtcactc atggtgattt ctcacttgat 600
aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg agtcggaatc 660
gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt ttctccttca 720
ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa taaattgcag 780
tttcatttga tgctcgatga gtttttctaa 810
<210> 212
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Adh1.终止子
<400> 212
agcgaatttc ttatgattta tgatttttat tattaaataa gttataaaaa aaataagtgt 60
atacaaattt taaagtgact cttaggtttt aaaacgaaaa ttcttattct tgagtaactc 120
tttcctgtag gtcaggttgc tttctcaggt atagcatgag gtcgctctta ttgaccacac 180
ctctaccggc atgccgagca aatgcctgca aatcgctccc catttcaccc aattgtagat 240
atgctaactc cagcaatgag ttgatgaatc tcggtgtgta ttttatgtcc tcagaggaca 300
a 301
<210> 213
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Adh2.终止子
<400> 213
agcggatctc ttatgtcttt acgatttata gttttcatta tcaagtatgc ctatattagt 60
atatagcatc tttagatgac agtgttcgaa gtttcacgaa taaaagataa tattctactt 120
tttgctccca ccgcgtttgc tagcacgagt gaacaccatc cctcgcctgt gagttgtacc 180
cattcctcta aactgtagac atggtagctt cagcagtgtt cgttatgtac ggcatcctcc 240
aacaaacagt cggttatagt ttgtcctgct cctctgaatc gtgtccctcg atatttctca 300
t 301
<210> 214
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Gmp1.终止子
<400> 214
agtctgaaga atgaatgatt tgatgatttc tttttccctc catttttctt actgaatata 60
tcaatgatat agacttgtat agtttattat ttcaaattaa gtagctatat atagtcaaga 120
taacgtttgt ttgacacgat tacattattc gtcgacatct tttttcagcc tgtcgtggta 180
gcaatttgag gagtattatt aattgaatag gttcattttg cgctcgcata aacagttttc 240
gtcagggaca gtatgttgga atgagtggta attaatggtg acatgacatg ttatagcaat 300
a 301
<210> 215
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Sc Tal1.终止子
<400> 215
aggaagtatc tcggaaatat taatttaggc catgtcctta tgcacgtttc ttttgatact 60
tacgggtaca tgtacacaag tatatctata tatataaatt aatgaaaatc ccctatttat 120
atatatgact ttaacgagac agaacagttt tttatttttt atcctatttg atgaatgata 180
cagtttctta ttcacgtgtt atacccacac caaatccaat agcaataccg gccatcacaa 240
tcactgtttc ggcagcccct aagatcagac aaaacatccg gaaccacctt aaatcaacgt 300
c 301
<210> 216
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Sc Tpi1.终止子
<400> 216
agattaatat aattatataa aaatattatc ttcttttctt tatatctagt gttatgtaaa 60
ataaattgat gactacggaa agctttttta tattgtttct ttttcattct gagccactta 120
aatttcgtga atgttcttgt aagggacggt agatttacaa gtgatacaac aaaaagcaag 180
gcgctttttc taataaaaag aagaaaagca tttaacaatt gaacacctct atatcaacga 240
agaatattac tttgtctcta aatccttgta aaatgtgtac gatctctata tgggttactc 300
a 301
<210> 217
<211> 285
<212> DNA
<213> 人工序列
<220>
<223> Ag Tef1_lox.终止子
<400> 217
atcagtactg acaataaaaa gattcttgtt ttcaagaact tgtcatttgt atagtttttt 60
tatattgtag ttgttctatt ttaatcaaat gttagcgtga tttatatttt ttttcgcctc 120
gacatcatct gcccagatgc gaagttaagt gcgcagaaag taatatcatg cgtcaatcgt 180
atgtgaatgc tggtcgctat actgctgtcg attcgatact aacgccgcca tccagtgtcg 240
aaaacgagct cataacttcg tataatgtat gctatacgaa cggta 285
<210> 218
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Sc Pdc1.终止子
<400> 218
agcgatttaa tctctaatta ttagttaaag ttttataagc atttttatgt aacgaaaaat 60
aaattggttc atattattac tgcactgtca cttaccatgg aaagaccaga caagaagttg 120
ccgacagtct gttgaattgg cctggttagg cttaagtctg ggtccgcttc tttacaaatt 180
tggagaattt ctcttaaacg atatgtatat tcttttcgtt ggaaaagatg tcttccaaaa 240
aaaaaaccga tgaattagtg gaaccaagga aaaaaaaaga ggtatccttg attaaggaac 300
a 301
<210> 219
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Sc Tdh1.终止子
<400> 219
aataaagcaa tcttgatgag gataatgatt tttttttgaa tatacataaa tactaccgtt 60
tttctgctag attttgtgaa gacgtaaata agtacatatt actttttaag ccaagacaag 120
attaagcatt aactttaccc ttttctcttc taagtttcaa tactagttat cactgtttaa 180
aagttatggc gagaacgtcg gcggttaaaa tatattaccc tgaacgtggt gaattgaagt 240
tctaggatgg tttaaagatt tttccttttt gggaaataag taaacaatat attgctgcct 300
t 301
<210> 220
<211> 301
<212> DNA
<213> 人工序列
<220>
<223> Sc Eno1.终止子
<400> 220
aagcttttga ttaagccttc tagtccaaaa aacacgtttt tttgtcattt atttcatttt 60
cttagaatag tttagtttat tcattttata gtcacgaatg ttttatgatt ctatataggg 120
ttgcaaacaa gcatttttca ttttatgtta aaacaatttc aggtttacct tttattctgc 180
ttgtggtgac gcgtgtatcc gcccgctctt ttggtcaccc atgtatttaa ttgcataaat 240
aattcttaaa agtggagcta gtctatttct atttacatac ctctcatttc tcatttcctc 300
c 301
<210> 221
<211> 1000
<212> DNA
<213> 人工序列
<220>
<223> Kl prom 3.pro
<400> 221
gagcctgtcc aagcaaatgc cttctcataa atggtgccaa agacccgcaa gcccaaagca 60
attacccccc aaaaagaaat gatatagtgc aagatacgta tatgaccatg acttgactag 120
gtgaaacagt gcagaaacag ccgcacaaaa gcagccctaa ccctcagagt cgattttact 180
ctttcaggta ataaagcctc gacatcaatt ttagacagaa gccaggctgg cctcgagatt 240
atagccatag gcaagcaaga ggagagaagg ggaggccccc catggggggc ctcccccccg 300
ctgtcaaggt ttggcagaac ctagcttcat taggccacta gcccagccta aaacgtcaac 360
gggcaggagg aacactccca caagacggcg tagtattctc gattcataac cattttctca 420
atcgaattac acagaacaca ccgtacaaac ctctctatca taactactta atagtcacac 480
acgtactcgt ctaaatacac atcatcgtcc tacaagttca tcaaagtgtt ggacagacaa 540
ctataccagc atggatctct tgtatcggtt cttttctccc gctctctcgc aataacaatg 600
aacactgggt caatcatagc ctacacaggt gaacagagta gcgtttatac agggtttata 660
cggtgattcc tacggcaaaa atttttcatt tctaaaaaaa aaaagaaaaa tttttctttc 720
caacgctaga aggaaaagaa aaatctaatt aaattgattt ggtgattttc tgagagttcc 780
ctttttcata tatcgaattt tgaatataaa aggagatcga aaaaattttt ctattcaatc 840
tgttttctgg ttttatttga tagttttttt gtgtattatt attatggatt agtactggtt 900
tatatgggtt tttctgtata acttcttttt attttagttt gtttaatctt attttgagtt 960
acattatagt tccctaactg caagagaagt aacattaaaa 1000
<210> 222
<211> 1000
<212> DNA
<213> 人工序列
<220>
<223> Kl prom 2.pro
<400> 222
gagcctgtcc aagcaaatgc cttctcataa atggtgccaa agacccgcaa gcccaaagca 60
attacccccc aaaaagaaat gatatagtgc aagatacgta tatgaccatg acttgactag 120
gtgaaacagt gcagaaacag ccgcacaaaa gcagccctaa ccctcagagt cgattttact 180
ctttcaggta ataaagcctc gacatcaatt ttagacagaa gccaggctgg cctcgagatt 240
atagccatag gcaagcaaga ggagagaagg ggaggccccc catggggggc ctcccccccg 300
ctgtcaaggt ttggcagaac ctagcttcat taggccacta gcccagccta aaacgtcaac 360
gggcaggagg aacactccca caagacggcg tagtattctc gattcataac cattttctca 420
atcgaattac acagaacaca ccgtacaaac ctctctatca taactactta atagtcacac 480
acgtactcgt ctaaatacac atcatcgtcc tacaagttca tcaaagtgtt ggacagacaa 540
ctataccagc atggatctct tgtatcggtt cttttctccc gctctctcgc aataacaatg 600
aacactgggt caatcatagc ctacacaggt gaacagagta gcgtttatac agggtttata 660
cggtgattcc tacggcaaaa atttttcatt tctaaaaaaa aaaagaaaaa tttttctttc 720
caacgctaga aggaaaagaa aaatctaatt aaattgattt ggtgattttc tgagagttcc 780
ctttttcata tatcgaattt tgaatataaa aggagatcga aaaaattttt ctattcaatc 840
tgttttctgg ttttatttga tagttttttt gtgtattatt attatggatt agtactggtt 900
tatatgggtt tttctgtata acttcttttt attttagttt gtttaatctt attttgagtt 960
acattatagt tccctaactg caagagaagt aacattaaaa 1000
<210> 223
<211> 600
<212> DNA
<213> 人工序列
<220>
<223> Sc PRE3.启动子
<400> 223
caaacattaa tttgttctgc atactttgaa cctttcagaa aataaaaaac attacgcgca 60
tacttaccct gctcgcgaag aagagtaaca ctaacgcatt ctatgggcaa ttgaagacag 120
tattcagtac aagacatagt ccgtttcctt gagtcaattc ctatagcatt atgaactagc 180
cgcctttaag agtgccaagc tgttcaacac cgatcatttt tgatgatttg gcgtttttgt 240
tatattgata gatttctttt gaattttgtc attttcactt ttccactcgc aacggaatcc 300
ggtggcaaaa aagggaaaag cattgaaatg caatctttaa cagtatttta aacaagttgc 360
gacacggtgt acaattacga taagaattgc tacttcaaag tacacacaga aagttaacat 420
gaatggaatt caagtggaca tcaatcgttt gaaaaagggc gaagtcagtt taggtacctc 480
aatgtatgta tataagaatt tttcctccca ctttattgtt tctaaaagtt caatgaagta 540
aagtctcaat tggccttatt actaactaat aggtatctta taatcaccta ataaaataga 600
Claims (15)
8.根据前述权利要求中任一项所述的甜菊醇糖苷,其为发酵制备的。
10.根据权利要求9所述的甜菊醇糖苷,其具有根据权利要求1至7中的任一项所述的结构。
11.一种用于制备根据前述权利要求中任一项所述的甜菊醇糖苷的方法,所述方法包括:
提供包含编码多肽的重组核酸序列的重组酵母细胞,所述多肽包含由下列编码的氨基酸序列:SEQ ID NO:61、SEQ ID NO:65、SEQ ID NO:23、SEQ ID NO:33、SEQ ID NO:77、SEQID NO:71、SEQ ID NO:87、SEQ ID NO:73和SEQ ID NO:75;
在合适的发酵培养基中发酵所述重组酵母细胞;以及,可选地,
回收根据前述权利要求中任一项所述的甜菊醇糖苷。
12.一种组合物,其包括根据权利要求1至11中的任一项所述的甜菊醇糖苷以及一种或更多种不同的甜菊醇糖苷。
13.一种食品、饲料或饮料,其包括根据权利要求1至10中的任一项所述的甜菊醇糖苷或根据权利要求12所述的组合物。
14.根据权利要求1至10中的任一项所述的甜菊醇糖苷或根据权利要求12所述的组合物在甜味剂组合物或风味组合物中的用途。
15.根据权利要求1至10中的任一项所述的甜菊醇糖苷或根据权利要求12所述的组合物在食品、饲料或饮料中的用途。
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201562142631P | 2015-04-03 | 2015-04-03 | |
US62/142,631 | 2015-04-03 | ||
PCT/EP2016/057360 WO2016156616A1 (en) | 2015-04-03 | 2016-04-04 | Steviol glycosides |
CN201680020186.4A CN107666834B (zh) | 2015-04-03 | 2016-04-04 | 甜菊醇糖苷 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201680020186.4A Division CN107666834B (zh) | 2015-04-03 | 2016-04-04 | 甜菊醇糖苷 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113683712A true CN113683712A (zh) | 2021-11-23 |
CN113683712B CN113683712B (zh) | 2022-10-21 |
Family
ID=55646619
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110909254.5A Active CN113683712B (zh) | 2015-04-03 | 2016-04-04 | 甜菊醇糖苷 |
CN201680020186.4A Active CN107666834B (zh) | 2015-04-03 | 2016-04-04 | 甜菊醇糖苷 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201680020186.4A Active CN107666834B (zh) | 2015-04-03 | 2016-04-04 | 甜菊醇糖苷 |
Country Status (6)
Country | Link |
---|---|
US (2) | US11344051B2 (zh) |
EP (2) | EP3277829B1 (zh) |
CN (2) | CN113683712B (zh) |
BR (1) | BR112017021066B1 (zh) |
CA (1) | CA2980090A1 (zh) |
WO (1) | WO2016156616A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116751317A (zh) * | 2023-06-26 | 2023-09-15 | 沈阳药科大学 | 一种具有抗氧化活性的中华小苦荬多糖的制备方法及其应用 |
CN116751317B (zh) * | 2023-06-26 | 2024-10-22 | 沈阳药科大学 | 一种具有抗氧化活性的中华小苦荬多糖的制备方法及其应用 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107249356B (zh) | 2014-12-17 | 2021-08-06 | 嘉吉公司 | 用于口服摄入或使用的甜菊醇糖苷化合物、组合物以及用于增强甜菊醇糖苷溶解度的方法 |
WO2016120486A1 (en) * | 2015-01-30 | 2016-08-04 | Evolva Sa | Production of steviol glycosides in recombinant hosts |
BR112017021066B1 (pt) | 2015-04-03 | 2022-02-08 | Dsm Ip Assets B.V. | Glicosídeos de esteviol, método para a produção de um glicosídeo de esteviol, composição, usos relacionados, gênero alimentício, alimento para animais e bebida |
BR112018076303B1 (pt) * | 2016-06-17 | 2022-10-11 | Cargill, Incorporated | Composição adoçante, bebida e método de modificação de uma característica sensorial de uma composição |
EP3764815A4 (en) * | 2018-03-16 | 2022-01-26 | PureCircle USA Inc. | HIGH PURITY STEVIOL GLYCOSIDES |
US20210002318A1 (en) * | 2018-03-16 | 2021-01-07 | Purecircle Usa Inc. | High-purity steviol glycosides |
CN108795897B (zh) * | 2018-05-29 | 2019-06-07 | 首都医科大学 | 一种糖基转移酶ugte1、其编码基因和应用 |
AU2019389030A1 (en) * | 2018-11-27 | 2021-06-17 | Purecircle Usa Inc. | High-purity steviol glycosides |
CN109628421B (zh) * | 2019-01-11 | 2022-11-01 | 安徽农业大学 | 一种特异合成呋喃酮葡萄糖苷的糖基转移酶及其应用 |
EP3927714A4 (en) * | 2019-02-15 | 2023-01-18 | PureCircle USA Inc. | HIGH PURITY STEVIOL GLYCOSIDES |
CN110846305B (zh) * | 2019-11-11 | 2023-11-28 | 中化健康产业发展有限公司 | 一种固定化糖基转移酶催化莱鲍迪苷a生成莱鲍迪苷m的方法 |
IL293855A (en) * | 2019-12-16 | 2022-08-01 | Manus Bio Inc | Production of mogrol and mogrosides using bacteria |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090162500A1 (en) * | 2007-12-21 | 2009-06-25 | The Quaker Oats Company | Grain products having a potent natural sweetener |
CN103404833A (zh) * | 2013-08-20 | 2013-11-27 | 济南汉定生物工程有限公司 | 甜菊糖甙复配甜味剂 |
US20140227421A1 (en) * | 2011-02-17 | 2014-08-14 | Purecircle Usa Inc. | Glucosyl stevia composition |
Family Cites Families (65)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58149697A (ja) | 1982-02-27 | 1983-09-06 | Dainippon Ink & Chem Inc | β−1,3グリコシルステビオシドの製造方法 |
WO1990014423A1 (en) | 1989-05-18 | 1990-11-29 | The Infergene Company | Microorganism transformation |
JP3273609B2 (ja) | 1989-07-07 | 2002-04-08 | ユニリーバー・ナームローゼ・ベンノートシヤープ | 発現ベクターのマルチコピー組込みにより形質転換した真菌による蛋白質の製法 |
ES2198410T3 (es) | 1993-07-23 | 2004-02-01 | Dsm Ip Assets B.V. | Cepas recombinantes de hongos filamentosos libres de gen marcador de seleccion: un metodo para obtener dichas cepas y su utilizacion. |
US6265186B1 (en) | 1997-04-11 | 2001-07-24 | Dsm N.V. | Yeast cells comprising at least two copies of a desired gene integrated into the chromosomal genome at more than one non-ribosomal RNA encoding domain, particularly with Kluyveromyces |
PL336345A1 (en) | 1997-04-11 | 2000-06-19 | Dsm Nv | Genic conversion as a tool for constructing recombined filiform fungi |
ID27054A (id) | 1998-05-19 | 2001-02-22 | Dsm Nv | Perbaikan produksi sepalosporin di dalam organisme yang hidup |
EP1141372A2 (en) | 1998-12-22 | 2001-10-10 | Dsm N.V. | Improved in vivo production of cephalosporins |
US6180157B1 (en) | 1999-02-18 | 2001-01-30 | The Nutrasweet Company | Process for preparing an N-[N-(3,3-dimethylbutyl)-L-α-aspartyl]-L-phenylalanine 1-methyl ester agglomerate |
EP1164872A1 (en) | 1999-03-26 | 2002-01-02 | The NutraSweet Company | PARTICLES OF N- N-(3,3-DIMETHYLBUTYL)-L-$g(a)-ASPARTYL]-L-PHENYLALANINE 1-METHYL ESTER |
JP2001048727A (ja) | 1999-08-10 | 2001-02-20 | Nonogawa Shoji Kk | 可溶化剤及びこれを含有する可溶化組成物 |
EP1259534A2 (en) | 2000-02-16 | 2002-11-27 | The NutraSweet Company | Process for making granulated n-[n-(3,3-dimethylbutyl)-l-alpha-aspartyl]-l-phenylalanine 1-methyl ester |
DE60325457D1 (de) | 2002-01-23 | 2009-02-05 | Royal Nedalco B V | Fermentation von pentosezuckern |
SE0202090D0 (sv) | 2002-05-08 | 2002-07-04 | Forskarpatent I Syd Ab | A modifierd yeast consuming L-arabinose |
EP1626979B1 (en) | 2003-05-02 | 2012-06-06 | Cargill, Incorporated | Genetically modified yeast species and fermentation processes using genetically modified yeast |
CN101052706B (zh) | 2004-07-16 | 2015-02-18 | 帝斯曼知识产权资产管理有限公司 | 发酵木糖的真核细胞的代谢工程 |
US7923552B2 (en) | 2004-10-18 | 2011-04-12 | SGF Holdings, LLC | High yield method of producing pure rebaudioside A |
EP1863901A1 (en) | 2005-03-11 | 2007-12-12 | Forskarpatent i Syd AB | Arabinose- and xylose-fermenting saccharomyces cerevisiae strains |
US9107436B2 (en) * | 2011-02-17 | 2015-08-18 | Purecircle Sdn Bhd | Glucosylated steviol glycoside as a flavor modifier |
AU2009234283B2 (en) | 2008-04-11 | 2015-04-02 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Diterpene glycosides as natural solubilizers |
EP3101023B1 (en) | 2008-10-03 | 2023-08-30 | Morita Kagaku Kogyo Co., Ltd. | New steviol glycosides |
US8551507B2 (en) | 2009-06-24 | 2013-10-08 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Terpene glycosides and their combinations as solubilizing agents |
US20110027446A1 (en) * | 2009-07-28 | 2011-02-03 | Heartland Sweeteners, LLC | No-calorie sweetener compositions |
US9205268B2 (en) | 2009-10-30 | 2015-12-08 | Medtronic, Inc. | Configuring operating parameters of a medical device based on a type of source of a disruptive energy field |
US20130309389A1 (en) | 2010-12-13 | 2013-11-21 | Cargill, Incorporated | Glycoside blends |
BR112014003037B1 (pt) | 2011-08-08 | 2022-04-05 | Evolva Sa | Hospedeiro recombinante e método para produzir um glicosídeo de esteviol |
BR112014005109A2 (pt) | 2011-09-06 | 2017-04-18 | Pepsico Inc | adoçantes a base de rebaudioside d e produtos alimentares adoçados com rebaudioside d |
US20140303036A1 (en) | 2011-11-23 | 2014-10-09 | Dsm Ip Assets B.V. | Nucleic Acid Assembly System |
CN103159808B (zh) | 2011-12-09 | 2017-03-29 | 上海泓博智源医药股份有限公司 | 一种制备天然甜味剂的工艺方法 |
CN104684414A (zh) | 2011-12-19 | 2015-06-03 | 可口可乐公司 | 纯化甜叶菊醇糖苷的方法和其用途 |
EP3444338A1 (en) | 2012-01-23 | 2019-02-20 | DSM IP Assets B.V. | Diterpene production |
US10292412B2 (en) | 2012-02-15 | 2019-05-21 | Kraft Foods Global Brands Llc | High solubility natural sweetener compositions |
WO2013135728A1 (en) | 2012-03-12 | 2013-09-19 | Dsm Ip Assets B.V. | Recombination system |
US9060537B2 (en) | 2012-03-26 | 2015-06-23 | Pepsico, Inc. | Method for enhancing rebaudioside D solubility in water |
WO2013144257A1 (en) | 2012-03-27 | 2013-10-03 | Dsm Ip Assets B.V. | Cloning method |
US9752174B2 (en) | 2013-05-28 | 2017-09-05 | Purecircle Sdn Bhd | High-purity steviol glycosides |
US20150237898A1 (en) | 2012-09-25 | 2015-08-27 | Cargill, Incorporated | Stevioside blends |
EP2928321A1 (en) | 2012-12-05 | 2015-10-14 | Evolva SA | Steviol glycoside compositions sensory properties |
US20140171519A1 (en) | 2012-12-19 | 2014-06-19 | Indra Prakash | Compositions and methods for improving rebaudioside x solubility |
EP2954058B1 (en) | 2013-02-06 | 2021-03-31 | Evolva SA | Methods for improved production of rebaudioside d and rebaudioside m |
BR112015019160A2 (pt) | 2013-02-11 | 2017-08-22 | Dalgaard Mikkelsen Michael | Produção de glicosídeos de esteviol em hospedeiros recombinantes |
US20160029677A1 (en) | 2013-03-15 | 2016-02-04 | The Coca-Cola Company | Novel glucosyl steviol glycosides, their compositions and their purification |
US10570164B2 (en) | 2013-03-15 | 2020-02-25 | The Coca-Cola Company | Steviol glycosides, their compositions and their purification |
US20140342043A1 (en) | 2013-05-14 | 2014-11-20 | Pepsico, Inc. | Rebaudioside Sweetener Compositions and Food Products Sweetened with Same |
MX2015016379A (es) * | 2013-05-31 | 2016-04-13 | Dsm Ip Assets Bv | Produccion de diterpeno extracelular. |
AU2014273055A1 (en) | 2013-05-31 | 2015-12-17 | Dsm Ip Assets B.V. | Microorganisms for diterpene production |
CN105339065B (zh) | 2013-05-31 | 2018-06-05 | 国际壳牌研究有限公司 | 利用溶剂提取回收二醇 |
US10905146B2 (en) | 2013-07-12 | 2021-02-02 | The Coca-Cola Company | Compositions for improving rebaudioside M solubility |
AU2014306548B2 (en) | 2013-08-15 | 2018-02-22 | Cargill, Incorporated | Sweetener and sweetened compositions incorporating Rebaudoside N |
WO2015051454A1 (en) | 2013-10-07 | 2015-04-16 | Vineland Research And Innovation Centre | Compositions and methods for producing steviol and steviol glycosides |
CN106103729B (zh) | 2013-11-01 | 2020-07-07 | 科纳根公司 | 甜菊醇糖苷的重组制备 |
BR112016018855B1 (pt) | 2014-02-18 | 2021-08-31 | Mcneil Nutritionals, Llc | Processo para produção de uma combinação glicosídeos de esteviol |
US9522929B2 (en) | 2014-05-05 | 2016-12-20 | Conagen Inc. | Non-caloric sweetener |
EP2954785B1 (de) * | 2014-06-13 | 2018-06-06 | Symrise AG | Neue Stoffmischung zur Verbesserung des Süssgeschmacks enthaltend Rubusosid oder alpha-Glycosylrubusosid |
SG11201700651RA (en) | 2014-08-11 | 2017-02-27 | Evolva Sa | Production of steviol glycosides in recombinant hosts |
US20170275666A1 (en) | 2014-08-19 | 2017-09-28 | The Coca-Cola Company | Methods for Preparing Rebaudioside I and Uses |
CN107109358B (zh) * | 2014-09-09 | 2022-08-02 | 埃沃尔瓦公司 | 在重组宿主中生产甜菊醇糖苷 |
MX2017003666A (es) | 2014-09-19 | 2018-02-01 | Purecircle Sdn Bhd | Esteviol glicosidos de alta pureza. |
MY179680A (en) | 2014-10-03 | 2020-11-11 | Conagen Inc | Non-caloric sweeteners and methods for synthesizing |
WO2016086233A1 (en) * | 2014-11-29 | 2016-06-02 | The Coca-Cola Company | Novel diterpene glycosides, compositions and purification methods |
CN107249356B (zh) | 2014-12-17 | 2021-08-06 | 嘉吉公司 | 用于口服摄入或使用的甜菊醇糖苷化合物、组合物以及用于增强甜菊醇糖苷溶解度的方法 |
WO2016120486A1 (en) * | 2015-01-30 | 2016-08-04 | Evolva Sa | Production of steviol glycosides in recombinant hosts |
BR112017021066B1 (pt) | 2015-04-03 | 2022-02-08 | Dsm Ip Assets B.V. | Glicosídeos de esteviol, método para a produção de um glicosídeo de esteviol, composição, usos relacionados, gênero alimentício, alimento para animais e bebida |
CN108289489B (zh) | 2015-11-30 | 2022-09-16 | 嘉吉公司 | 用于口服摄取或使用的甜菊醇糖苷组合物 |
MX2019010537A (es) | 2017-03-06 | 2019-11-21 | Conagen Inc | Produccion biosintetica del rebaudiosido d4 de glucosido de esteviol a partir del rebaudiosido e. |
-
2016
- 2016-04-04 BR BR112017021066-5A patent/BR112017021066B1/pt active IP Right Grant
- 2016-04-04 EP EP16713488.1A patent/EP3277829B1/en active Active
- 2016-04-04 CA CA2980090A patent/CA2980090A1/en active Pending
- 2016-04-04 EP EP20162530.8A patent/EP3718417A1/en active Pending
- 2016-04-04 CN CN202110909254.5A patent/CN113683712B/zh active Active
- 2016-04-04 US US15/563,475 patent/US11344051B2/en active Active
- 2016-04-04 CN CN201680020186.4A patent/CN107666834B/zh active Active
- 2016-04-04 WO PCT/EP2016/057360 patent/WO2016156616A1/en active Application Filing
-
2022
- 2022-04-22 US US17/727,130 patent/US11540544B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090162500A1 (en) * | 2007-12-21 | 2009-06-25 | The Quaker Oats Company | Grain products having a potent natural sweetener |
US20140227421A1 (en) * | 2011-02-17 | 2014-08-14 | Purecircle Usa Inc. | Glucosyl stevia composition |
CN103404833A (zh) * | 2013-08-20 | 2013-11-27 | 济南汉定生物工程有限公司 | 甜菊糖甙复配甜味剂 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116751317A (zh) * | 2023-06-26 | 2023-09-15 | 沈阳药科大学 | 一种具有抗氧化活性的中华小苦荬多糖的制备方法及其应用 |
CN116751317B (zh) * | 2023-06-26 | 2024-10-22 | 沈阳药科大学 | 一种具有抗氧化活性的中华小苦荬多糖的制备方法及其应用 |
Also Published As
Publication number | Publication date |
---|---|
BR112017021066A2 (pt) | 2018-08-14 |
CN113683712B (zh) | 2022-10-21 |
US20180070622A1 (en) | 2018-03-15 |
EP3718417A1 (en) | 2020-10-07 |
US20220256903A1 (en) | 2022-08-18 |
BR112017021066B1 (pt) | 2022-02-08 |
EP3277829B1 (en) | 2020-07-08 |
CA2980090A1 (en) | 2016-10-06 |
EP3277829A1 (en) | 2018-02-07 |
US11540544B2 (en) | 2023-01-03 |
WO2016156616A1 (en) | 2016-10-06 |
CN107666834B (zh) | 2021-08-24 |
CN107666834A (zh) | 2018-02-06 |
US11344051B2 (en) | 2022-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2014298420B2 (en) | Recovery of steviol glycosides | |
KR102170340B1 (ko) | 다이테르펜 제조 | |
CN113683712A (zh) | 甜菊醇糖苷 | |
AU2020273296B2 (en) | Methods for improved production of rebaudioside D and rebaudioside M | |
CN107567492B (zh) | Udp-糖基转移酶 | |
US11725223B2 (en) | Microorganisms for diterpene production | |
AU2014292150B2 (en) | Diterpene production | |
EP3004365A1 (en) | Extracellular diterpene production | |
CN107922465B (zh) | 甜菊醇糖苷转运 | |
CN107922913B (zh) | 甜菊醇糖苷转运 | |
AU2021367159A1 (en) | Microorganisms for diterpene production |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |