CA2356492A1 - Nematode resistant plants and seeds - Google Patents
Nematode resistant plants and seeds Download PDFInfo
- Publication number
- CA2356492A1 CA2356492A1 CA002356492A CA2356492A CA2356492A1 CA 2356492 A1 CA2356492 A1 CA 2356492A1 CA 002356492 A CA002356492 A CA 002356492A CA 2356492 A CA2356492 A CA 2356492A CA 2356492 A1 CA2356492 A1 CA 2356492A1
- Authority
- CA
- Canada
- Prior art keywords
- leu
- ala
- ser
- glu
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 241000244206 Nematoda Species 0.000 title claims abstract description 58
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 55
- 230000009466 transformation Effects 0.000 claims abstract description 40
- 239000013598 vector Substances 0.000 claims abstract description 10
- 241000196324 Embryophyta Species 0.000 claims description 131
- 108090000623 proteins and genes Proteins 0.000 claims description 88
- 244000068988 Glycine max Species 0.000 claims description 42
- 239000002773 nucleotide Substances 0.000 claims description 39
- 125000003729 nucleotide group Chemical group 0.000 claims description 39
- 235000018102 proteins Nutrition 0.000 claims description 32
- 102000004169 proteins and genes Human genes 0.000 claims description 32
- 240000008042 Zea mays Species 0.000 claims description 22
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 19
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 19
- 235000009973 maize Nutrition 0.000 claims description 19
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 18
- 239000013612 plasmid Substances 0.000 claims description 11
- 108010064851 Plant Proteins Proteins 0.000 claims description 6
- 235000021118 plant-derived protein Nutrition 0.000 claims description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 241000209510 Liliopsida Species 0.000 claims description 3
- 241001233957 eudicotyledons Species 0.000 claims description 3
- 235000013339 cereals Nutrition 0.000 claims 1
- 238000000034 method Methods 0.000 abstract description 56
- 108020004414 DNA Proteins 0.000 abstract description 32
- 230000014509 gene expression Effects 0.000 abstract description 20
- 230000002068 genetic effect Effects 0.000 abstract description 5
- 230000001172 regenerating effect Effects 0.000 abstract description 2
- 230000001131 transforming effect Effects 0.000 abstract description 2
- 210000001519 tissue Anatomy 0.000 description 46
- 210000004027 cell Anatomy 0.000 description 41
- 235000010469 Glycine max Nutrition 0.000 description 38
- 239000012634 fragment Substances 0.000 description 17
- 108010049041 glutamylalanine Proteins 0.000 description 16
- 150000007523 nucleic acids Chemical class 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 15
- 102000039446 nucleic acids Human genes 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 14
- 108090000765 processed proteins & peptides Chemical group 0.000 description 14
- 238000006467 substitution reaction Methods 0.000 description 13
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 11
- 230000000408 embryogenic effect Effects 0.000 description 11
- 108010065920 Insulin Lispro Proteins 0.000 description 10
- 235000001014 amino acid Nutrition 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 10
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 9
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 9
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 230000002103 transcriptional effect Effects 0.000 description 9
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 8
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 8
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 8
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 8
- 108010087924 alanylproline Proteins 0.000 description 8
- 229940024606 amino acid Drugs 0.000 description 8
- 150000001413 amino acids Chemical group 0.000 description 8
- 238000012217 deletion Methods 0.000 description 8
- 230000037430 deletion Effects 0.000 description 8
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 210000001938 protoplast Anatomy 0.000 description 8
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 7
- 108700008625 Reporter Genes Proteins 0.000 description 7
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 230000000977 initiatory effect Effects 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 102000004196 processed proteins & peptides Human genes 0.000 description 7
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 6
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 6
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 6
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 6
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- 241000282326 Felis catus Species 0.000 description 6
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 6
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 6
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 6
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 6
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 6
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 6
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 6
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 6
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 230000009261 transgenic effect Effects 0.000 description 6
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 5
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 5
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 5
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 5
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 5
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 5
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 5
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 5
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 5
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 5
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 5
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 5
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 5
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 5
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 5
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 5
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 5
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 5
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 5
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 5
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 210000002257 embryonic structure Anatomy 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 230000008929 regeneration Effects 0.000 description 5
- 238000011069 regeneration method Methods 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- 241000589158 Agrobacterium Species 0.000 description 4
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 4
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 4
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 4
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 4
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 4
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 4
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 4
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 4
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 4
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 4
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 4
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 4
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 4
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 4
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 4
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 4
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 4
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 4
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 4
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 4
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 4
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 4
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 4
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 4
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 4
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 4
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 4
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 4
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 4
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 4
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 4
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 4
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 4
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 4
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 4
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 230000013020 embryo development Effects 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 238000002844 melting Methods 0.000 description 4
- 230000008018 melting Effects 0.000 description 4
- 239000011859 microparticle Substances 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 229920001184 polypeptide Chemical group 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 3
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 3
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 3
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 3
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 3
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 3
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 3
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 3
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 3
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 3
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 3
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 3
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 3
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 3
- 241001200922 Gagata Species 0.000 description 3
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 3
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 3
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 3
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 3
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 3
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 3
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 3
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 3
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 3
- 241000498254 Heterodera glycines Species 0.000 description 3
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 3
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 3
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 3
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 3
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 3
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 3
- ZJSXCIMWLPSTMG-HSCHXYMDSA-N Lys-Trp-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJSXCIMWLPSTMG-HSCHXYMDSA-N 0.000 description 3
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 3
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 208000000291 Nematode infections Diseases 0.000 description 3
- 240000007594 Oryza sativa Species 0.000 description 3
- 235000007164 Oryza sativa Nutrition 0.000 description 3
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 3
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 3
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 3
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 3
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 3
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 3
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 3
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 3
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 3
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 3
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 3
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 3
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 3
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 3
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 3
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 3
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 3
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 3
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 3
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 3
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 3
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 3
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 3
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 3
- 235000007244 Zea mays Nutrition 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 235000009566 rice Nutrition 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000004114 suspension culture Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 2
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 2
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 2
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 2
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 2
- UDDITVWSXPEAIQ-IHRRRGAJSA-N Cys-Phe-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UDDITVWSXPEAIQ-IHRRRGAJSA-N 0.000 description 2
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 2
- DQBRIEGWTLXALA-GQGQLFGLSA-N Cys-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N DQBRIEGWTLXALA-GQGQLFGLSA-N 0.000 description 2
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 2
- 208000035240 Disease Resistance Diseases 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 229920002148 Gellan gum Polymers 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 2
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 2
- VBGCPJBKUXRYDA-DSYPUSFNSA-N Ile-Trp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N VBGCPJBKUXRYDA-DSYPUSFNSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- LCPYQJIKPJDLLB-UWVGGRQHSA-N Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C LCPYQJIKPJDLLB-UWVGGRQHSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- VSJXPNCQYGOLFM-XIRDDKMYSA-N Lys-Cys-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VSJXPNCQYGOLFM-XIRDDKMYSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 2
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 description 2
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 2
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 2
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 2
- 244000061176 Nicotiana tabacum Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 2
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- RETPETNFPLNLRV-JYJNAYRXSA-N Pro-Asn-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O RETPETNFPLNLRV-JYJNAYRXSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 2
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 235000021536 Sugar beet Nutrition 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 2
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 2
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 2
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 2
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 2
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 238000000975 co-precipitation Methods 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 235000021186 dishes Nutrition 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 239000004009 herbicide Substances 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010091798 leucylleucine Proteins 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 2
- 229910052721 tungsten Inorganic materials 0.000 description 2
- 239000010937 tungsten Substances 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- IKWHIGGRTYBSIW-OBJOEFQTSA-N (2s)-2-[[(2s)-2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-methylbutanoic acid Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN IKWHIGGRTYBSIW-OBJOEFQTSA-N 0.000 description 1
- SADYNMDJGAWAEW-JKQORVJESA-N (2s)-2-[[(2s)-3-carboxy-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN SADYNMDJGAWAEW-JKQORVJESA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- UPMXNNIRAGDFEH-UHFFFAOYSA-N 3,5-dibromo-4-hydroxybenzonitrile Chemical compound OC1=C(Br)C=C(C#N)C=C1Br UPMXNNIRAGDFEH-UHFFFAOYSA-N 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- RGDKRCPIFODMHK-HJWJTTGWSA-N Ala-Leu-Leu-His Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RGDKRCPIFODMHK-HJWJTTGWSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- RFJNDTQGEJRBHO-DCAQKATOSA-N Ala-Val-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)[NH3+] RFJNDTQGEJRBHO-DCAQKATOSA-N 0.000 description 1
- 244000291564 Allium cepa Species 0.000 description 1
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- QYLJIYOGHRGUIH-CIUDSAMLSA-N Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N QYLJIYOGHRGUIH-CIUDSAMLSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ZFSIGJMSVGZVGP-DHATWTDPSA-N Arg-Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)[C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZFSIGJMSVGZVGP-DHATWTDPSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- JTRDJYIZIKCIRC-AJNGGQMLSA-N Asp-Leu-Leu-Gln Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTRDJYIZIKCIRC-AJNGGQMLSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 239000005489 Bromoxynil Substances 0.000 description 1
- 101100163949 Caenorhabditis elegans asp-3 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- YHDXIZKDOIWPBW-WHFBIAKZSA-N Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O YHDXIZKDOIWPBW-WHFBIAKZSA-N 0.000 description 1
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- 241000482313 Globodera ellingtonae Species 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 241000379510 Heterodera schachtii Species 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- VHOLZZKNEBBHTH-YUMQZZPRSA-N His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 VHOLZZKNEBBHTH-YUMQZZPRSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 1
- QNBYCZTZNOVDMI-HGNGGELXSA-N Ile-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QNBYCZTZNOVDMI-HGNGGELXSA-N 0.000 description 1
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- BVRPESWOSNFUCJ-LKTVYLICSA-N Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 BVRPESWOSNFUCJ-LKTVYLICSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- HSQGMTRYSIHDAC-BQBZGAKWSA-N Leu-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(O)=O HSQGMTRYSIHDAC-BQBZGAKWSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- 241000234280 Liliaceae Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- JUXONJROIXKHEV-GUBZILKMSA-N Met-Cys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCNC(N)=N JUXONJROIXKHEV-GUBZILKMSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- CFRRIZLGFGJEDB-SRVKXCTJSA-N Met-His-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CFRRIZLGFGJEDB-SRVKXCTJSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- 206010061291 Mineral deficiency Diseases 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 239000006002 Pepper Substances 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 235000016761 Piper aduncum Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 235000017804 Piper guineense Nutrition 0.000 description 1
- 235000008184 Piper nigrum Nutrition 0.000 description 1
- 108020005120 Plant DNA Proteins 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 240000003829 Sorghum propinquum Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- NDLHSJWPCXKOGG-VLCNGCBASA-N Thr-Trp-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N)O NDLHSJWPCXKOGG-VLCNGCBASA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- GRQCSEWEPIHLBI-JQWIXIFHSA-N Trp-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 GRQCSEWEPIHLBI-JQWIXIFHSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- TYYLDKGBCJGJGW-WMZOPIPTSA-N Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 TYYLDKGBCJGJGW-WMZOPIPTSA-N 0.000 description 1
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- 241000209149 Zea Species 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- QGLZXHRNAYXIBU-WEVVVXLNSA-N aldicarb Chemical compound CNC(=O)O\N=C\C(C)(C)SC QGLZXHRNAYXIBU-WEVVVXLNSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010083327 glycyl-prolyl-arginyl-valine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 208000037824 growth disorder Diseases 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010071185 leucyl-alanine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000005645 nematicide Substances 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000003415 peat Substances 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000003032 phytopathogenic effect Effects 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 229910052902 vermiculite Inorganic materials 0.000 description 1
- 239000010455 vermiculite Substances 0.000 description 1
- 235000019354 vermiculite Nutrition 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
The present invention relates to isolated DNA sequences capable of conferring nematode resistance in plants. The isolated DNA sequences can be inserted into a DNA vector to form a transformation construct for the expression of the isolated DNA sequences in plants. The transformation construct can be introduced into plant cells. Plants expressing the isolated DNA sequences can be regenerated from the transformed cells. Methods for improving genetic traits for nematode resistance in plants are also provided, comprising transforming cells with the isolated DNA
sequences and regenerating plants from the transformed cells expressing the isolated DNA sequences necessary for nematode resistance.
sequences and regenerating plants from the transformed cells expressing the isolated DNA sequences necessary for nematode resistance.
Description
NEMATODE RESISTANT PLANTS AND SEEDS
This application is related to international publication number WO 99/60141, and is a divisional of Canadian patent application SN 2,323,312.
FIELD OF THE INVENTION
The present invention relates to isolated DNA sequences involved in nematode resistance inplants.The invention also relates to~methods for improving the genetic traits for nematode resistance in plants b~~ utilizing such isolated DNA
sequences.
BACKGROUND
Plants are continually attacked by a diverse range of phytopathogenic organisms. These organisms cause substantial losses to crops each year.
Traditional approaches for control of plant diseases have been the use of chemical treatment arid the construction of interspecific hybrids between resistant crops and their wild-type relatives as sources of resistant germplasm. However, environmental and economic concerns make chemical pesticides undesirable, while the traditional interspecific breeding is inefEcient and often cannot eliminate the undesired traits of the wild species. Thus, the discovery of pest and pathogen-resistant genes provides a new approach to control plant disease.
Several genes responsible for disease resistance have been identified and ' isolated from plants. See Staskawicz et al. (1995) Science 268:661-667.
Recently, 2 o the sugar beet Hsl°'°'~ gene that confers resistance to the beet cyst nematode was cloned. See Cai et al: (1997) Science 275:832-834; and Moffat (1997) Science 275:77: Transformation of plants or plant tissues with the resistance genes can confer disease resistance to susceptible strains. See, for example, PCT
Publication W093/19181; and Cai et al. (1997) Science 275:832-834.
2 5 Nematode infection is prevalent in many crops. For example, soybean cyst nematode (Heterodera glycines) is a widespread pest that causes substantial damage 62451-860 (D) to soybeans every year. Such damage is the result of the stunting of the soybean plant caused by the cyst nematode.
The stunted plants have smaller root systems, show symptoms of mineral deficiencies in their leaves, and wilt easily.
The soybean cyst nematode is believed to be responsible for yield losses in soybeans that are estimated to be in excess of $500 million per year.
Nematicides such as Aldicarb and its breakdown products are known to be highly toxic to mammals. As a result, government restrictions have been imposed on the use of these chemicals. Thus, there is a great need for the isolation of genes that can provide an effective method of controlling nematodes without causing health and environmental problems.
SUMMARY OF THE INVENTION
This invention relates to DNA sequences isolated from soybean and maize. The sequences alone, or in combination with other sequences, confer nematode resistance in a plant. The sequences are useful in methods for the protection of plants from nematodes. Additionally, allelic variants of the resistance gene from a susceptible plant are included. In another aspect of the present invention, expression cassettes and transformation vectors comprising the isolated nucleotide sequences are disclosed. The transformation vectors can be used to transform plants and express the nematode resistance genes in the transformed cells. Plants susceptible to nematode infection can be targeted to confer nematode resistance. The transformed cells as well as the regenerated transgenic plants containing and expressing the isolated DNA and protein sequences are also disclosed.
62451-860(D) 2a In one aspect, the invention provides a plant which has been transformed with a transformation vector comprising a DNA sequence that encodes an amino acid sequence selected from the group consisting of the sequences set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8.
Another aspect of the invention provides a plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence having at least 70%
identity to the nucleotide sequence set forth in SEQ ID NO:
1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7; and (b) a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence encoding a plant protein, wherein said sequence encoding said plant protein is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364.
Another aspect of the invention provides a plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of: (a) the nucleotide sequence set forth in SEQ
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7; (b) a nucleotide sequence encoding a plant protein, wherein said sequence is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364; and (c) a nucleotide sequence having at least 85% identity to the sequence of (a) or (b).
Another aspect of the invention provides a plant transformed with a DNA sequence encoding a protein comprising an amino acid sequence selected from the group consisting of the amino acid sequences set forth in SEQ ID
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8, wherein 62451-860(D) 2b said plant exhibits improved resistance to nematodes over the native untransformed plant.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 schematically illustrates the plasmid vector comprising a nematode resistance DNA sequence of the present invention operably linked to the ubiquitin promoter.
Constitutive expression of this sequence confers resistance to nematodes in a transformed plant.
DETAILED DESCRIPTION OF THE INVENTION
Compositions and methods for the control of nematodes in susceptible plants are provided. The compositions comprise isolated proteins and DNA sequences encoding such proteins involved in nematode resistance. Such isolated 'DNA
sequences can be transferred into plants to confer or improve nematode resistance in the transformed plants. Sequences of the invention have been isolated from maize and soybean. By "involved in nematode resistance" is intended that the proteins or sequences, either alone or in combination with other proteins or sequences, confer nematode resistance in a plant. In this manner, resistance to nematodes can be enhanced or improved in the transformed plant as at least one of the sequences required for nematode resistance is provided.
DNA sequences isolated from the genomes of maize and soybean are disclosed. The nucleotide sequences and amino acid sequences from two maize isolates are set forth in SEQ ID NOs: 1-2 and 3-4, and the corresponding sequences from two soybean isolates are set forth in SEQ ID NOs: 5-6 and 7-8. The nucleotide sequences in accordance with this invention are involved in nematode resistance in 1 S plants and may confer, alone or in combination with other sequences, nematode resistance in plants. Also discussed are DNA sequences isolated from a susceptible genotype of soybean. The nucleotide and amino acid sequences for this isolate are set forth in SEQ ID NOs: 9-10. Nucleotide sequences of the invention also include the maize and soybean nematode resistance gene sequences as contained in plasmids deposited with American Type Culture Collection (ATCC) and assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364.
Using the sequence information set forth in the SEQ ID NOs or the sequences as contained in ATTC deposits assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364, other plant DNA sequences comprising the nucleotide sequences disclosed above can be isolated based on sequence homology at either the amino acid or nucleotide sequence level. Any suitable molecular cloning method can be used including, but not limited to, PCR'amplification and DNA
hybridization. In the same manner, synthetic nucleotide sequences can be designed based on the amino acid sequences of the invention. Methods to design and make such synthetic sequences are available in the art.
In a hybridization method, the hybridization probes may be genomic DNA
fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 3ZP, or any other detectable marker.
Probes for hybridization can be made by labeling synthetic oligonucleotides based on the sequence of the soybean and/or maize sequence. Degenerate primers designed on the basis of conserved nucleotide or amino acid sequences in the maize and soybean sequences can additionally be used. Preparation of probes for hybridization is generally know in the art and is disclosed in Sambrook et al. ( 1989) Molecular Cloning: A Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory Press.
Plainview, New Yorkj.~ The labeled probes can be used to screen cDNA or genomic libraries made from nematode resistant plants.
Methods for construction of such cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook et al. (1989).Molecular Cloninh: A
Laboratort:
Manual (2d ed.. Cold Spring Harbor Laboratory Press. Plainviev-. New York).
In a PCR method. the DNA or amino acid sequence encoded by the soybean or maize sequences of the invention can be aligned with each other. Nucleotide 1 S primers can be designed based on any conserved short stretches of amino acid sequences or nucleotide sequences. Pairs of primers can be used in PCR
reactions for amplification of DNA sequences from cDNA or genomic DNA extracted from plants of interest. In addition. a single specific primer with a sequence corresponding to one of the nucleotide sequences disclosed herein can be paired with a primer having a sequence of the DNA vector in the cDNA or genomic libraries for PCR
amplification of the sequences ~' or 3' to the nucleotide sequences disclosed herein.
Similarly.
nested primers may be used instead of a single specific primer for the purposes of the invention. Methods for designing PCR primers and PCR cloning are generally known in the art and are disclosed in Sambrook et al. (1989) Molecular Cloning: A
Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory Press, Plainview, New York).
The sequences of the invention comprise coding sequences from other plants that may be isolated according to well-known techniques based on their sequence homology to the maize or soybean coding sequences set forth herein. In these techniques, all or part of the known coding sequence is used as a probe that selectively hybridizes to other possible nematode resistance coding sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i. e., genomic or cDNA libraries) from a chosen organism. To achieve specific hybridization under a variety of conditions, such probes include sequences that are unique and are preferably at least about 10 nucleotides in length. and most preferably at least about 20 nucleotides in length. Such probes may be used to amplify corresponding coding S sequences from a chosen organism by PCR. This technique may be used to isolate other possible nematode resistance coding sequences from a desired organism or as a diagnostic assay to determine the presence of the nematode resistance coding sequence m an organism.
Such techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, e.g., Innis et al.. eds. (1990) PCR
Protocols: A Guide to Methods and Applications (Academic Press. New York)).
The isolated DNA sequences further comprise DNA sequences isolated from other plants by hybridization with partial sequences obtained from maize and soybean. Conditions that will permit other DNA sequences to hybridize to the DNA
1 S sequences disclosed herein can be determined in accordance with techniques generally known in the art. For example, hybridization of such sequences may be carried out under conditions of reduced stringency. medium stringency. or high stringency conditions (e.g., conditions represented by a wash stringency of 3~-40°ro Formamide with Sx Denhardt's solution. 0.~% SDS. and lx SSPE at 37°C;
conditions represented by a wash stringency of 40-45% Forniamide with Sx Denhardt's solution.
0.5% SDS. and lx SSPE at 42°C; and conditions represented by a wash stringenc~~ of ~0% Formamide with Sx Denhardt's solution, 0.~% SDS, and lx SSPE at 42°C.
respectively. See Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, New York). In general.
sequences that confer nematode resistance and hybridize to the DNA sequences disclosed herein will be at least 70-75% homologous, 80-85% homologous. and even 90-95% homologous or more.
The following terms are used to describe the sequence relationships bet«~een two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison window", (c) "sequence identity". (d) "percentage of sequence identiy", and (e) "substantial identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset of or the entire specified sequence; for example, as a segment of a full-length cDNA
or gene sequence, or the complete cDNA or gene sequence.
(b) As used herein, "comparison window" makes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or more contiguous nucleotides in length. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well-known in the art.
I S Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman ( 1981 ) Adv. Appl. Math. 2:482; b~~
the homology alignment algorithm of Needleman and Wunsch ( 1970) J. Mol. Biol.
48:443; by the search for similarity method of Pearson and Lipman ( 1988) Proc. Natl.
Acad Sci. 85:2444; by computerized implementations of these algorithms.
including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics (Mountain View, California). GAP. BESTFIT. BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG) (57~ Science Drive, Madison, Wisconsin); the CLUSTAL program is well described by Higgins and Sharp (1988) Gene 73:237-244; Higgins and Sharp (1989); CABIOS S:l~l-153:' Corpet et al. ( I 988) Nucleic Acids Res. 16:10881-90; Huang et al. ( 1992) Computer Applications in the Biosciences 8:155-65; and Person et al. ( 1994) Meth. of Mol. Biol.
24:307-331; preferred computer alignment methods also include the BLASTP.
BLASTN, and BLASTX algorithms. See Altschul et al. (1990) J. Mol. Biol. 21 ~:403-410. Alignment is also often performed by visual inspection and manual alignment.
(c) As used herein. "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a v~0 99/60141 PCT/US98/27450 specified comparison window. When percentage of sequence identity is used in reference to proteins, it is recognized that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not substantially change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percentage of sequence identity may be adjusted upward to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have "sequence similarity" or "similarity." Means for making this adjustment are well-known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch. thereby increasing the percentage of sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and I . The scoring of conservative 1 S substitutions is calculated, e.g., as implemented in the program PC/GENE
(Intelligenetics, Mountain View, California).
(d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison windov~~, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i. e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
(e)(i) The term "substantial identity" of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70%
sequence identity, preferably at least 80%, more preferably at least 90% and most preferably at least 95%, compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill in the art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins _g_ encoded by two nucleotide sequences by taking into account codon degeneracy amino acid similarity, reading frame positioning, and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 60%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%.
Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Generally, stringent temperature conditions are selected to be about 5°C to about 2°C lower than the melting point (Tm) for the specific sequence at a defined ionic strength and pH.
The denaturation or melting of DNA occurs over a narrow temperature range and represents the disruption of the double helix into its complementary single strands.
The process usually is characterized by the temperature of the midpoint of transition, T~,. which is sometimes described as the melting temperature. Formulas are available in the art for the determination of melting temperatures. Typically. stringent wash conditions are those in which the salt concentration is about 0.02 molar at pH
7 and I S the temperature is at 50, 55, or 60°C. However, nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides that they encode are substantially identical. This may occur, for example, when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two nucleic acid sequences are substantially identical is that the poIypeptide that the first nucleic acid encodes is immunologically cross reactive with the polypeptide encoded by the second nucleic acid.
(e)(ii) The terms "substantial identity" in the context of a peptide indicates that a peptide comprises a sequence with at least 70% sequence identity to a reference sequence, preferably 80%, more preferably 85%, most preferably at least 90°io or 95% sequence identity to the reference sequence over a specified comparison windov<-. Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman and Wunsch ( 1970) J. Mol. Biol. 48:443. An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide.
Thus, a peptide is substantially identical to a second peptide, for example, where the two peptides differ only by a conservative substitution. Polypeptides that are . ,'s "substantially similar" share sequences as noted above except that residue positions that are not identical may differ by conservative amino acid changes.
The present invention also encompasses the proteins and peptides encoded by the nucleotide sequences of this invention. It is recognized that the proteins of the invention may be oligomeric and will vary in molecular weight. component peptides, activity, and in other characteristics. The proteins of the invention can be used to protect plants against nematodes. Such methods are described in more detail below.
Fragments and variants of the disclosed nucleotide sequences and proteins encoded thereby are also encompassed by the present invention. By "fragment"
is intended a portion of the nucleotide sequence or a portion of the amino acid sequence and hence protein encoded thereby. Fragments of a nucleotide sequence may encode protein fragments that retain the biological activity of the native protein and hence confer resistance to nematodes. Alternatively. fragments of a nucleotide sequence that are useful as hybridization probes generally do not encode fragment proteins retaining biological activity. Thus, fragments of a nucleotide sequence may range from at least about 20 nucleotides, about 50 nucleotides, about 100 nucleotides. and up to the entire nucleotide sequence encoding the proteins of the invention.
By "variants" is intended substantially similar sequences. For nucleotide sequences, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the proteins conferring resistance to nematodes. Generally. nucleotide sequence variants of the invention will have at least 70%, generally, 80%, preferably up to 90%
sequence identity to its respective native nucleotide sequence.
By "variant" protein is intended a protein derived from the native protein by deletion (so-called truncation) or addition of one or more amino acids to the N-terminal and/or C-terminal end of the native protein; deletion or addition of one or more amino acids at one or more sites~in the native protein; or substitution of one or more amino acids at one or more sites in the native protein. Such variants may result from. for example, genetic polymorphism or from human manipulation. Methods for such manipulations are generally known in the art.
Thus, the proteins of the invention may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such 62451-860(S) manipulations are generally known in the art. For example, amino acid sequence variants of the proteins can be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide sequence alterations are well known in the art.
See_ for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-49?; Kunkel et al.
(1987) Methods in Enrymol. 154:367-382; U.S. Patent No. 4.873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing Company. New York) and the references cited therein. Thus, the genes and nucleotide sequences of the invention include both the naturally occurring sequences as well as mutant forms.
Likewise, the proteins of the invention encompass both naturally occurring proteins as _ w~el1 as variations, fragments, and modified forms thereof. Such variants will;
continue to possess the desired activity of conferring resistance to nematodes.
Obviously. the mutations.that will be made in the DNA encoding the variant must not place the sequence out of reading frame and preferably will not create sequences deleterious to expression of the gene.product. See, EP Patent Application Publication No.75,444.
The nematode resistance genes of the invention can be optimized for enhanced expression in plants of interest. See, for example, EPA0359472; EPA0385962:
W091/16432; Perlak et al. (1991) Prnc. Natl. Acad. Sci. USA 88:3324-3328: and Murray et al. (1989) Nucleic Acids Res. 17:477-498. In this manner, the genes can be synthesized utilizing.plant-preferred colons. See, for example, Murray et al.
(1989) . Nucleic Acids Res. 17:477-498. In this manner, synthetic genes can also be made based on the distribution of colons a particular host uses for a particular amino acid. Thus, the nucleotide sequences can be optimized for expression in any plant. It is recognized that ali or any part of the gene sequence may be optimized or synthetic. That is, synthetic or partially optimized sequences may also be used.
The present invention also relates to a recombinant DNA transformation construct comprising the isolated DNA sequences involved in nematode resistance in plants. The recombinant DNA transformation construct can be introduced into plant cells, prbtoplasts, calli, tissues, or whole plants to confer nematode-resistance properties in plants.
The sequences of the invention can be constructed in expression cassettes for 62451-860 (S) expression in a plant. Such expression cassettes will comprise a transcriptional initiation region linked to the gene encoding the gene of interest. Such an expression cassette is provided with a plurality of restriction sites for insertion of the gene of interest behind the regulatory control of a designated promoter. The expression cassette may additionally contain selectable marker genes suitable for the particular host organism.
The transcriptional initiation region, the promoter, may be native or analogous or foreign or heterologous to the host. Additionally, the promoter may be the natural sequence or alternatively a synthetic sequence. By foreign is intended that the transcriptional initiation region is not found in the wild-type host into which the transcriptional initiation region is introduced. As used herein a chimeric gene comprises a coding sequence operably linked to a transcription initiation region which is heterologous to the coding sequence. While any promoter or promoter element capable of driving expression of a coding sequence can be utilized. of particular interest for expression in plants are root promoters (Bevan et al. (1993) in Gene Conservation and Exploitation: Proceedings of the 20th Stadler Genetics Symposium, ed. Gustafson et al. (Plenum Press, New York) pp. 109-129; Brears et al.
( 1991 ) Plant J. 1:235-244; Lorenz et al. ( 1993) Plant J. 4:545-554; U.S.
Patent Nos.
5.459.252; 5,608,149; 5,599,670); pith promoter (U.S. Patent Nos. 5.466,785;
5.451.514; 5,391.725); or other tissue specific and constitutive promoters (see, for example, U.S. Patent Nos. 5.608.149; 5,608,144; 5,604.121; 5.569.597;
5,466.785;
5,399,680; 5,268,463; 5,608;142).
The ttanscriptional cassette will include in the 5'-to-3' direction of transcription, transcriptional and translational initiation regions, a DNA
sequence of interest, and transcriptional and translational termination regions functional in plants.
The termination region may be native with the transcriptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source.
Convenient termination regions are available from the Ti-plasmid of A.
tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau et al. ( 1991 ) Mol. Gen. Genet. 262:141-144; Proudfoot ( 1991 ) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-I58; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639.
Methodologies for the construction of plant transformation constructs are described in the art. The construct may include any necessary regulatory elements such as promoters, terminators (Guerineau et al. ( I 991 ) Mol. Gen. Genet.
226: I 41-144; Proudfoot ( 1991 ) Cell 64:671-674; Sanfacon et al. ( 1991 ) Genes Dev.
5:141-149; Molten et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; .Ioshi et al. (1987) Nucleic Acid Res. I 5:962?-9639); plant translational consensus sequences (Joshi, C.P.
(1987) Nucleic Acids Research 15:6643-6653). enhancers, introns (Luehrsen and Walbot (1991) Mol. Gen. Genet. 225:81-93) and the like, operably linked to the nucleotide sequence. It may be beneficial to include 5' leader sequences in the transformation construct. Such leader sequences can act to enhance translation. See.
for example, Elroy-Stein et al. ( 1989) PNAS USA 86:6126-6130; Allison et al.
( 1986):
Macejak and Sarnow (1991) Nature 353:90-94; Jobling and Gehrke (1987) Nature 325:622-625; Gallie et al. (1989) Molecular Biology of RNA, pp. 237-256;
Lommel et al. (1991) Virology 81:382-385; and Della-Cioppa et al. (1987) Plant Physiol.
84:965-968.
Transcriptional and translational regulatory signals include but are not limited to promoters. transcriptional initiation start sites. operators, activators, enhancers, other regulatory elements, ribosomal binding sites. an initiation codon.
termination signals, and the like. See, for example. U.S. Patent No. 5,039,523: U.S.
Patent No.
4,853.331; EPO 0480762A2; Sambrook et al. (1989) Molecular Cloning A
Laboratory Manual (2d .ed., Cold Spring Harbor Laboratory Press, Plainview, New York); Davis et al., eds. (1980) Advanced Bacterial Genetics (2d ed., Cold Spring Harbor Laboratory. Cold Spring Harbor, New York); and the references cited therein.
For the expression of the proteins encoded by the isolated DNA sequences of the present invention. a promoter capable of facilitating gene transcription in plant cells must be operable linked to the nematode resistance gene sequence. A
variety of suitable promoters are generally known in the art. Both constitutive promoter and tissue-specific promoters can be used. A constitutive promoter is a promoter that can initiate RNA transcription in any tissue or cell in a plant, while tissue-specific promoters can do so only in specific tissues. Suitable promoters are known in the art WO 99/60141 PCT/US98t2745~b and include 35S and I 9S promoter of CaMV. Agrobacterium NOS (nopaline symthase) gene promoter, and the Agrobacterium mannopine synthase gene promoter.
For tissue specific expression, the isolated DNA sequences of the invention conferring nematode resistance can be operably linked to tissue specific promoters.
In addition, a marker gene for identifying and selecting transformed cells.
tissues, or plants may be included in the transformation construct. By marker gene is intended to be either reporter genes or selectable marker genes.
Reporter genes are generally known in the an. The reporter gene used should be exogenous and not expressed endogenously. Ideally the reporter gene will exhibit low- background activity and should not interfere with plant biochemical and physiological activities. The products expressed by the.reporter gene should be stable and readily detectable. It is important that the reporter gene expression should be able to be assayed by a non-destructive., quantitative, sensitive, easy to perform and inexpensive method.
Examples of suitable reporter genes known in the art can be found in, for example, Jefferson et al. (1991 ) in Plant Molecular- Biology Manual, ed.
Gelvin et al.
(Kiuwer Academic Publishers), pp. 1-33; (DeWet et al. (1987) Mol. Cell. Biol.
7:725-737: Goff et al. (1990) EMBO J. 9:2517-2522: Kain et al. (1995) BioTechnigues 19:60-655; Chiu et al. (1996) Current Biology 6:32-330.
Selectable marker genes for selection of transformed cells or tissues can include genes that confer antibiotic resistance or resistance to herbicides.
Examples of suitable seiectable marker genes include, but are not limited to, genes encoding resistance to chloramphenicol (Herrera Estrella et al. (1983) EMBO,I. 2:987-992:
methotrexate (Herrera Estrella et al. ( 1983) Nature 303:209-213; Meijer et al. ( 1991 ) Plant Mol. Biol. 16:807-820); hygromycin Waldron et al. (1985) Plant Mol.
Biol.
x:103-108; Zhijian et al. (1995) Plant Science 108:219-227); streptomycin (Jones et al. ( 1987) Mol. Gen. Genet. 210:86-91; spectinomycin (Bretagne-Sagnard et al.
(1996) Transgenic Res. 5:131-137); bleomycin (Hille et al. {1990) Plant Mol.
Biol.
7:171-176); sulfonamide (Guerineau et al. (1990) Plant Mol. Biol. 15:127-136):
bromoxynil (Stalker et al .(1988) Science 242:419-423); glyphosate (Shaw et al.
( 1986) Science 233:478-481 ); phosphinothricin (DeBlock et al. ( 1987) EMBO
J.
6:'_' ~ I 3-2518); kanomycin, and the like.
It is further recognized that the components of the transformation construct may be modified to increase expression. For example, truncated sequences, nucleotide substitutions or other modifications may be employed. See, for example, Perlak et al. (1991 ) Proc. Natl. Acad Sci. USA 88:3324-3328; Murray et al. ( 1989) Arcleic Acids Res. 17:477-498; and W091/16432.
In preparing the transformation construct, the various DNA fragments may be manipulated, so as to provide for the DNA sequences in the proper orientation and, as appropriate in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites. or the like. For this purpose, in vitro mutagenesis, primer repair.
restriction. annealing, resection, ligation. PCR, or the like may be employed.
where insertions, deletions. or substitutions, e.g., transitions and transversions, may be involved.
The present invention also relates to the introduction of the transformation constructs into plant protoplasts, calli, tissues, or organ explants and the regeneration of transformed plants expressing the nematode resistance gene. The compositions of the present invention can be used to transform any plant. In this manner, genetically modified plants, plant cells, plant tissue, seed, and the like can be obtained.
Transformation protocols may vary depending on the type of plant or plant cell. i. e., monocot or dicot. targeted for transformation. Suitable methods of transforming plant cells include microinjection (Crossway et al. (1986) Biotechniques 7:320-334).
electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606):
Agrobacterium-mediated transformation (Hinchee et al. (1988) Biotechnology 6:91 ~-921 ); direct gene transfer (Paszkowski et al. ( 1984) EMBO .I. 3:2717-2722):
and ballistic panicle bombardment (see, for example, Sanford et al., U.S.
Patent 4.945,050; Tomes et al. (1995) in Plant Cell, Tissue and Organ Culture:
Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin): and McCabe et al. (1988) Biotechnology 6:923-926). Also see Weissinger et al.
(1988) Ann. Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science and Technology x:27-37 (onion); Christou et al. (1988) Plant Physiol. 87:671-674 (soybean); McCabe et al. (1988) Biotechnolo~v 6:923-926 (soybean); Finer and 62451-860(5) McMullen (1991) In Yitro Cell Deo. Biol. 27P:175-182 (soybean); Singh et al.
(1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad Sci. USA 85:4305-4309 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); Klein et al.
(1988) Plant Physiol. 91:440-444 (maize); Fromm et al. ( 1990) Biotechnology 8:833-839;
and 'Tomes et al. ( I 995) in Plant Cell, Tissue, and Organ Culture:
Fundamental Methods, ed. Gamborg and Phillips (Springer-Veilag, Berlin) (maize); Hooydaas-Van Slogteren and Hooykaas (1984) Nature (London) 311:763-764; Bytebier et al.
(1987) Proc. Natl. Acad Sci. USA 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation oJOvule Tissues, (G.H.P. Chapman et al., Longman. NY
eds. pp. I 97-209) ('pollen); Kaeppler et al. ( 1990) Plant Cell Reports 9.:41 ~-418:
Kaeppler et al. ( 1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); DeHalluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 1'':250-255, and Christou and Ford (1995)Annals of I 5 Botany 75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-(maize via Agrobacterium tumefaciens).
Plant tissues suitable for transformation include but are not limited to leaf tissues, root tissues, shoots, meristems. and protoplasts. For soybean it is often preferred to utilize explants of cotyledons.
For example, the Agrobacterium tumefaciens strain A208 is known to be highly virulent on soybean and to give rise to a higher rate of transformation. See Byme et al. (1987) Plant Cell Tissue and Organ Culture 8:3-I5. The transformation of soybean protoplasts by co-culturing them with Agrobacterium tumefaciens or Agrobacterium rhizogenes has been known.
See Facciotti et al. (1985) Biotechnology (NeH~ y'ork) 3:241. Tissue explants may be inoculated with the bacterium for transformation. For example, U.S. Patent No.
5.569,834 issued to Hinchee et al. discloses a method for soybean transformation and regeneration by inoculating a cotyledon explant that is tom apart at the cotyledonary node.
Alternatively, plants can also be transformed successfully by the biolistic technique. which involves using high velocity microprojectiles carrying microparticles containing the transformation construct to propel the microparticles into a plant cell, protoplast, or tissue. The high velocity microprojectile penetrates the outer cell surface without destroying the cell and injects the microparticles into the cells. The transformation construct in the microparticles is thereafter released and incorporated into the cell genome. This technique is also known as particle bombardment and is disclosed in U.S. Patent Nos. 4,945.050, 5,036,006, and x.100,792, which are hereby incorporated by reference. The key advantage of this technique is that it works on virtually any plant tissue. An example of successful transformation of soybean using this particle bombardment technique is demonstrated in McCabe et al. (1988) Biotechnology 6:923-926.
In yet another method of transformation, protoplasts are transfected directly with expression vector DNA that contains the nematode-resistance gene by electroporation or DNA-protoplast co-precipitation in accordance with procedures generally known in the art. See Christou et al. (1987) Proc. Natl. Acad. Sci.
USA
84:3962-3966; Lin et al. (1987) Plant Physiol. 84:856-861.
Once the transformation construct containing the isolated DNA sequences of this invention has been delivered, protoplasts, cells, or tissues expressing the protein encoded by the isolated nematode resistance gene are selected. Selection can be based on the selectable marker that is incorporated in the transformation construct or by culturing the protoplasts, cells, or tissues in media containing one of the antibiotics or herbicides. Alternatively, nematode-resistance may be directly selected by inoculating nematodes into the transformed protoplasts, cells, or tissues.
Both methods of selection are generally known in the art.
A further aspect of the present invention relates to the regeneration of transgenic plants that express nematode resistance genes of the invention. The cells that have been transformed and selected for expression of the sequence of this invention may be grown into plants in accordance with conventional ways. See.
for example, McCormick et al. (1986) Plant Cell Reports 5:81-84. These plants may then be grown. and either pollinated with the same transformed strain or different strains, the resulting hybrid having the desired genetic traits necessary for nematode-resistance.
For example. in soybean, transgenic soybean regeneration has been successful 62451-860(S) from tissues such as nodal axillary buds transformed with elec~roporation-mediated gene transfer technique (Chowrira et al. (1996) Mol. Biotechnol. 5:85-96);
somatic embryos transformed using microprojectile bombardment (Stewart et al. (1996) Plafzt Physiol. 112:121-129); and cotyledon explants that are tom apart at the cotyledonary node and are uansformed by Agrobacterium inoculation (LJ.S. Patent No.
5,569.834 issued to Hinchee et al.). Other methods for regenerating soybean plants are disclosed in U.S. Patent No. 4,684,612 issued to Hemphill et al., and U.S.
Patent No.
4,992.375 issued to Wright.
The sequences of the invention are generally introduced into plants wherein the plant in its native state does not contain the DNA sequences. However. it is recognized that in some plants the gene may occur but does not confer resistance because of aberrant expression, a mutation in the sequence, a nonfunctional protein.
and the like. It will be beneficial to transform such plants with the sequences of the invention.
Using cells and tissues of the present invention that are resistant to nematodes helps to obviate the problem of nematode infection of the host cells and tissues in the culture. In addition, the cells and tissues according to the present invention can also be valuable in the elucidation of the mechanism underlying the plant resistance to pathogens. Such plants include maize, oats, wheat. rice, barley. sorghum, alfalfa.
tobacco, cotton. sugar beet, sunflower, carrot, canola, tomato, potato, oilseed rape.
cabbage, pepper. lettuce, brassicas. tobacco, and soybean.
It is recognized that resistance to nematodes may be multigenic and quantitative in certain plants. Thus, the sequences disclosed herein may be useful alone or in combination with other sequences. Breeding programs have produced mane genotypes that have varying numbers of the genes responsible for nematode resistance.
Thus, the isolated DNA sequences of the invention are preferably used to transform plants expressing one or more other nematode resistance genes. Such plants may be naturally occurring, produced by breeding programs, or produced b~~
transformation with other nematode resistance genes. The result of the transformation with the isolated nematode resistance gene of this invention improves the plants capacity for nematode resistance.
Cotransformation may be conducted to introduce the DNA sequences of this invention into plants together with one or more other nematode resistance genes. In the transformation construct, the other known nematode resistance genes may be contained on the same plasmid as the DNA sequence of this invention or may be contained on a separate plasmid or DNA molecule. The methods for making transformation constructs having the other known nematode resistance gene with or without a DNA sequence isolated in this invention are similar to the methods described above and should be apparent to a person skilled in the art.
Several methods of cotransformation of plants have been developed.
Cotransformation is easily accomplished by DNA mediated processes, such as the co-precipitation method, biolistic method. and electroporation. Each of these methods is adequately suited for the introduction of the DNA sequences of this invention and other nematode resistance genes, on the same or separate plasmids, into the plant cells. Alternatively, Agrobacterium tumefaciens-mediated cotransformation techniques can be employed. Examples of such techniques can be found in, for example, Depicker et al. (1985) Mol. Gen. Genet. 201:477-484; McKnight et al.
(1987) Plant Mol. Biol. 8:439-445; De Block et al. (1991) Theor. Appl. Genet.
8?:257-263; de Framond et al. (1986) Mol. Gerz Genet. 202:125-131; and Komari et al. ( 1996) The Plant Journal 10: I 65-174. In an alternative method, multiple transgenes may be brought together by breeding of separately transformed parent plants.
The following examples are offered by way of illustration and not by wav of limitation.
EXAMPLES
Example 1: Incorporation of DNA Sequences Conferring Nematode Resistance into Expression Vectors Genomic DNA sequences spanning the full length coding regions of gene fragments conferring nematode-resistance to maize and soybean were isolated and cloned. These sequences are set forth in SEQ ID NOs: 1 and 3 (maize) and 5 and (soybean). Plasmids containing these sequences have been deposited with American Type Culture Collection (ATCC) on October 1 S, 1997, and on February 4, 1998 and are assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364.
Gene fragments are cloned into a plasmid vector, such as that shown in Figure 6, in the sense orientation so that they are under the transcriptional control of a constitutive promoter. The transformation construct is then available for introduction into soybean cells by bombardment methods as described in Example 2.
Example 2: Transformation of Soybean Cells and Regeneration of Transgenic Plants Having Improved Nematode Resistance Initiation and Maicitenance of EmbryoQenic Suspension Cultures Embryogenic suspension cultures of soybean (Glycine max Merrill) are initiated and maintained in a 10A40N medium supplemented with 5 mM asparagine as described previously (Finer and Nagasowa (1988) Plant Cell Tissue Org.
Culi.
15:125-136). For subculture, two clumps of embryogenic tissue, 4 mm in diameter, are transferred to 35 ml of 10A40N medium in a 125-ml delong flask. High quality embryogenic material is selectively subcultured monthly at this low inoculum density.
Preparation of DNA and Tungsten Pellets Plasmid DNA from Example 1 is precipitated onto 1.1 ~tm (average diameter) tungsten pellets using a CaCl2 precipitation procedure (Finer and McMullen (1990) Plant Cell Rep. 8:586-589). The pellet mixture containing the precipitated DNA
is gently resuspended after precipitation, and 2 ~1 is removed for bombardment.
errneTr~rr~ cmr~r~r .v WO 99/60141 PCT/US98l27456 Preparation of Plant Tissue for Bombardment Approximately 1 g of embryogenic suspension culture tissue (taken 3 weeks after subculture) is transferred to a 3.5-cm-diameter petri dish. The tissue is centered in the dish, the excess liquid is removed with a pipette, and a sterile 500 pm pore size nylon screen (Tetko Inc., Elmsford, New York) is placed over the embryonic tissue.
Open petri dishes are placed in a laminar-flow hood for 10 to 15 minutes to evaporate residual liquid medium from the tissue. The 3.5-cm-diameter petri dish is placed in the center of a 9-cm-diameter petri dish immediately before bombardment.
Bombardments are performed using a DuPont Biolistics Particle Delivery System (model BPG). Each sample of embryogenic soybean tissue is bombarded once.
Selection for Transgenic Clones Bombarded tissues are resuspended in the 1 OA40N maintenance medium.
One to two weeks after bombardment the clumps of embryogenic tissue are resuspended in fresh 10A40N medium containing a selection agent, such as kanomycin or hygromycin. The selection agent is filter-sterilized before addition to liquid media. The medium containing a selection agent is replaced with fresh antibiotic-containing medium weekly for 3 additional weeks.
Six to eight weeks after the initial bombardment, brown clumps of tissue that contain yellow-green lobes of embryogenic tissue are removed and separately subcultured in 10A40N medium containing selection agent. After 3 to 4 months of maintenance in this medium, proliferating embryogenic tissues are maintained by standard subculture in 10A40N without added antibiotic. Embryogenic tissues are periodically removed from I OA40N medium containing selection agent and 10A40N
for embryo development and Southern hybridization analyses.
Embrvo Development and Germination For embryo development. clumps of kanamycin-resistant embryogenic tissues are placed at 23°C on the embryo development medium, which contains MS
salts (Murashige and Skoog (1962) Physiol. Plant 15:474-497), B5 vitamins (Gamborg et al. (1968) Exp. Cell. Res. 50:151-158). 6% maltose, and 0.2% gelrite (pH 5.7).
One WO 99/60141 PCT/US98I2745b month after plating, the developing embryos are cultured as individual embryos, 25 per 9-cm-diameter petri dish in fresh embryo development medium. After an additional 4 weeks, the mature embryos are placed in dry petri dishes for 2 to 3 days.
After the desiccation treatment, the embryos are transferred to a medium containing MS salts, BS vitamins, 3% sucrose, and 0.2% Gelrite (pH 5.7). After root and shoot elongation, plantlets are transferred to pots containing a 1:1:1 mixture of vermiculite, topsoil. and peat, and maintained under high humidity. Plantlets are gradually exposed to ambient humidity over a 2-week period and placed in the greenhouse.
where they are grown to maturity and monitored for expression of the nematode resistance gene.
DNA Extraction and~Southern Hybridization Analysis DNA is extracted from embryogenic tissue and leaves using the CTAB
procedure (Saghai-Maroof et al. (1984) Proc. Natl. Acad. Sci. USA 81:8014-8018).
Digested DNAs are electrophoresed on a 0.8% agarose gel. The DNA in the gels is 1 ~ treated with 0.2 N HCI, twice for 1 ~ minutes, followed with 0.5 M
NaOH/0.1 M 1.~
M NaCI, twice for 30 minutes, and finally 1 M NHaCzH302/0.1 M NaOH, for 40 minutes. The DNA is transferred (Vollrath et al. ( 1988) Proc. Natl. Acad.
Sci. USA
8:6027-6031 ) to nylon membranes (Zetaprobe-BioRad, Richmond, California) overnight by capillary transfer using l M NH4C~H30z/0.1 M NaOH. The membranes are baked at 80°C for 2 hours under vacuum and then prehybridized for 4 to 6 hours at 6~°C in SO mM Tris pH 8.0, Sx standard saline citrate (SSC). 2x Denhardt's. 10 mM
Na~EDTA, 0.2% sodium dodecyl sulfate (SDS). and 62.~ ~tg/ml salmon sperm DNA.
All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
Although the foregoing invention has been described in some detail by way of illusuation and example for purposes of clarity of understanding. it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.
SEQUENCE LISTING
<110> Jessen, Holly J.
Meyer, Terry E.
<120> Genes and Methods for Control of Nematodes in Plants <130> 5718-18-1, 035718/171690 <140> PCT/US/98/27456 <141> 1998-12-23 <160> 10 <170> PatentIn Ver. 2.0 < 210 > 1 <211> 1347 <212> DNA
<213> Zea mays <220>
2 0 <221> CDS
<222> (146)..(991) <400> 1 ccacgcgtcc gcggacgcgt gggtgcccgg gagcgccgcc gcggtcgtgt gccaggtcag 60 cgaggccagc ctgctcccgc gcctcgccgc gtgggacaag tccgagacgc tcgcggccaa 120 gatcatgtac gccatcgaga gccag atg cag ggc tgc gcc ttc acg ctc gga 172 Met Gln Gly Cys Ala Phe Thr Leu Gly ctc ggc gag ccc aac ctc gcc ggc aag ccc gtg ctc gag tac gac cgc 220 3 0 Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg gtc gtg cgc ccg cac gag ctg cac gcg ctc aag ccc aag cca gcg ccg 268 Val Val Arg Pro His Glu Leu His Ala Leu Lys Pro Lys Pro Ala Pro gag ccc aag tct ggg tac ctc aac agg gag aac gag acg ctg ttc acc 316 Glu Pro Lys Ser Gly Tyr Leu Asn Arg Glu Asn Glu Thr Leu Phe Thr atg tac cag ata ctc gaa tcg tgg ctg cgc gcc gcg tcg caa ctc ctc 364 Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Ala Ala Ser Gln Leu Leu gcc cgc ctc aac gaa cgg atc gaa gcc aag aac tgg gaa gcg gcg get 412 Ala Arg Leu Asn Glu Arg Ile Glu Ala Lys Asn Trp Glu Ala Ala Ala gcc gac tgc tgg atc ctg gag cgc gtg tgg aag ctg ctc gcc gac gtc 460 Ala Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu Leu Ala Asp Val gag gac ctc cac ctg ctg atg gac ccg gac gac ttc ctg cgg ctc aag 508 Glu Asp Leu His Leu Leu Met Asp Pro Asp Asp Phe Leu Arg Leu Lys ggc cag ctc get gta cga gcg get cca tgg tct gac gcg tcg ttc tgt 556 Gly Gln Leu Ala Val Arg Ala Ala Pro Trp Ser Asp Ala Ser Phe Cys ttc cgg tcc agg gcg ctc ctg cac gtc get aac acc act agg gac ctc 604 Phe Arg Ser Arg Ala Leu Leu His Val Ala Asn Thr Thr Arg Asp Leu aag aag cgt gtg ccc tgg gtg ctc ggt gtc gag gtg gac ccc aac ggc 652 Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Val Asp Pro Asn Gly ggc ccg cgg gtg cag gag gca gcc atg atg ctg tac cac agc cgt agg 700 Gly Pro Arg Val Gln Glu Ala Ala Met Met Leu Tyr His Ser Arg Arg cgc ggc gag ggc gag gag gcg ggc aag gtg gag ctg ctc cag gcc ttc 748 Arg Gly Glu Gly Glu Glu Ala Gly Lys Val Glu Leu Leu Gln Ala Phe caa gca gtg gag gtg gcc gtg aga gga ttc ttc ttc gcg tac cgg cag 796 2 0 Gln Ala Val Glu Val Ala Val Arg Gly Phe Phe Phe Ala Tyr Arg Gln 205 , 210 _ 215 ctc gtg gcg gcg gtg atg ggc acg gcg gag gcg ttg ggc aac cgg gcg 844 Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Leu Gly Asn Arg Ala ctg ttc gtg ccg gcg gag ggg atg gat cca ttg gcc cag atg ttc ctc 892 Leu Phe Val Pro Ala Glu Gly Met Asp Pro Leu Ala Gln Met Phe Leu gag cca ccc tac tac ccc agc ctg gat gcc gcc aag acg ttc cta gcg 940 Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala gat tac tgg gtt cag cag atg gcg ggg gcc tct get ccg tca ata caa 988 Asp Tyr Trp Val Gln Gln Met Ala Gly Ala Ser Ala Pro Ser Ile Gln agc tgaaacggcg aaatggcgcg gctggatagc gaccgaatcg cgcagttttg 1041 Ser cagcctgaag atactatgta tgcatgcatc gtaatttcgc tgtggccttg tggtgataga 1101 gtgattcatt tctatagcga tcctgtacta gtgtagtaca tgtagcacta aattgtctta 1161 ttatcgttgt gcttgtgcac tgcgttgtgt tgtgttctac atagagattg attcagttag 1221 atgccatttg tcactctagg caagtgtttc aattgggcac cgtgtatata tagaactttt 1281 gtaaacactg gtagatggat tcatcaatta cagaatgttg atgttgacaa aaaaaaaaaa 1341 aaaaaa 1347 <210> 2 5 0 <211> 282 <212> PRT
<213> Zea mays <400> 2 Met Gln Gly Cys Ala Phe Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg Val Val Arg Pro His Glu Leu 60 His Ala Leu Lys Pro Lys Pro Ala Pro Glu Pro Lys Ser Gly Tyr Leu Asn Arg Glu Asn Glu Thr Leu Phe Thr Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Ala Ala Ser Gln Leu Leu Ala Arg Leu Asn Glu Arg Ile Glu Ala Lys Asn Trp Glu Ala Ala Ala Ala Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu Leu Ala Asp Val Glu Asp Leu His Leu Leu Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Gly Gln Leu Ala Val Arg Ala Ala Pro Trp Ser Asp Ala Ser Phe Cys Phe Arg Ser Arg Ala Leu Leu His Val Ala Asn Thr Thr Arg Asp Leu Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Val Asp Pro Asn Gly Gly Pro Arg Val Gln Glu Ala 165 ~ 170 175 Ala Met Met Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu Glu Ala 3 0 Gly Lys Val Glu Leu Leu Gln Ala Phe Gln Ala Val Glu Val Ala Val Arg Gly Phe Phe Phe Ala Tyr Arg Gln Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Leu Gly Asn Arg Ala Leu Phe Val Pro Ala Glu Gly Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln Gln Met Ala Gly Ala Ser Ala Pro Ser Ile Gln Ser <210> 3 <211> 1325 50 <212> DNA
<213> Zea mays <220>
<221> CDS
<222> (126)..(980) <400> 3 ccacgcgtcc gagcgccgcc gcggtcgtgt gccgggccag caaggccagc ctgctcccgc 60 gcctcgccgc gtgggagaag tctgaggcgc tcgcggccag gatcacgtac gccgtcgagg 120 gccag atg cag ggc tgc gcc tcc acg ctc ggc ctc ggc gag ccc aac ctc 170 Met Gln Gly Cys Ala Ser Thr Leu Gly Leu Gly Glu Pro Asn Leu gccggcaagccc gtgctcgag tacgaccgc gtcgtgcgc ccgcacgag 218 AlaGlyLysPro ValLeuGlu TyrAspArg ValValArg ProHisGlu ctgcacgcgctg aagcccgac cctgcgccg gagcccatg tccggctac 266 LeuHisAlaLeu LysProAsp ProAlaPro GluProMet SerGlyTyr cgcaaccgggag ctcgagact ctgttcacc atgtaccag atactcgag 314 ArgAsnArgGlu LeuGluThr LeuPheThr MetTyrGln IleLeuGlu tcctggctccgc gtcgcgtcg cagctgctc acccgcctc gacgagcgg 362 SerTrpLeuArg ValAlaSer GlnLeuLeu ThrArgLeu AspGluArg atc gaa gac aag tgc tgg gag gcg gcg gcc ggc gac tgc tgg atc ctg 410 2 0 Ile Glu Asp Lys Cys Trp Glu Ala Ala Ala Gly Asp Cys Trp Ile Leu 80 ~ 85 90 95 gag cgc gtg tgg aag ctg ctc gcg gac gtc gag gac ctc cac ctg ctg 458 Glu Arg Val Trp Lys Leu Leu Ala Asp Val Glu Asp Leu His Leu Leu atg gac ccg gac gag ttc cta cgg ctc aag agc cag ctc gcc gta cga 506 Met Asp Pro Asp Glu Phe Leu Arg Leu Lys Ser Gln Leu Ala Val Arg gcg gcg ccg ggg tct gag tcc gcg tcc ttc tgt ttc cgg tcc acg gcg 554 Ala Ala Pro Gly Ser Glu Ser Ala Ser Phe Cys Phe Arg Ser Thr Ala ctc ctg cac gtc get agc gcc act agg gac ctc aag aag cgt gtg ccc 602 Leu Leu His Val Ala Ser Ala Thr Arg Asp Leu Lys Lys Arg Val Pro tgg gtg ctc ggt gtc gag gcg gac ccc agc ggc ggc cca cgg gtg cag 650 4 0 Trp Val Leu Gly Val Glu Ala Asp Pro Ser Gly Gly Pro Arg Val Gln gag gcg gcc atg aag ctg tac cac agc cgt agg cgc ggt gag ggc gag 698 Glu Ala Ala Met Lys Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu gag gca ggc aag gtg gac ctg ctc cag gcc ttc cag gcg gtg gag gtg 746 Glu Ala Gly Lys Val Asp Leu Leu Gln Ala Phe Gln Ala Val Glu Val gcc gtg aga gca ttc ttc ttc ggg tac cgg cag ctg gtg gcg gcg gtc 794 Ala Val Arg Ala Phe Phe Phe Gly Tyr Arg Gln Leu Val Ala Ala Val atg ggc acg gcg gag gcg tcg ggc aac cgg gcg ctg ttc gtg ccg gcg 842 Met Gly Thr Ala Glu Ala Ser Gly Asn Arg Ala Leu Phe Val Pro Ala gag gag atg gat ccg ctc gcc caa atg ttc ctg gag ccg cca tac tac 890 Glu Glu Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr cct agc ctg gac gcc gcc aag acg ttt cta gcg gat tac tgg gtt cag 938 Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln ctt cag cag atg gcg gag gcc tct get ccg tca aga caa agc 980 Leu Gln Gln Met Ala Glu Ala Ser Ala Pro Ser Arg Gln Ser tgaaacggcg aaatggcacg gctgagccac cgaatcgcgc agttttgcag gactgaagat 1040 actatgcatg catttcgttg gggccttttg cccttgtggt gaatggtgat agagtgattc 1100 atttctatag cgatcatgta ctattgcagt acatgtcgca ctagaatact agattctctt 1160 actatcgttg tgcactgcgt tgtacgtgtt gtgttctacg tagatataga ttgattcagt 1220 tagatgtcat ttgtattgcc aagtaggtca attggatatg gaacttttgt aaataccgaa 1280 atactgttgt tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 1325 <210> 4 <211> 285 2 0 <212> PRT
<213> Zea mat's - -<400> 4 Met Gln Gly Cys Ala Ser Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg Val Val Arg Pro His Glu Leu His Ala Leu Lys Pro Asp Pro Ala Pro Glu Pro Met Ser Gly Tyr Arg Asn Arg Glu Leu Glu Thr Leu Phe Thr Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Val Ala Ser Gln Leu Leu Thr Arg Leu Asp Glu Arg Ile Glu Asp Lys Cys Trp Glu Ala Ala Ala Gly Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu heu Ala Asp Val Glu Asp Leu His Leu Leu Met Asp Pro Asp Glu Phe Leu Arg Leu Lys Ser Gln Leu Ala Val Arg Ala Ala Pro Gly Ser Glu Ser Ala Ser Phe Cys Phe Arg Ser Thr Ala Leu 50 Leu His Val Ala Ser Ala Thr Arg Asp Leu Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Ala Asp Pro Ser Gly Gly Pro Arg Val Gln Glu Ala Ala Met Lys Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu Glu Ala Gly Lys Val Asp Leu Leu Gln Ala Phe Gln Ala Val Glu Val Ala Val Arg Ala Phe Phe Phe Gly Tyr Arg Gln Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Ser Gly Asn Arg Ala Leu Phe Val Pro Ala Glu Glu Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln Leu Gln Gln Met Ala Glu Ala Ser Ala Pro Ser Arg Gln Ser <210> 5 <211> 1498 2 0 <212> DNA
<213> Glycine max <220>
<221> CDS
<222> (69)..(1433) <220>
<223> Immediate source: Clone- P12568 <400> 5 cgacaccaat ttctccatcc tctcattgaa aaacaaaatt aatcatctta cttatttatt 60 ctccgaaa atg gtt gat tta cat tgg aaa tca aag atg cca agt tcc gac 110 3 0 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp atg cct tcc aaa act cta aaa ctc tct ctc tcc gac aac aag tcc tta 158 Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu ccc tct ttg caa cta ccc ttc cgc acc aca gat atc tct cac gcc gca 206 Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala cct tct gtt tgc gcc act tac gac tac tat ctc cgt ctt cct caa ctc 254 Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu aga aag ctt tgg aac tcc tca gat ttt cct aat tgg aac aac gaa cca 302 Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro atc tta aaa cct atc ttg caa get ctc gaa atc acc ttc cgc ttt ctc 350 50 Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu tcc att gtt ctc tcc gat cca aga cct tac tcc aac cac aga gaa tgg 398 Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp act cgc agg ata gag tct ctt atc aca cat caa att gaa atc att gcc 446 Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala atactttgtgaa gatgag gaacaaaat tccgacacacgt ggcact gca 494 IleLeuCysGlu AspGlu GluGlnAsn SerAspThrArg GlyThr Ala ccaaccgetgat ctcagc aggaacaat agcagcgagagc agaagc tac 542 ProThrAlaAsp LeuSer ArgAsnAsn SerSerGluSer ArgSer Tyr agcgaggcaagc ctgctt ccgcggctt gccacgtggtac aaatcc aag 590 SerGluAlaSer LeuLeu ProArgLeu AlaThrTrpTyr LysSer Lys gacgtagcgcag aggatc cttctctca gttgaatgccaa atgagg agg 638 AspValAlaGln ArgIle LeuLeuSer ValGluCysGln MetArg Arg tgt tcc tac acg ctg ggt ttg ggt gag ccg aac cta gcg ggc aaa ccg 686 2 0 Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro 195 . 200 . 205 agc ctg ctc tac gac ctc gtg tgt aag ccg aac gag atc cac gcg ctg 734 Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu aag acg acg ccg tac gat gag cgc gta gag aat cac gag aac cac gcg 782 Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala ttg cac gcg acg cac cag atc gcc gag tcg tgg atc cac gcg tcg cgg 830 Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg aag gtt cta gag agg atc gca gac gcg gtg ctc tcc aga acc ttc gag 878 Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu aag gcg get gag gac tgc tac gcc gtg gaa agg atc tgg aag ctt ctc 926 4 0 Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu gcg gag gtg gag gac ctc cac ctg atg atg gat ccg gac gat ttc ttg 974 Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu aga ctg aag aat cag ctc tcg gtg aaa tcc tcc ggc ggc gaa acg get 1022 Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala tcg ttc tgc ttc agg tcg aag gag ttg gtt gaa ctg acg aag atg tgc 1070 Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys aga gat ctg agg cac aag gtg ccg gag ata ttg gag gtg gag gtg gat 1118 Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp ccg aag gga gga ccg agg att caa gag gcg gcg atg aag ctc tac gtt 1166 Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val tcg aag agc gcg ttc gag aag gtt cac ttg ttg cag gcg atg cag gcg 1214 Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala att gag gcg gcg atg aag aga ttc ttc tac gcg tat aag cag gtg ttg 1262 Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu gcg gtg gtg atg gga agc tcc gag get aac ggt aac cga gtt ggg ttg 1310 Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu agt tgc gac tcg get gac tcg ttg act cag att ttc ctt gaa ccg acg 1358 Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr tat ttt cca agc ttg gat gcc gcc aag act ttt ctt gga tac ttg tgg 1406 2 0 Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp 435 440 ~ 445 gat aat aac gat aat aac aaa tgg ata tgataaggga aaaaaaaaaa 1453 Asp Asn Asn Asp Asn Asn Lys Trp Ile acggcacaaa aacgatggcc aaagtgagat tttcggtttg ggcac 1498 <210> 6 3 0 <211> 455 <212> PRT
<213> Glycine max <400> 6 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser 4 0 Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr Ser Glu x ,. _M
r Ala Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg Lys Val Leu Glu Arg I1e Ala Asp Ala Val Leu Ser Arg Thr Phe Glu Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu Ala Glu 275 . 280 285 Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu 3 0 Lys Asn Gln,Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile <210> 7 <211> 1418 <212> DNA
<213> Glycine max <220>
<221> CDS
<222> (46)..(1398) <400> 7 caccaaacaa aaaaatcaat cattttattt tatttttcta cgaaa atg gtt gat tta 57 Met Val Asp Leu cat tgg aaa tca aag atg cct agt tcc aaa aca cca aaa ctc tct ctc 105 His Trp Lys Ser Lys Met Pro Ser Ser Lys Thr Pro Lys Leu Ser Leu tcc gac aac aag tcc tta ccc tct ttg caa cta ccc ttc cgc acc aca 153 Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr 2 0 gat atc tct ccc gcc get cct tcc gtt tgc gcc get tac gac tac tat 201 Asp Ile Ser Pro Ala Ala Pro Ser Val Cys Ala Ala Tyr Asp Tyr Tyr ctc cgt ctt cct caa ctc aga aag ctt tgg aac tcc act gat ttt cct 249 Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Thr Asp Phe Pro aat tgg aac aac gaa ccg att cta aaa cca att ttg caa get ctc gaa 297 Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu ata acg ttc cgc ttt ctt tcc att gtt ctc tcc gat ccc aga cct tac 345 Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr tcc aac cac aga gaa tgg act cgc cgg ata gag tct ctc atc atg cat 393 Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Met His 4 0 caa att gaa atc att gcc ata ctt tgt gaa gaa gag gaa caa aat tcc 441 Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Glu Glu Glu Gln Asn Ser gac aca cgt ggc act gca cca acc get gat ctc agc agc agc aat agc 489 Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Ser Ser Asn Ser agc gtg agc aga agc tac agc gag gcg agc ctg ctt cct cgg ctt gcc 537 Ser Val Ser Arg Ser Tyr Ser Glu Ala Ser Leu Leu Pro Arg Leu Ala acg tgg tac aaa tcc agg gac gtg gcg cag agg atc ctt ctc tcc gtg 585 Thr Trp Tyr Lys Ser Arg Asp Val Ala Gln Arg Ile Leu Leu Ser Val gaa tgc caa atg agg agg tgc tcc tac acg ctt ggt ttg ggc gag ccg 633 Glu Cys Gln Met Arg Arg Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro 60 aac cta gcg ggg aag ccg agc ctg ctc tac gac ctc gtg tgc aag ccg 681 Asn Leu Ala Gly Lys Pro Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro aatgagatc cacgcgctg aagacg acgccgtacgac gagcgcgtg gag 729 AsnGluIle HisAlaLeu LysThr ThrProTyrAsp GluArgVal Glu aaccacgag aaccacgcg gtgcac gccacgcaccag atcgcggag tcg 777 AsnHisGlu AsnHisAla ValHis AlaThrHisGln IleAlaGlu Ser tggattcac gcgtcgcgg aaggtt ctggagagaatc gcggacgcg gtg 825 TrpIleHis AlaSerArg LysVal LeuGluArgIle AlaAspAla Val ctctccaga accttcctg aaagca gcagaggactgc tacgccgtg gag 873 LeuSerArg ThrPheLeu LysAla AlaGluAspCys TyrAlaVal Glu agg atc tgg aag ctt ctc gcg gag gtg gag gac ctc cac ctg atg atg 921 2 0 Arg Ile Trp Lys Leu Leu Ala Glu Val Glu Asp Leu His Leu Met Met 280 285 . 290 gat ccg gac gat ttc ttg agg cta aag aat caa ctc tcg gtg aaa tcc 969 Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser tcg agc ggc gaa acg gca tcg ttc tgc ttc aga tcg aat gag tta gtg 1017 Ser Ser Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Asn Glu Leu Val gaa ctg acg aag atg tgc aga gat ctg agg cac aag gtg ccg gag ata 1065 Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile ttg gag gtg gag gtg gat ccg aag gga gga ccg agg att caa gag gcg 1113 Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala gcg atg aag ctc tac gtt tcg aag agc gag ttc gag aag gtt cac ttg 1161 4 0 Ala Met Lys Leu Tyr Val Ser Lys Ser Glu Phe Glu Lys Val His Leu ttg cag gcg atg cag gcg att gag gcg gcg atg aag aga ttc ttc tac 1209 Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr gcg tat aag cag gtg ttg gcg gtg gtg atg gga agt tca gag get aac 1257 Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn ggt aac cga gtt ggg ttg agt tgc gac tcg get gac tcg ttg act cag 1305 Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln att ttc ctt gaa ccg acg tat ttt cca agc ttg gat gcc gcc aag act 1353 Ile Phe Leu Glu.Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr ttt ctt gga tac ctg tgg gat aat aac gat aat aac aaa tgg ata 1398 Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile tgaaaacgaa aaaaaaaaaa 1418 <210> 8 <211> 451 <212> PRT
<213> Glycine max <400> 8 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Lys Thr Pro Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser Pro Ala Ala Pro Ser Val Cys Ala Ala Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Thr Asp PheProAsn TrpAsn AsnGluPro IleLeuLysPro IleLeu 65 70 . 75 80 Gln Ala LeuGluIle ThrPhe ArgPheLeu SerIleValLeu SerAsp Pro Arg ProTyrSer AsnHis ArgGluTrp ThrArgArgIle GluSer Leu Ile MetHisGln IleGlu IleIleAla IleLeuCysGlu GluGlu Glu Gln AsnSerAsp ThrArg GlyThrAla ProThrAlaAsp LeuSer Ser Ser AsnSerSer ValSer ArgSerTyr SerGluAlaSer LeuLeu 4 Pro Arg LeuAlaThr TrpTyr LysSerArg AspValAlaGln ArgIle Leu Leu SerValGlu CysGln MetArgArg CysSerTyrThr LeuGly Leu Gly GluProAsn LeuAla GlyLysPro SerLeuLeuTyr AspLeu Val Cys LysProAsn GluIle HisAlaLeu LysThrThrPro TyrAsp Glu Arg ValGluAsn HisGlu AsnHisAla ValHisAlaThr HisGln Ile Ala GluSerTrp IleHis AlaSerArg LysValLeuGlu ArgIle Ala Asp AlaValLeu SerArg ThrPheLeu LysAlaAlaGlu AspCys Tyr Ala ValGluArg IleTrp LysLeuLeu AlaGluValGlu AspLeu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Ser Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Asn Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Glu Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Mgt Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp 405 410 ~ 415 Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp 3 0 Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile <210> 9 <211> 1498 <212> DNA
<213> Glycine max 40 <220>
<221> CDS
<222> (69)..(1433) <400> 9 cgacaccaat ttctccatcc tctcattgaa aaacaaaatt aatcatctta tttatttatt 60 ctccgaaa atg gtt gat tta cat tgg aaa tca aag atg cca agt tcc gac 110 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp atg cct tcc aaa act ctc aaa ctc tct ctc tcc gac aac aag tcc tta 158 50 Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu ccc tct ttg caa cta ccc ttc cgc acc aca gat atc tct cac gcc gca 206 Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala cct tct gtt tgc gcc act tac gac tac tat ctc cgt ctt cct caa ctc 254 Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu aga aag ctt tgg aac tcc tca gat ttt cct aat tgg aac aac gaa cca 302 Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro atc tta aaa cct atc ttg caa get ctc gaa atc acc ttc cgc ttt ctc 350 Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu tcc att gtt ctc tcc gat cca aga cct tac tcc aac cac aga gaa tgg 398 Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp act cgc agg ata gag tct ctt atc aca cat caa att gaa atc att gcc 446 Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala ata ctt tgt gaa gat gag gaa caa aat tcc gac aca cgt ggc act gca 494 2 0 Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala 130 . 135 140 cca acc get gat ctc agc agg aac aat agc agc gag agc aga agc tac 542 Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr agc gag gca agc ctg ctt ccg cgg ctt gcc acg tgg tac aaa tcc aag 590 Ser Glu Ala Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys gac gta gcg cag agg atc ctt ctc tca gtt gaa tgc caa atg agg agg 638 Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg tgt tcc tac acg ctg ggt ttg ggt gag ccg aac cta gcg ggc aaa ccg 686 Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro agc ctg ctc tac gac ctc gtg tgc aag ccg aac gag atc cac gcg ctg 734 4 0 Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu aag acg acg ccg tac gat gag cgc gta gag aat cac gag aac cac gcg 782 Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala ttg cac gcg acg cac cag atc gcc gag tcg tgg atc cac gcg tcg cgg 830 Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg aag gtt cta gag agg atc gca gac gcg gtc ctc tcc aga acc ttc gag 878 Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu aag gcg get gag gac tgc tac gcc gtg gaa agg atc tgg aag ctt ctc 926 Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu gcg gag gtg gag gac ctc cac ctg atg atg gat ccg gac gat ttc ttg 974 Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu aga ctg aag aat cag ctc tcg gtg aaa tcc tcc ggc ggc gaa acg get 1022 Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala tcg ttc tgc ttc agg tcg aag gag ttg gtt gaa ctg acg aag atg tgc 1070 Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys aga gat ctg agg cac aag gtg ccg gag ata ttg gag gtg gag gtg gat 1118 Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp ccg aag gga gga ccg agg att caa gag gcg gcg atg aag ctc tac gtt 1166 Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val tcg aag agc gcg ttc gag aag gtt cac ttg ttg cag gcg atg cag gcg 1214 2 0 Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala att gag gcg gcg atg aag aga ttc ttc tac gcg tat aag cag gtg ttg 1262 Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu gcg gtg gtg atg gga agc tcc gag get aac ggt aac cga gtt ggg ttg 1310 Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu agt tgc gac tcg cgt gac tcg ttg act cag att ttc ctt gaa ccg acg 1358 Ser Cys Asp Ser Arg Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr tat ttt cca agc ttg gat gcc gcc aag act ttt ctt gga tac ttg tgg 1406 Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp gat aat aac gat aat aac aaa tgg ata tgataaggga aaaaaaaaaa 1453 4 0 Asp Asn Asn Asp Asn Asn Lys Trp Ile acggcacaaa aacgatggcc aaagtgagat tttcggtttg ggcac 1498 <210> 10 <211> 455 <212> PRT
<213> Glycine max <400> 10 SO Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr Ser Glu Ala.Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg Cys Ser 180 ~ 185 190 Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Ser Leu 3 0 Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu 3a Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Arg Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile
This application is related to international publication number WO 99/60141, and is a divisional of Canadian patent application SN 2,323,312.
FIELD OF THE INVENTION
The present invention relates to isolated DNA sequences involved in nematode resistance inplants.The invention also relates to~methods for improving the genetic traits for nematode resistance in plants b~~ utilizing such isolated DNA
sequences.
BACKGROUND
Plants are continually attacked by a diverse range of phytopathogenic organisms. These organisms cause substantial losses to crops each year.
Traditional approaches for control of plant diseases have been the use of chemical treatment arid the construction of interspecific hybrids between resistant crops and their wild-type relatives as sources of resistant germplasm. However, environmental and economic concerns make chemical pesticides undesirable, while the traditional interspecific breeding is inefEcient and often cannot eliminate the undesired traits of the wild species. Thus, the discovery of pest and pathogen-resistant genes provides a new approach to control plant disease.
Several genes responsible for disease resistance have been identified and ' isolated from plants. See Staskawicz et al. (1995) Science 268:661-667.
Recently, 2 o the sugar beet Hsl°'°'~ gene that confers resistance to the beet cyst nematode was cloned. See Cai et al: (1997) Science 275:832-834; and Moffat (1997) Science 275:77: Transformation of plants or plant tissues with the resistance genes can confer disease resistance to susceptible strains. See, for example, PCT
Publication W093/19181; and Cai et al. (1997) Science 275:832-834.
2 5 Nematode infection is prevalent in many crops. For example, soybean cyst nematode (Heterodera glycines) is a widespread pest that causes substantial damage 62451-860 (D) to soybeans every year. Such damage is the result of the stunting of the soybean plant caused by the cyst nematode.
The stunted plants have smaller root systems, show symptoms of mineral deficiencies in their leaves, and wilt easily.
The soybean cyst nematode is believed to be responsible for yield losses in soybeans that are estimated to be in excess of $500 million per year.
Nematicides such as Aldicarb and its breakdown products are known to be highly toxic to mammals. As a result, government restrictions have been imposed on the use of these chemicals. Thus, there is a great need for the isolation of genes that can provide an effective method of controlling nematodes without causing health and environmental problems.
SUMMARY OF THE INVENTION
This invention relates to DNA sequences isolated from soybean and maize. The sequences alone, or in combination with other sequences, confer nematode resistance in a plant. The sequences are useful in methods for the protection of plants from nematodes. Additionally, allelic variants of the resistance gene from a susceptible plant are included. In another aspect of the present invention, expression cassettes and transformation vectors comprising the isolated nucleotide sequences are disclosed. The transformation vectors can be used to transform plants and express the nematode resistance genes in the transformed cells. Plants susceptible to nematode infection can be targeted to confer nematode resistance. The transformed cells as well as the regenerated transgenic plants containing and expressing the isolated DNA and protein sequences are also disclosed.
62451-860(D) 2a In one aspect, the invention provides a plant which has been transformed with a transformation vector comprising a DNA sequence that encodes an amino acid sequence selected from the group consisting of the sequences set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8.
Another aspect of the invention provides a plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence having at least 70%
identity to the nucleotide sequence set forth in SEQ ID NO:
1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7; and (b) a nucleotide sequence having at least 70% sequence identity to a nucleotide sequence encoding a plant protein, wherein said sequence encoding said plant protein is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364.
Another aspect of the invention provides a plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of: (a) the nucleotide sequence set forth in SEQ
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7; (b) a nucleotide sequence encoding a plant protein, wherein said sequence is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364; and (c) a nucleotide sequence having at least 85% identity to the sequence of (a) or (b).
Another aspect of the invention provides a plant transformed with a DNA sequence encoding a protein comprising an amino acid sequence selected from the group consisting of the amino acid sequences set forth in SEQ ID
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8, wherein 62451-860(D) 2b said plant exhibits improved resistance to nematodes over the native untransformed plant.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 schematically illustrates the plasmid vector comprising a nematode resistance DNA sequence of the present invention operably linked to the ubiquitin promoter.
Constitutive expression of this sequence confers resistance to nematodes in a transformed plant.
DETAILED DESCRIPTION OF THE INVENTION
Compositions and methods for the control of nematodes in susceptible plants are provided. The compositions comprise isolated proteins and DNA sequences encoding such proteins involved in nematode resistance. Such isolated 'DNA
sequences can be transferred into plants to confer or improve nematode resistance in the transformed plants. Sequences of the invention have been isolated from maize and soybean. By "involved in nematode resistance" is intended that the proteins or sequences, either alone or in combination with other proteins or sequences, confer nematode resistance in a plant. In this manner, resistance to nematodes can be enhanced or improved in the transformed plant as at least one of the sequences required for nematode resistance is provided.
DNA sequences isolated from the genomes of maize and soybean are disclosed. The nucleotide sequences and amino acid sequences from two maize isolates are set forth in SEQ ID NOs: 1-2 and 3-4, and the corresponding sequences from two soybean isolates are set forth in SEQ ID NOs: 5-6 and 7-8. The nucleotide sequences in accordance with this invention are involved in nematode resistance in 1 S plants and may confer, alone or in combination with other sequences, nematode resistance in plants. Also discussed are DNA sequences isolated from a susceptible genotype of soybean. The nucleotide and amino acid sequences for this isolate are set forth in SEQ ID NOs: 9-10. Nucleotide sequences of the invention also include the maize and soybean nematode resistance gene sequences as contained in plasmids deposited with American Type Culture Collection (ATCC) and assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364.
Using the sequence information set forth in the SEQ ID NOs or the sequences as contained in ATTC deposits assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364, other plant DNA sequences comprising the nucleotide sequences disclosed above can be isolated based on sequence homology at either the amino acid or nucleotide sequence level. Any suitable molecular cloning method can be used including, but not limited to, PCR'amplification and DNA
hybridization. In the same manner, synthetic nucleotide sequences can be designed based on the amino acid sequences of the invention. Methods to design and make such synthetic sequences are available in the art.
In a hybridization method, the hybridization probes may be genomic DNA
fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 3ZP, or any other detectable marker.
Probes for hybridization can be made by labeling synthetic oligonucleotides based on the sequence of the soybean and/or maize sequence. Degenerate primers designed on the basis of conserved nucleotide or amino acid sequences in the maize and soybean sequences can additionally be used. Preparation of probes for hybridization is generally know in the art and is disclosed in Sambrook et al. ( 1989) Molecular Cloning: A Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory Press.
Plainview, New Yorkj.~ The labeled probes can be used to screen cDNA or genomic libraries made from nematode resistant plants.
Methods for construction of such cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook et al. (1989).Molecular Cloninh: A
Laboratort:
Manual (2d ed.. Cold Spring Harbor Laboratory Press. Plainviev-. New York).
In a PCR method. the DNA or amino acid sequence encoded by the soybean or maize sequences of the invention can be aligned with each other. Nucleotide 1 S primers can be designed based on any conserved short stretches of amino acid sequences or nucleotide sequences. Pairs of primers can be used in PCR
reactions for amplification of DNA sequences from cDNA or genomic DNA extracted from plants of interest. In addition. a single specific primer with a sequence corresponding to one of the nucleotide sequences disclosed herein can be paired with a primer having a sequence of the DNA vector in the cDNA or genomic libraries for PCR
amplification of the sequences ~' or 3' to the nucleotide sequences disclosed herein.
Similarly.
nested primers may be used instead of a single specific primer for the purposes of the invention. Methods for designing PCR primers and PCR cloning are generally known in the art and are disclosed in Sambrook et al. (1989) Molecular Cloning: A
Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory Press, Plainview, New York).
The sequences of the invention comprise coding sequences from other plants that may be isolated according to well-known techniques based on their sequence homology to the maize or soybean coding sequences set forth herein. In these techniques, all or part of the known coding sequence is used as a probe that selectively hybridizes to other possible nematode resistance coding sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i. e., genomic or cDNA libraries) from a chosen organism. To achieve specific hybridization under a variety of conditions, such probes include sequences that are unique and are preferably at least about 10 nucleotides in length. and most preferably at least about 20 nucleotides in length. Such probes may be used to amplify corresponding coding S sequences from a chosen organism by PCR. This technique may be used to isolate other possible nematode resistance coding sequences from a desired organism or as a diagnostic assay to determine the presence of the nematode resistance coding sequence m an organism.
Such techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, e.g., Innis et al.. eds. (1990) PCR
Protocols: A Guide to Methods and Applications (Academic Press. New York)).
The isolated DNA sequences further comprise DNA sequences isolated from other plants by hybridization with partial sequences obtained from maize and soybean. Conditions that will permit other DNA sequences to hybridize to the DNA
1 S sequences disclosed herein can be determined in accordance with techniques generally known in the art. For example, hybridization of such sequences may be carried out under conditions of reduced stringency. medium stringency. or high stringency conditions (e.g., conditions represented by a wash stringency of 3~-40°ro Formamide with Sx Denhardt's solution. 0.~% SDS. and lx SSPE at 37°C;
conditions represented by a wash stringency of 40-45% Forniamide with Sx Denhardt's solution.
0.5% SDS. and lx SSPE at 42°C; and conditions represented by a wash stringenc~~ of ~0% Formamide with Sx Denhardt's solution, 0.~% SDS, and lx SSPE at 42°C.
respectively. See Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, New York). In general.
sequences that confer nematode resistance and hybridize to the DNA sequences disclosed herein will be at least 70-75% homologous, 80-85% homologous. and even 90-95% homologous or more.
The following terms are used to describe the sequence relationships bet«~een two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) "comparison window", (c) "sequence identity". (d) "percentage of sequence identiy", and (e) "substantial identity".
(a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset of or the entire specified sequence; for example, as a segment of a full-length cDNA
or gene sequence, or the complete cDNA or gene sequence.
(b) As used herein, "comparison window" makes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or more contiguous nucleotides in length. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
Methods of alignment of sequences for comparison are well-known in the art.
I S Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman ( 1981 ) Adv. Appl. Math. 2:482; b~~
the homology alignment algorithm of Needleman and Wunsch ( 1970) J. Mol. Biol.
48:443; by the search for similarity method of Pearson and Lipman ( 1988) Proc. Natl.
Acad Sci. 85:2444; by computerized implementations of these algorithms.
including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics (Mountain View, California). GAP. BESTFIT. BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG) (57~ Science Drive, Madison, Wisconsin); the CLUSTAL program is well described by Higgins and Sharp (1988) Gene 73:237-244; Higgins and Sharp (1989); CABIOS S:l~l-153:' Corpet et al. ( I 988) Nucleic Acids Res. 16:10881-90; Huang et al. ( 1992) Computer Applications in the Biosciences 8:155-65; and Person et al. ( 1994) Meth. of Mol. Biol.
24:307-331; preferred computer alignment methods also include the BLASTP.
BLASTN, and BLASTX algorithms. See Altschul et al. (1990) J. Mol. Biol. 21 ~:403-410. Alignment is also often performed by visual inspection and manual alignment.
(c) As used herein. "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a v~0 99/60141 PCT/US98/27450 specified comparison window. When percentage of sequence identity is used in reference to proteins, it is recognized that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not substantially change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percentage of sequence identity may be adjusted upward to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have "sequence similarity" or "similarity." Means for making this adjustment are well-known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch. thereby increasing the percentage of sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and I . The scoring of conservative 1 S substitutions is calculated, e.g., as implemented in the program PC/GENE
(Intelligenetics, Mountain View, California).
(d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison windov~~, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i. e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
(e)(i) The term "substantial identity" of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70%
sequence identity, preferably at least 80%, more preferably at least 90% and most preferably at least 95%, compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill in the art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins _g_ encoded by two nucleotide sequences by taking into account codon degeneracy amino acid similarity, reading frame positioning, and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 60%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%.
Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Generally, stringent temperature conditions are selected to be about 5°C to about 2°C lower than the melting point (Tm) for the specific sequence at a defined ionic strength and pH.
The denaturation or melting of DNA occurs over a narrow temperature range and represents the disruption of the double helix into its complementary single strands.
The process usually is characterized by the temperature of the midpoint of transition, T~,. which is sometimes described as the melting temperature. Formulas are available in the art for the determination of melting temperatures. Typically. stringent wash conditions are those in which the salt concentration is about 0.02 molar at pH
7 and I S the temperature is at 50, 55, or 60°C. However, nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides that they encode are substantially identical. This may occur, for example, when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two nucleic acid sequences are substantially identical is that the poIypeptide that the first nucleic acid encodes is immunologically cross reactive with the polypeptide encoded by the second nucleic acid.
(e)(ii) The terms "substantial identity" in the context of a peptide indicates that a peptide comprises a sequence with at least 70% sequence identity to a reference sequence, preferably 80%, more preferably 85%, most preferably at least 90°io or 95% sequence identity to the reference sequence over a specified comparison windov<-. Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman and Wunsch ( 1970) J. Mol. Biol. 48:443. An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide.
Thus, a peptide is substantially identical to a second peptide, for example, where the two peptides differ only by a conservative substitution. Polypeptides that are . ,'s "substantially similar" share sequences as noted above except that residue positions that are not identical may differ by conservative amino acid changes.
The present invention also encompasses the proteins and peptides encoded by the nucleotide sequences of this invention. It is recognized that the proteins of the invention may be oligomeric and will vary in molecular weight. component peptides, activity, and in other characteristics. The proteins of the invention can be used to protect plants against nematodes. Such methods are described in more detail below.
Fragments and variants of the disclosed nucleotide sequences and proteins encoded thereby are also encompassed by the present invention. By "fragment"
is intended a portion of the nucleotide sequence or a portion of the amino acid sequence and hence protein encoded thereby. Fragments of a nucleotide sequence may encode protein fragments that retain the biological activity of the native protein and hence confer resistance to nematodes. Alternatively. fragments of a nucleotide sequence that are useful as hybridization probes generally do not encode fragment proteins retaining biological activity. Thus, fragments of a nucleotide sequence may range from at least about 20 nucleotides, about 50 nucleotides, about 100 nucleotides. and up to the entire nucleotide sequence encoding the proteins of the invention.
By "variants" is intended substantially similar sequences. For nucleotide sequences, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the proteins conferring resistance to nematodes. Generally. nucleotide sequence variants of the invention will have at least 70%, generally, 80%, preferably up to 90%
sequence identity to its respective native nucleotide sequence.
By "variant" protein is intended a protein derived from the native protein by deletion (so-called truncation) or addition of one or more amino acids to the N-terminal and/or C-terminal end of the native protein; deletion or addition of one or more amino acids at one or more sites~in the native protein; or substitution of one or more amino acids at one or more sites in the native protein. Such variants may result from. for example, genetic polymorphism or from human manipulation. Methods for such manipulations are generally known in the art.
Thus, the proteins of the invention may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such 62451-860(S) manipulations are generally known in the art. For example, amino acid sequence variants of the proteins can be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide sequence alterations are well known in the art.
See_ for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-49?; Kunkel et al.
(1987) Methods in Enrymol. 154:367-382; U.S. Patent No. 4.873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing Company. New York) and the references cited therein. Thus, the genes and nucleotide sequences of the invention include both the naturally occurring sequences as well as mutant forms.
Likewise, the proteins of the invention encompass both naturally occurring proteins as _ w~el1 as variations, fragments, and modified forms thereof. Such variants will;
continue to possess the desired activity of conferring resistance to nematodes.
Obviously. the mutations.that will be made in the DNA encoding the variant must not place the sequence out of reading frame and preferably will not create sequences deleterious to expression of the gene.product. See, EP Patent Application Publication No.75,444.
The nematode resistance genes of the invention can be optimized for enhanced expression in plants of interest. See, for example, EPA0359472; EPA0385962:
W091/16432; Perlak et al. (1991) Prnc. Natl. Acad. Sci. USA 88:3324-3328: and Murray et al. (1989) Nucleic Acids Res. 17:477-498. In this manner, the genes can be synthesized utilizing.plant-preferred colons. See, for example, Murray et al.
(1989) . Nucleic Acids Res. 17:477-498. In this manner, synthetic genes can also be made based on the distribution of colons a particular host uses for a particular amino acid. Thus, the nucleotide sequences can be optimized for expression in any plant. It is recognized that ali or any part of the gene sequence may be optimized or synthetic. That is, synthetic or partially optimized sequences may also be used.
The present invention also relates to a recombinant DNA transformation construct comprising the isolated DNA sequences involved in nematode resistance in plants. The recombinant DNA transformation construct can be introduced into plant cells, prbtoplasts, calli, tissues, or whole plants to confer nematode-resistance properties in plants.
The sequences of the invention can be constructed in expression cassettes for 62451-860 (S) expression in a plant. Such expression cassettes will comprise a transcriptional initiation region linked to the gene encoding the gene of interest. Such an expression cassette is provided with a plurality of restriction sites for insertion of the gene of interest behind the regulatory control of a designated promoter. The expression cassette may additionally contain selectable marker genes suitable for the particular host organism.
The transcriptional initiation region, the promoter, may be native or analogous or foreign or heterologous to the host. Additionally, the promoter may be the natural sequence or alternatively a synthetic sequence. By foreign is intended that the transcriptional initiation region is not found in the wild-type host into which the transcriptional initiation region is introduced. As used herein a chimeric gene comprises a coding sequence operably linked to a transcription initiation region which is heterologous to the coding sequence. While any promoter or promoter element capable of driving expression of a coding sequence can be utilized. of particular interest for expression in plants are root promoters (Bevan et al. (1993) in Gene Conservation and Exploitation: Proceedings of the 20th Stadler Genetics Symposium, ed. Gustafson et al. (Plenum Press, New York) pp. 109-129; Brears et al.
( 1991 ) Plant J. 1:235-244; Lorenz et al. ( 1993) Plant J. 4:545-554; U.S.
Patent Nos.
5.459.252; 5,608,149; 5,599,670); pith promoter (U.S. Patent Nos. 5.466,785;
5.451.514; 5,391.725); or other tissue specific and constitutive promoters (see, for example, U.S. Patent Nos. 5.608.149; 5,608,144; 5,604.121; 5.569.597;
5,466.785;
5,399,680; 5,268,463; 5,608;142).
The ttanscriptional cassette will include in the 5'-to-3' direction of transcription, transcriptional and translational initiation regions, a DNA
sequence of interest, and transcriptional and translational termination regions functional in plants.
The termination region may be native with the transcriptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source.
Convenient termination regions are available from the Ti-plasmid of A.
tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau et al. ( 1991 ) Mol. Gen. Genet. 262:141-144; Proudfoot ( 1991 ) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-I58; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639.
Methodologies for the construction of plant transformation constructs are described in the art. The construct may include any necessary regulatory elements such as promoters, terminators (Guerineau et al. ( I 991 ) Mol. Gen. Genet.
226: I 41-144; Proudfoot ( 1991 ) Cell 64:671-674; Sanfacon et al. ( 1991 ) Genes Dev.
5:141-149; Molten et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; .Ioshi et al. (1987) Nucleic Acid Res. I 5:962?-9639); plant translational consensus sequences (Joshi, C.P.
(1987) Nucleic Acids Research 15:6643-6653). enhancers, introns (Luehrsen and Walbot (1991) Mol. Gen. Genet. 225:81-93) and the like, operably linked to the nucleotide sequence. It may be beneficial to include 5' leader sequences in the transformation construct. Such leader sequences can act to enhance translation. See.
for example, Elroy-Stein et al. ( 1989) PNAS USA 86:6126-6130; Allison et al.
( 1986):
Macejak and Sarnow (1991) Nature 353:90-94; Jobling and Gehrke (1987) Nature 325:622-625; Gallie et al. (1989) Molecular Biology of RNA, pp. 237-256;
Lommel et al. (1991) Virology 81:382-385; and Della-Cioppa et al. (1987) Plant Physiol.
84:965-968.
Transcriptional and translational regulatory signals include but are not limited to promoters. transcriptional initiation start sites. operators, activators, enhancers, other regulatory elements, ribosomal binding sites. an initiation codon.
termination signals, and the like. See, for example. U.S. Patent No. 5,039,523: U.S.
Patent No.
4,853.331; EPO 0480762A2; Sambrook et al. (1989) Molecular Cloning A
Laboratory Manual (2d .ed., Cold Spring Harbor Laboratory Press, Plainview, New York); Davis et al., eds. (1980) Advanced Bacterial Genetics (2d ed., Cold Spring Harbor Laboratory. Cold Spring Harbor, New York); and the references cited therein.
For the expression of the proteins encoded by the isolated DNA sequences of the present invention. a promoter capable of facilitating gene transcription in plant cells must be operable linked to the nematode resistance gene sequence. A
variety of suitable promoters are generally known in the art. Both constitutive promoter and tissue-specific promoters can be used. A constitutive promoter is a promoter that can initiate RNA transcription in any tissue or cell in a plant, while tissue-specific promoters can do so only in specific tissues. Suitable promoters are known in the art WO 99/60141 PCT/US98t2745~b and include 35S and I 9S promoter of CaMV. Agrobacterium NOS (nopaline symthase) gene promoter, and the Agrobacterium mannopine synthase gene promoter.
For tissue specific expression, the isolated DNA sequences of the invention conferring nematode resistance can be operably linked to tissue specific promoters.
In addition, a marker gene for identifying and selecting transformed cells.
tissues, or plants may be included in the transformation construct. By marker gene is intended to be either reporter genes or selectable marker genes.
Reporter genes are generally known in the an. The reporter gene used should be exogenous and not expressed endogenously. Ideally the reporter gene will exhibit low- background activity and should not interfere with plant biochemical and physiological activities. The products expressed by the.reporter gene should be stable and readily detectable. It is important that the reporter gene expression should be able to be assayed by a non-destructive., quantitative, sensitive, easy to perform and inexpensive method.
Examples of suitable reporter genes known in the art can be found in, for example, Jefferson et al. (1991 ) in Plant Molecular- Biology Manual, ed.
Gelvin et al.
(Kiuwer Academic Publishers), pp. 1-33; (DeWet et al. (1987) Mol. Cell. Biol.
7:725-737: Goff et al. (1990) EMBO J. 9:2517-2522: Kain et al. (1995) BioTechnigues 19:60-655; Chiu et al. (1996) Current Biology 6:32-330.
Selectable marker genes for selection of transformed cells or tissues can include genes that confer antibiotic resistance or resistance to herbicides.
Examples of suitable seiectable marker genes include, but are not limited to, genes encoding resistance to chloramphenicol (Herrera Estrella et al. (1983) EMBO,I. 2:987-992:
methotrexate (Herrera Estrella et al. ( 1983) Nature 303:209-213; Meijer et al. ( 1991 ) Plant Mol. Biol. 16:807-820); hygromycin Waldron et al. (1985) Plant Mol.
Biol.
x:103-108; Zhijian et al. (1995) Plant Science 108:219-227); streptomycin (Jones et al. ( 1987) Mol. Gen. Genet. 210:86-91; spectinomycin (Bretagne-Sagnard et al.
(1996) Transgenic Res. 5:131-137); bleomycin (Hille et al. {1990) Plant Mol.
Biol.
7:171-176); sulfonamide (Guerineau et al. (1990) Plant Mol. Biol. 15:127-136):
bromoxynil (Stalker et al .(1988) Science 242:419-423); glyphosate (Shaw et al.
( 1986) Science 233:478-481 ); phosphinothricin (DeBlock et al. ( 1987) EMBO
J.
6:'_' ~ I 3-2518); kanomycin, and the like.
It is further recognized that the components of the transformation construct may be modified to increase expression. For example, truncated sequences, nucleotide substitutions or other modifications may be employed. See, for example, Perlak et al. (1991 ) Proc. Natl. Acad Sci. USA 88:3324-3328; Murray et al. ( 1989) Arcleic Acids Res. 17:477-498; and W091/16432.
In preparing the transformation construct, the various DNA fragments may be manipulated, so as to provide for the DNA sequences in the proper orientation and, as appropriate in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites. or the like. For this purpose, in vitro mutagenesis, primer repair.
restriction. annealing, resection, ligation. PCR, or the like may be employed.
where insertions, deletions. or substitutions, e.g., transitions and transversions, may be involved.
The present invention also relates to the introduction of the transformation constructs into plant protoplasts, calli, tissues, or organ explants and the regeneration of transformed plants expressing the nematode resistance gene. The compositions of the present invention can be used to transform any plant. In this manner, genetically modified plants, plant cells, plant tissue, seed, and the like can be obtained.
Transformation protocols may vary depending on the type of plant or plant cell. i. e., monocot or dicot. targeted for transformation. Suitable methods of transforming plant cells include microinjection (Crossway et al. (1986) Biotechniques 7:320-334).
electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606):
Agrobacterium-mediated transformation (Hinchee et al. (1988) Biotechnology 6:91 ~-921 ); direct gene transfer (Paszkowski et al. ( 1984) EMBO .I. 3:2717-2722):
and ballistic panicle bombardment (see, for example, Sanford et al., U.S.
Patent 4.945,050; Tomes et al. (1995) in Plant Cell, Tissue and Organ Culture:
Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin): and McCabe et al. (1988) Biotechnology 6:923-926). Also see Weissinger et al.
(1988) Ann. Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science and Technology x:27-37 (onion); Christou et al. (1988) Plant Physiol. 87:671-674 (soybean); McCabe et al. (1988) Biotechnolo~v 6:923-926 (soybean); Finer and 62451-860(5) McMullen (1991) In Yitro Cell Deo. Biol. 27P:175-182 (soybean); Singh et al.
(1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad Sci. USA 85:4305-4309 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); Klein et al.
(1988) Plant Physiol. 91:440-444 (maize); Fromm et al. ( 1990) Biotechnology 8:833-839;
and 'Tomes et al. ( I 995) in Plant Cell, Tissue, and Organ Culture:
Fundamental Methods, ed. Gamborg and Phillips (Springer-Veilag, Berlin) (maize); Hooydaas-Van Slogteren and Hooykaas (1984) Nature (London) 311:763-764; Bytebier et al.
(1987) Proc. Natl. Acad Sci. USA 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation oJOvule Tissues, (G.H.P. Chapman et al., Longman. NY
eds. pp. I 97-209) ('pollen); Kaeppler et al. ( 1990) Plant Cell Reports 9.:41 ~-418:
Kaeppler et al. ( 1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); DeHalluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 1'':250-255, and Christou and Ford (1995)Annals of I 5 Botany 75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-(maize via Agrobacterium tumefaciens).
Plant tissues suitable for transformation include but are not limited to leaf tissues, root tissues, shoots, meristems. and protoplasts. For soybean it is often preferred to utilize explants of cotyledons.
For example, the Agrobacterium tumefaciens strain A208 is known to be highly virulent on soybean and to give rise to a higher rate of transformation. See Byme et al. (1987) Plant Cell Tissue and Organ Culture 8:3-I5. The transformation of soybean protoplasts by co-culturing them with Agrobacterium tumefaciens or Agrobacterium rhizogenes has been known.
See Facciotti et al. (1985) Biotechnology (NeH~ y'ork) 3:241. Tissue explants may be inoculated with the bacterium for transformation. For example, U.S. Patent No.
5.569,834 issued to Hinchee et al. discloses a method for soybean transformation and regeneration by inoculating a cotyledon explant that is tom apart at the cotyledonary node.
Alternatively, plants can also be transformed successfully by the biolistic technique. which involves using high velocity microprojectiles carrying microparticles containing the transformation construct to propel the microparticles into a plant cell, protoplast, or tissue. The high velocity microprojectile penetrates the outer cell surface without destroying the cell and injects the microparticles into the cells. The transformation construct in the microparticles is thereafter released and incorporated into the cell genome. This technique is also known as particle bombardment and is disclosed in U.S. Patent Nos. 4,945.050, 5,036,006, and x.100,792, which are hereby incorporated by reference. The key advantage of this technique is that it works on virtually any plant tissue. An example of successful transformation of soybean using this particle bombardment technique is demonstrated in McCabe et al. (1988) Biotechnology 6:923-926.
In yet another method of transformation, protoplasts are transfected directly with expression vector DNA that contains the nematode-resistance gene by electroporation or DNA-protoplast co-precipitation in accordance with procedures generally known in the art. See Christou et al. (1987) Proc. Natl. Acad. Sci.
USA
84:3962-3966; Lin et al. (1987) Plant Physiol. 84:856-861.
Once the transformation construct containing the isolated DNA sequences of this invention has been delivered, protoplasts, cells, or tissues expressing the protein encoded by the isolated nematode resistance gene are selected. Selection can be based on the selectable marker that is incorporated in the transformation construct or by culturing the protoplasts, cells, or tissues in media containing one of the antibiotics or herbicides. Alternatively, nematode-resistance may be directly selected by inoculating nematodes into the transformed protoplasts, cells, or tissues.
Both methods of selection are generally known in the art.
A further aspect of the present invention relates to the regeneration of transgenic plants that express nematode resistance genes of the invention. The cells that have been transformed and selected for expression of the sequence of this invention may be grown into plants in accordance with conventional ways. See.
for example, McCormick et al. (1986) Plant Cell Reports 5:81-84. These plants may then be grown. and either pollinated with the same transformed strain or different strains, the resulting hybrid having the desired genetic traits necessary for nematode-resistance.
For example. in soybean, transgenic soybean regeneration has been successful 62451-860(S) from tissues such as nodal axillary buds transformed with elec~roporation-mediated gene transfer technique (Chowrira et al. (1996) Mol. Biotechnol. 5:85-96);
somatic embryos transformed using microprojectile bombardment (Stewart et al. (1996) Plafzt Physiol. 112:121-129); and cotyledon explants that are tom apart at the cotyledonary node and are uansformed by Agrobacterium inoculation (LJ.S. Patent No.
5,569.834 issued to Hinchee et al.). Other methods for regenerating soybean plants are disclosed in U.S. Patent No. 4,684,612 issued to Hemphill et al., and U.S.
Patent No.
4,992.375 issued to Wright.
The sequences of the invention are generally introduced into plants wherein the plant in its native state does not contain the DNA sequences. However. it is recognized that in some plants the gene may occur but does not confer resistance because of aberrant expression, a mutation in the sequence, a nonfunctional protein.
and the like. It will be beneficial to transform such plants with the sequences of the invention.
Using cells and tissues of the present invention that are resistant to nematodes helps to obviate the problem of nematode infection of the host cells and tissues in the culture. In addition, the cells and tissues according to the present invention can also be valuable in the elucidation of the mechanism underlying the plant resistance to pathogens. Such plants include maize, oats, wheat. rice, barley. sorghum, alfalfa.
tobacco, cotton. sugar beet, sunflower, carrot, canola, tomato, potato, oilseed rape.
cabbage, pepper. lettuce, brassicas. tobacco, and soybean.
It is recognized that resistance to nematodes may be multigenic and quantitative in certain plants. Thus, the sequences disclosed herein may be useful alone or in combination with other sequences. Breeding programs have produced mane genotypes that have varying numbers of the genes responsible for nematode resistance.
Thus, the isolated DNA sequences of the invention are preferably used to transform plants expressing one or more other nematode resistance genes. Such plants may be naturally occurring, produced by breeding programs, or produced b~~
transformation with other nematode resistance genes. The result of the transformation with the isolated nematode resistance gene of this invention improves the plants capacity for nematode resistance.
Cotransformation may be conducted to introduce the DNA sequences of this invention into plants together with one or more other nematode resistance genes. In the transformation construct, the other known nematode resistance genes may be contained on the same plasmid as the DNA sequence of this invention or may be contained on a separate plasmid or DNA molecule. The methods for making transformation constructs having the other known nematode resistance gene with or without a DNA sequence isolated in this invention are similar to the methods described above and should be apparent to a person skilled in the art.
Several methods of cotransformation of plants have been developed.
Cotransformation is easily accomplished by DNA mediated processes, such as the co-precipitation method, biolistic method. and electroporation. Each of these methods is adequately suited for the introduction of the DNA sequences of this invention and other nematode resistance genes, on the same or separate plasmids, into the plant cells. Alternatively, Agrobacterium tumefaciens-mediated cotransformation techniques can be employed. Examples of such techniques can be found in, for example, Depicker et al. (1985) Mol. Gen. Genet. 201:477-484; McKnight et al.
(1987) Plant Mol. Biol. 8:439-445; De Block et al. (1991) Theor. Appl. Genet.
8?:257-263; de Framond et al. (1986) Mol. Gerz Genet. 202:125-131; and Komari et al. ( 1996) The Plant Journal 10: I 65-174. In an alternative method, multiple transgenes may be brought together by breeding of separately transformed parent plants.
The following examples are offered by way of illustration and not by wav of limitation.
EXAMPLES
Example 1: Incorporation of DNA Sequences Conferring Nematode Resistance into Expression Vectors Genomic DNA sequences spanning the full length coding regions of gene fragments conferring nematode-resistance to maize and soybean were isolated and cloned. These sequences are set forth in SEQ ID NOs: 1 and 3 (maize) and 5 and (soybean). Plasmids containing these sequences have been deposited with American Type Culture Collection (ATCC) on October 1 S, 1997, and on February 4, 1998 and are assigned Accession Numbers 209366, 209365, 209614, 209363, and 209364.
Gene fragments are cloned into a plasmid vector, such as that shown in Figure 6, in the sense orientation so that they are under the transcriptional control of a constitutive promoter. The transformation construct is then available for introduction into soybean cells by bombardment methods as described in Example 2.
Example 2: Transformation of Soybean Cells and Regeneration of Transgenic Plants Having Improved Nematode Resistance Initiation and Maicitenance of EmbryoQenic Suspension Cultures Embryogenic suspension cultures of soybean (Glycine max Merrill) are initiated and maintained in a 10A40N medium supplemented with 5 mM asparagine as described previously (Finer and Nagasowa (1988) Plant Cell Tissue Org.
Culi.
15:125-136). For subculture, two clumps of embryogenic tissue, 4 mm in diameter, are transferred to 35 ml of 10A40N medium in a 125-ml delong flask. High quality embryogenic material is selectively subcultured monthly at this low inoculum density.
Preparation of DNA and Tungsten Pellets Plasmid DNA from Example 1 is precipitated onto 1.1 ~tm (average diameter) tungsten pellets using a CaCl2 precipitation procedure (Finer and McMullen (1990) Plant Cell Rep. 8:586-589). The pellet mixture containing the precipitated DNA
is gently resuspended after precipitation, and 2 ~1 is removed for bombardment.
errneTr~rr~ cmr~r~r .v WO 99/60141 PCT/US98l27456 Preparation of Plant Tissue for Bombardment Approximately 1 g of embryogenic suspension culture tissue (taken 3 weeks after subculture) is transferred to a 3.5-cm-diameter petri dish. The tissue is centered in the dish, the excess liquid is removed with a pipette, and a sterile 500 pm pore size nylon screen (Tetko Inc., Elmsford, New York) is placed over the embryonic tissue.
Open petri dishes are placed in a laminar-flow hood for 10 to 15 minutes to evaporate residual liquid medium from the tissue. The 3.5-cm-diameter petri dish is placed in the center of a 9-cm-diameter petri dish immediately before bombardment.
Bombardments are performed using a DuPont Biolistics Particle Delivery System (model BPG). Each sample of embryogenic soybean tissue is bombarded once.
Selection for Transgenic Clones Bombarded tissues are resuspended in the 1 OA40N maintenance medium.
One to two weeks after bombardment the clumps of embryogenic tissue are resuspended in fresh 10A40N medium containing a selection agent, such as kanomycin or hygromycin. The selection agent is filter-sterilized before addition to liquid media. The medium containing a selection agent is replaced with fresh antibiotic-containing medium weekly for 3 additional weeks.
Six to eight weeks after the initial bombardment, brown clumps of tissue that contain yellow-green lobes of embryogenic tissue are removed and separately subcultured in 10A40N medium containing selection agent. After 3 to 4 months of maintenance in this medium, proliferating embryogenic tissues are maintained by standard subculture in 10A40N without added antibiotic. Embryogenic tissues are periodically removed from I OA40N medium containing selection agent and 10A40N
for embryo development and Southern hybridization analyses.
Embrvo Development and Germination For embryo development. clumps of kanamycin-resistant embryogenic tissues are placed at 23°C on the embryo development medium, which contains MS
salts (Murashige and Skoog (1962) Physiol. Plant 15:474-497), B5 vitamins (Gamborg et al. (1968) Exp. Cell. Res. 50:151-158). 6% maltose, and 0.2% gelrite (pH 5.7).
One WO 99/60141 PCT/US98I2745b month after plating, the developing embryos are cultured as individual embryos, 25 per 9-cm-diameter petri dish in fresh embryo development medium. After an additional 4 weeks, the mature embryos are placed in dry petri dishes for 2 to 3 days.
After the desiccation treatment, the embryos are transferred to a medium containing MS salts, BS vitamins, 3% sucrose, and 0.2% Gelrite (pH 5.7). After root and shoot elongation, plantlets are transferred to pots containing a 1:1:1 mixture of vermiculite, topsoil. and peat, and maintained under high humidity. Plantlets are gradually exposed to ambient humidity over a 2-week period and placed in the greenhouse.
where they are grown to maturity and monitored for expression of the nematode resistance gene.
DNA Extraction and~Southern Hybridization Analysis DNA is extracted from embryogenic tissue and leaves using the CTAB
procedure (Saghai-Maroof et al. (1984) Proc. Natl. Acad. Sci. USA 81:8014-8018).
Digested DNAs are electrophoresed on a 0.8% agarose gel. The DNA in the gels is 1 ~ treated with 0.2 N HCI, twice for 1 ~ minutes, followed with 0.5 M
NaOH/0.1 M 1.~
M NaCI, twice for 30 minutes, and finally 1 M NHaCzH302/0.1 M NaOH, for 40 minutes. The DNA is transferred (Vollrath et al. ( 1988) Proc. Natl. Acad.
Sci. USA
8:6027-6031 ) to nylon membranes (Zetaprobe-BioRad, Richmond, California) overnight by capillary transfer using l M NH4C~H30z/0.1 M NaOH. The membranes are baked at 80°C for 2 hours under vacuum and then prehybridized for 4 to 6 hours at 6~°C in SO mM Tris pH 8.0, Sx standard saline citrate (SSC). 2x Denhardt's. 10 mM
Na~EDTA, 0.2% sodium dodecyl sulfate (SDS). and 62.~ ~tg/ml salmon sperm DNA.
All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
Although the foregoing invention has been described in some detail by way of illusuation and example for purposes of clarity of understanding. it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.
SEQUENCE LISTING
<110> Jessen, Holly J.
Meyer, Terry E.
<120> Genes and Methods for Control of Nematodes in Plants <130> 5718-18-1, 035718/171690 <140> PCT/US/98/27456 <141> 1998-12-23 <160> 10 <170> PatentIn Ver. 2.0 < 210 > 1 <211> 1347 <212> DNA
<213> Zea mays <220>
2 0 <221> CDS
<222> (146)..(991) <400> 1 ccacgcgtcc gcggacgcgt gggtgcccgg gagcgccgcc gcggtcgtgt gccaggtcag 60 cgaggccagc ctgctcccgc gcctcgccgc gtgggacaag tccgagacgc tcgcggccaa 120 gatcatgtac gccatcgaga gccag atg cag ggc tgc gcc ttc acg ctc gga 172 Met Gln Gly Cys Ala Phe Thr Leu Gly ctc ggc gag ccc aac ctc gcc ggc aag ccc gtg ctc gag tac gac cgc 220 3 0 Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg gtc gtg cgc ccg cac gag ctg cac gcg ctc aag ccc aag cca gcg ccg 268 Val Val Arg Pro His Glu Leu His Ala Leu Lys Pro Lys Pro Ala Pro gag ccc aag tct ggg tac ctc aac agg gag aac gag acg ctg ttc acc 316 Glu Pro Lys Ser Gly Tyr Leu Asn Arg Glu Asn Glu Thr Leu Phe Thr atg tac cag ata ctc gaa tcg tgg ctg cgc gcc gcg tcg caa ctc ctc 364 Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Ala Ala Ser Gln Leu Leu gcc cgc ctc aac gaa cgg atc gaa gcc aag aac tgg gaa gcg gcg get 412 Ala Arg Leu Asn Glu Arg Ile Glu Ala Lys Asn Trp Glu Ala Ala Ala gcc gac tgc tgg atc ctg gag cgc gtg tgg aag ctg ctc gcc gac gtc 460 Ala Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu Leu Ala Asp Val gag gac ctc cac ctg ctg atg gac ccg gac gac ttc ctg cgg ctc aag 508 Glu Asp Leu His Leu Leu Met Asp Pro Asp Asp Phe Leu Arg Leu Lys ggc cag ctc get gta cga gcg get cca tgg tct gac gcg tcg ttc tgt 556 Gly Gln Leu Ala Val Arg Ala Ala Pro Trp Ser Asp Ala Ser Phe Cys ttc cgg tcc agg gcg ctc ctg cac gtc get aac acc act agg gac ctc 604 Phe Arg Ser Arg Ala Leu Leu His Val Ala Asn Thr Thr Arg Asp Leu aag aag cgt gtg ccc tgg gtg ctc ggt gtc gag gtg gac ccc aac ggc 652 Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Val Asp Pro Asn Gly ggc ccg cgg gtg cag gag gca gcc atg atg ctg tac cac agc cgt agg 700 Gly Pro Arg Val Gln Glu Ala Ala Met Met Leu Tyr His Ser Arg Arg cgc ggc gag ggc gag gag gcg ggc aag gtg gag ctg ctc cag gcc ttc 748 Arg Gly Glu Gly Glu Glu Ala Gly Lys Val Glu Leu Leu Gln Ala Phe caa gca gtg gag gtg gcc gtg aga gga ttc ttc ttc gcg tac cgg cag 796 2 0 Gln Ala Val Glu Val Ala Val Arg Gly Phe Phe Phe Ala Tyr Arg Gln 205 , 210 _ 215 ctc gtg gcg gcg gtg atg ggc acg gcg gag gcg ttg ggc aac cgg gcg 844 Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Leu Gly Asn Arg Ala ctg ttc gtg ccg gcg gag ggg atg gat cca ttg gcc cag atg ttc ctc 892 Leu Phe Val Pro Ala Glu Gly Met Asp Pro Leu Ala Gln Met Phe Leu gag cca ccc tac tac ccc agc ctg gat gcc gcc aag acg ttc cta gcg 940 Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala gat tac tgg gtt cag cag atg gcg ggg gcc tct get ccg tca ata caa 988 Asp Tyr Trp Val Gln Gln Met Ala Gly Ala Ser Ala Pro Ser Ile Gln agc tgaaacggcg aaatggcgcg gctggatagc gaccgaatcg cgcagttttg 1041 Ser cagcctgaag atactatgta tgcatgcatc gtaatttcgc tgtggccttg tggtgataga 1101 gtgattcatt tctatagcga tcctgtacta gtgtagtaca tgtagcacta aattgtctta 1161 ttatcgttgt gcttgtgcac tgcgttgtgt tgtgttctac atagagattg attcagttag 1221 atgccatttg tcactctagg caagtgtttc aattgggcac cgtgtatata tagaactttt 1281 gtaaacactg gtagatggat tcatcaatta cagaatgttg atgttgacaa aaaaaaaaaa 1341 aaaaaa 1347 <210> 2 5 0 <211> 282 <212> PRT
<213> Zea mays <400> 2 Met Gln Gly Cys Ala Phe Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg Val Val Arg Pro His Glu Leu 60 His Ala Leu Lys Pro Lys Pro Ala Pro Glu Pro Lys Ser Gly Tyr Leu Asn Arg Glu Asn Glu Thr Leu Phe Thr Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Ala Ala Ser Gln Leu Leu Ala Arg Leu Asn Glu Arg Ile Glu Ala Lys Asn Trp Glu Ala Ala Ala Ala Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu Leu Ala Asp Val Glu Asp Leu His Leu Leu Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Gly Gln Leu Ala Val Arg Ala Ala Pro Trp Ser Asp Ala Ser Phe Cys Phe Arg Ser Arg Ala Leu Leu His Val Ala Asn Thr Thr Arg Asp Leu Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Val Asp Pro Asn Gly Gly Pro Arg Val Gln Glu Ala 165 ~ 170 175 Ala Met Met Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu Glu Ala 3 0 Gly Lys Val Glu Leu Leu Gln Ala Phe Gln Ala Val Glu Val Ala Val Arg Gly Phe Phe Phe Ala Tyr Arg Gln Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Leu Gly Asn Arg Ala Leu Phe Val Pro Ala Glu Gly Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln Gln Met Ala Gly Ala Ser Ala Pro Ser Ile Gln Ser <210> 3 <211> 1325 50 <212> DNA
<213> Zea mays <220>
<221> CDS
<222> (126)..(980) <400> 3 ccacgcgtcc gagcgccgcc gcggtcgtgt gccgggccag caaggccagc ctgctcccgc 60 gcctcgccgc gtgggagaag tctgaggcgc tcgcggccag gatcacgtac gccgtcgagg 120 gccag atg cag ggc tgc gcc tcc acg ctc ggc ctc ggc gag ccc aac ctc 170 Met Gln Gly Cys Ala Ser Thr Leu Gly Leu Gly Glu Pro Asn Leu gccggcaagccc gtgctcgag tacgaccgc gtcgtgcgc ccgcacgag 218 AlaGlyLysPro ValLeuGlu TyrAspArg ValValArg ProHisGlu ctgcacgcgctg aagcccgac cctgcgccg gagcccatg tccggctac 266 LeuHisAlaLeu LysProAsp ProAlaPro GluProMet SerGlyTyr cgcaaccgggag ctcgagact ctgttcacc atgtaccag atactcgag 314 ArgAsnArgGlu LeuGluThr LeuPheThr MetTyrGln IleLeuGlu tcctggctccgc gtcgcgtcg cagctgctc acccgcctc gacgagcgg 362 SerTrpLeuArg ValAlaSer GlnLeuLeu ThrArgLeu AspGluArg atc gaa gac aag tgc tgg gag gcg gcg gcc ggc gac tgc tgg atc ctg 410 2 0 Ile Glu Asp Lys Cys Trp Glu Ala Ala Ala Gly Asp Cys Trp Ile Leu 80 ~ 85 90 95 gag cgc gtg tgg aag ctg ctc gcg gac gtc gag gac ctc cac ctg ctg 458 Glu Arg Val Trp Lys Leu Leu Ala Asp Val Glu Asp Leu His Leu Leu atg gac ccg gac gag ttc cta cgg ctc aag agc cag ctc gcc gta cga 506 Met Asp Pro Asp Glu Phe Leu Arg Leu Lys Ser Gln Leu Ala Val Arg gcg gcg ccg ggg tct gag tcc gcg tcc ttc tgt ttc cgg tcc acg gcg 554 Ala Ala Pro Gly Ser Glu Ser Ala Ser Phe Cys Phe Arg Ser Thr Ala ctc ctg cac gtc get agc gcc act agg gac ctc aag aag cgt gtg ccc 602 Leu Leu His Val Ala Ser Ala Thr Arg Asp Leu Lys Lys Arg Val Pro tgg gtg ctc ggt gtc gag gcg gac ccc agc ggc ggc cca cgg gtg cag 650 4 0 Trp Val Leu Gly Val Glu Ala Asp Pro Ser Gly Gly Pro Arg Val Gln gag gcg gcc atg aag ctg tac cac agc cgt agg cgc ggt gag ggc gag 698 Glu Ala Ala Met Lys Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu gag gca ggc aag gtg gac ctg ctc cag gcc ttc cag gcg gtg gag gtg 746 Glu Ala Gly Lys Val Asp Leu Leu Gln Ala Phe Gln Ala Val Glu Val gcc gtg aga gca ttc ttc ttc ggg tac cgg cag ctg gtg gcg gcg gtc 794 Ala Val Arg Ala Phe Phe Phe Gly Tyr Arg Gln Leu Val Ala Ala Val atg ggc acg gcg gag gcg tcg ggc aac cgg gcg ctg ttc gtg ccg gcg 842 Met Gly Thr Ala Glu Ala Ser Gly Asn Arg Ala Leu Phe Val Pro Ala gag gag atg gat ccg ctc gcc caa atg ttc ctg gag ccg cca tac tac 890 Glu Glu Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr cct agc ctg gac gcc gcc aag acg ttt cta gcg gat tac tgg gtt cag 938 Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln ctt cag cag atg gcg gag gcc tct get ccg tca aga caa agc 980 Leu Gln Gln Met Ala Glu Ala Ser Ala Pro Ser Arg Gln Ser tgaaacggcg aaatggcacg gctgagccac cgaatcgcgc agttttgcag gactgaagat 1040 actatgcatg catttcgttg gggccttttg cccttgtggt gaatggtgat agagtgattc 1100 atttctatag cgatcatgta ctattgcagt acatgtcgca ctagaatact agattctctt 1160 actatcgttg tgcactgcgt tgtacgtgtt gtgttctacg tagatataga ttgattcagt 1220 tagatgtcat ttgtattgcc aagtaggtca attggatatg gaacttttgt aaataccgaa 1280 atactgttgt tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 1325 <210> 4 <211> 285 2 0 <212> PRT
<213> Zea mat's - -<400> 4 Met Gln Gly Cys Ala Ser Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Val Leu Glu Tyr Asp Arg Val Val Arg Pro His Glu Leu His Ala Leu Lys Pro Asp Pro Ala Pro Glu Pro Met Ser Gly Tyr Arg Asn Arg Glu Leu Glu Thr Leu Phe Thr Met Tyr Gln Ile Leu Glu Ser Trp Leu Arg Val Ala Ser Gln Leu Leu Thr Arg Leu Asp Glu Arg Ile Glu Asp Lys Cys Trp Glu Ala Ala Ala Gly Asp Cys Trp Ile Leu Glu Arg Val Trp Lys Leu heu Ala Asp Val Glu Asp Leu His Leu Leu Met Asp Pro Asp Glu Phe Leu Arg Leu Lys Ser Gln Leu Ala Val Arg Ala Ala Pro Gly Ser Glu Ser Ala Ser Phe Cys Phe Arg Ser Thr Ala Leu 50 Leu His Val Ala Ser Ala Thr Arg Asp Leu Lys Lys Arg Val Pro Trp Val Leu Gly Val Glu Ala Asp Pro Ser Gly Gly Pro Arg Val Gln Glu Ala Ala Met Lys Leu Tyr His Ser Arg Arg Arg Gly Glu Gly Glu Glu Ala Gly Lys Val Asp Leu Leu Gln Ala Phe Gln Ala Val Glu Val Ala Val Arg Ala Phe Phe Phe Gly Tyr Arg Gln Leu Val Ala Ala Val Met Gly Thr Ala Glu Ala Ser Gly Asn Arg Ala Leu Phe Val Pro Ala Glu Glu Met Asp Pro Leu Ala Gln Met Phe Leu Glu Pro Pro Tyr Tyr Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Ala Asp Tyr Trp Val Gln Leu Gln Gln Met Ala Glu Ala Ser Ala Pro Ser Arg Gln Ser <210> 5 <211> 1498 2 0 <212> DNA
<213> Glycine max <220>
<221> CDS
<222> (69)..(1433) <220>
<223> Immediate source: Clone- P12568 <400> 5 cgacaccaat ttctccatcc tctcattgaa aaacaaaatt aatcatctta cttatttatt 60 ctccgaaa atg gtt gat tta cat tgg aaa tca aag atg cca agt tcc gac 110 3 0 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp atg cct tcc aaa act cta aaa ctc tct ctc tcc gac aac aag tcc tta 158 Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu ccc tct ttg caa cta ccc ttc cgc acc aca gat atc tct cac gcc gca 206 Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala cct tct gtt tgc gcc act tac gac tac tat ctc cgt ctt cct caa ctc 254 Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu aga aag ctt tgg aac tcc tca gat ttt cct aat tgg aac aac gaa cca 302 Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro atc tta aaa cct atc ttg caa get ctc gaa atc acc ttc cgc ttt ctc 350 50 Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu tcc att gtt ctc tcc gat cca aga cct tac tcc aac cac aga gaa tgg 398 Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp act cgc agg ata gag tct ctt atc aca cat caa att gaa atc att gcc 446 Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala atactttgtgaa gatgag gaacaaaat tccgacacacgt ggcact gca 494 IleLeuCysGlu AspGlu GluGlnAsn SerAspThrArg GlyThr Ala ccaaccgetgat ctcagc aggaacaat agcagcgagagc agaagc tac 542 ProThrAlaAsp LeuSer ArgAsnAsn SerSerGluSer ArgSer Tyr agcgaggcaagc ctgctt ccgcggctt gccacgtggtac aaatcc aag 590 SerGluAlaSer LeuLeu ProArgLeu AlaThrTrpTyr LysSer Lys gacgtagcgcag aggatc cttctctca gttgaatgccaa atgagg agg 638 AspValAlaGln ArgIle LeuLeuSer ValGluCysGln MetArg Arg tgt tcc tac acg ctg ggt ttg ggt gag ccg aac cta gcg ggc aaa ccg 686 2 0 Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro 195 . 200 . 205 agc ctg ctc tac gac ctc gtg tgt aag ccg aac gag atc cac gcg ctg 734 Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu aag acg acg ccg tac gat gag cgc gta gag aat cac gag aac cac gcg 782 Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala ttg cac gcg acg cac cag atc gcc gag tcg tgg atc cac gcg tcg cgg 830 Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg aag gtt cta gag agg atc gca gac gcg gtg ctc tcc aga acc ttc gag 878 Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu aag gcg get gag gac tgc tac gcc gtg gaa agg atc tgg aag ctt ctc 926 4 0 Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu gcg gag gtg gag gac ctc cac ctg atg atg gat ccg gac gat ttc ttg 974 Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu aga ctg aag aat cag ctc tcg gtg aaa tcc tcc ggc ggc gaa acg get 1022 Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala tcg ttc tgc ttc agg tcg aag gag ttg gtt gaa ctg acg aag atg tgc 1070 Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys aga gat ctg agg cac aag gtg ccg gag ata ttg gag gtg gag gtg gat 1118 Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp ccg aag gga gga ccg agg att caa gag gcg gcg atg aag ctc tac gtt 1166 Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val tcg aag agc gcg ttc gag aag gtt cac ttg ttg cag gcg atg cag gcg 1214 Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala att gag gcg gcg atg aag aga ttc ttc tac gcg tat aag cag gtg ttg 1262 Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu gcg gtg gtg atg gga agc tcc gag get aac ggt aac cga gtt ggg ttg 1310 Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu agt tgc gac tcg get gac tcg ttg act cag att ttc ctt gaa ccg acg 1358 Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr tat ttt cca agc ttg gat gcc gcc aag act ttt ctt gga tac ttg tgg 1406 2 0 Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp 435 440 ~ 445 gat aat aac gat aat aac aaa tgg ata tgataaggga aaaaaaaaaa 1453 Asp Asn Asn Asp Asn Asn Lys Trp Ile acggcacaaa aacgatggcc aaagtgagat tttcggtttg ggcac 1498 <210> 6 3 0 <211> 455 <212> PRT
<213> Glycine max <400> 6 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser 4 0 Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr Ser Glu x ,. _M
r Ala Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg Lys Val Leu Glu Arg I1e Ala Asp Ala Val Leu Ser Arg Thr Phe Glu Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu Ala Glu 275 . 280 285 Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu 3 0 Lys Asn Gln,Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile <210> 7 <211> 1418 <212> DNA
<213> Glycine max <220>
<221> CDS
<222> (46)..(1398) <400> 7 caccaaacaa aaaaatcaat cattttattt tatttttcta cgaaa atg gtt gat tta 57 Met Val Asp Leu cat tgg aaa tca aag atg cct agt tcc aaa aca cca aaa ctc tct ctc 105 His Trp Lys Ser Lys Met Pro Ser Ser Lys Thr Pro Lys Leu Ser Leu tcc gac aac aag tcc tta ccc tct ttg caa cta ccc ttc cgc acc aca 153 Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr 2 0 gat atc tct ccc gcc get cct tcc gtt tgc gcc get tac gac tac tat 201 Asp Ile Ser Pro Ala Ala Pro Ser Val Cys Ala Ala Tyr Asp Tyr Tyr ctc cgt ctt cct caa ctc aga aag ctt tgg aac tcc act gat ttt cct 249 Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Thr Asp Phe Pro aat tgg aac aac gaa ccg att cta aaa cca att ttg caa get ctc gaa 297 Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu ata acg ttc cgc ttt ctt tcc att gtt ctc tcc gat ccc aga cct tac 345 Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr tcc aac cac aga gaa tgg act cgc cgg ata gag tct ctc atc atg cat 393 Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Met His 4 0 caa att gaa atc att gcc ata ctt tgt gaa gaa gag gaa caa aat tcc 441 Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Glu Glu Glu Gln Asn Ser gac aca cgt ggc act gca cca acc get gat ctc agc agc agc aat agc 489 Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Ser Ser Asn Ser agc gtg agc aga agc tac agc gag gcg agc ctg ctt cct cgg ctt gcc 537 Ser Val Ser Arg Ser Tyr Ser Glu Ala Ser Leu Leu Pro Arg Leu Ala acg tgg tac aaa tcc agg gac gtg gcg cag agg atc ctt ctc tcc gtg 585 Thr Trp Tyr Lys Ser Arg Asp Val Ala Gln Arg Ile Leu Leu Ser Val gaa tgc caa atg agg agg tgc tcc tac acg ctt ggt ttg ggc gag ccg 633 Glu Cys Gln Met Arg Arg Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro 60 aac cta gcg ggg aag ccg agc ctg ctc tac gac ctc gtg tgc aag ccg 681 Asn Leu Ala Gly Lys Pro Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro aatgagatc cacgcgctg aagacg acgccgtacgac gagcgcgtg gag 729 AsnGluIle HisAlaLeu LysThr ThrProTyrAsp GluArgVal Glu aaccacgag aaccacgcg gtgcac gccacgcaccag atcgcggag tcg 777 AsnHisGlu AsnHisAla ValHis AlaThrHisGln IleAlaGlu Ser tggattcac gcgtcgcgg aaggtt ctggagagaatc gcggacgcg gtg 825 TrpIleHis AlaSerArg LysVal LeuGluArgIle AlaAspAla Val ctctccaga accttcctg aaagca gcagaggactgc tacgccgtg gag 873 LeuSerArg ThrPheLeu LysAla AlaGluAspCys TyrAlaVal Glu agg atc tgg aag ctt ctc gcg gag gtg gag gac ctc cac ctg atg atg 921 2 0 Arg Ile Trp Lys Leu Leu Ala Glu Val Glu Asp Leu His Leu Met Met 280 285 . 290 gat ccg gac gat ttc ttg agg cta aag aat caa ctc tcg gtg aaa tcc 969 Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser tcg agc ggc gaa acg gca tcg ttc tgc ttc aga tcg aat gag tta gtg 1017 Ser Ser Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Asn Glu Leu Val gaa ctg acg aag atg tgc aga gat ctg agg cac aag gtg ccg gag ata 1065 Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile ttg gag gtg gag gtg gat ccg aag gga gga ccg agg att caa gag gcg 1113 Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala gcg atg aag ctc tac gtt tcg aag agc gag ttc gag aag gtt cac ttg 1161 4 0 Ala Met Lys Leu Tyr Val Ser Lys Ser Glu Phe Glu Lys Val His Leu ttg cag gcg atg cag gcg att gag gcg gcg atg aag aga ttc ttc tac 1209 Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr gcg tat aag cag gtg ttg gcg gtg gtg atg gga agt tca gag get aac 1257 Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn ggt aac cga gtt ggg ttg agt tgc gac tcg get gac tcg ttg act cag 1305 Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp Ser Leu Thr Gln att ttc ctt gaa ccg acg tat ttt cca agc ttg gat gcc gcc aag act 1353 Ile Phe Leu Glu.Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr ttt ctt gga tac ctg tgg gat aat aac gat aat aac aaa tgg ata 1398 Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile tgaaaacgaa aaaaaaaaaa 1418 <210> 8 <211> 451 <212> PRT
<213> Glycine max <400> 8 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Lys Thr Pro Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser Pro Ala Ala Pro Ser Val Cys Ala Ala Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Thr Asp PheProAsn TrpAsn AsnGluPro IleLeuLysPro IleLeu 65 70 . 75 80 Gln Ala LeuGluIle ThrPhe ArgPheLeu SerIleValLeu SerAsp Pro Arg ProTyrSer AsnHis ArgGluTrp ThrArgArgIle GluSer Leu Ile MetHisGln IleGlu IleIleAla IleLeuCysGlu GluGlu Glu Gln AsnSerAsp ThrArg GlyThrAla ProThrAlaAsp LeuSer Ser Ser AsnSerSer ValSer ArgSerTyr SerGluAlaSer LeuLeu 4 Pro Arg LeuAlaThr TrpTyr LysSerArg AspValAlaGln ArgIle Leu Leu SerValGlu CysGln MetArgArg CysSerTyrThr LeuGly Leu Gly GluProAsn LeuAla GlyLysPro SerLeuLeuTyr AspLeu Val Cys LysProAsn GluIle HisAlaLeu LysThrThrPro TyrAsp Glu Arg ValGluAsn HisGlu AsnHisAla ValHisAlaThr HisGln Ile Ala GluSerTrp IleHis AlaSerArg LysValLeuGlu ArgIle Ala Asp AlaValLeu SerArg ThrPheLeu LysAlaAlaGlu AspCys Tyr Ala ValGluArg IleTrp LysLeuLeu AlaGluValGlu AspLeu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Ser Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Asn Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Glu Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Mgt Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Ala Asp 405 410 ~ 415 Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp 3 0 Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile <210> 9 <211> 1498 <212> DNA
<213> Glycine max 40 <220>
<221> CDS
<222> (69)..(1433) <400> 9 cgacaccaat ttctccatcc tctcattgaa aaacaaaatt aatcatctta tttatttatt 60 ctccgaaa atg gtt gat tta cat tgg aaa tca aag atg cca agt tcc gac 110 Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp atg cct tcc aaa act ctc aaa ctc tct ctc tcc gac aac aag tcc tta 158 50 Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu ccc tct ttg caa cta ccc ttc cgc acc aca gat atc tct cac gcc gca 206 Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala cct tct gtt tgc gcc act tac gac tac tat ctc cgt ctt cct caa ctc 254 Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu aga aag ctt tgg aac tcc tca gat ttt cct aat tgg aac aac gaa cca 302 Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro atc tta aaa cct atc ttg caa get ctc gaa atc acc ttc cgc ttt ctc 350 Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu tcc att gtt ctc tcc gat cca aga cct tac tcc aac cac aga gaa tgg 398 Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp act cgc agg ata gag tct ctt atc aca cat caa att gaa atc att gcc 446 Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala ata ctt tgt gaa gat gag gaa caa aat tcc gac aca cgt ggc act gca 494 2 0 Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala 130 . 135 140 cca acc get gat ctc agc agg aac aat agc agc gag agc aga agc tac 542 Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr agc gag gca agc ctg ctt ccg cgg ctt gcc acg tgg tac aaa tcc aag 590 Ser Glu Ala Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys gac gta gcg cag agg atc ctt ctc tca gtt gaa tgc caa atg agg agg 638 Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg tgt tcc tac acg ctg ggt ttg ggt gag ccg aac cta gcg ggc aaa ccg 686 Cys Ser Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro agc ctg ctc tac gac ctc gtg tgc aag ccg aac gag atc cac gcg ctg 734 4 0 Ser Leu Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu aag acg acg ccg tac gat gag cgc gta gag aat cac gag aac cac gcg 782 Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala ttg cac gcg acg cac cag atc gcc gag tcg tgg atc cac gcg tcg cgg 830 Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg aag gtt cta gag agg atc gca gac gcg gtc ctc tcc aga acc ttc gag 878 Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu aag gcg get gag gac tgc tac gcc gtg gaa agg atc tgg aag ctt ctc 926 Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu gcg gag gtg gag gac ctc cac ctg atg atg gat ccg gac gat ttc ttg 974 Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu aga ctg aag aat cag ctc tcg gtg aaa tcc tcc ggc ggc gaa acg get 1022 Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala tcg ttc tgc ttc agg tcg aag gag ttg gtt gaa ctg acg aag atg tgc 1070 Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys aga gat ctg agg cac aag gtg ccg gag ata ttg gag gtg gag gtg gat 1118 Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp ccg aag gga gga ccg agg att caa gag gcg gcg atg aag ctc tac gtt 1166 Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val tcg aag agc gcg ttc gag aag gtt cac ttg ttg cag gcg atg cag gcg 1214 2 0 Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala att gag gcg gcg atg aag aga ttc ttc tac gcg tat aag cag gtg ttg 1262 Ile Glu Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu gcg gtg gtg atg gga agc tcc gag get aac ggt aac cga gtt ggg ttg 1310 Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu agt tgc gac tcg cgt gac tcg ttg act cag att ttc ctt gaa ccg acg 1358 Ser Cys Asp Ser Arg Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr tat ttt cca agc ttg gat gcc gcc aag act ttt ctt gga tac ttg tgg 1406 Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp gat aat aac gat aat aac aaa tgg ata tgataaggga aaaaaaaaaa 1453 4 0 Asp Asn Asn Asp Asn Asn Lys Trp Ile acggcacaaa aacgatggcc aaagtgagat tttcggtttg ggcac 1498 <210> 10 <211> 455 <212> PRT
<213> Glycine max <400> 10 SO Met Val Asp Leu His Trp Lys Ser Lys Met Pro Ser Ser Asp Met Pro Ser Lys Thr Leu Lys Leu Ser Leu Ser Asp Asn Lys Ser Leu Pro Ser Leu Gln Leu Pro Phe Arg Thr Thr Asp Ile Ser His Ala Ala Pro Ser Val Cys Ala Thr Tyr Asp Tyr Tyr Leu Arg Leu Pro Gln Leu Arg Lys Leu Trp Asn Ser Ser Asp Phe Pro Asn Trp Asn Asn Glu Pro Ile Leu Lys Pro Ile Leu Gln Ala Leu Glu Ile Thr Phe Arg Phe Leu Ser Ile Val Leu Ser Asp Pro Arg Pro Tyr Ser Asn His Arg Glu Trp Thr Arg Arg Ile Glu Ser Leu Ile Thr His Gln Ile Glu Ile Ile Ala Ile Leu Cys Glu Asp Glu Glu Gln Asn Ser Asp Thr Arg Gly Thr Ala Pro Thr Ala Asp Leu Ser Arg Asn Asn Ser Ser Glu Ser Arg Ser Tyr Ser Glu Ala.Ser Leu Leu Pro Arg Leu Ala Thr Trp Tyr Lys Ser Lys Asp Val Ala Gln Arg Ile Leu Leu Ser Val Glu Cys Gln Met Arg Arg Cys Ser 180 ~ 185 190 Tyr Thr Leu Gly Leu Gly Glu Pro Asn Leu Ala Gly Lys Pro Ser Leu 3 0 Leu Tyr Asp Leu Val Cys Lys Pro Asn Glu Ile His Ala Leu Lys Thr Thr Pro Tyr Asp Glu Arg Val Glu Asn His Glu Asn His Ala Leu His Ala Thr His Gln Ile Ala Glu Ser Trp Ile His Ala Ser Arg Lys Val Leu Glu Arg Ile Ala Asp Ala Val Leu Ser Arg Thr Phe Glu Lys Ala Ala Glu Asp Cys Tyr Ala Val Glu Arg Ile Trp Lys Leu Leu Ala Glu Val Glu Asp Leu His Leu Met Met Asp Pro Asp Asp Phe Leu Arg Leu Lys Asn Gln Leu Ser Val Lys Ser Ser Gly Gly Glu Thr Ala Ser Phe Cys Phe Arg Ser Lys Glu Leu Val Glu Leu Thr Lys Met Cys Arg Asp Leu Arg His Lys Val Pro Glu Ile Leu Glu Val Glu Val Asp Pro Lys Gly Gly Pro Arg Ile Gln Glu Ala Ala Met Lys Leu Tyr Val Ser Lys Ser Ala Phe Glu Lys Val His Leu Leu Gln Ala Met Gln Ala Ile Glu 3a Ala Ala Met Lys Arg Phe Phe Tyr Ala Tyr Lys Gln Val Leu Ala Val Val Met Gly Ser Ser Glu Ala Asn Gly Asn Arg Val Gly Leu Ser Cys Asp Ser Arg Asp Ser Leu Thr Gln Ile Phe Leu Glu Pro Thr Tyr Phe Pro Ser Leu Asp Ala Ala Lys Thr Phe Leu Gly Tyr Leu Trp Asp Asn Asn Asp Asn Asn Lys Trp Ile
Claims (14)
1. A plant which has been transformed with a transformation vector comprising a DNA sequence that encodes an amino acid sequence selected from the group consisting of the sequences set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO:
6, and SEQ ID NO: 8.
6, and SEQ ID NO: 8.
2. A plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of:
(a) a nucleotide sequence having at least 70%
identity to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7; and (b) a nucleotide sequence having at least 70%
sequence identity to a nucleotide sequence encoding a plant protein, wherein said sequence encoding said plant protein is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364.
(a) a nucleotide sequence having at least 70%
identity to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7; and (b) a nucleotide sequence having at least 70%
sequence identity to a nucleotide sequence encoding a plant protein, wherein said sequence encoding said plant protein is contained in a plasmid having ATCC accession number 209366, 209365, 209614, 209363, or 209364.
3. A plant which has been transformed with a transformation vector comprising a nucleotide sequence selected from the group consisting of:
(a) the nucleotide sequence set forth in SEQ ID NO:
1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7;
(b) a nucleotide sequence encoding a plant protein, wherein said sequence is contained in a plasmid having ATCC
accession number 209366, 209365, 209614, 209363, or 209364; and (c) a nucleotide sequence having at least 85%
identity to the sequence of (a) or (b).
(a) the nucleotide sequence set forth in SEQ ID NO:
1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7;
(b) a nucleotide sequence encoding a plant protein, wherein said sequence is contained in a plasmid having ATCC
accession number 209366, 209365, 209614, 209363, or 209364; and (c) a nucleotide sequence having at least 85%
identity to the sequence of (a) or (b).
4. The plant of claim 3, wherein said plant is a dicot.
5. The plant of claim 3, wherein said plant is a monocot.
6. The plant of claim 4, wherein said plant is a soybean plant.
7. The plant of claim 5, wherein said plant is maize.
8. A plant transformed with a DNA sequence encoding a protein comprising an amino acid sequence selected from the group consisting of the amino acid sequences set forth in SEQ
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8, wherein said plant exhibits improved resistance to nematodes over the native untransformed plant.
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and SEQ ID NO: 8, wherein said plant exhibits improved resistance to nematodes over the native untransformed plant.
9. The plant of claim 8, wherein said DNA sequence is selected from the group consisting of the sequences set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7.
10. The plant according to claim 9, wherein said plant is a dicot.
11. The plant according to claim 9, wherein said plant is a monocot.
12. The plant according to claim 10, wherein said plant is a soybean plant.
13. The plant according to claim 11, wherein said plant is a cereal plant.
14. Seed of the plant of any one of claims 1 to 13.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US8583898P | 1998-05-18 | 1998-05-18 | |
US60/085,838 | 1998-05-18 | ||
CA002323312A CA2323312A1 (en) | 1998-05-18 | 1998-12-23 | Genes and methods for control of nematodes in plants |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002323312A Division CA2323312A1 (en) | 1998-05-18 | 1998-12-23 | Genes and methods for control of nematodes in plants |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2356492A1 true CA2356492A1 (en) | 1999-11-25 |
Family
ID=25682156
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002356492A Abandoned CA2356492A1 (en) | 1998-05-18 | 1998-12-23 | Nematode resistant plants and seeds |
Country Status (1)
Country | Link |
---|---|
CA (1) | CA2356492A1 (en) |
-
1998
- 1998-12-23 CA CA002356492A patent/CA2356492A1/en not_active Abandoned
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6228992B1 (en) | Proteins for control of nematodes in plants | |
AU779111B2 (en) | Modulation of plant response to abscisic acid | |
CA2348755A1 (en) | Polypeptide compositions toxic to diabrotica insects, obtained from bacillus thuringiensis; cryet70, and methods of use | |
US7943753B2 (en) | Auxin transport proteins | |
EP1849868B1 (en) | Plant defensins | |
US6452069B1 (en) | SF3 promoter and methods of use | |
AU2002310288B2 (en) | Alteration of embryo/endosperm size during seed development | |
AU2002310288A1 (en) | Alteration of embryo/endosperm size during seed development | |
US7122717B2 (en) | Enzymes involved in squalene metabolism | |
US6271437B1 (en) | Soybean gene promoters | |
US7193132B2 (en) | Plant MYB transcription factor homologs | |
CA2356492A1 (en) | Nematode resistant plants and seeds | |
US8153860B2 (en) | Alteration of embryo/endosperm size during seed development | |
CA2484760A1 (en) | Increasing host plant susceptibility to acrobacterium infection by overexpression of the arabidopsis vip1 gene | |
US7112722B2 (en) | Plant genes encoding pantothenate synthetase | |
US7250558B2 (en) | Disease resistance factors | |
EP0945508A1 (en) | The insect-resistant use of sweet potato sporamin gene and method for controlling pests using the gene | |
US20030088083A1 (en) | Metal-binding proteins | |
US7122723B2 (en) | Plant recombination proteins | |
CA2408224C (en) | Cold stress-responsive crtintp gene and use thereof | |
WO2000031142A2 (en) | Plant syr2 homologs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |