AU741600B2 - Improved bacillus thuringiensis toxin - Google Patents
Improved bacillus thuringiensis toxin Download PDFInfo
- Publication number
- AU741600B2 AU741600B2 AU88044/98A AU8804498A AU741600B2 AU 741600 B2 AU741600 B2 AU 741600B2 AU 88044/98 A AU88044/98 A AU 88044/98A AU 8804498 A AU8804498 A AU 8804498A AU 741600 B2 AU741600 B2 AU 741600B2
- Authority
- AU
- Australia
- Prior art keywords
- amino acid
- protein
- cry9c
- leu
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 239000003053 toxin Substances 0.000 title description 26
- 231100000765 toxin Toxicity 0.000 title description 26
- 241000193388 Bacillus thuringiensis Species 0.000 title description 9
- 229940097012 bacillus thuringiensis Drugs 0.000 title description 8
- 108090000623 proteins and genes Proteins 0.000 claims description 240
- 235000018102 proteins Nutrition 0.000 claims description 190
- 102000004169 proteins and genes Human genes 0.000 claims description 190
- 235000001014 amino acid Nutrition 0.000 claims description 147
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 91
- 150000001413 amino acids Chemical class 0.000 claims description 78
- 241000196324 Embryophyta Species 0.000 claims description 71
- 241000238631 Hexapoda Species 0.000 claims description 70
- 231100000419 toxicity Toxicity 0.000 claims description 52
- 230000001988 toxicity Effects 0.000 claims description 52
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 27
- 235000004279 alanine Nutrition 0.000 claims description 26
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 22
- 238000000034 method Methods 0.000 claims description 21
- 108020004414 DNA Proteins 0.000 claims description 16
- 241000256244 Heliothis virescens Species 0.000 claims description 15
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 14
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 14
- 235000005822 corn Nutrition 0.000 claims description 14
- 241000879145 Diatraea grandiosella Species 0.000 claims description 12
- 241001147398 Ostrinia nubilalis Species 0.000 claims description 12
- 239000012634 fragment Substances 0.000 claims description 11
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 claims description 4
- 240000003259 Brassica oleracea var. botrytis Species 0.000 claims description 4
- 229920000742 Cotton Polymers 0.000 claims description 4
- 241000219146 Gossypium Species 0.000 claims description 4
- 230000002068 genetic effect Effects 0.000 claims description 3
- 235000007319 Avena orientalis Nutrition 0.000 claims description 2
- 241000209763 Avena sativa Species 0.000 claims description 2
- 235000007558 Avena sp Nutrition 0.000 claims description 2
- 235000016068 Berberis vulgaris Nutrition 0.000 claims description 2
- 241000335053 Beta vulgaris Species 0.000 claims description 2
- 240000002791 Brassica napus Species 0.000 claims description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 2
- 235000017647 Brassica oleracea var italica Nutrition 0.000 claims description 2
- 235000002566 Capsicum Nutrition 0.000 claims description 2
- 235000007516 Chrysanthemum Nutrition 0.000 claims description 2
- 244000189548 Chrysanthemum x morifolium Species 0.000 claims description 2
- 244000115658 Dahlia pinnata Species 0.000 claims description 2
- 235000012040 Dahlia pinnata Nutrition 0.000 claims description 2
- 241000245654 Gladiolus Species 0.000 claims description 2
- 244000068988 Glycine max Species 0.000 claims description 2
- 235000010469 Glycine max Nutrition 0.000 claims description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 2
- 240000007594 Oryza sativa Species 0.000 claims description 2
- 235000007164 Oryza sativa Nutrition 0.000 claims description 2
- 239000006002 Pepper Substances 0.000 claims description 2
- 235000016761 Piper aduncum Nutrition 0.000 claims description 2
- 235000017804 Piper guineense Nutrition 0.000 claims description 2
- 235000008184 Piper nigrum Nutrition 0.000 claims description 2
- 240000004713 Pisum sativum Species 0.000 claims description 2
- 235000010582 Pisum sativum Nutrition 0.000 claims description 2
- 240000003768 Solanum lycopersicum Species 0.000 claims description 2
- 244000061458 Solanum melongena Species 0.000 claims description 2
- 235000002597 Solanum melongena Nutrition 0.000 claims description 2
- 244000061456 Solanum tuberosum Species 0.000 claims description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 2
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 2
- 210000000081 body of the sternum Anatomy 0.000 claims description 2
- 235000009566 rice Nutrition 0.000 claims description 2
- 235000009917 Crataegus X brevipes Nutrition 0.000 claims 1
- 235000013204 Crataegus X haemacarpa Nutrition 0.000 claims 1
- 235000009685 Crataegus X maligna Nutrition 0.000 claims 1
- 235000009444 Crataegus X rubrocarnea Nutrition 0.000 claims 1
- 235000009486 Crataegus bullatus Nutrition 0.000 claims 1
- 235000017181 Crataegus chrysocarpa Nutrition 0.000 claims 1
- 235000009682 Crataegus limnophila Nutrition 0.000 claims 1
- 235000004423 Crataegus monogyna Nutrition 0.000 claims 1
- 240000000171 Crataegus monogyna Species 0.000 claims 1
- 235000002313 Crataegus paludosa Nutrition 0.000 claims 1
- 235000009840 Crataegus x incaedua Nutrition 0.000 claims 1
- 244000203593 Piper nigrum Species 0.000 claims 1
- 240000006394 Sorghum bicolor Species 0.000 claims 1
- 240000008042 Zea mays Species 0.000 claims 1
- 238000003306 harvesting Methods 0.000 claims 1
- 210000004027 cell Anatomy 0.000 description 37
- 108700012359 toxins Proteins 0.000 description 25
- 108091026890 Coding region Proteins 0.000 description 20
- 230000014509 gene expression Effects 0.000 description 19
- 108091005804 Peptidases Proteins 0.000 description 17
- 239000004365 Protease Substances 0.000 description 17
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 16
- 101100497222 Bacillus thuringiensis cry1Af gene Proteins 0.000 description 13
- 241000209149 Zea Species 0.000 description 13
- 101150041868 cry1Aa gene Proteins 0.000 description 13
- 102000005962 receptors Human genes 0.000 description 12
- 108020003175 receptors Proteins 0.000 description 12
- 229920001940 conductive polymer Polymers 0.000 description 10
- 238000009616 inductively coupled plasma Methods 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 9
- 230000000749 insecticidal effect Effects 0.000 description 9
- 108010021466 Mutant Proteins Proteins 0.000 description 8
- 102000008300 Mutant Proteins Human genes 0.000 description 8
- 231100000171 higher toxicity Toxicity 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- 239000004475 Arginine Substances 0.000 description 7
- 108700010070 Codon Usage Proteins 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 240000002024 Gossypium herbaceum Species 0.000 description 6
- 235000004341 Gossypium herbaceum Nutrition 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 150000007513 acids Chemical class 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 101710151559 Crystal protein Proteins 0.000 description 5
- 241001057636 Dracaena deremensis Species 0.000 description 5
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 102000035118 modified proteins Human genes 0.000 description 5
- 108091005573 modified proteins Proteins 0.000 description 5
- 230000007170 pathology Effects 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 230000002588 toxic effect Effects 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 239000004474 valine Substances 0.000 description 5
- 206010020649 Hyperkeratosis Diseases 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 210000000110 microvilli Anatomy 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 239000011148 porous material Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 231100000331 toxic Toxicity 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 3
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 3
- 102000004142 Trypsin Human genes 0.000 description 3
- 108090000631 Trypsin Proteins 0.000 description 3
- 241000607479 Yersinia pestis Species 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- -1 alanine amino acid Chemical class 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 238000004166 bioassay Methods 0.000 description 3
- 230000000853 biopesticidal effect Effects 0.000 description 3
- 230000006378 damage Effects 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 230000000408 embryogenic effect Effects 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000012588 trypsin Substances 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical group C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 2
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 2
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- 241000254173 Coleoptera Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- 241000255893 Pyralidae Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 235000021405 artificial diet Nutrition 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 238000000159 protein binding assay Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylpentanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)CC)C(O)=O ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 241000566547 Agrotis ipsilon Species 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- XNSKSTRGQIPTSE-ACZMJKKPSA-N Arg-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XNSKSTRGQIPTSE-ACZMJKKPSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- ZFSIGJMSVGZVGP-DHATWTDPSA-N Arg-Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)[C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZFSIGJMSVGZVGP-DHATWTDPSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- SJUXYGVRSGTPMC-IMJSIDKUSA-N Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O SJUXYGVRSGTPMC-IMJSIDKUSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- NPDLYUOYAGBHFB-WDSKDSINSA-N Asn-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NPDLYUOYAGBHFB-WDSKDSINSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 241000186650 Clavibacter Species 0.000 description 1
- 101150102464 Cry1 gene Proteins 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- WYVKPHCYMTWUCW-YUPRTTJUSA-N Cys-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)N)O WYVKPHCYMTWUCW-YUPRTTJUSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 241001147381 Helicoverpa armigera Species 0.000 description 1
- 241000255967 Helicoverpa zea Species 0.000 description 1
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- 241000255777 Lepidoptera Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- 241000555303 Mamestra brassicae Species 0.000 description 1
- 241000255908 Manduca sexta Species 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 241001045988 Neogene Species 0.000 description 1
- 241000256259 Noctuidae Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241001147397 Ostrinia Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- QPQDWBAJWOGAMJ-IHPCNDPISA-N Phe-Asp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 QPQDWBAJWOGAMJ-IHPCNDPISA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 108020005089 Plant RNA Proteins 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- 240000003829 Sorghum propinquum Species 0.000 description 1
- 241000256247 Spodoptera exigua Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 1
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- KAFKKRJQHOECGW-JCOFBHIZSA-N Thr-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(O)=O)=CNC2=C1 KAFKKRJQHOECGW-JCOFBHIZSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- 238000001994 activation Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- CYDMQBQPVICBEU-UHFFFAOYSA-N chlorotetracycline Natural products C1=CC(Cl)=C2C(O)(C)C3CC4C(N(C)C)C(O)=C(C(N)=O)C(=O)C4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-UHFFFAOYSA-N 0.000 description 1
- 229960004475 chlortetracycline Drugs 0.000 description 1
- CYDMQBQPVICBEU-XRNKAMNCSA-N chlortetracycline Chemical compound C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-XRNKAMNCSA-N 0.000 description 1
- 235000019365 chlortetracycline Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000002016 colloidosmotic effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 235000013312 flour Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000002917 insecticide Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 231100001231 less toxic Toxicity 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 101150091879 neo gene Proteins 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 231100000916 relative toxicity Toxicity 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003335 steric effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 238000002723 toxicity assay Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000010497 wheat germ oil Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Pest Control & Pesticides (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Insects & Arthropods (AREA)
- Crystallography & Structural Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
Description
WO 99/00407 PCT/EP98/04033 IMPROVED BACILLUS THURINGIENSIS TOXIN BACKGROUND OF THE INVENTION Field of the Invention The present invention provides new improved proteins derived from a Bacillus thuringiensis Cry9C crystal protein. In accordance with this invention, amino acid positions in a Cry9C protein were identified as involved in insect toxicity. Further in accordance with this invention are provided modified Cry9C proteins with increased of decreased toxicity to an insect species, and DNA sequences encoding such modified Cry9C proteins. Plants can be protected from insect damage by expressing a chimeric gene encoding an improved Cry9C protein with an increased toxicity to an insect species.
(ii) Description of Related Art Bacillus thuringiensis (Bt)-derived proteins are currently widely used to protect plants from insects by expression of such proteins in transgenic plants. Concerns of insect resistance development and the desire to achieve the optimum toxicity and control of additional insect species resulted in efforts to modify existing Bt-derived proteins so as to increase their toxicity or alter their mode of action.
Most studies on the mode of action of Bacillus thuringiensis toxins have focused on lepidopteran-specific Cry1 insecticidal crystal proteins The following picture has emerged from these studies (Gill et al., 1992, Annu. Rev.
Entomol. 37, 615-36; Knowles, 1993, BioEssays, 15, 469-476). Following ingestion of the crystals by a susceptible insect, they are dissolved in the alkaline reducing environment of the insect midgut lumen. The liberated proteins, the protoxins, are then proteolytically processed by insect midgut proteases to a protease-resistant fragment. This active fragment, the toxin, then passes through the peritrophic membrane and binds to specific receptors located on the brush border membrane of gut epithelial cells. Subsequent to binding, the toxin or part thereof inserts in the membrane resulting in the formation of pores. These pores lead to colloid osmotic swelling and ultimately lysis of the midgut cells, causing death of the insect.
-1- CONFIRMATION COPY WO 99/00407 PCT/EP98/04033 Binding studies have demonstrated that receptor binding is a crucial step in the mode of action of ICPs (Hofmann et al., 1988, 173, 85-91; Hofmann et al., 1988, Proc. Natl. Acad. Sci. USA, 85, 7844-7848; Van Rie et al., 1990, Appl. Environm.
Microbiol. 56, 1378-85).
The three dimensional structure of two ICPs, Cry3A and the CrylAa toxic fragment, has been solved (Li et al., 1991, Nature 353, 815-21; Grochulski et al., 1994, Journal of Molecular Biology 254, 1-18). The Cry proteins have been found to have three structural domains: the N-terminal domain I consists of 7 alpha helices, domain II contains three beta-sheets and the C-terminal domain III is a betasandwich. Based on this structure, a hypothesis has been formulated regarding the structure-function relationships of ICPs. The bundle of long, hydrophobic and amphipathic helices (domain I) is equipped for pore formation in the insect membrane, and regions of the three-sheet domain (domain II) are probably responsible for receptor binding (Li et al, 1991, supra). The function of domain III is less clear. When different ICP amino acid sequences are aligned, five conserved sequence blocks are evident (H6fte Whiteley, 1989, Microbiol. Revs. 53, 242-255).
These conserved blocks are all located in the interior of a structural domain or at the interface between domains. The high degree of conservation of these internal residues implies that homologous proteins would adopt a similar fold (Li et al., 1991, supra).
Data from Ahmad et al. (1991, FEMS Microbiol. Lett. 68, 97-104); Wu et al.
(1992, J. Biol. Chem. 267, 2311-2317) and Gazit et al. (1993, Biochemistry 32, 3429-3436) provide evidence for the function of domain I of ICPs as a pore formation unit.
Deletions and alanine substitutions in the CrylAa protoxin at a position predicted to be at or near the second loop of domain II significantly altered toxicity and receptor binding ability (Lu et al., 1993, XXVIth Annual meeting of the Society for Invertebrate Pathology, Asheville, USA, Conference book, page 31, Abstract 17).
Smith and Ellar (1992, XXVth Annual meeting of the Society for Invertebrate Pathology, Heidelberg, Germany, Conference book, page 111, abstract 68) observed dramatic effects on toxicity towards in vitro insect cell cultures with mutant CrylC proteins, differing in the amino acid sequence of the predicted loop regions.
WO 99/00407 PCT/EP98/04033 They formulated the hypothesis that it should be possible to map the putative receptor binding domain of this toxin and eventually generate toxins with increased potency. In some cases however, a contribution to specificity and binding from domain III of the Cry toxin could not be excluded (Schnepf et al., 1990, supra; Ge et al., 1991, J. Biol. Chem. 266, 17954-17958). Furthermore, a recent study using hybrid ICPs, constructed by exchanging gene fragments between crylC and crylE, has indicated that domain II of Cryl C is not sufficient to confer the high activity of this protein towards Spodoptera exigua and Mamestra brassicae (Schipper et al., 1993, Seventh International Conference on Bacillus, Institut Pasteur, July 18-23, Abstracts of lectures, p. L69). Site-directed mutagenesis experiments on Cryl Ac indicated that certain amino acids in domain I are important for receptor binding (Wu et al., 1992, supra). Rajamohan et al. (1996, J. Biol. Chem. 271, 2390-2396) explored the role of loop 2 residues in domain II of the CrylAb protein in reversible and irreversible binding to Manduca sexta and Heliothis virescens.
Also, changes outside the 60 kD toxin region of the Bt protoxin were found to influence toxicity. It was suggested that this may be related to the activation processes by the gut juice (Nakamura et al., 1990, Agric. Biol. Chem. 54, 715-24).
Visser et al. (1993, In "Bacillus thuringiensis, an Environmental Biopesticide Theory and Practice", pp.71-88, eds.: Entwistle, Cory, Bailey, and Higgs, John Wiley Sons, NY) reviewed the domain-function studies with Bt ICPs and concluded that in general, the function of essential stretches of the toxic fragment of Bt ICPs is unknown. From studies of mutant proteins, it was found that several amino acid residues from different regions of the toxic fragment, either conserved or variable, were shown to affect toxic activity.
Lambert et al. (1996, Appl. Environm. Microbiol. 62, p. 80-86) and PCT patent publication WO 94/05771 describe a new Bt protein which is currently named cry9Cal (abbreviated as Cry9C) (Peferoen et al., 1997, in Advances in Insect Control: The role of transgenic plants, pp. 21-48, Taylor Francis Ltd., London).
This protein was found to have a broad insect target range within the group of lepidopteran pest insects making it interesting for insect control applications in agriculture.
-4- De Roeck et al. (1995, the 2 8 th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 52) suggests to determine the likely position of the binding epitope of the CrylH protein by making Alanine mutants so as to allow the determination of the contribution of amino acid positions in binding of the CrylH protein to different insects. The CrylH protein is currently named Cry9C in the new nomenclature (Crickmore et al., 1995, 28 th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 14). De Roeck et al. (1997, the 6 th International Conference on Perspectives in Protein Engineering, John Innes Centre, Norwich, UK, June 28-July 1, p. 34) determined the likely position of residues in the loops at the apex of the molecule in domain II of the Cry9C protein.
SUMMARY OF THE INVENTION This invention provides a modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 or an insecticidally-effective fragment thereof, wherein at least one amino acid at the following amino acid positions in SEQ ID No. 2 is replaced by another amino acid: 316, 317, 319, 321, 329, 330, 364, 369, 422, or 488.
This invention further provides preferred improved Cry9C proteins that comprise the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 20 44 to amino acid position 658, wherein at least one of the amino acids at the following positions are replaced by other amino acids: 316, 317, 319, 321, 329, 0o 330, 364, 369, 422, and 488.
This invention also provides a modified Cry9C protein with improved toxicity to Ostrinia nubila/is, comprising the amino acid sequence of SEQ ID No. 2 25 from amino acid position 1 or 44 to amino acid position 658, wherein at least the amino acids at position 488 or at least at positions 364 and 488 are replaced by other amino acids, preferably by alanine.
WO 99/00407 PCT/EP98/04033 This invention also provides modified Cry9C proteins with improved toxicity to Heliothis virescens, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at position 321 or position 329, is replaced by another amino acid, preferably by alanine.
This invention further provides modified Cry9C proteins with improved toxicity to Diatraea grandiosella, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at any or all of positions 316, 317, 319, 321, 330, 369, or 422 is replaced by another amino acid, preferably by alanine.
Further in accordance are provided DNA sequences encoding the modified Cry9C proteins, and particularly chimeric genes designed for expression in plants comprising these DNA sequences.
In another preferred embodiment of this invention, a plant transformed with a DNA sequence encoding a modified Cry9C protein is provided, so that the plant acquires increased resistance to insects, particularly a corn plant transformed with a modified Cry9C protein yielding increased toxicity towards Heliothis virescens, Ostrinia nubilalis, or Diatraea grandiosella insects.
Other objects and advantages of this invention will become evident from the following description.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS In this invention, certain amino acid residues important for toxicity of the Cry9C protein have been identified. These amino acid residues can be replaced by other amino acids to increase the toxicity to a specific insect species.
The "Cry9C protein", as used herein, refers to an insecticidal protein characterized by the amino acid sequence of SEQ ID No. 2 or any equivalents thereof such as the insecticidally effective truncated proteins or the fusion proteins of the Cry9C protein described in PCT patent publications WO 94/05771 and WO 94/24264. Particularly preferred Cry9C proteins, in accordance with this invention, are proteins containing at least the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658. Throughout the description WO 99/00407 PCT/EP98/04033 and the claims, the new nomenclature for Bt crystal proteins as suggested by Crickmore et al. (1995, 28th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 14) and reported in Peferoen et al. (1997, in Advances in Insect Control: The role of transgenic plants, pp. 21-48, Taylor Francis Ltd., London) has been used.
"Cry9C protein variants", for a particular insect species, are insecticidal proteins that differ from but are indirectly or directly derived from the Cry9C protein.
Indeed, several variants of a Bt protein in which some amino acids are changed into others without significantly changing activity and/or specificity to a particular insect species can be found in nature (H6fte Whiteley, 1989, supra) or can be made by recombinant DNA techniques. Variants of a Cry9C protein, as used herein, also include proteins containing the specificity- or toxicity-determining domain or region of the Cry9C protein, in a hybrid with another protein, such as another Bt ICP, a membrane-permeating protein domain, a cytotoxin or an antibody fragment, provided that the Cry9C specificity- or toxicity-determining domain or region contributes to the toxicity or specificity of the hybrid protein. Particularly preferred Cry9C protein variants are those proteins comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the arginine at position 164 has been replaced by another amino acid, preferably alanine or lysine. These variants with a replacement of the arginine at position 164 in the sequence of SEQ ID No. 2 show a significantly lower susceptibility to breakdown upon protease treatment, and are named herein the "protease-resistant Cry9C variants". Like here for the protease-resistant variants, whenever reference to a particular region or position in SEQ ID No. 2 is made, this does not necessarily imply that the protein referred to is the full-length protein-of SEQ ID No. 2; this statement merely refers to the position corresponding to the particular position in the reference Cry9C protein in SEQ ID No. 2. Indeed, improved Cry9C proteins of the invention can be truncated so that the actual position of an amino acid in that protein will differ but nevertheless reference will be made throughout this invention to the positions in the full-length reference protein, shown in SEQ ID No. 2.
Following the teachings of this invention, Cry9C proteins or variants thereof can be modified to have an increased toxicity for an insect species. "Modified Cry9C WO 99/00407 PCT/EP98/04033 protein", as used herein, refers to a Cry9C protein or its protease-resistant variant wherein amino acids have been modified to analyse the contribution of amino acid positions in toxicity, particularly a Cry9C protein or its protease-resistant variant wherein amino acids have been modified in the regions at the following positions in SEQ ID No. 2: 313-334, 358-369, 418-425, 480-492. "Improved Cry9C protein", in accordance with this invention, refers to a Cry9C protein or its protease-resistant variant wherein at least one amino acid has been replaced, so that the toxicity of this improved protein towards an insect species is significantly increased. In a particularly preferred improved Cry9C protein or its protease-resistant variant, the at least one amino acid change is located in domain II of the Cry9C protein, particularly in the regions of the Cry9C protein characterized by the following positions in SEQ ID No. 2: 313-334, 358-369, 418-425, 480-492. A modified Cry9C protein, differing in one amino acid from the native protein or its protease-resistant variant and being significantly less toxic towards the target insect, allows the direct identification of this amino acid position as involved in toxicity (provided no gross structural changes are introduced), and thus has considerable value in improving toxicity. In accordance with this invention, the identification of these amino acid positions involved in toxicity allows the construction of modified proteins having increased toxicity to the target insect by amino acid randomization at these positions. Preferred modified Cry9C proteins in accordance with this invention are the modified Cry9C proteins having altered toxicity to Ostrinia nubilalis, Heliothis virescens or Diatraea grandiosella as shown in Table 1, as well as combinations of those modifications in one modified protein.
An example of an improved Cry9C protein in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 or 666 wherein an amino acid in at least one of the following amino acid positions of SEQ ID No. 2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 362, 364, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491; or an amino acid position located in the immediate vicinity of any one of these positions in the three-dimensional structure of the protein, preferably those amino acids whose C-alpha atom is at a maximum distance of about 7 Angstrom from the C-alpha atom WO 99/00407 PCT/EP98/04033 of the amino acid listed above. A preferred improved Cry9C protein in accordance with this invention is the protein of SEQ ID No. 2 with at least one of the following amino acid changes: P316A, A317V, V319A, L321A, P329A, Y330A, S364A, Y369A, 1422A, and 1488A. "V319A" or "Cry9C(V319A)", as used herein, means a change of the valine amino acid at position 319 in SEQ ID No. 2 to an alanine amino acid.
Preferred improved Cry9C proteins also include Cry9C proteins having also the arginine amino acid at position 164 in SEQ ID No. 2 altered into another amino acid, particularly alanine or lysine, to enhance stability upon protease, particularly trypsin, cleavage.
A preferred Cry9C protein for the control of Ostrinia nubilalis insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No. 2 has been replaced by another amino acid: 325, 364, 418, 421, 485, and 488. A particularly preferred improved Cry9C protein for the control of Ostrinia nubilalis insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least position 364 or at least in positions 364 and 488 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine.
A preferred Cry9C protein for the control of Heliothis virescens insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No. 2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 492, particularly at least one of the following amino acid positions: 321, 325, 329, 418, 420, and 480. A particularly preferred improved Cry9C protein for the control of Heliothis virescens insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 321 and 329 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine.
WO 99/00407 PCT/EP98/04033 A preferred Cry9C protein for the control of Diatraea grandiosella insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No. 2 has been replaced by another amino acid: 316, 317, 319, 321, 325, 330, 369, 421, 422, 480, 483, 484, 485, 487, 488, 490, and 491; particularly at least one of the following amino acid positions: 480, 484, 485, 487, and 490. A particularly preferred improved Cry9C protein for the control of Diatraea grandiosella insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 316, 317, 319, 321, 330, 369 and 422 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine or valine (for 317).
By using DNA sequences encoding improved Cry9C proteins in accordance with this invention, improved toxicity to a selected insect species can be obtained upon expression of such DNA in a transgenic plant.
A "cry9C gene", as used herein, is a DNA sequence comprising a DNA encoding a Cry9C protein (a coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. A cry9C gene does not necessarily need to be expressed everywhere at all times, expression can be periodic at certain stages of development in a plant) and/or can be spatially restricted in certain cells or tissues in a plant), mainly depending on the activity of regulatory elements provided in the chimeric gene or in the site of insertion in the plant genome. A cry9C gene can be naturallyoccurring or can be a hybrid or synthetic DNA and the regulatory elements can be from prokaryotic or eucaryotic origin.
The "modified cry9C gene", as used herein, is a DNA sequence comprising a DNA encoding a modified Cry9C protein (a modified coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. An example of a modified cry9C coding region is the cry9C coding region of SEQ ID No. 3 wherein the valine codon at nucleotide positions 844-846 of SEQ ID No. 3 has been replaced by an alanine codon.
WO 99/00407 PCT/EP98/04033 "Substantial sequence homology" to a DNA sequence, as used herein, refers to DNA sequences differing in some, most or all of their codons from another DNA sequence but encoding the same or substantially the same protein. Indeed, because of the degeneracy of the genetic code, the codon usage of a particular DNA coding region can be substantially modified, so as to more closely resemble the codon usage of the genes in the host cell, without changing the encoded protein. Changing the codon usage of a DNA coding region to that of the host cell has been described to be desired for gene expression in foreign hosts Bennetzen Hall, 1982, J.
Biol. Chem. 257, 3026-3031.; Itakura, 1977, Science 198, 1056-1063). Codon usage tables are available in the literature (Wada et al., 1990, Nucl. Acids Res. 18, 2367- 1411; Murray et al., 1989, Nucl. Acids Res. 17(2), 477-498) and in the major DNA sequence databanks at EMBL in Heidelberg, Germany). Accordingly, recombinant or synthetic DNA sequences can be constructed so that the same or substantially the same proteins with substantially the same insecticidal activity are produced (Koziel et al., 1993, Bio/technology 11, 194-200; Perlak et al., 1993, Plant Mol. Biol. 22, 313-321). A modified cry9C gene has all appropriate control regions so that the modified Cry9C protein can be expressed in a host cell, e.g. for expression in plants, a plant-expressible promoter and a 3' termination and polyadenylation region active in plants.
A "chimeric improved cry9C gene", as used herein, refers to a chimeric gene comprising a DNA sequence encoding the improved Cry9C protein inserted in between controlling elements of different origin, e.g. a DNA sequence encoding the improved Cry9C protein under the control of a promoter transcribing the DNA in the plant cell, and fused to 3' transcription termination sequences active in plant cells.
Protection of a plant, preferably a corn or cotton plant, against an insect species which is known to feed on said plant is preferably accomplished by expressing an improved Cry9C protein in the cells of the plant. This is preferably accomplished by expressing a chimeric improved cry9C gene encoding such an improved Cry9C protein in the cells of a plant, preferably a corn or cotton plant. An improved Cry9C protein of this invention preferably only has a small number, particularly less than 20, more particularly less than 15, preferably less than amino acids replaced by other amino acids as compared to the Cry9C protein, WO 99/00407 PCTIEP98/04033 preferably as compared to the region from between amino acid positions 1 and 45 to amino acid position 658 of the Cry9C protein of SEQ ID No. 2. A significant increase in toxicity can already be obtained by replacing only 1 amino acid, but it is preferred that more than one amino acid is changed to improve toxicity.
The following steps are followed to construct the new modified Cry9C proteins: amino acids in domain II of the Cry9C protein from amino acid positions 313-334, 358-369, 418-425, and 480-492 were chosen for modification, using alaninescanning mutagenesis (Cunningham Wells, 1989, Science 244,1081-85). In case the original position is alanine, a substitution by valine is done. These regions occur at positions corresponding to the solvent-exposed positions in the loop between beta-strands 1 and 2 (comprising alpha-helix 8) and in loop 1 (located between beta strands 2 and in loop 2 (located between beta-strands 6 and and in loop 3 (located between beta-strands 10 and 11) in the three-dimensional model of the Cry3A protein (Li et al., 1991, supra). To discount any observed lower toxicity of a modified Cry9C protein which is due to misfolding or structural distortion, the structural stability of mutant ICPs can be analysed by a variety of methods including toxicity to another target insect, crystal formation, solubilization, monoclonal antibody binding analysis, protease resistance, fluorometric monitoring of unfolding and circular dichroism spectrum analysis. In the case of structural distortion, it is impossible to determine the functional role of this position by alanine replacement.
However, a more conservative amino acid substitution may yield a correctly folded mutant protein which allows to determine the functional role of this position.
The amino acid positions, identified above, which yield modified proteins with significantly decreased toxicity ("down-mutants") are randomized. This means that a set of 20 different mutants, representing each type of amino acid, is generated for each position of interest (the original amino acid and the alanine substitution function as a control). This method is further referred to as "amino acid randomization".
Such mutants may be generated by a variety of methods, e.g. following the PCR overlap extension method (Ho et al., 1989, Gene 77, 51-59). These mutant proteins are then tested in toxicity assays on the target insect. Mutants at each position which are more toxic, yield higher mortality than the wild type protein, are selected.
Such mutants with improved toxicity are termed "up-mutants". Alternatively, it is also -11- WO 99/00407 PCT/EP98/04033 possible to select potential up-mutants on the basis of increased reversible binding which can be measured following the procedures of Van Rie et al. (1990, Appl.
Environm. Microbiol. 56, 1378-1385) or Liang et al. (1995, J. Biol. Chem. 270, 24719-24724), which is incorporated herein by reference.
All or some of the "up-mutant" amino acids, identified in step 2, are combined in a single modified protein. According to additivity principles, mutations in noninteracting parts of a protein should combine to give simple additive changes in the free energy of binding (Lowman and Wells, 1993, J. Mol. Biol., 234, 564-578).
Increases in toxicity are thus accumulated by combining several single mutants into one multiple mutant. Finally a modified protein with improved toxicity is designed, which comprises some or all, preferably all, of the up-mutant amino acids previously identified.
In accordance with this invention, amino acids of domain II of a Cry9C protein, located at the protruding regions of domain II are chosen for modification. By "protruding regions of domain II", as used herein, are meant the solvent-exposed regions organized in loops, alpha helices or beta-strands which are protruding from domain II and are located at or towards the apex of the molecule.
This invention is particularly suited for improving the toxicity to an insect species for which the Cry9C protein has a rather weak toxicity. The toxicity of this improved Cry9C protein can be increased by combining amino acid mutations in the protein, each yielding an increased toxicity when compared to the amino acid present in the native Cry9C protein. Insect species for which improved Cry9C proteins can be made also include Spodoptera frugiperda, Heliothis zea, Heliothis armigera, and Agrotis ipsilon. Also, this invention is suited to increase toxicity of a Cry9C protein or its protease-resistant variant to one insect species and to decrease toxicity of the same protein to another insect species by making the proper amino acid substitutions in the protein. This may be advantageous, to limit the likelihood of insect resistance occurrence to the protein in a particular insect species.
An insecticidally effective part of the modified cry9C gene of this invention encoding an insecticidally effective portion of the modified Cry9C protein, can be made in a conventional manner. An "insecticidally effective part" of the modified -12- WO 99/00407 PCT/EP98/04033 cry9C gene refers to a gene comprising a DNA coding region encoding a polypeptide with fewer amino acids than the full length modified Cry9C protein but that still retains toxicity to insects. A preferred insecticidally effective part of the Cry9C protein is the part from amino acid position 1 or 44 to amino acid position 658 in SEQ ID No. 2.
In order to express all or an insecticidally effective part of the improved cry9C gene in E. coli, in Bt strains and in plants, suitable restriction sites can be introduced, flanking each gene or gene part. This can be done by site-directed mutagenesis, using well-known procedures (Stanssens et al., 1989, Nucl. Acids Res. 12, 4441-4454; White et al., 1989, Trends in Genet. 5,185-189).
In order to improve expression in foreign host cells such as plant cells, it may be preferred to alter the improved cry9C coding region or its insecticidally effective part to form an equivalent, artificial improved cry9C coding region. Expression is improved by selectively inactivating certain cryptic regulatory or processing elements present in the native sequence as described in PCT publications WO 91/16432 and WO 93/09218. This can be done by site-directed mutagenesis or site-directed introninsertion (WO 93/09218), or by introducing overall changes to the codon usage, e.g., adapting the codon usage to that most preferred by the host organism (publication of European patent application number 0 385 962, EP 0 359 472, publication of PCT patent application WO 93/07278, Murray et al., 1989, supra) without significantly changing, preferably without changing, the encoded amino acid sequence. Small modifications to a DNA sequence such as described above can be routinely made by PCR-mediated mutagenesis (Ho et al., 1989, supra; White et al., 1989, supra). For major changes to the DNA sequence, DNA synthesis methods are available in the art Davies et al., 1991, Society for Applied Bacteriology, Technical Series 28, pp. 351-359). For obtaining enhanced expression in monocot plants such as corn, a monocot intron can be added to the chimeric improved cry9C gene (Callis et al., 1987, Genes Development 1, 1183-1200; PCT publication WO 93/07278). Another preferred embodiment of this invention is the expression of the improved Cry9C proteins by the method described in PCT patent publication WO 97/49814, which is incorporated herein by reference.
The chimeric improved cry9C gene can be stably inserted in a conventional manner into the nuclear genome of a single plant cell, and the so-transformed plant -13- WO 99/00407 PCT/EP98/04033 cell can be used in a conventional manner to produce a transformed plant that is insect-resistant. Particularly preferred plants in accordance with this invention are corn plants. Corn cells can be stably transformed by electroporation) using wounded or enzyme-degraded intact tissues capable of forming compact embryogenic callus (such as corn immature embryos), or the embryogenic callus (such as type I callus in corn) obtained thereof, as described in PCT patent publication WO 92/09696 or US Patent 5,641,664. Other methods for transformation of corn include the methods by Fromm et al. (1990, Bio/Technology 8, 833-839), Gordon-Kamm et al. (1990, The Plant Cell 2, 603-618) and Ishida et al. (1996, Nature Biotechnology 14, 745-750).
Alternatively, a disarmed Ti plasmid, containing the insecticidally effective chimeric improved cry9C gene, in Agrobacterium tumefaciens can be used to transform the plant cell, preferably the corn or cotton cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using the procedures described, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and EP 0242246 (which are also incorporated herein by reference), and in Gould et al. (1991, Plant Physiol. 95, 426-434) or Ishida et al. (1996, supra), particularly the method described in PCT publication WO 94/00977. Preferred Tiplasmid vectors each contain the insecticidally effective chimeric improved cry9C gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and US Patent 4,684,611), plant RNA virus-mediated transformation (as described, for example in EP 0067553 and US Patent 4,407,956), and liposome-mediated transformation (as described, for example in US Patent 4,536,475).
A resulting transformed plant, such as a transformed corn or cotton plant, can be used in a conventional plant breeding scheme to produce more transformed plants with the same characteristics or to introduce the improved cry9C gene, or an insecticidally effective part thereof in other varieties of the same or related plant species. Seeds, which are obtained from the transformed plants, contain the -14- WO 99/00407 PCT/EP98/04033 chimeric improved cry9C gene or its insecticidally effective part as a stable genomic insert. Cells of the transformed plant can be cultured in a conventional manner to produce the improved Cry9C protein or insecticidally effective portions thereof, which can be recovered for use in conventional insecticide compositions against insects, particularly lepidopteran insects Patent 5,254,799). Preferred plants in accordance with this invention, besides corn and cotton, include rice, plants of the genus Brassica such as oilseed rape, cauliflower and broccoli, and also soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
The improved cry9C coding region or its insecticidally effective part is inserted in a plant cell genome so that the inserted coding region is downstream of, and under the control of, a promoter which can direct the expression of the gene part in the plant cell. This is preferably accomplished by inserting the chimeric improved cry9C gene or its insecticidally effective part in the plant cell genome. Preferred promoters include: the strong constitutive 35S promoters (the "35S promoters") of the cauliflower mosaic virus of isolates CM 1841 (Gardner et al., 1981, Nucleic Acids Research 9, 2871-2887), CabbB-S (Franck et al., 1980, Cell 21, 285-294) and CabbB-JI (Hull and Howell, 1987, Virology 86, 482-493); the ubiquitin promoter (EP 0342926), and the TR1' promoter and the TR2' promoter which drive the expression of the 1' and 2' genes, respectively, of the T-DNA (Velten et al., 1984, EMBO J. 3, 2723-2730). Alternatively, a promoter can be utilized which is not constitutive but rather is specific for one or more tissues or organs of the plant, preferably leaf and stem tissue, whereby the inserted chimeric improved cry9C gene or its insecticidally effective part is expressed only in cells of the specific tissue(s) or organ(s). Another alternative is to use a promoter whose expression is inducible by insect feeding or by chemical factors). Known wound-induced promoters inducing systemic expression of their gene product throughout the plant are also of particular interest.
The improved cry9C coding region, or its insecticidally effective part, is inserted in the plant genome so that the inserted coding region is upstream of suitable 3' end transcription regulation signals transcript termination and polyadenylation signals). Preferred polyadenylation and transcript formation signals include those of the 35S gene (Mogen et al., 1990, The Plant Cell 2, 1261-1272), the WO 99/00407 PCT/EP98/04033 octopine synthase gene (Gielen et al., 1984, EMBO J. 3, 835-845) and the T-DNA gene 7 (Velten and Schell, 1985, Nucl. Acids Res. 13, 6981-6998), which act as 3'-untranslated DNA sequences in transformed plant cells.
The chimeric improved cry9C gene, or its insecticidally effective gene part, can optionally be inserted in the plant genome as a hybrid gene (EP 0 193 259; Vaeck et al., 1987, Nature 327, 33-37) under the control of the same promoter as the coding region of a selectable marker gene, such as the coding region of the neo gene (EP 0 242 236) encoding kanamycin resistance, so that the plant expresses a fusion protein.
Preferably, the improved cry9C gene is expressed in a plant in combination with another insect control protein, another Bt-derived crystal protein or an insecticidal fragment thereof, particularly a CrylAb- or CrylB-type protein, to prevent or delay the occurrence of insect resistance development (EP 0 408 403).
All or part of the improved cry9C coding region can also be used to transform bacteria, such as a B. thuringiensis which produces other insecticidal toxins (Lereclus et al., 1992, Bio/Technology 10, 418-421; Gelernter Schwab, 1993, In Bacillus thuringiensis, An Environmental Biopesticide: theory and Practice, pp. 89-104, eds.
Entwistle, Cory, Bailey, M.J. and Higgs, John Wiley Sons Ltd.).
Thereby, a transformed Bt strain is produced which is useful for combating a wide spectrum of insect pests or for combating insects in such a manner that insect resistance development is prevented or delayed (EP 0 408 403). Preferred promoter and 3' termination and polyadenylation sequences for the chimeric improved cry9C gene are derived from Bacillus thuringiensis genes, such as the native ICP genes.
Alternatively, the improved coding region of the invention can be inserted and expressed in endophytic and/or root-colonizing bacteria, such as bacteria of the genus Pseudomonas or Clavibacter, under the control of a Bt ICP gene promoter and 3' termination sequences. Successful transfer and expression of ICP genes into such bacteria has been described by Stock et al. (1990, Can. J. Microbiol.
36, 879-884), Dimock et al. (1989, In Biotechnology, Biopesticides and Novel Plant Pest Resistance Management, eds. Roberts, D.W. Granados, pp.
88 92 Boyce Thompson Institute for Plant Research, Ithaca, New York), and Waalwijk et al.
(1991, FEMS Microbiol. Lett. 77, 257-264). Transformation of bacteria with all or part -16- WO 99/00407 PCT/EP98/04033 of the improved cry9C coding region of the invention, incorporated in a suitable cloning vehicle, can be carried out in a conventional manner, preferably using conventional electroporation techniques as described in Mahillon et al. (1989, FEMS Microbiol. Letters 60, 205-210), in PCT patent publication WO 90/06999, Chassy et al. (1988, Trends Biotechnol. 6, 303-309) or other methods, as described by Lereclus et al. (1992, Bio/Technology 10, 418).
The improved Cry9C-producing strain can also be transformed with all or an insecticidally effective part of one or more DNA sequences encoding a Bt protein or an insecticidally effective part thereof, such as: a DNA encoding the Bt2 or CrylAb protein (US patent 5,254,799; EP 0 193 259) or the Btl09P or Cry3C protein (PCT publication WO 91/16433), or another DNA coding for an anti-lepidoptera or an anti- Coleoptera protein. Thereby, a transformed Bt strain can be produced which is useful for combating an even greater variety of insect pests Coleoptera and/or additional lepidoptera) or for preventing or delaying the development of insect resistance.
For the purpose of combating insects by contacting them with the improved Cry9C protein, e.g. in the form of transformed plants or insecticidal formulations, any DNA sequence encoding any of the above described improved Cry9C proteins, can be used.
The following Examples are offered by way of illustration and not by way of limitation. The sequence listing referred to in the description and the Examples is as follows: SEQUENCE LISTING SEQ ID No. 1: Nucleotide sequence of the Bacillus thuringiensis cry9C gene, showing the coding region and flanking 5' and 3' regions.
SEQ ID No. 2: Amino acid sequence of the full length Bacillus thuringiensis Cry9C protein.
SEQ ID No. 3: Nucleotide sequence of a codon-optimized DNA sequence encoding a truncated Cry9C protein wherein the arginine at -17- WO 99/00407 PCTIEP98/04033 amino acid position 123 (corresponding to amino acid position 164 in the protein of SEQ ID No. 2) has been replaced by lysine.
SEQ ID No. 4: Amino acid sequence of the modified Cry9C protein encoded by the DNA of SEQ ID No. 3.
Unless otherwise stated in the Examples, all general materials and methods, including procedures for making and manipulating recombinant DNA are carried out by the standardized procedures as described in volumes 1 and 2 of Ausubel et al., Current Protocols in Molecular Biology, Current Protocols, USA (1994), in Plant Molecular Biology Labfax (1993, by R.D.D. Croy, jointly published by BIOS Scientific publications Ltd. UK and Blackwell Scientific Publications, UK) and Sambrook et al., Molecular Cloning A Laboratory Manual, Second Ed., Cold Spring Harbor Laboratory Press, NY (1989).
EXAMPLES:
1. CONSTRUCTION OF MODIFIED CRY9C PROTEINS Multiple alignments between Bt crystal protein sequences including the sequences of Cry9C, Cry3A and CrylAa allowed identification of the amino acids located in the expected binding site of the Cry9C domain II. Using known alignment programs, 52 amino acid positions were identified for amino acid replacement. The amino acids in the Cry9C protein of SEQ ID No. 2 from amino acid positions 313- 334, 358-369, 418-425, 480-492 have been identified to correspond to the solventaccessible regions most likely involved in receptor-binding in the Cry3A protein, and these positions in the Cry9C protein were chosen for amino acid modification. Since alanine substitution does not alter the main chain of a protein, and does not impose extreme electrostatic or steric effects and since it eliminates the side chain beyond the beta carbon, each of the amino acids in these identified regions was changed into alanine, one by one, using splice overlap extension PCR (Ho et al., 1989, supra) on the protease-resistant form of the native cry9C gene wherein the arginine codon at position 164 was replaced by an alanine codon. The codon most preferred in the cry9C native gene for alanine, GCA, was used for these modifications. When the original codon encodes alanine, then this is replaced by a valine codon (GTA). The -18- WO 99/00407 PCT/EP98/04033 obtained PCR fragments were ligated in pUC19-derived vectors. If not present, suitable unique restriction sites were created in the cry9C DNA. All plasmids containing modified DNA sequences were controlled by sequencing the relevant portions and were found to be correctly constructed. The modified cry9C genes were expressed in transformed WK6 cells. Every mutant protein was expressed in these E. coli cells at least twice. Mutants causing problems in expression, probably caused by structural changes in these mutants, were discarded. No gross folding aberrations of the mutants identified to be involved in toxicity (and listed in Table 1) are found, as was evidenced by the similar SDS-PAGE patterns following trypsin cleavage or treatment with midgut juice of the insect larvae of solubilized mutant and Cry9C(R164A) proteins.
2. INSECT TOXICITY OF THE MODIFIED CRY9C PROTEINS Bio assays on the modified Cry9C proteins obtained in Example 1 were carried out with first instar larvae of the Southwestern corn borer, Diatraea grandiosella (family Pyralidae); the European corn borer, Ostrinia nubilalis (family Pyralidae); and the tobacco budworm, Heliothis virescens (family Noctuidae). A dilution series of each protein was surface-layered on the artificial diet to determine the LCso value. The artificial diet consisted of: agar (20 water (1,000 ml), corn flour (96 g, ICN Biochemicals), yeast (30 wheat germs (64 g, ICN Biochemicals), wesson salt (7.5 g, ICN Biochemicals), casein (15 sorbic acid (2 aureomycin (0.3 nipagin (1 wheat germ oil (4 ml), sucrose (15 cholesterol (1 g), ascorbic acid (3.5 Vanderzand modified vitamin mix (12 g, ICN Biochemicals).
Larvae were placed on the diet in multi-well plates, 1 larva per well (2 for Ostrinia nubilalis). For each dilution, 24 larvae were tested, and dead and living larvae were counted after 5 days. Prior to application, the mutant proteins were digested with trypsin to release the toxin fragments. For each mutant protein, the assays are repeated at least 5 times, using two different protein preparations. As control protein, the trypsin-digested Cry9C(R164A) protein was used. The Cry9C(R164A) protein has the amino acid sequence of SEQ ID No. 2 wherein the arginine at position 164 was replaced by alanine. This protein was found to be more stable than the wild-type Cry9C toxin while retaining its toxicity to the test insects (see, PCT patent -19- WO 99/00407 PCT/EP98/04033 publication WO 94/24264). The LC 50 values were calculated with the POLOprogram, which is based on the probit analysis (POLO-PC, LeOra Software, 1119 Shattuck Ave., Berkeley California 94707). The results of these assays for those protein mutants which gave an LC 50 value that is significantly different from that of the control protein in repeated bio assays are summarized in Table 1. It is clear that different positions in the Cry9C protein when substituted to alanine cause increased toxicity in each of the tested insects.
Binding assays on isolated brush border membrane vesicles of Heliothis virescens and Ostrinia nubilalis performed as described in Van Rie et al. (1990, Appl.
Environm. Microbiol. 56, 1378-1385) showed that for all, with the exception of two, of the modified Cry9C proteins with altered toxicity, receptor binding is also altered an observed shift in KD value), thus confirming that for most amino acid residues altered toxicity is due to altered receptor binding. Hence, these residues are proper candidates for improvement of toxicity by amino acid randomization at or near the identified critical position.
3. COMPETITION BINDING EXPERIMENTS The Cry9C(R164A) protein was tested in competition binding assays using the ECL protein biotinylation system (Amersham Life Sciences, Amersham International plc., UK) as described by Lambert et al. (1996, supra) to determine if competition occurred with other Bt toxins in selected insects. For the assays, 3ng biotinylated Cry9C(R164A) protein was added to 30 pg brush border membrane vesicles in PBS buffer (comprising 0,1 BSA) in the presence of a 300-fold excess of nonbiotinylated toxin (homologous competition assays were included in every test as control). Repeated competition tests showed that in both Ostrinia nubilalis and Heliothis virescens brush border membranes, there was no detectable competition in receptor binding between the (activated) Cry9C(R164A) protein and any one of the following (activated) Bt toxins: the CrylAa (Schnepf et al., 1985, J. Biol. Chem. 260, 6264-6272), CrylAb (Hofte et al.,1986, Eur. J. Biochem. 161, 271-280), CrylAc (Adang et al., 1985, Gene 36, 289-300), CrylB (Brizzard Whiteley, 1988, Nucl.
Acids Res. 16, 4168-4169) and CrylC (Honee et al., 1988, Nucl. Acids Res. 16, 6240) toxins. Thus, in these insects the Cry9C(R164A) protein binds to a different WO 99/00407 PCT/EP98/04033 receptor than these other Bt toxins. In Diatreae grandiosella competition assays, it was found that the Cry9C(R164A) does compete for a receptor site with the Cryl B and CrylC Bt toxins, but does not compete with any one of the CrylAa, CrylAb, and CrylAc toxins.
The same results are found for all three insects when testing the Cry9C protein with the amino acid sequence of SEQ ID No. 2 from amino acids 1-658.
Thus, in all these three insects, combination of the Cry9C and a selected noncompetitively binding Bt toxin with good toxicity to the target insect can be used simultaneously in order to prevent or delay insect resistance development. In transgenic corn plants, a particularly interesting combination would be the Cry9C (or its protease-resistant variant) and a CrylB and/or any of the CrylA-type toxins for Ostrinia nubilalis control and the Cry9C (or its protease-resistant variant) and any one of the CrylA-type toxins, preferably a CrylAb-type toxin, for D. grandiosella control. For Heliothis virescens control, the Cry9C (or its protease-resistant variant) and any of the CrylA-type toxins are preferred toxins to be co-expressed.
4. CONSTRUCTION OF IMPROVED CRY9C PROTEINS The modified position in every mutant protein of Example 2 giving rise to a significantly decreased or increased toxicity to an insect species is altered to all other amino acids and the toxicity is re-evaluated. The amino acids yielding the highest toxicity at a particular position are combined to form an improved Cry9C protein.
Also the alanine mutants yielding an increase in toxicity (up-mutant amino acid positions) are included in such combinations to form improved Cry9C proteins for the selected insect species. Table 1 indeed shows already two up-mutant proteins for every insect tested. Analysis of all these improved Cry9C proteins in the bio assay shows that combinations of up-mutant amino acid positions can substantially increase toxicity of the Cry9C protein towards selected insect species.
GENE CONSTRUCTION AND PLANT TRANSFORMATION A modified DNA sequence encoding a truncated Cry9C(R164K) protein for expression in corn and cotton plants is shown in SEQ ID No. 3. This DNA sequence has an optimized codon usage for plants and encodes an N- and C-terminally -21- WO 99/00407 PCT/EP98/04033 truncated Cry9C protein wherein an arginine amino acid has been replaced by a lysine (at position 123 in SEQ ID No. Based on this DNA sequence, DNA sequences are made encoding the above improved Cry9C proteins and comprising amino acids 1 to 666 of the Cry9C(R164K) protein. Preferred codons to encode the amino acid replacements in the improved Cry9C proteins are those which are most preferred by the plant host (see, Murray, 1989, supra). A chimeric improved cry9C gene comprising the 35S promoter and 35S 3' transcription termination and polyadenylation signal is constructed by routine molecular biology techniques as described in the detailed description.
Corn cells are stably transformed by either Agrobacterium-mediated transformation (Ishida et al., 1996, supra and U.S. Patent No. 5,591,616) or by electroporation using wounded and enzyme-degraded embryogenic callus, as described in WO 92/09696 or US Patent 5,641,664 (incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Corn plants expressing a truncated improved Cry9C(R164K) protein wherein the amino acids at positions 364, 488, 319 and 321 have been replaced into alanine show a significantly higher protection from Ostrinia nubilalis and Diatraea grandiosella damage in comparative tests against corn plants expressing a truncated Cry9C(R164K) protein. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
Cotton cells are stably transformed by Agrobacterium-mediated transformation (Umbeck et al., 1987, Bio/Technology 5, 263-266; US Patent 5,004,863, incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Cotton plants expressing the truncated improved Cry9C(R164K, L321A, P329A) protein at similar levels than cotton plants expressing the truncated Cry9C(R164K) protein show a significantly higher protection from Heliothis virescens damage. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
-22- WO 99/00407 PCT/EP98/04033 The examples and embodiments of this invention described herein are only supplied for illustrative purposes. Many variations and modifications in accordance with the present invention are known to the person skilled in the art and are included in this invention and the scope of the claims. For instance, it is possible to alter, delete or add some nucleotides or amino acids to certain regions of the DNA or protein sequences of the invention without departing from the invention.
All publications (including patent publications) referred to in this application are hereby incorporated by reference, particularly WO 94/05771, WO 94/24264, and Lambert et al. (1996, supra).
-23- WO 99/00407 PCT/EP98/04033 Table 1: relative toxicity of modified trypsin-digested Cry9C proteins to different insects when compared with the Cry9C(R164A) trypsin-digested protein (mutant 'F313A': the Cry9C(R164A) trypsin-digested protein wherein also the phenylalanine at position 313 is replaced by alanine; 'down(2x)': mutant protein with a significantly lower toxicity (LC50 value about 2 times higher than the control protein), 'up mutant with a significantly higher toxicity (LC50 value about two times lower than that of the control protein), no difference in toxicity found): mutant H. virescens 0. nubilalis D. grandiosella F313A down (2x) P316A up (2x) A317V up (2x) N318A down (2-3x) V319A -up (3x) L321A up (2x) up (2x) R323A down (3x) W325A down (4-5x) down (2x) down (2-3x) P329A up (2x) Y330A -down (1.5x) up (2x) V362A down (3-4x) S364A -up (2x) D368A down (2-3x) Y369A up (2x) R418A down (16x) down (2x) A420V down (12x) L421A down (2x)- 1422A up (2x) F480A down (5x) -down -24- WO 99/00407 WO 9900407PCT/EP98/04033 mutant H. virescens 0. nubilalis D. grandiosella Q481 A down (3x)- N483A down (2x) Q484A down A485V down (3x) down (2x) down S487A down (2x) -down 1488A down (2x) up (2-3x) down N490A down A491V down (3x) WO 99/00407 PCT/EP98/04033 SEQUENCE LISTING GENERAL INFORMATION:
APPLICANT:
NAME: PLANT GENETIC SYSTEMS N.V.
STREET: Jozef Plateaustraat 22 CITY: Gent COUNTRY: Belgium POSTAL CODE (ZIP): B-9000 TELEPHONE: (32) (9)2358411 TELEFAX: (32)(9)2231923 (ii) TITLE OF INVENTION: Improved Bacillus thuringiensis toxin (iii) NUMBER OF SEQUENCES: 4 (iv) COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.30 (EPO) (vi) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/884,389 FILING DATE: 27-JUN-1997 INFORMATION FOR SEQ ID NO: 1: SEQUENCE CHARACTERISTICS: LENGTH: 4344 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (ix) FEATURE: NAME/KEY: LOCATION:668..4141 OTHER INFORMATION:/note= "coding sequence" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: GAATTCGAGC TCGGTACCTT TTCAGTGTAT CGTTTCCCTT CCATCAGGTT TTCAAATTGA
AAAGCCGAAT
GTGTAAAAAA
AAAGATTAAG
AAACATGTAT
ATATGTGGAC
TGATCATATT
TCAAGTATCA
AATAGGAATA
GATTTGAAAC
CGTATTGAGA
GGTGTCCTTT
ACAACGGTTG
CATATTTTAA
CAAGTACGTG
GGTTTGTTTT
AATACTATCC
TTGTTTACGA
TTGATGAATG
CTTTTATCGG
ATAGAGATCC
AATATAGCGT
ATTTACAAAT
GTTTTGTATA
ATTTTTTCAA
TGTAAGTCAT
TGGACAAGTA
AAATTTCTCT
GTCTATTTCC
CCAACAACTA
CAAACTGATG
TGAGTAAGAA
GAAATATTTT
TTGTCTATGA
GAAATTGACT
ATTGAACCTA
TTAAGTTTCC
CCATATTATG
AAAGAGAATC
CCGAAGGTTT
TTTATTAGAA
CGAAAGATAC
TACAAGTATT
TTCTGTGTGA
AAGATACGGT
TAATTGATGG
CGCAATCTGC
GTAAAAAAGA
AGGAATCTTT
120 180 240 300 360 420 480 540 -26- WO 99/00407 W099/0407PCT/EP98/04033 CTTACACGGG AAAATCCTAA GATTGAGAGT AAAGATATAT ATATATAAAT ACAATAAAGA 600
GTTTGTCAGG
AAAAAGTATG
GTGTCCATCA
AAATATGAAC
AAATCCTAGT
GAGAATACTC
CCTTTTAAAT
GGTGGAGGAA
ATTGCAAGGA
TGATCGAAAT
TGATTTTGTT
AGTATATGCA
AGA1AGGATGG
AACCGCTAAG
AGGAACAAAT
ATTTTTGAAA
AATCGAAATA
GATGACGATG
TATAAAGATT
TTATCTATTA
GGGGCTTTAG
ACACTGTGGC
CTTGTCAATC
TTAGGAGACT
GATACACGAA
AATGCTATTC
CAAGCTGTGA
GGATTCACAC
TACACTAATT
ACTGAAAGTT
GATATGATAT
ATCAAAATGA
TGAGGTATCC
ACTTACAAAT
GTGGTAGAGA
GTGTTCCGTT
CAGTTAATGA
AACAAATAAC
CTTTTAATGT
ATTTAAGTGT
CATTGTTTGC
ATTTACATTT
AGGGGGAAAT
ACTGTGAAAC
GGTTAAGATA
TATTTCCATA
AGGTATATAC
GGGGTACTAA
ATCTTTTTGA
ATTTTATGGA
TACAAGAAGA
ATGGAACAAA
ATGGCGTGAA
CTAATGGAGG
CCGGAAGTTC
CTGGATC TAT
ACCTTAATAA
CACCTGTTTC
GAAGAACAAC
GAACATGCAC
ATATGAAATT
TTTGGCAAGT
GACAGATGAG
TGCAGTTCAG
TTCTGGACAA
TACAGCTATA
AGAATTTGCA
ATATCAACGT
TGTTCGTGCT
AGTAAATGGA
GTTATTATTA
TTCCACATAT
TTGGTATAAT
TCATCAATTC
TTATGATGTA
AGATCCGATT
TCCCTATAAT
TAGGCTGAAT
TTATTGGTCA
TAGTTATGGC
CCGCATAGAG
TAGAGCTTCT
ATGTAGAGAT
AACCCATAGA
AGCTAATGCA
TACGATTACC
GGGTACTACG
TAATGGCACA
TAGATTTATA
PTTGATGCCC
GACCCAAATG
GACTACACTG
1CtGCGCTTA 1ATAGTGAGTT
TGGGAAGCTT
AGAAATCAGG
TCCCTTCAAA
CAATTTATAG
CAGCAGGTTC
AAAGATGCAT
TATGACCGTC
ACAGGTTTAG
CGTAGAGAAA
CGACTTTATC
GTATTTAATC
ACTTTTTCTG
AGCTTAACAA
GGACATACGT
CTAATTACAA
TCAACGGCAG
TTTGTCCCAG
CTCTATGATA
CTATCTCATG
GGAAGTGTAC
CCAAATAGAA
GTCTTAAAAG
TTTGGAACGT
CGTTTTGCCT
GTATAGGAGG
CCCATTGTGG
CAGCGTTACA
ATTCTTATAT
CTGTTGTTGG
TTTATCAATT
TCATGCGACA
CACTTGCAAG
ATTGGTTGGC
CTTTAGACCT
CATTACTGTC
CTCTTTTTGG
AATTGGAACT
ATCGTTTAAG
TGACTTTAGT
CAACGGGATC
CACCAGCTAA
AGCTCGAAAA
TCAGCAGTAA
TACGCCGTAG
CCACAAGAGC
TAGATTTTCG
GAGGCTTGTT
CAAATGATGA
TTACCTTTTT
CTACTTATGT
TTACACAATT
GTCCAGGATT
TAAGAGTAAC
CAACAGGAAA
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 GGTATTAGAT GTTGTGGCGC
AAACCCACAG
TGTTGGACTT
TGCCTTCATT
TCGATTTCCA
TTATCTGAAC
AACAATTAAT
TTCTGCATTG
TAATGGTACG
ATTACCACCA
TAGCTTTCAA
TTGGACCCGT
ACCATTGGTA
TACAGGAGGG
CTTACACGTG
TGCCGACGTT
CGCCCACCAC
GTTTCATCTA
GATTCAGCAG
CCCGGAGTTG
ATAGGTATAT
ACTTCTCCTG
GATGAAAGTA
ACTAATCAGG
CGTGATGTGG
AAGGCATCTG
GGTATACTCC
GGTTAATTCA CCATTAACAC AACAATATCG CCTAAGAGTT -27- WO 99/00407 WO 9900407PCT/EP98/04033
TTTCAGTATA
AATGAACAGA
TGGTCCGTTC
AGAAGGTGTT
TCCGGCACGA
TACACGTACA
AAATTTAGTG
AGCGGTAAGA
TAATACAATC
CGAGGGCGGT
TCCAACATAC
ACTAGATGGA
AGTCCATCTT
TTGCAGCGGA
GCATCATCCA
TACAGGGGAT
AACAGATGGG
TGAATCTCTA
ACGTGCAGAA
AGACTATCAA
AAATCTTGTA
TAACTACGAA
GTCTAGAAAT
TATGGATGCA
TGCACAAGTT
AGCAAGAAAA
AGAAACTCTT
TTCGTATATA
GAGTGAATCC
GAAGAGGGGG
AAAAAAGGTA
TGAAAAAATG
AGGGTACTCC
GGGCAGGAAC
AATCCGCCTT
AGCACCGGTG
GAAGCGGAAG
AGGGACGGAT
TCATGCTTAT
GCGGCAAAAC
AATAGTACAG
CCATTCTTTA
ATTTATCAAA
TTTGTGAAGA
GTAAAAAATG
ATCAACCGTT
ATGGATTGCT
CTAAATGCAA
TATGCGACGT
GAACGGGAAC
ATAGATCGTG
GATCAACAAT
GAGTCAATTT
ATTTACACAG
GCGGTGCAAA
TCGGTTCAGC
TCCCAACAAT
GTAGGAGGCG
ACATTTAATG
ACAGAAGAAG
GAAGGTTCAT
ATCCTAACGT
AAATGAATAG
TAGCTGTTTA
GTGGAGGGGT
TAACTTACGA
TTACATTTAC
GTGAATATTA
AGGATTTAGA
TACAGGTAAA
CCGATGAACA
GCCTCAGCCG
AAGAGAATGG
AAGGTCGTGC
AAGTAGATGC
GTAGTCAAGA
TACCAGATAA
GTGATGAACA
GTGAAGCGGC
GTGTAGATCA
TAGGAAATCT
AAAGAGATAA
TGTATTTAGC
TAAATCCAGA
CGGGTGTATA
AGTTATCCGA
ATGGAGACTT
AAGATGGCAA
TGAGAGTAAA
GAGATGGATA
CATGTGACTA
TGGTATTCTA
TCTATATAGA
ATAGCAACTA
AACCCCCTAC
CTAAGGTGTA
rTCTATCGGT kTCCTTTTTC k.CAAGCTCAA
TATAGATAGA
kGCGGCGAAG rGTGACAGAT
ATATGGGCAT
C.GAACGCAAC
C.TGGAAGGCA
ACTTCAGTTA
ATCGGTGTTA
TTTAGAAATT
TTTAGTATCT
GCATCAGGTA
TCAAACACAT
GGGCATTTGG
TGAATTGGTA
TGCGAAATGG
TGCGAAACAA
AATTGGGCTA
TAGTGATACA
TCGCTTACAA
TAACAGTGGT
TATGCATTTC
TCCGAATTGT
CGTCACAATC
CGATGTAAAT
CCCAGAGACA
CAGTATTGAG
TGAGAGGATA
TGGTAGAAGG
TAAAAAACAG
,ATGTTAGAT
kCAAGAGAGT
GAGATTCTAA
k.TTGAAATTG
AAAGCGGTGG
rATCAAGTGG
GACAAAAAGA
TTACTTCAAG
1AGTAACGGTG
GCAAGCGCAA
AAGCCTTATA
GATCTCATCC
GATACTTACT
GATATGCAGC
GAGTTTTCTT
GTTGTATTAA
GAGGTTGGGC
AATGCAGAGC
GCAATTAATC
GCAGAAATTA
CTATTACAGA
CAAGCATCGT
CTAGATAGTT
TTAGTTCTTT
AAGTATGTCT
CGAGATGGCG
GGTACGTATG
AAACATATGT
TTTATTGAAA
CTCCGTACAA
ACCGATAGGG
CATATCTGAT
rAGGGAGCAC rTACTACTAC
CAGTGA.ATGC
TCCCTGTGAA
CGAGCTTGTT
IkCCAAGCGGC rGTTATTGGA
A~TCCAGATTT
TTACTATTAG
GAGAAAATTA
CACGCTATAG
ACCATCATAA
CAGATGGTTC
TAGATGCGGA
CCTATATTAA
AAGTTCGAAC
CATTATCGGG
TAGGAAGAAA
ATCTGTTTGT
ATGAAGCTTC
TTCCTGGGAT
ATCTGTATAC
GGAATACAAC
CGCATTGGGA
TACGTGTGAC
CTCATCACCA
TCAATGACAA
GGGTAGAGGT
CACAAGAGTA
ACAAAGATTA
GGTTCTTACA
AGAAAAAAGT
2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3900 3960 4020 4080 4140 4200 4260 4320 -28- WO 99/00407 PCT/EP98/04033 GAGTACCTTA TAAAGAAAGA ATTC INFORMATION FOR SEQ ID NO: 2: SEQUENCE CHARACTERISTICS: LENGTH: 1157 amino acids TYPE: amino acid
STRANDEDNESS:
TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: Met Asn Arg Asn Asn Gin Asn Glu Tyr Glu Ile Ile Asp Ala Pro His 1 5 10 4344 Cys Pro Thr Ser Leu Gin Glu Glu Ser 145 Asn Asp Gin Gly Asn Asp Gly Gly Phe Ala Phe 130 Phe Asp Leu Val Cys Ala Glu Arg Ala Leu Phe 115 Ala Asn Thr Asp Pro Pro Ser Asp Ala Leu Asp Tyr Asp Ala Leu Gly Leu Asn 100 Met Arg Arg Asn Val Tyr Arg Asn 165 Phe Val 180 Leu Leu Gin Thr Val 70 Val Thr Gin Gin Gin 150 Leu Asn Ser Asp Asp Val Arg Tyr Pro Leu Ala Ser Asp 25 Asn Met Asn Tyr Lys Asp Tyr Leu Gin Met 40 Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile 55 Gin Thr Ala Leu Thr Val Val Gly Arg Ile 75 Pro Phe Ser Gly Gin Ile Val Ser Phe Tyr 90 Leu Trp Pro Val Asn Asp Thr Ala Ile Trp 105 110 Val Glu Glu Leu Val Asn Gin Gin Ile Thr 120 125 Ala Leu Ala Arg Leu Gin Gly Leu Gly Asp 135 140 Arg Ser Leu Gin Asn Trp Leu Ala Asp Arg 155 160 Ser Val Val Arg Ala Gin Phe Ile Ala Leu 170 175 Ala Ile Pro Leu Phe Ala Val Asn Gly Gin 185 190 Val Tyr Ala Gin Ala Val Asn Leu His Leu 200 205 195 Leu Leu 210 Leu Lys Asp Ala Ser 215 Leu Phe Gly Glu Trp Gly Phe Thr -29- WO 99/00407 Gin Gly 225 Lys Tyr Leu Arg Arg Glu Tyr Asp 290 Glu Val 305 Leu Cys Glu Asn Leu Thr Tyr Trp 370 Val Gin 385 Asn Pro Phe Arg Val Pro Cys Arg 450 Thr Gly 465 Gin Thr Tyr Val Asn Arg Gly Thr 530 Glu Thr Gly Met 275 Val Tyr Arg Ala Ile 355 Ser Glu Gly Ser Gly 435 Asp Ser Asn Trp Ile 515 Thr Ile Asn Thr 260 Thr Arg Thr Arg Phe 340 Ser Gly Asp Va1 Ala 420 Gly Leu Ser Gin Thr 500 Thr Val Ser Tyr 245 Asn Leu Leu Asp Trp 325 Ile Ser His Ser Asp 405 Leu Leu Tyr Thr Ala 485 Arg Gin Leu rhr 230 Cys Thr Val Tyr Pro 310 Gly Arg Asn Thr Tyr 390 Gly Ile Phe Asp His 470 Gly Arg Leu Lys Tyr Glu Glu Val Pro 295 Ile Thr Pro Arg Leu 375 Gly Thr Gly Asn Thr 455 Arg Ser Asp Pro Gly 535 Tyr rhr Ser Leu 280 Thr Vai Asn Pro Phe 360 Arg Leu Asn Ile Gly 440 Asn Leu Ile Val Leu 520 Pro Asp Trp Trp 265 Asp Gly Phe Pro His 345 Pro Arg Ile Arg Tyr 425 Thr Asp Ser Ala Asp 505 Val Gly Arg Tyr 250 Leu Va1 Ser Asn Tyr 330 Leu Vai Ser Thr Ile 410 Gly Thr Glu His Asn 490 Leu Lys Phe Gin 235 Asn Arg Vai Asn Pro 315 Asn Phe Ser Tyr Thr 395 Glu Vai Ser Leu Val 475 Ala Asn Ala Thr Leu Thr Tyr I Ala Pro 300 Pro Thr Asp Ser Leu 380 Thr Ser Asn Pro Pro 460 Thr Gly Asn Ser Gly 540 3lu ;iy His Leu 285 ln Ala Phe Arg Asn 365 Asn Arg Thr Arg Ala 445 Pro Phe Ser Thr Ala 525 Gly Leu Leu Gin 270 Phe Leu Asn Ser Leu 350 Phe Asp Ala Ala Ala 430 Asn Asp Phe Val lE 51c Prc G1 PCT/EP98/04033 Thr Ala 240 Asp Arg 255 Phe Arg Pro Tyr Thr Arg Val Gly 320 Glu Leu 335 Asn Ser Met Asp Ser Ala Thr Ile 400 Val Asp 415 Ser Phe Gly Gly Glu Ser Ser Phe 480 Pro Thr 495 Thr Pro Val Ser Ile Leu Arg Thr Thr Asn Thr Phe Gly Thr Leu 555 Arg Vai Thr Val Asn 560 WO 99/00407 PCT/EP98/04033 Ser Pro Leu Thr Gin Gin Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr 565 570 575 Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly Val Ser Ile Gly Asp 580 585 590 Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gin Glu Leu Thr Tyr Glu 595 600 605 Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro 610 615 620 Phe Thr Phe Thr Gin Ala Gin Glu Ile Leu Thr Val Asn Ala Glu Gly 625 630 635 640 Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro 645 650 655 Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys 660 665 670 Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn 675 680 685 Val Thr Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu 690 695 700 Ser Asp Glu Gin Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val 705 710 715 720 Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro 725 730 735 Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser 740 745 750 Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala 755 760 765 Leu Gin Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gin 770 775 780 Lys Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp 785 790 795 800 Gly Phe Val Lys Ser Ser Gin Asp Leu Glu Ile Asp Leu Ile His His 805 810 815 His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp 820 825 830 Thr Tyr Ser Asp Gly Ser Cys Ser Gly Ile Asn Arg Cys Asp Glu Gin 835 840 845 His Gin Val Asp Met Gin Leu Asp Ala Glu His His Pro Met Asp Cys 850 855 860 Cys Glu Ala Ala Gin Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly 865 870 875 880 Asp Leu Asn Ala Ser Val Asp Gin Gly Ile Trp Val Val Leu Lys Val 885 890 895 -31- WO 99/00407 PCT/EP98/04033 Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu 900 905 910 Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg Asp Asn 915 920 925 Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Ile Asp Arg 930 935 940 Val Tyr Leu Ala Ala Lys Gin Ala Ile Asn His Leu Phe Val Asp Tyr 945 950 955 960 Gin Asp Gin Gin Leu Asn Pro Glu Ile Gly Leu Ala Glu Ile Asn Glu 965 970 975 Ala Ser Asn Leu Val Glu Ser Ile Ser Gly Val Tyr Ser Asp Thr Leu 980 985 990 Leu Gin Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asp 995 1000 1005 Arg Leu Gin Gin Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin 1010 1015 1020 Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr Met Asp 1025 1030 1035 1040 Ala Ser Val Gin Gin Asp Gly Asn Met His Phe Leu Val Leu Ser His 1045 1050 1055 Trp Asp Ala Gin Val Ser Gin Gin Leu Arg Val Asn Pro Asn Cys Lys 1060 1065 1070 Tyr Val Leu Arg Val Thr Ala Arg Lys Val Gly Gly Gly Asp Gly Tyr 1075 1080 1085 Val Thr Ile Arg Asp Gly Ala His His Gin Glu Thr Leu Thr Phe Asn 1090 1095 1100 Ala Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Asn Ser Tyr 1105 1110 1115 1120 Ile Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met Trp Val 1125 1130 1135 Glu Val Ser Glu Ser Glu Gly Ser Phe Tyr Ile Asp Ser Ile Glu Phe 1140 1145 1150 Ile Glu Thr Gin Glu 1155 INFORMATION FOR SEQ ID NO: 3: SEQUENCE CHARACTERISTICS: LENGTH: 1897 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid DESCRIPTION: /desc "synthetic DNA" -32- WO 99/00407 WO 9900407PCT/EP98/04033 (ix) FEATURE: NAME/KEY: LOCATION:13. .1890 OTHER INFORMATION:/note= "coding sequence" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
GGTACCAAAA
ATCAACCCCA
GGTCGCATCC
TTCCTGCTGA
CAGGTGGAGG
CGCCTGCAGG
GCCGACCGCA
CTGGACTTCG
AGCGTGTACG
GGCGAGGGCT
CTGACCGCCA
AGGGGCACCA
GTGGTGCTGG
AGCAACCCCC
AACGTGGGCC
AACGCCTTCA
AATCGATTCC
AGCTACCTGA
GCCACCATCA
CGCAGCGCTC
TTCAACGGCA
GAGCTGCCAC
TTCAGCTTCC
GTGTGGACCT
CTGCCCCTGC
TTCACCGGTC
ACCGTGAATJ
CCATGGCTGA
GCCTGAGCAT
TGGGTGCCCT
ACACCCTGTG
AGCTGGTGAA
GCCTGGGCGA
ACGACACCAA
TGAACGCCAT
CCCAGGCCGT
GGGGCTTCAC
AGTACACCAA
ACACCGAGAG
ACGTGGTGGC
AGCTGACACG
TGTGCCGCAG
TCAGGCCACC
CCGTGAGCAC
ACGACAGCGC
.ACCCAGGCG1 TGATCGGCA9
CCACCAGCCC
CCGACGAGAC
AGACCAACC2
GGAGGGACG'
TGAAGGCCA(
GCGGTATAC'
P' CCCCACTGAI
CTACCTGCAG
CAGCGGTCGC
GGGCGTGCCC
GCCAGTGAAC
CCAGCAGATC
CAGCTTCAAC
GAACCTGAGC
CCCCCTGTTC
GAACCTGCAC
CCAGGGCGAG
CTACTGCGAG
CTGGCTGCGC
CCTGTTCCCC
TGAGGTGTAC
GTGGGGCACC
CCACCTGTTC
CAACTTCATG
CGTGCAGGAG
GGACGGCACC
CTACGGCGTC
AGCCAACGGI
CACCGGCAGC
k. GGCTGGCAGC r' GGACCTGAAC 3CGCTCCCGTC r GCGCAGGACC
:CCAGCAGTAC
ATGACCGACG AGGACTACAC
GACGCCGTGC
TTCAGCGGTC
GACACCGCCA
ACCGAGTTCG
GTGTACCAGC
GTGGTGAGGG
GCCGTGAACG
CTGCTGCTGC
ATCAGCACCT
ACCTGGTACA
TACCACCAGT
TACTACGACG
ACCGACCCCA
AACCCCTACA
GACCGCCTGA
GACTACTGGA
GACAGC TACO
AACCGCATCG
AACAGGGCC.P
GGCTGCCGAC
*AGCACCCACC
*ATCGCCAACC
-AACACCATC1
AGCGGCACCI
-ACCAACGGC2
SCGCCTGCGC(
AGACCGCTCT
AGATCGTGAG
TCTGGGAAGC
CTCGCAACCA
GCAGCCTGCA
CCCAGTTCAT
GCCAGCAGGT
TGAAGGATGC
ACTACGACCG
ACACCGGTCT
TCCGCAGGGA
TGCGCCTGTA
TCGTGTTCAA
ACACCTTCAG
ACAGCCTGAC
*GCGGTCACAC
GCCTGATCAC
AGAGCACCGC
GCTTCGTGCC
ATCTGTACG.A
GCCTGAGCCA
CTGGCAGCG7
SCCCCCAACCC
k. CCGTGCTGM7 k. CCTTCGGCAC 3 TGCGCTTCGC
CGACAGCTAC
GACCGTGGTG
CTTCTACCAG
TTTCATGCGC
GGCCCTGGCT
GAACTGGCTG
CGCCCTGGAC
GCCCCTGCTG
ATCCCTGTTC
CCAGCTCGAG
GGACCGCCTG
GATGACCCTG
CCCCACCGGC
CCCACCAGCC
CGAGCTGGAG
CATCAGCAGC
CCTGCGCAGG
CACCACCAGG
TGTGGACTTC
AGGTGGCCTG
*CACCAACGAC
CGTCACCTTC
GCCCACCTAC
SCATCACCCAG
LGGGTCCAGGC
CCTGCGCGTG
CAGCACCGGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 AACTTCAGCA TCCGCGTGCT GAGGGGTGGC GTGAGCATCG GCGACGTGCG CCTGGGCAGC -33- WO 99/00407 WO 9900407PCTIEP98/04033 ACCATGAACA GGGGCCAGGA GCTGACCTAC GAGAGCTTCT TCACCCGCGA GTTCACCACC ACCGGTCCCT TCAACCCACC CTTCACCTTC ACCCAGGCCC AGGAGATCCT GACCGTGAAC GCCGAGGGCG TGAGCACCGG TGGCGAGTAC TACATCGACC GCATCGAGAT CGTGCCCGTG AACCCAGCTC GCGAGGCCGA GGAGGACTGA GGCTAGC INFORMATION FOR SEQ ID NO: 4: SEQUENCE CHARACTERISTICS: LENGTH: 625 amino acids TYPE: amino acid
STRANDEDNESS:
TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: Met Ala Asp Tyr Leu Gin Met Thr Asp Giu Asp Tyr Thr Asp Ser Tyr 1 5 10 1740 1800 1860 1897 Ile Leu Gly Vai Leu Arg Gin Arg Leu 145 Gln Giy Asn Thr Gin Asn Val1 Leu Asn Ala 130 Phe Ala Glu Pro Val1 Ile Asp Asn Gin Trp, 115 Gin Ala Val Gly Ser Val Val Thr Gin Gly 100 Leu Phe Val Asn Trp 180 Leu Gly Ser Ala Gin Leu Ala Ile Asn Leu 165 Gly Ser Arg Phe Ile 70 Ile Gly Asp Ala Gly 150 His Phe Ile Ile Tyr 55 Trp Thr Asp Arg Leu 135 Gin Leu Thr Ser Leu 40 Gin Giu Giu Ser Asn 120 Asp Gin Leu Gin Lys 200 Gly 25 Gly Phe Aia Phe Phe 105 Asp Leu Val Leu Giy 185 Tyr Arg Al a Leu Phe Al a 90 Asn Thr Asp Pro Leu 170 Giu Thr Asp Leu Leu Met 75 Arg Vai Lys Phe Leu 155 Lys Ile Asn Ala Gly Asn Arg Asn Tyr Asn Val 140 Leu Asp Ser Tyr Val Val Thr Gln Gln Gin Leu 125 Asn Ser Ala Thr Cys 205 Gln Pro Leu Val1 Al a Arg 110 Ser Ala Vai Ser Tyr 190 Glu Thr Phe Trp Giu Leu Ser Val Ile Tyr Leu 175 Tyr Thr Ala Ser Pro Glu Ala Leu Val Pro Ala 160 Phe Asp Trp Arg Gin Leu Glu Leu Thr Ala 195 -34- WO -99/00407 PCTIEP98/04033 Tyr Asn Thr Gly Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp 210 215 220 Leu 225 Val Ser Asn Tyr Leu 305 Val Ser Thr Ile Gly 385 Thr Glu His Asn Leu 465 Lys Phe Thr krg Ja1 Asn Pro Asn 290 Phe Ser Tyr Thr Glu 370 Va1 Ser Leu Val Ala 450 Asn Ala Thi Let Tyr Ala I Pro Pro 275 Thr Asp Ser Leu Thr 355 Ser Asn Pro Pro Thr 435 Gly Asn Ser Gly 1 Arg 515 is eu 31n i 260 kla Phe Arg Asn Asn 340 Arg Thr Arg Ala Pro 420 Phe Ser Thr Ala Gly 500 Val .ln Phe 245 Leu Asn Ser Leu Phe 325 Asp Ala Ala Ala Asn 405 Asp Phe Val Ile Pro 485 Gly Thi Phe 230 Pro Thr Val Glu Asn 310 Met Ser Thr Val Ser 390 Gly Glu Ser Pro Thr 470 Val Ile Val krg Iyr krg Gly Leu 295 Ser Asp Ala Ile Asp 375 Phe Gly Ser Phe Thr 455 Pro Ser Let Asr Arg Tyr Glu Leu 280 Glu Leu Tyr Val Asn 360 Phe Va1 Cys Thr Gin 440 STyr Asr Glj Ar 1 Se 52( Glu D Asp Val 265 Cys 2 Asn 2 Thr Trp Gin 4 345 Pro Arg Pro Arg Gly 425 Thr Val Arg Thr Arg 505 Pro 0 let Tal ,50 yr krg kia Ile Ser 330 Glu Gly Ser Gly Asp 410 Ser Asn Trp Ile Thr 490 Thy Let Thr I 235 Arg Thr 2 Arg Phe Ser 315 Gly Asp Val Ala Gly 395 Leu Ser Gin Thr Thr 475 Val Thr 1 Thr .eu eu ksp rrp Ile 300 Ser His Ser Asp Leu 380 Leu Tyr Thr Ala Arg 460 Gin Leu Asn Glr Val N Tyr I Pro Gly 285 Arg Asn Thr Tyr Gly 365 Ile Phe Asp His Gly 445 Arg Leu Lys Gly Gin 525 Tal ?ro Ile 270 rhr Pro Arg Leu Gly 350 Thr Gly Asn Thr Arg 430 Ser Asp Pro Gly Thr 510 TyT.
Leu 2 Thr 255 Val Asn Pro Phe Arg 335 Leu Asn Ile Gly Asn 415 Leu Ile Val Leu Pro 495 Phe Arg ksp 240 ly Phe Pro His Pro 320 Arg Ile Arg Tyr Thr 400 Asp Ser Ala Asp Val 480 Gly Gly Leu Arg Val Arg Phe Ala Ser Thr Gly Asn Phe Ser Ile Arg Val Leu Arg 530 535 540 -36- Gly Gly Val 545 Gly Gin Glu Thr Gly Pro Leu Thr Val 595 Asp Arg Ile 610 Ser Ile Gly Asp Val 550 Leu Thr Tyr Glu Ser 565 Arg Leu Gly Ser 555 Phe Phe Thr Arg 570 Thr Phe Thr Gin 585 Ser Thr Gly Gly Thr Met Asn Arg 560 Phe 580 Asn Asn Pro Pro Phe Ala Glu Gly Val 600 Glu Phe Thr Thr 575 Ala Gin Glu Ile 590 Glu Tyr Tyr Ile 605 Glu Ala Glu Glu Glu Ile Val Pro Val Asn Pro Ala Arg 615 620 Asp 625 "Comprises/comprising" when used in this specification is taken to specify the presence of stated features, integers, steps or components but does not preclude the presence or addition of one or more other features, integers, steps, components or groups thereof.
Claims (12)
1. A modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 or an insecticidally-effective fragment thereof, wherein at least one amino acid at the following amino acid positions in SEQ ID No. 2 is replaced by another amino acid: 316, 317, 319, 321, 329, 330, 364, 369, 422, or 488.
2. The modified Cry9C protein of claim 1 with improved toxicity to Ostrinia nubilalis, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least the amino acids at positions 364 and 488 in SEQ ID No. 2 are replaced by other amino acids.
3. The modified Cry9C protein of claim 1 with improved toxicity to Heliothis virescens, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at position 321 or position 329 in SEQ ID No. 2, is replaced by another amino acid.
4. The modified Cry9C protein of claim 1 with improved toxicity to Diatraea grandiosella, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at any *e or all of amino acid positions 316, 317, 319, 321, 330, 369, or 422 in SEQ ID No. 2 is replaced by another amino acid.
5. The modified Cry9C protein of any one of claims 1 to 4 wherein the Sarginine at position 164 in SEQ ID No. 2 is replaced by another amino acid.
6. The modified Cry9C protein of any one of claims 1 to 4 wherein said at least one amino acid position is replaced by alanine. A DNA sequence encoding the protein of any one of claims 1 to 4. 38
8. A DNA sequence encoding the protein of claim 5 or 6.
9. A plant, comprising the DNA of claim 7 or 8. A seed, comprising the DNA of claim 7 or 8.
11. The plant of claim 9 which is selected from the group consisting of: corn, cotton, rice, oilseed rape, cauliflower, broccoli, soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
12. A method for controlling insects feeding on a plant, comprising expressing the protein of any one of claims 1 to 4 in a plant.
13. A method for controlling insects feeding on a plant, comprising growing the plant of claim 9.
14. A method of obtaining a seed comprising the DNA of claim 7 or 8 comprising inserting said DNA into the genome of a plant and harvesting the seed from said plant. DATED this 13 th day of September 2001 PLANT GENETIC SYSTEMS N.V. WATERMARK PATENT TRADEMARK ATTORNEYS 290 BURWOOD ROAD HAWTHORN VICTORIA 3122 AUSTRALIA P16581AU00 KJS:AMT:SLB -bj KN C-) x^^
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US88438997A | 1997-06-27 | 1997-06-27 | |
US08/884389 | 1997-06-27 | ||
PCT/EP1998/004033 WO1999000407A2 (en) | 1997-06-27 | 1998-06-25 | Improved bacillus thuringiensis toxin |
Publications (2)
Publication Number | Publication Date |
---|---|
AU8804498A AU8804498A (en) | 1999-01-19 |
AU741600B2 true AU741600B2 (en) | 2001-12-06 |
Family
ID=25384517
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU88044/98A Ceased AU741600B2 (en) | 1997-06-27 | 1998-06-25 | Improved bacillus thuringiensis toxin |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0989998A1 (en) |
AU (1) | AU741600B2 (en) |
CA (1) | CA2290718A1 (en) |
WO (1) | WO1999000407A2 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6570005B1 (en) | 1996-07-01 | 2003-05-27 | Mycogen Corporation | Toxins active against pests |
US6369213B1 (en) | 1996-07-01 | 2002-04-09 | Mycogen Corporation | Toxins active against pests |
AU7518100A (en) * | 1999-09-17 | 2001-04-24 | Aventis Cropscience N.V. | Insect-resistant rice plants |
ES2321375T3 (en) | 1999-12-28 | 2009-06-05 | Bayer Bioscience N.V. | INSECTICIDE PROTEINS OF BACILLUS THURINGIENSIS. |
JP2004518616A (en) * | 2000-08-11 | 2004-06-24 | ザ レジェンツ オブ ザ ユニヴァースティ オブ カリフォルニア | Methods for blocking resistance of insects and nematodes to Bt toxins |
US7230167B2 (en) | 2001-08-31 | 2007-06-12 | Syngenta Participations Ag | Modified Cry3A toxins and nucleic acid sequences coding therefor |
EP1694846B1 (en) | 2003-12-10 | 2014-10-01 | Novozymes A/S | A cell with improved secretion mediated by mrga protein or homologue |
WO2005066202A2 (en) | 2003-12-22 | 2005-07-21 | E.I. Du Pont De Nemours And Company | Bacillus cry9 family members |
UA94893C2 (en) | 2004-03-25 | 2011-06-25 | Сингента Партисипейшнс Аг | Transgenic maize plant mir604 |
JP4899180B2 (en) * | 2004-12-22 | 2012-03-21 | 独立行政法人農業・食品産業技術総合研究機構 | Primer set for nucleic acid test, test kit and test method using the same |
ES2601577T3 (en) | 2007-03-28 | 2017-02-15 | Syngenta Participations Ag | Insecticidal proteins |
US9522937B2 (en) | 2007-03-28 | 2016-12-20 | Syngenta Participations Ag | Insecticidal proteins |
BRPI0924153A2 (en) * | 2009-01-23 | 2016-05-24 | Pioneer Hi Bred Int | isolated nucleic acid molecule, DNA construct, host cell, transgenic plant, transformed seed, pesticide-isolated isolated polypeptide, composition and method for controlling an insect pest population, for exterminating an insect pest, for producing an active polypeptide pesticide and to protect a plant from a pest |
DE102015113908B4 (en) | 2015-08-21 | 2023-05-04 | Truma Gerätetechnik GmbH & Co. KG | level gauge |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0317511A3 (en) * | 1987-11-18 | 1991-10-16 | Ciba-Geigy Ag | Insecticidal cotton plant cells |
EP0447493A1 (en) * | 1988-12-12 | 1991-09-25 | Plant Genetic Systems, N.V. | New strain of bacillus thuringiensis |
EP0694062B1 (en) * | 1993-04-09 | 2010-08-04 | Bayer BioScience N.V. | New bacillus thuringiensis strains and their insecticidal proteins |
-
1998
- 1998-06-25 EP EP98939581A patent/EP0989998A1/en not_active Withdrawn
- 1998-06-25 AU AU88044/98A patent/AU741600B2/en not_active Ceased
- 1998-06-25 CA CA002290718A patent/CA2290718A1/en not_active Abandoned
- 1998-06-25 WO PCT/EP1998/004033 patent/WO1999000407A2/en not_active Application Discontinuation
Non-Patent Citations (1)
Title |
---|
GENE. 1996 VOL.179 PP111-117 * |
Also Published As
Publication number | Publication date |
---|---|
EP0989998A1 (en) | 2000-04-05 |
WO1999000407A3 (en) | 1999-05-14 |
WO1999000407A2 (en) | 1999-01-07 |
CA2290718A1 (en) | 1999-01-07 |
AU8804498A (en) | 1999-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2702998C (en) | Axmi-066 and axmi-076: delta-endotoxin proteins and methods for their use | |
EP1594966B1 (en) | Delta-endotoxin genes and methods for their use | |
US8829279B2 (en) | Family of pesticidal proteins and methods for their use | |
AU623530B2 (en) | Prevention of bt resistance development | |
EP1844064B1 (en) | Axmi-018, axmi-020, and axmi-021, a family of delta-endotoxin genes and methods for their use | |
US5659123A (en) | Diabrotica toxins | |
US20080172764A1 (en) | Axmi-004, a delta-endotoxin gene and methods for its use | |
EP1988099B1 (en) | Bacillus thuringiensis insecticidal proteins | |
AU2002252974A1 (en) | Bacillus thuringiensis insecticidal proteins | |
EP1352068A2 (en) | Bacillus thuringiensis insecticidal proteins | |
AU741600B2 (en) | Improved bacillus thuringiensis toxin | |
JP4369758B2 (en) | Chimeric δ Endotoxin Proteins Cry1Ea and Cry1Ca | |
NZ561959A (en) | Delta-endotoxin genes of bacillus thuringiensis and methods for their use as a pesticide |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) | ||
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |