CA2290718A1 - Improved bacillus thuringiensis toxin - Google Patents
Improved bacillus thuringiensis toxin Download PDFInfo
- Publication number
- CA2290718A1 CA2290718A1 CA002290718A CA2290718A CA2290718A1 CA 2290718 A1 CA2290718 A1 CA 2290718A1 CA 002290718 A CA002290718 A CA 002290718A CA 2290718 A CA2290718 A CA 2290718A CA 2290718 A1 CA2290718 A1 CA 2290718A1
- Authority
- CA
- Canada
- Prior art keywords
- amino acid
- protein
- cry9c
- leu
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000003053 toxin Substances 0.000 title description 26
- 231100000765 toxin Toxicity 0.000 title description 26
- 241000193388 Bacillus thuringiensis Species 0.000 title description 7
- 229940097012 bacillus thuringiensis Drugs 0.000 title description 7
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 250
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 200
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 96
- 150000001413 amino acids Chemical class 0.000 claims abstract description 86
- 241000238631 Hexapoda Species 0.000 claims abstract description 73
- 231100000419 toxicity Toxicity 0.000 claims abstract description 56
- 230000001988 toxicity Effects 0.000 claims abstract description 56
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 28
- 235000018102 proteins Nutrition 0.000 claims description 193
- 235000001014 amino acid Nutrition 0.000 claims description 154
- 241000196324 Embryophyta Species 0.000 claims description 70
- 235000004279 alanine Nutrition 0.000 claims description 26
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 22
- 238000000034 method Methods 0.000 claims description 21
- 108020004414 DNA Proteins 0.000 claims description 16
- 241000256244 Heliothis virescens Species 0.000 claims description 14
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 13
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 13
- 235000005822 corn Nutrition 0.000 claims description 13
- 241000879145 Diatraea grandiosella Species 0.000 claims description 12
- 241001147398 Ostrinia nubilalis Species 0.000 claims description 12
- 239000012634 fragment Substances 0.000 claims description 11
- 239000004475 Arginine Substances 0.000 claims description 8
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 8
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 claims description 4
- 240000003259 Brassica oleracea var. botrytis Species 0.000 claims description 4
- 229920000742 Cotton Polymers 0.000 claims description 4
- 241000219146 Gossypium Species 0.000 claims description 4
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 claims description 4
- 235000007319 Avena orientalis Nutrition 0.000 claims description 2
- 241000209763 Avena sativa Species 0.000 claims description 2
- 235000007558 Avena sp Nutrition 0.000 claims description 2
- 235000016068 Berberis vulgaris Nutrition 0.000 claims description 2
- 241000335053 Beta vulgaris Species 0.000 claims description 2
- 240000002791 Brassica napus Species 0.000 claims description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 2
- 235000017647 Brassica oleracea var italica Nutrition 0.000 claims description 2
- 235000002566 Capsicum Nutrition 0.000 claims description 2
- 235000007516 Chrysanthemum Nutrition 0.000 claims description 2
- 244000189548 Chrysanthemum x morifolium Species 0.000 claims description 2
- 244000115658 Dahlia pinnata Species 0.000 claims description 2
- 235000012040 Dahlia pinnata Nutrition 0.000 claims description 2
- 241000245654 Gladiolus Species 0.000 claims description 2
- 244000068988 Glycine max Species 0.000 claims description 2
- 235000010469 Glycine max Nutrition 0.000 claims description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 2
- 244000061176 Nicotiana tabacum Species 0.000 claims description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 2
- 240000007594 Oryza sativa Species 0.000 claims description 2
- 235000007164 Oryza sativa Nutrition 0.000 claims description 2
- 239000006002 Pepper Substances 0.000 claims description 2
- 235000016761 Piper aduncum Nutrition 0.000 claims description 2
- 235000017804 Piper guineense Nutrition 0.000 claims description 2
- 235000008184 Piper nigrum Nutrition 0.000 claims description 2
- 240000004713 Pisum sativum Species 0.000 claims description 2
- 235000010582 Pisum sativum Nutrition 0.000 claims description 2
- 240000003768 Solanum lycopersicum Species 0.000 claims description 2
- 244000061458 Solanum melongena Species 0.000 claims description 2
- 235000002597 Solanum melongena Nutrition 0.000 claims description 2
- 244000061456 Solanum tuberosum Species 0.000 claims description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 2
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 2
- 210000000081 body of the sternum Anatomy 0.000 claims description 2
- 235000009566 rice Nutrition 0.000 claims description 2
- 244000203593 Piper nigrum Species 0.000 claims 1
- 240000006394 Sorghum bicolor Species 0.000 claims 1
- 240000008042 Zea mays Species 0.000 claims 1
- 238000003306 harvesting Methods 0.000 claims 1
- 231100000171 higher toxicity Toxicity 0.000 abstract description 9
- 238000004458 analytical method Methods 0.000 abstract description 8
- 238000002703 mutagenesis Methods 0.000 abstract description 2
- 231100000350 mutagenesis Toxicity 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 37
- 108700012359 toxins Proteins 0.000 description 25
- 101150102464 Cry1 gene Proteins 0.000 description 22
- 108091026890 Coding region Proteins 0.000 description 20
- 230000014509 gene expression Effects 0.000 description 19
- 108091005804 Peptidases Proteins 0.000 description 17
- 239000004365 Protease Substances 0.000 description 17
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 16
- 101100497222 Bacillus thuringiensis cry1Af gene Proteins 0.000 description 14
- 101150041868 cry1Aa gene Proteins 0.000 description 14
- 241000209149 Zea Species 0.000 description 12
- 102000005962 receptors Human genes 0.000 description 12
- 108020003175 receptors Proteins 0.000 description 12
- 229920001940 conductive polymer Polymers 0.000 description 11
- 238000009616 inductively coupled plasma Methods 0.000 description 11
- 230000000749 insecticidal effect Effects 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- 108010021466 Mutant Proteins Proteins 0.000 description 8
- 102000008300 Mutant Proteins Human genes 0.000 description 8
- 230000009466 transformation Effects 0.000 description 8
- 238000011161 development Methods 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 240000002024 Gossypium herbaceum Species 0.000 description 6
- 235000004341 Gossypium herbaceum Nutrition 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 150000007513 acids Chemical class 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 101710151559 Crystal protein Proteins 0.000 description 5
- 241001057636 Dracaena deremensis Species 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 210000001072 colon Anatomy 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 108091005573 modified proteins Proteins 0.000 description 5
- 102000035118 modified proteins Human genes 0.000 description 5
- 230000007170 pathology Effects 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 230000002588 toxic effect Effects 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 239000004474 valine Substances 0.000 description 5
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 4
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 4
- 206010020649 Hyperkeratosis Diseases 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 4
- 241000607479 Yersinia pestis Species 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 210000000110 microvilli Anatomy 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 239000011148 porous material Substances 0.000 description 4
- 231100000331 toxic Toxicity 0.000 description 4
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 3
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- 102000004142 Trypsin Human genes 0.000 description 3
- 108090000631 Trypsin Proteins 0.000 description 3
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- -1 alanine amino acid Chemical class 0.000 description 3
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 238000004166 bioassay Methods 0.000 description 3
- 230000000853 biopesticidal effect Effects 0.000 description 3
- 230000006378 damage Effects 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 230000000408 embryogenic effect Effects 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000012588 trypsin Substances 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 2
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 2
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 2
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 2
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 2
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 2
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 2
- 241000255893 Pyralidae Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 2
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 2
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 2
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 2
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 235000021405 artificial diet Nutrition 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 210000001035 gastrointestinal tract Anatomy 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 238000000159 protein binding assay Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 241000566547 Agrotis ipsilon Species 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 101710117545 C protein Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 241000186650 Clavibacter Species 0.000 description 1
- 241000254173 Coleoptera Species 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 241001147381 Helicoverpa armigera Species 0.000 description 1
- 241000255967 Helicoverpa zea Species 0.000 description 1
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 241000255777 Lepidoptera Species 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- 241000555303 Mamestra brassicae Species 0.000 description 1
- 241000255908 Manduca sexta Species 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 241001045988 Neogene Species 0.000 description 1
- 241000256259 Noctuidae Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241001147397 Ostrinia Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 108020005089 Plant RNA Proteins 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 240000003829 Sorghum propinquum Species 0.000 description 1
- 241000256247 Spodoptera exigua Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- 238000001994 activation Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- CYDMQBQPVICBEU-UHFFFAOYSA-N chlorotetracycline Natural products C1=CC(Cl)=C2C(O)(C)C3CC4C(N(C)C)C(O)=C(C(N)=O)C(=O)C4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-UHFFFAOYSA-N 0.000 description 1
- 229960004475 chlortetracycline Drugs 0.000 description 1
- CYDMQBQPVICBEU-XRNKAMNCSA-N chlortetracycline Chemical compound C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-XRNKAMNCSA-N 0.000 description 1
- 235000019365 chlortetracycline Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000002016 colloidosmotic effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 235000013312 flour Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000002917 insecticide Substances 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 231100001231 less toxic Toxicity 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 101150091879 neo gene Proteins 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 231100000916 relative toxicity Toxicity 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003335 steric effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 238000002723 toxicity assay Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000010497 wheat germ oil Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Pest Control & Pesticides (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Insects & Arthropods (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
Abstract
New improved Cry9C proteins, having significantly increased toxicity, and DNA sequences encoding these proteins, were designed. Analysis of amino acid positions in domain II of the Cry9C protein by protein mutagenesis identified amino acids involved in insect toxicity. Random replacement of these amino acids identifies proteins with improved toxicity. A combination of identified improved amino acids in a single protein yields modified Cry9C proteins with significantly improved toxicity.
Description
IMPROVED BACILLUS THURINGIENSIS TOXIN
BACKGROUND OF THE INVENTION
{i) Field of the Invention . 5 The present invention provides new improved proteins derived from a Bacillus thuringiensis Cry9C crystal protein. In accordance with this invention, amino acid positions in a Cry9C protein were identified as involved in insect toxicity.
Further in accordance with this invention are provided modified Cry9C proteins with increased of decreased toxicity to an insect species, and DNA sequences encoding such modified Cry9C proteins. Plants can be protected from insect damage by expressing a chimeric gene encoding an improved Cry9C protein with an increased toxicity to an insect species.
(ii) Description of Related Art Bacillus thuringiensis (Bt)-derived proteins are currently widely used to protect plants from insects by expression of such proteins in transgenic plants.
Concerns of insect resistance development and the desire to achieve the optimum toxicity and control of additional insect species resulted in efforts to modify existing Bt-derived proteins so as to increase their toxicity or alter their mode of action.
Most studies on the mode of action of Bacillus thuringiensJS toxins have focused on lepidopteran-specific Cry1 insecticidal crystal proteins ("ICPs").
The following picture has emerged from these studies {Gill et al., 1992, Annu.
Rev.
Entomol. 37, 615-36; Knowles, 1993, BioEssays, 15, 469-476). Following ingestion of the crystals by a susceptible insect, they are dissolved in the alkaline reducing environment of the insect midgut lumen. The liberated proteins, the protoxins, are then proteolytically processed by insect midgut proteases to a protease-resistant fragment. This active fragment, the toxin, then passes through the peritrophic membrane and binds to specific receptors located on the brush border membrane of gut epithelial cells. Subsequent to binding, the toxin or part thereof inserts in the membrane resulting in the formation of pores. These pores lead to colloid osmotic swelling and ultimately lysis of the midgut cells, causing death of the insect.
CONFIRMATION COPY
Binding studies have demonstrated that receptor binding is a crucial step in the mode of action of ICPs (Hofmann et al., 1988, 173, 85-91; Hofmann et al., 1988, Proc. Natl. Acad. Sci. USA, 85, 7844-7848; Van Rie et af., 1990, Appl.
Environm.
Microbiol. 56, 1378-85).
The three dimensional structure of two ICPs, Cry3A and the CrylAa toxic fragment, has been solved (Li et al., 1991, Nature 353, 815-21; Grochulski et al., 1994, Journal of Molecular Biology 254, 1-18). The Cry proteins have been found to have three structural domains: the N-terminal domain I consists of 7 alpha helices, domain II contains three beta-sheets and the C-terminal domain 111 is a beta-sandwich. Based on this structure, a hypothesis has been formulated regarding the structure-function relationships of ICPs. The bundle of long, hydrophobic and amphipathic helices (domain I) is equipped for pore formation in the insect membrane, and regions of the three-sheet domain (domain II) are probably responsible for receptor binding (Li et al, 1991, supra}. The function of domain 111 is less clear. When different ICP amino acid sequences are aligned, five conserved sequence blocks are evident (Hofte & Whiteley, 1989, Microbiol. Revs. 53, 242-255).
These conserved blocks are all located in the interior of a structural domain or at the interface between domains. The high degree of conservation of these internal residues implies that homologous proteins would adopt a similar fold {Li et al., 1991, supra).
Data from Ahmad et al. (1991, FEMS Microbiol. Lett. 68, 97-104}; Wu et al.
(1992, J. Biol. Chem. 267, 2311-2317) and Gazit et al. (1993, Biochemistry 32, 3429-3436) provide evidence for the function of domain I of ICPs as a pore formation unit.
Deletions and alanine substitutions in the Cry1 Aa protoxin at a position predicted to be at or near the second loop of domain fl significantly altered toxicity and receptor binding ability (Lu et al., 1993, XXVIth Annual meeting of the Society for Invertebrate Pathology, Asheville, USA, Conference book, page 31, Abstract 17).
Smith and Ellar {1992, XXVth Annual meeting of the Society for Invertebrate Pathology, Heidelberg, Germany, Conference book, page 111, abstract 68) observed dramatic effects on toxicity towards in vitro insect cell cultures with mutant Cry1 C proteins, differing in the amino acid sequence of the predicted loop regions.
BACKGROUND OF THE INVENTION
{i) Field of the Invention . 5 The present invention provides new improved proteins derived from a Bacillus thuringiensis Cry9C crystal protein. In accordance with this invention, amino acid positions in a Cry9C protein were identified as involved in insect toxicity.
Further in accordance with this invention are provided modified Cry9C proteins with increased of decreased toxicity to an insect species, and DNA sequences encoding such modified Cry9C proteins. Plants can be protected from insect damage by expressing a chimeric gene encoding an improved Cry9C protein with an increased toxicity to an insect species.
(ii) Description of Related Art Bacillus thuringiensis (Bt)-derived proteins are currently widely used to protect plants from insects by expression of such proteins in transgenic plants.
Concerns of insect resistance development and the desire to achieve the optimum toxicity and control of additional insect species resulted in efforts to modify existing Bt-derived proteins so as to increase their toxicity or alter their mode of action.
Most studies on the mode of action of Bacillus thuringiensJS toxins have focused on lepidopteran-specific Cry1 insecticidal crystal proteins ("ICPs").
The following picture has emerged from these studies {Gill et al., 1992, Annu.
Rev.
Entomol. 37, 615-36; Knowles, 1993, BioEssays, 15, 469-476). Following ingestion of the crystals by a susceptible insect, they are dissolved in the alkaline reducing environment of the insect midgut lumen. The liberated proteins, the protoxins, are then proteolytically processed by insect midgut proteases to a protease-resistant fragment. This active fragment, the toxin, then passes through the peritrophic membrane and binds to specific receptors located on the brush border membrane of gut epithelial cells. Subsequent to binding, the toxin or part thereof inserts in the membrane resulting in the formation of pores. These pores lead to colloid osmotic swelling and ultimately lysis of the midgut cells, causing death of the insect.
CONFIRMATION COPY
Binding studies have demonstrated that receptor binding is a crucial step in the mode of action of ICPs (Hofmann et al., 1988, 173, 85-91; Hofmann et al., 1988, Proc. Natl. Acad. Sci. USA, 85, 7844-7848; Van Rie et af., 1990, Appl.
Environm.
Microbiol. 56, 1378-85).
The three dimensional structure of two ICPs, Cry3A and the CrylAa toxic fragment, has been solved (Li et al., 1991, Nature 353, 815-21; Grochulski et al., 1994, Journal of Molecular Biology 254, 1-18). The Cry proteins have been found to have three structural domains: the N-terminal domain I consists of 7 alpha helices, domain II contains three beta-sheets and the C-terminal domain 111 is a beta-sandwich. Based on this structure, a hypothesis has been formulated regarding the structure-function relationships of ICPs. The bundle of long, hydrophobic and amphipathic helices (domain I) is equipped for pore formation in the insect membrane, and regions of the three-sheet domain (domain II) are probably responsible for receptor binding (Li et al, 1991, supra}. The function of domain 111 is less clear. When different ICP amino acid sequences are aligned, five conserved sequence blocks are evident (Hofte & Whiteley, 1989, Microbiol. Revs. 53, 242-255).
These conserved blocks are all located in the interior of a structural domain or at the interface between domains. The high degree of conservation of these internal residues implies that homologous proteins would adopt a similar fold {Li et al., 1991, supra).
Data from Ahmad et al. (1991, FEMS Microbiol. Lett. 68, 97-104}; Wu et al.
(1992, J. Biol. Chem. 267, 2311-2317) and Gazit et al. (1993, Biochemistry 32, 3429-3436) provide evidence for the function of domain I of ICPs as a pore formation unit.
Deletions and alanine substitutions in the Cry1 Aa protoxin at a position predicted to be at or near the second loop of domain fl significantly altered toxicity and receptor binding ability (Lu et al., 1993, XXVIth Annual meeting of the Society for Invertebrate Pathology, Asheville, USA, Conference book, page 31, Abstract 17).
Smith and Ellar {1992, XXVth Annual meeting of the Society for Invertebrate Pathology, Heidelberg, Germany, Conference book, page 111, abstract 68) observed dramatic effects on toxicity towards in vitro insect cell cultures with mutant Cry1 C proteins, differing in the amino acid sequence of the predicted loop regions.
They formulated the hypothesis that it should be possible to map the putative receptor binding domain of this toxin and eventually generate toxins with increased potency. In some cases however, a contribution to specificity and binding from domain Ill of the Cry toxin could not be excluded (Schnepf et al., 1990, supra; Ge et S al., 1991, J. Biol. Chem. 266, 17954-17958). Furthermore, a recent study using hybrid ICPs, constructed by exchanging gene fragments between cry1C and crylE, has indicated that domain II of Cry1 C is not sufficient to confer the high activity of this protein towards Spodoptera exigua and Mamestra brassicae (Schipper et al., 1993, Seventh International Conference on Bacillus, Institut Pasteur, July 18-23, Abstracts of lectures, p. L69). Site-directed mutagenesis experiments on CrylAc indicated that certain amino acids in domain I are important for receptor binding (Wu et al., 1992, supra). Rajamohan et al: (1996, J. Biol. Chem. 271, 2390-2396) explored the role of loop 2 residues in domain I I of the Cry1 Ab protein in reversible and irreversible binding to Manduca sexta and Heliothis virescens.
Also, changes outside the 60 kD toxin region of the Bt protoxin were found to influence toxicity. It was suggested that this may be related to the activation processes by the gut juice (Nakamura et al., 1990, Agric. Biol. Chem. 54, 715-24).
Visser et al. (1993, In "Bacillus thuringiensis, an Environmental Biopesticide Theory and Practice", pp.71-88, eds.: Entwistle, P.F., Cory, J.S., Bailey, M.J., and Higgs, S., John Wiley & Sons, NY) reviewed the domain-function studies with Bt ICPs and concluded that in general, the function of essential stretches of the toxic fragment of Bt ICPs is unknown. From studies of mutant proteins, it was found that several amino acid residues from different regions of the toxic fragment, either conserved or variable, were shown to affect toxic activity.
Lambert et al. (1996, Appl. & Environm. Microbiol. 62, p. 80-86) and PCT
patent publication WO 94/05771 describe a new Bt protein which is currently named cry9Ca1 (abbreviated as Cry9C) (Peferoen et al., 1997, in Advances in Insect Control: The role of transgenic plants; pp. 21-48, Taylor & Francis Ltd., London).
This protein was found to have a broad insect target range within the group of lepidopteran pest insects making it interesting for insect control applications in agriculture.
Also, changes outside the 60 kD toxin region of the Bt protoxin were found to influence toxicity. It was suggested that this may be related to the activation processes by the gut juice (Nakamura et al., 1990, Agric. Biol. Chem. 54, 715-24).
Visser et al. (1993, In "Bacillus thuringiensis, an Environmental Biopesticide Theory and Practice", pp.71-88, eds.: Entwistle, P.F., Cory, J.S., Bailey, M.J., and Higgs, S., John Wiley & Sons, NY) reviewed the domain-function studies with Bt ICPs and concluded that in general, the function of essential stretches of the toxic fragment of Bt ICPs is unknown. From studies of mutant proteins, it was found that several amino acid residues from different regions of the toxic fragment, either conserved or variable, were shown to affect toxic activity.
Lambert et al. (1996, Appl. & Environm. Microbiol. 62, p. 80-86) and PCT
patent publication WO 94/05771 describe a new Bt protein which is currently named cry9Ca1 (abbreviated as Cry9C) (Peferoen et al., 1997, in Advances in Insect Control: The role of transgenic plants; pp. 21-48, Taylor & Francis Ltd., London).
This protein was found to have a broad insect target range within the group of lepidopteran pest insects making it interesting for insect control applications in agriculture.
De Roeck et al. (1995, the 28th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 52) suggests to determine the likely position of the binding epitope of the CryIH protein by making Alanine mutants so as to allow the determination of the contribution of amino acid positions in binding of the CryIH protein to different insects. The CryIH protein is currently named Cry9C
in the new nomenclature (Crickmore et al., 1995, 28t" annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p.14.). De Roeck et al. (1997, the 6th International Conference on Perspectives in Protein Engineering, John Innes Centre, Norwich, UK, June 28-July 1, p. 34) determined the likely position of residues in the loops at the apex of the molecule in domain II of the Cry9C protein.
SUMMARY OF THE INVENTION
This invention provides a modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 or an insecticidally-effective fragment thereof, wherein at least one amino acid in the following regions in SEQ ID No. 2 is replaced by another amino acid: 313-334, 369, 418-425, 480-492.
This invention further provides improved Cry9C proteins comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least one of the amino acids at the following positions in SEQ ID No. 2 have been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491. Preferred improved Cry9C proteins comprise the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least one of the amino acids at the following positions are replaced by other amino acids: 316, 317, 319, 321, 329, 330, 369, 422, and 488.
This invention also provides a modified Cry9C protein with improved toxicity to Ostrinia nubilalis, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least the amino acids at position 488 or at least at positions 364 and 488 are replaced by other amino acids, preferably by alanine.
in the new nomenclature (Crickmore et al., 1995, 28t" annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p.14.). De Roeck et al. (1997, the 6th International Conference on Perspectives in Protein Engineering, John Innes Centre, Norwich, UK, June 28-July 1, p. 34) determined the likely position of residues in the loops at the apex of the molecule in domain II of the Cry9C protein.
SUMMARY OF THE INVENTION
This invention provides a modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 or an insecticidally-effective fragment thereof, wherein at least one amino acid in the following regions in SEQ ID No. 2 is replaced by another amino acid: 313-334, 369, 418-425, 480-492.
This invention further provides improved Cry9C proteins comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least one of the amino acids at the following positions in SEQ ID No. 2 have been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491. Preferred improved Cry9C proteins comprise the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least one of the amino acids at the following positions are replaced by other amino acids: 316, 317, 319, 321, 329, 330, 369, 422, and 488.
This invention also provides a modified Cry9C protein with improved toxicity to Ostrinia nubilalis, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least the amino acids at position 488 or at least at positions 364 and 488 are replaced by other amino acids, preferably by alanine.
This invention also provides modified Cry9C proteins with improved toxicity to Heliothis virescens, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at position 321 or position 329, is replaced by another amino acid, preferably by alanine.
This invention further provides modified Cry9C proteins with improved toxicity to Diatraea grandiosella, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at any or all of positions 316, 317, 319, 321, 330, 369, or 422 is replaced by another amino acid, preferably by alanine.
Further in accordance are provided DNA sequences encoding the modified Cry9C proteins, and particularly chimeric genes designed for expression in plants comprising these DNA sequences.
In another preferred embodiment of this invention, a plant transformed with a DNA sequence encoding a modified Cry9C protein is provided, so that the plant acquires increased resistance to insects, particularly a corn plant transformed with a modified Cry9C protein yielding increased toxicity towards Heliothis virescens, Ostrinia nubilalis, or Diatraea grandiosella insects.
Other objects and advantages of this invention will become evident from the following description.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
In this invention, certain amino acid residues important for toxicity of the Cry9C protein have been identified. These amino acid residues can be replaced by other amino acids to increase the toxicity to a specific insect species.
The "Cry9C protein", as used herein, refers to an insecticidal protein characterized by the amino acid sequence of SEQ ID No. 2 or any equivalents thereof such as the insecticidally effective truncated proteins or the fusion proteins of the Cry9C protein described in PCT patent publications WO 94/05771 and WO
94/24264. Particularly preferred Cry9C proteins, in accordance with this invention, are proteins containing at least the amino acid sequence of SEQ lD No. 2 from amino acid position 1 or 44 to amino acid position 658. Throughout the description and the claims, the new nomenclature for Bt crystal proteins as suggested by Crickmore et al. (1995, 28th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 14) and reported in Peferoen et al. (1997, in Advances in Insect Control: The role of transgenic plants, pp.
21-48, Taylor & Francis Ltd., London) has been used.
"Cry9C protein variants", for a particular insect species, are insecticidal proteins that differ from but are indirectly or directly derived from the Cry9C protein.
Indeed, several variants of a Bt protein in which some amino acids are changed into others without significantly changing activity and/or specificity to a particular insect species can be found in nature (Hofte & Whiteley, 1989, supra) or can be made by recombinant DNA techniques. Variants of a Cry9C protein, as used herein, also include proteins containing the specificity- or toxicity-determining domain or region of the Cry9C protein, e.g., in a hybrid with another protein, such as another Bt ICP, a membrane-permeating protein domain, a cytotoxin or an antibody fragment, provided that the Cry9C specificity- or toxicity-determining domain or region contributes to the toxicity or specificity of the hybrid protein. Particularly preferred Cry9C protein variants are those proteins comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the arginine at position 164 has been replaced by another amino acid, preferably alanine or lysine. These variants with a replacement of the arginine at position 164 in the sequence of SEQ ID No. 2 show a significantly lower susceptibility to breakdown upon protease treatment, and are named herein the "protease-resistant Cry9C variants". Like here for the protease-resistant variants, whenever reference to a particular region or position in SEQ ID No. 2 is made, this does not necessarily imply that the protein referred to is the full-length protein of SEQ ID No. 2;
this statement merely refers to the position corresponding to the particular position in the reference Cry9C protein in SEQ ID No. 2. Indeed, improved Cry9C proteins of the invention can be truncated so that the actual position of an amino acid in that protein will differ but nevertheless reference will be made throughout this invention to the positions in the full-length reference protein, shown in SEQ ID No. 2.
Following the teachings of this invention, Cry9C proteins or variants thereof can be modified to have an increased toxicity for an insect species. "Modified Cry9C
This invention further provides modified Cry9C proteins with improved toxicity to Diatraea grandiosella, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at any or all of positions 316, 317, 319, 321, 330, 369, or 422 is replaced by another amino acid, preferably by alanine.
Further in accordance are provided DNA sequences encoding the modified Cry9C proteins, and particularly chimeric genes designed for expression in plants comprising these DNA sequences.
In another preferred embodiment of this invention, a plant transformed with a DNA sequence encoding a modified Cry9C protein is provided, so that the plant acquires increased resistance to insects, particularly a corn plant transformed with a modified Cry9C protein yielding increased toxicity towards Heliothis virescens, Ostrinia nubilalis, or Diatraea grandiosella insects.
Other objects and advantages of this invention will become evident from the following description.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
In this invention, certain amino acid residues important for toxicity of the Cry9C protein have been identified. These amino acid residues can be replaced by other amino acids to increase the toxicity to a specific insect species.
The "Cry9C protein", as used herein, refers to an insecticidal protein characterized by the amino acid sequence of SEQ ID No. 2 or any equivalents thereof such as the insecticidally effective truncated proteins or the fusion proteins of the Cry9C protein described in PCT patent publications WO 94/05771 and WO
94/24264. Particularly preferred Cry9C proteins, in accordance with this invention, are proteins containing at least the amino acid sequence of SEQ lD No. 2 from amino acid position 1 or 44 to amino acid position 658. Throughout the description and the claims, the new nomenclature for Bt crystal proteins as suggested by Crickmore et al. (1995, 28th annual meeting of the Society for Invertebrate Pathology, Cornell University, Ithaca, New York, p. 14) and reported in Peferoen et al. (1997, in Advances in Insect Control: The role of transgenic plants, pp.
21-48, Taylor & Francis Ltd., London) has been used.
"Cry9C protein variants", for a particular insect species, are insecticidal proteins that differ from but are indirectly or directly derived from the Cry9C protein.
Indeed, several variants of a Bt protein in which some amino acids are changed into others without significantly changing activity and/or specificity to a particular insect species can be found in nature (Hofte & Whiteley, 1989, supra) or can be made by recombinant DNA techniques. Variants of a Cry9C protein, as used herein, also include proteins containing the specificity- or toxicity-determining domain or region of the Cry9C protein, e.g., in a hybrid with another protein, such as another Bt ICP, a membrane-permeating protein domain, a cytotoxin or an antibody fragment, provided that the Cry9C specificity- or toxicity-determining domain or region contributes to the toxicity or specificity of the hybrid protein. Particularly preferred Cry9C protein variants are those proteins comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the arginine at position 164 has been replaced by another amino acid, preferably alanine or lysine. These variants with a replacement of the arginine at position 164 in the sequence of SEQ ID No. 2 show a significantly lower susceptibility to breakdown upon protease treatment, and are named herein the "protease-resistant Cry9C variants". Like here for the protease-resistant variants, whenever reference to a particular region or position in SEQ ID No. 2 is made, this does not necessarily imply that the protein referred to is the full-length protein of SEQ ID No. 2;
this statement merely refers to the position corresponding to the particular position in the reference Cry9C protein in SEQ ID No. 2. Indeed, improved Cry9C proteins of the invention can be truncated so that the actual position of an amino acid in that protein will differ but nevertheless reference will be made throughout this invention to the positions in the full-length reference protein, shown in SEQ ID No. 2.
Following the teachings of this invention, Cry9C proteins or variants thereof can be modified to have an increased toxicity for an insect species. "Modified Cry9C
protein", as used herein, refers to a Cry9C protein or its protease-resistant variant wherein amino acids have been modified to analyse the contribution of amino acid positions in toxicity, particularly a Cry9C protein or its protease-resistant variant wherein amino acids have been modified in the regions at the following positions in SEQ ID No. 2: 313-334, 358-369, 418-425, 480-492. "Improved Cry9C protein", in accordance with this invention, refers to a Cry9C protein or its protease-resistant variant wherein at least one amino acid has been replaced, so that the toxicity of this improved protein towards an insect species is significantly increased. In a particularly preferred improved Cry9C protein or its protease-resistant variant, the at least one amino acid change is located in domain II of the Cry9C protein, particularly in the regions of the Cry9C protein characterized by the following positions in SEQ ID
No. 2: 313-334, 358-369, 418-425, 480-492. A modified Cry9C protein, differing in one amino acid from the native protein or its protease-resistant variant and being significantly less toxic towards the target insect, allows the direct identification of this amino acid position as involved in toxicity (provided no gross structural changes are introduced), and thus has considerable value in improving toxicity. In accordance with this invention, the identification of these amino acid positions involved in toxicity allows the construction of modified proteins having increased toxicity to the target insect by amino acid randomization at these positions. Preferred modified Cry9C
proteins in accordance with this invention are the modified Cry9C proteins having altered toxicity to Ostrinia nubilalis, Heliothis virescens or Diatraea grandiosella as shown in Table 1, as well as combinations of those modifications in one modified protein.
An example of an improved Cry9C protein in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 or 666 wherein an amino acid in at least one of the following amino acid positions of SEQ ID No. 2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 362, 364, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491;
or an amino acid position located in the immediate vicinity of any one of these positions in the three-dimensional structure of the protein, preferably those amino acids whose C-alpha atom is at a maximum distance of about 7 Angstrom from the C-alpha atom _7_ of the amino acid listed above. A preferred improved Cry9C protein in accordance with this invention is the protein of SEQ ID No. 2 with at least one of the following amino acid changes: P316A, A317V, V319A, L321 A, P329A, Y330A, S364A, Y369A, 1422A, and 1488A. "V319A" or "Cry9C(V319A)", as used herein, means a change of the valine amino acid at position 319 in SEQ ID No. 2 to an alanine amino acid.
Preferred improved Cry9C proteins also include Cry9C proteins having also the arginine amino acid at position 164 in SEQ ID No. 2 altered into another amino acid, particularly alanine or lysine, to enhance stability upon protease, particularly trypsin, cleavage.
A preferred Cry9C protein for the control of Ostrinia nubilalis insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 325, 364, 418, 421, 485, and 488. A
particularly preferred improved Cry9C protein for the control of Ostrinia nubilalis insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least position 364 or at least in positions 364 and 488 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine.
A preferred Cry9C protein for the control of Heliothis virescens insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 492, particularly at least one of the following amino acid positions: 321, 325, 329, 418, 420, and 480. A particularly preferred improved Cry9C protein for the control of Heliothis virescens insects is a protein comprising the amino acid sequence of SEQ
ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 321 and 329 of SEQ ID
No. 2 are replaced by another amino acid, particularly alanine.
_g_ A preferred Cry9C protein for the control of Diatraea grandiosella insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 316, 317, 319, 321, 325, 330, 369, 421, 422, 480, 483, 484, 485, 487, 488, 490, and 491; particularly at least one of the following amino acid positions: 480, 484, 485, 487, and 490. A particularly preferred improved Cry9C protein for the control of Diatraea grandiosella insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 316, 317, 319, 321, 330, 369 and 422 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine or valine (for 317).
By using DNA sequences encoding improved Cry9C proteins in accordance with this invention, improved toxicity to a selected insect species can be obtained upon expression of such DNA in a transgenic plant.
A "cry9C gene", as used herein, is a DNA sequence comprising a DNA
encoding a Cry9C protein (a coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. A cry9C gene does not necessarily need to be expressed everywhere at all times, expression can be periodic (e.g. at certain stages of development in a plant) and/or can be spatially restricted (e.g. in certain cells or tissues in a plant), mainly depending on the activity of regulatory elements provided in the chimeric gene or in the site of insertion in the plant genome. A cry9C gene can be naturally-occurring or can be a hybrid or synthetic DNA and the regulatory elements can be from prokaryotic or eucaryotic origin.
The "modified cry9C gene", as used herein, is a DNA sequence comprising a DNA encoding a modified Cry9C protein (a modified coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. An example of a modified cry9C coding region is the cry9C coding region of SEQ ID No. 3 wherein the valine codon at nucleotide positions 844-846 of SEQ ID No. 3 has been replaced by an alanine codon.
_g_ "Substantial sequence homology" to a DNA sequence, as used herein, refers to DNA sequences differing in some, most or. all of their colons from another DNA
sequence but encoding the same or substantially the same protein. Indeed, because of the degeneracy of the genetic code, the colon usage of a particular DNA
coding region can be substantially modified, e.g., so as to more closely resemble the colon usage of the genes in the host cell, without changing the encoded protein.
Changing the colon usage of a DNA coding region to that of the host cell has been described to be desired for gene expression in foreign hosts (e.g. Bennetzen & Hall, 1982, J.
Biol. Chem. 257, 3026-3031.; Itakura, 1977, Science 198, 1056-1063). Colon usage tables are available in the literature (Wada et al., 1990, Nucl. Acids Res.
18, 2367-1411; Murray et al., 1989, Nucl. Acids Res. 17(2), 477-498) and in the major DNA
sequence databanks (e.g. at EMBL in Heidelberg, Germany). Accordingly, recombinant or synthetic DNA sequences can be constructed so that the same or substantially the same proteins with substantially the same insecticidal activity are produced (Koziel et al., 1993, Biotechnology 11, 194-200; Perlak et al., 1993, Plant Mol. Biol. 22, 313-321 ). A modified cry9C gene has all appropriate control regions so that the modified Cry9C protein can be expressed in a host cell, e.g. for expression in plants, a plant-expressible promoter and a 3' termination and polyadenylation region active in plants.
A "chimeric improved cry9C gene", as used herein, refers to a chimeric gene comprising a DNA sequence encoding the improved Cry9C protein inserted in between controlling elements of different origin, e.g. a DNA sequence encoding the improved Cry9C protein under the control of a promoter transcribing the DNA in the plant cell, and fused to 3' transcription termination sequences active in plant cells.
Protection of a plant, preferably a corn or cotton plant, against an insect species which is known to feed on said plant is preferably accomplished by expressing an improved Cry9C protein in the cells of the plant. This is preferably accomplished by expressing a chimeric improved cry9C gene encoding such an improved Cry9C protein in the cells of a plant, preferably a corn or cotton plant. An improved Cry9C protein of this invention preferably only has a small number, particularly less than 20, more particularly less than 15, preferably less than 10 amino acids replaced by other amino acids as compared to the Cry9C protein, preferably as compared to the region from between amino acid positions 1 and 45 to amino acid position 658 of the Cry9C protein of SEQ ID No. 2. A significant increase in toxicity can already be obtained by replacing only 1 amino acid, but it is preferred that more than one amino acid is changed to improve toxicity.
The following steps are followed to construct the new modified Cry9C proteins:
amino acids in domain II of the Cry9C protein from amino acid positions 313-334, 358-369, 418-425, and 480-492 were chosen for modification, using alanine-scanning mutagenesis (Cunningham & Wells, 1989, Science 244, 1081-85). In case the original position is alanine, a substitution by valine is done. These regions occur at positions corresponding to the solvent-exposed positions in the loop between beta-strands 1 and 2 (comprising alpha-helix 8) and in loop 1 (located between beta strands 2 and 3), in loop 2 (located between beta-strands 6 and 7), and in loop 3 (located between beta-strands 10 and 11 ) in the three-dimensional model of the Cry3A protein (Li et al., 1991, supra). To discount any observed lower toxicity of a modified Cry9C protein which is due to misfolding or structural distortion, the structural stability of mutant ICPs can be analysed by a variety of methods including toxicity to another target insect, crystal formation, solubilization, monoclonal antibody binding analysis, protease resistance, fluorometric monitoring of unfolding and circular dichroism spectrum analysis. In the case of structural distortion, it is impossible to determine the functional role of this position by alanine replacement.
However, a more conservative amino acid substitution may yield a correctly folded mutant protein which allows to determine the functional role of this position.
The amino acid positions, identified above, which yield modified proteins with significantly decreased toxicity ("down-mutants") are randomized. This means that a set of 20 different mutants, representing each type of amino acid, is generated for each position of interest (the original amino acid and the alanine substitution function as a control). This method is further referred to as "amino acid randomization".
Such mutants may be generated by a variety of methods, e.g. following the PCR
overlap extension method (Ho et al., 1989, Gene 77, 51-59). These mutant proteins are then tested in toxicity assays on the target insect. Mutants at each position which are more toxic, e.g., yield higher mortality than the wild type protein, are selected.
Such mutants with improved toxicity are termed "up-mutants". Alternatively, it is also possible to select potential up-mutants on the basis of increased reversible binding which can be measured following the procedures of Van Rie et al. (1990, Appl.
Environm. Microbiol. 56, 1378-1385) or Liang et al. (1995, J. Biol. Chem. 270, 24719-24724), which is incorporated herein by reference.
All or some of the "up-mutant" amino acids, identified in step 2, are combined in a single modified protein. According to additivity principles, mutations in non-interacting parts of a protein should combine to give simple additive changes in the free energy of binding (Lowman and Wells, 1993, J. Mol. Biol., 234, 564-578).
Increases in toxicity are thus accumulated by combining several single mutants into one multiple mutant. Finally a modified protein with improved toxicity is designed, which comprises some or all, preferably all, of the up-mutant amino acids previously identified.
In accordance with this invention, amino acids of domain II of a Cry9C
protein, located at the protruding regions of domain II are chosen for modification. By "protruding regions of domain II", as used herein, are meant the solvent-exposed regions organized in loops, alpha helices or beta-strands which are protruding from domain II and are located at or towards the apex of the molecule.
This invention is particularly suited for improving the toxicity to an insect species for which the Cry9C protein has a rather weak toxicity. The toxicity of this improved Cry9C protein can be increased by combining amino acid mutations in the protein, each yielding an increased toxicity when compared to the amino acid present in the native Cry9C protein. Insect species for which improved Cry9C proteins can be made also include Spodoptera frugiperda, Heliothis zea, Heliothis armigera, and Agrotis ipsilon. Also, this invention is suited to increase toxicity of a Cry9C protein or its protease-resistant variant to one insect species and to decrease toxicity of the same protein to another insect species by making the proper amino acid substitutions in the protein. This may be advantageous, e.g., to limit the likelihood of insect resistance occurrence to the protein in a particular insect species.
An insecticidally effective part of the modified cry9C gene of this invention encoding an insecticidally effective portion of the modified Cry9C protein, can be made in a conventional manner. An "insecticidally effective part" of the modified cry9C gene refers to a gene comprising a DNA coding region encoding a polypeptide with fewer amino acids than the full length modified Cry9C protein but that still retains toxicity to insects. A preferred insecticidally effective part of the Cry9C
protein is the part from amino acid position 1 or 44 to amino acid position 658 in SEQ ID No.
2.
In order to express all or an insecticidally effective part of the improved cry9C
gene in E. coli, in Bt strains and in plants, suitable restriction sites can be introduced, flanking each gene or gene part. This can be done by site-directed mutagenesis, using well-known procedures (Stanssens et al., 1989, Nucl. Acids Res. 92, 4441-4454; White et al., 1989, Trends in Genet. 5, 185-189).
In order to improve expression in foreign host cells such as plant cells, it may be preferred to alter the improved cry9C coding region or its insecticidally effective part to form an equivalent, artificial improved cry9C coding region.
Expression is improved by selectively inactivating certain cryptic regulatory or processing elements present in the native sequence as described in PCT publications WO 91/16432 and 8. This can be done by site-directed mutagenesis or site-directed intron-insertion (WO 93/09218), or by introducing overall changes to the codon usage, e.g., adapting the codon usage to that most preferred by the host organism (publication of European patent application number ("EP") 0 385 962, EP 0 359 472, publication of PCT patent application WO 93/07278, Murray et al., 1989, supra) without significantly changing, preferably without changing, the encoded amino acid sequence. Small modifications to a DNA sequence such as described above can be routinely made by PCR-mediated mutagenesis (Ho et al., 1989, supra; White et al., 1989, supra). For major changes to the DNA sequence, DNA synthesis methods are available in the art (e.g. Davies et al., 1991, Society for Applied Bacteriology, Technical Series 28, pp. 351-359). For obtaining enhanced expression in monocot plants such as com, a monocot intron can be added to the chimeric improved cry9C
gene (Callis et al., 1987, Genes & Development 1, 1183-1200; PCT publication WO
93/07278). Another preferred embodiment of this invention is the expression of the improved Cry9C proteins by the method described in PCT patent publication WO
97/49814, which is incorporated herein by reference.
The chimeric improved cry9C gene can be stably inserted in a conventional manner into the nuclear genome of a single plant cell, and the so-transformed plant cell can be used in a conventional manner to produce a transformed plant that is insect-resistant. Particularly preferred plants in accordance with this invention are corn plants. Corn cells can be stably transformed (e.g. by electroporation) using wounded or enzyme-degraded intact tissues capable of forming compact embryogenic callus (such as corn immature embryos), or the embryogenic callus (such as type I callus in corn) obtained thereof, as described in PCT patent publication WO 92/09696 or US Patent 5,641,664. Other methods for transformation of corn include the methods by Fromm et al. (1990, Bio/Technology 8, 833-839), Cordon-Kamm et al. (1990, The Plant Cell 2, 603-618) and Ishida et al. (1996, Nature Biotechnology 14, 745-750).
Alternatively, a disarmed Ti plasmid, containing the insecticidally effective chimeric improved cry9C gene, in Agrobacterium tumefaciens can be used to transform the plant cell, preferably the corn or cotton cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using the procedures described, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and EP 0242246 (which are also incorporated herein by reference), and in Could et al. (1991, Plant Physiol. 95, 426-434) or Ishida et al. (1996, supra), particularly the method described in PCT publication WO 94/00977. Preferred Ti-plasmid vectors each contain the insecticidally effective chimeric improved cry9C
gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and US
Patent 4,684,611 ), plant RNA virus-mediated transformation (as described, for example in EP 0067553 and US Patent 4,407,956), and liposome-mediated transformation (as described, for example in US Patent 4,536,475).
A resulting transformed plant, such as a transformed corn or cotton plant, can be used in a conventional plant breeding scheme to produce more transformed plants with the same characteristics or to introduce the improved cry9C gene, or an insecticidally effective part thereof in other varieties of the same or related plant species. Seeds, which are obtained from the transformed plants, contain the chimeric improved cry9C gene or its insecticidally effective part as a stable genomic insert. Cells of the transformed plant can be. cultured in a conventional manner to produce the improved Cry9C protein or insecticidally effective portions thereof, which can be recovered for use in conventional insecticide compositions against insects, particularly lepidopteran insects {U.S. Patent 5,254,799). Preferred plants in accordance with this invention, besides corn and cotton, include rice, plants of the genus Brassica such as oilseed rape, cauliflower and broccoli, and also soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
The improved cry9C coding region or its insecticidally effective part is inserted in a plant cell genome so that the inserted coding region is downstream (i.e., 3') of, and under the control of, a promoter which can direct the expression of the gene part in the plant cell. This is preferably accomplished by inserting the chimeric improved cry9C gene or its insecticidally effective part in the plant cell genome.
Preferred promoters include: the strong constitutive 35S promoters (the "35S promoters") of the cauliflower mosaic virus of isolates CM 1841 (Gardner et al., 1981, Nucleic Acids Research 9, 2871-2887), CabbB-S (Franck et al., 1980, Cell 27, 285-294) and CabbB-JI (Hull and Howell, 1987, Virology 86, 482-493); the ubiquitin promoter (EP
0342926), and the TR1' promoter and the TR2' promoter which drive the expression of the 1' and 2' genes, respectively, of the T-DNA (Velten et al., 1984, EMBO
J. 3, 2723-2730). Alternatively, a promoter can be utilized which is not constitutive but rather is specific for one or more tissues or organs of the plant, preferably leaf and stem tissue, whereby the inserted chimeric improved cry9C gene or its insecticidally effective part is expressed only in cells of the specific tissues) or organ(s). Another alternative is to use a promoter whose expression is inducible (e.g., by insect feeding or by chemical factors). Known wound-induced promoters inducing systemic expression of their gene product throughout the plant are also of particular interest.
The improved cry9C coding region, or its insecticidally effective part, is inserted in the plant genome so that the inserted coding region is upstream (i.e., 5') of suitable 3' end transcription regulation signals (i.e., transcript termination and polyadenylation signals). Preferred polyadenylation and transcript formation signals include those of the 35S gene (Mogen et al., 1990, The Plant Cell 2, 1261-1272), the _ __ _ __ ..~___ _____ _,__ octopine synthase gene (Gielen et aL, 1984, EMBO J. 3, 835-845) and the T-DNA
gene 7 (Velten and Schell, 1985, Nucl. Acids Res. 13, 6981-6998), which act as 3'-untranslated DNA sequences in transformed plant cells.
The chimeric improved cry9C gene, or its insecticidally effective gene part, can optionally be inserted in the plant genome as a hybrid gene (EP 0 193 259;
Vaeck et al., 1987, Nature 327, 33-37) under the control of the same promoter as the coding region of a selectable marker gene, such as the coding region of the neo gene (EP 0 242 236) encoding kanamycin resistance, so that the plant expresses a fusion protein.
Preferably, the improved cry9C gene is expressed in a plant in combination with another insect control protein, e.g., another Bt-derived crystal protein or an insecticidal fragment thereof, particularly a Cry1 Ab- or Cry1 B-type protein, to prevent or delay the occurrence of insect resistance development (EP 0 408 403).
All or part of the improved cry9C coding region can also be used to transform bacteria, such as a B, thuringiensis which produces other insecticidal toxins (Lereclus et al., 1992, Bio/Technology 10, 418-421; Gelernter & Schwab, 1993, In Bacillus thuringiensis, An Environmental Biopesticide: theory and Practice, pp. 89-104, eds.
Entwistle, P.F., Cory, J.S., Bailey, M.J. and Higgs, S., John Wiley & Sons Ltd.).
Thereby, a transformed Bt strain is produced which is useful for combating a wide spectrum of insect pests or for combating insects in such a manner that insect resistance development is prevented or delayed (EP 0 408 403). Preferred promoter and 3' termination and polyadenylation sequences for the chimeric improved cry9C
gene are derived from Bacillus thuringiensis genes, such as the native ICP
genes.
Alternatively, the improved coding region of the invention can be inserted and expressed in endophytic and/or root-colonizing bacteria, such as bacteria of the genus Pseudomonas or Clavibacter, e.g., under the control of a Bt ICP gene promoter and 3' termination sequences. Successful transfer and expression of ICP
genes into such bacteria has been described by Stock et al. (1990, Can. J.
Microbiol.
36, 879-884), Dimock et at. (1989, In Biotechnology, Biopesticides and Novel Plant Pest Resistance Management, eds. Roberts, D.W. & Granados, R.R., pp.88-92, Boyce Thompson Institute for Plant Research, Ithaca, New York), and Waalwijk et al.
(1991, FEMS Microbiol. Lett. 77, 257-264). Transformation of bacteria with all or part of the improved cry9C coding region of the invention, incorporated in a suitable cloning vehicle, can be carried out in a conventional manner, preferably using conventional electroporation techniques as described in Mahillon et al. {1989, FEMS
Microbiol. Letters 60, 205-210), in PCT patent publication WO 90/06999, Chassy et al. (1988, Trends Biotechnol. 6, 303-309) or other methods, e.g., as described by ~ Lerecfus et al. (1992, Bio/Technology 90, 418).
The improved Cry9C-producing strain can also be transformed with all or an insecticidally effective part of one or more DNA sequences encoding a Bt protein or an insecticidally effective part thereof, such as: a DNA encoding the Bt2 or Cry1 Ab protein (US patent 5,254,799; EP 0 193 259) or the Bt109P or Cry3C protein (PCT
publication WO 91/16433), or another DNA coding for an anti-lepidoptera or an anti-Coleoptera protein. Thereby, a transformed Bt strain can be produced which is useful for combating an even greater variety of insect pests {e.g., Coleoptera and/or additional lepidoptera) or for preventing or delaying the development of insect resistance.
For the purpose of combating insects by contacting them with the improved Cry9C protein, e.g. in the form of transformed plants or insecticidal formulations, any DNA sequence encoding any of the above described improved Cry9C proteins, can be used.
The following Examples are offered by way of illustration and not by way of limitation. The sequence listing referred to in the description and the Examples is as follows:
SEQUENCE LISTING
SEQ ID No. 1: Nucleotide sequence of the Bacillus thuringiensis cry9C gene, showing the coding region and flanking 5' and 3' regions.
SEQ ID No. 2: Amino acid sequence of the full length Bacillus thuringiensis Cry9C protein.
SEQ ID No. 3: Nucleotide sequence of a codon-optimized DNA sequence encoding a truncated Cry9C protein wherein the arginine at _17_ amino acid position 123 (corresponding to amino acid position 164 in the protein of SEQ ID No. 2) has been replaced by lysine.
SEQ ID No. 4: Amino acid sequence of the modified Cry9C protein encoded by the DNA of SEQ ID No. 3.
Unless otherwise stated in the Examples, all general materials and methods, including procedures for making and manipulating recombinant DNA are carried out by the standardized procedures as described in volumes 1 and 2 of Ausubel et al., Current Protocols in Molecular Biology, Current Protocols, USA (1994), in Plant Molecular Biology Labfax (1993, by R.D.D. Croy, jointly published by BIOS
Scientific publications Ltd. UK and Blackwell Scientific Publications, UK) and Sambrook et al., Molecular Cloning - A Laboratory Manual, Second Ed., Cold Spring Harbor Laboratory Press, NY (1989).
EXAMPLES:
1. CONSTRUCTION OF MODIFIED CRY9C PROTEINS
Multiple alignments between Bt crystal protein sequences including the sequences of Cry9C, Cry3A and Cry1 Aa allowed identification of the amino acids located in the expected binding site of the Cry9C domain II. Using known alignment programs, 52 amino acid positions were identified for amino acid replacement.
The amino acids in the Cry9C protein of SEQ ID No. 2 from amino acid positions 313-334, 358-369, 418-425, 480-492 have been identified to correspond to the solvent-accessible regions most likely involved in receptor-binding in the Cry3A
protein, and these positions in the Cry9C protein were chosen for amino acid modification.
Since alanine substitution does not alter the main chain of a protein, and does not impose extreme electrostatic or steric effects and since it eliminates the side chain beyond the beta carbon, each of the amino acids in these identified regions was changed into alanine, one by one, using splice overlap extension PCR (Ho et al., 1989, supra) on the protease-resistant form of the native cry9C gene wherein the arginine codon at position 164 was replaced by an alanine codon. The codon most preferred in the cry9C native gene for alanine, GCA, was used for these modifications. When the original codon encodes alanine, then this is replaced by a valine codon (GTA).
The obtained PCR fragments were ligated in pUCl9-derived vectors. If not present, suitable unique restriction sites were created in the cry9C DNA. All plasmids containing modified DNA sequences were controlled by sequencing the relevant portions and were found to be correctly constructed. The modified cry9C genes were expressed in transformed WK6 cells. Every mutant protein was expressed in these E. coli cells at least twice. Mutants causing problems in expression, probably caused by structural changes in these mutants, were discarded. No gross folding aberrations of the mutants identified to be involved in toxicity (and fisted in Table 1 ) are found, e.g., as was evidenced by the similar SDS-PAGE patterns following trypsin cleavage or treatment with midgut juice of the insect larvae of solubilized mutant and Cry9C(R164A) proteins.
2. INSECT TOXICITY OF THE MODIFIED CRY9C PROTEINS
Bio assays on the modified Cry9C proteins obtained in Example 1 were carried out with first instar larvae of the Southwestern corn borer, Diatraea grandiosella (family Pyralidae); the European corn borer, 4strinia nubilalis (family Pyralidae); and the tobacco budworm, Heliothis virescens (family Noctuidae). A
dilution series of each protein was surface-layered on the artificial diet to determine' the LCSO value. The artificial diet consisted of: agar (20 g), water (1,000 ml), corn flour (96 g, ICN Biochemicals), yeast (30 g), wheat germs (64 g, 1CN
Biochemicals), wesson salt (7.5 g, ICN Biochemicals), casein (15 g), sorbic acid (2 g), aureomycin (0.3 g), nipagin (1 g), wheat germ oil (4 ml), sucrose (15 g), cholesterol (1 g), ascorbic acid (3.5 g), Vanderzand modified vitamin mix (12 g, ICN
Biochemicals).
Larvae were placed on the diet in multi-well plates, 1 larva per well (2 for Ostrinia nubilalis). For each dilution, 24 larvae were tested, and dead and living larvae were counted after 5 days. Prior to application, the mutant proteins were digested with trypsin to release the toxin fragments. For each mutant protein, the assays are repeated at least 5 times, using two different protein preparations. As control protein, the trypsin-digested Cry9C(R164A) protein was used. The Cry9C(R164A) protein has the amino acid sequence of SEQ ID No. 2 wherein the arginine at position was replaced by alanine. This protein was found to be more stable than the wild-type Cry9C toxin while retaining its toxicity to the test insects (see, e.g., PCT
patent publication WO 94/24264). The LCSO values were calculated with the POLO-program, which is based on the probit analysis (POLO-PC, LeOra Software, 1119 Shattuck Ave., Berkeley California 94707). The results of these assays for those protein mutants which gave an LCSO value that is significantly different from that of the control protein in repeated bio assays are summarized in Table 1. It is clear that different positions in the Cry9C protein when substituted to alanine cause increased toxicity in each of the tested insects.
Binding assays on isolated brush border membrane vesicles of Heliothis virescens and Ostrinia nubilalis performed as described in Van Rie et al.
(1990, Appl.
Environm. Microbiol. 56, 1378-1385) showed that for all, with the exception of two, of the modified Cry9C proteins with altered toxicity, receptor binding is also altered (e.g., an observed shift in K~ value), thus confirming that for most amino acid residues altered toxicity is due to altered receptor binding. Hence, these residues are proper candidates for improvement of toxicity by amino acid randomization at or near the identified critical position.
3. COMPETITION BINDING EXPERIMENTS
The Cry9C(R164A) protein was tested in competition binding assays using the ECL protein biotinylation system (Amersham Life Sciences, Amersham International plc., UK) as described by Lambent et al. (1996, supra) to determine if competition occurred with other Bt toxins in selected insects. For the assays, 3ng biotinylated Cry9C(R164A) protein was added to 30 Ng brush border membrane vesicles in PBS
buffer (comprising 0,1 % BSA) in the presence of a 300-fold excess of non-biotinylated toxin (homologous competition assays were included in every test as control). Repeated competition tests showed that in both Ostrinia nubilaiis and Heliothis virescens brush border membranes, there was no detectable competition in receptor binding between the (activated) Cry9C(R164A) protein and any one of the following (activated) Bt toxins: the Cry1 Aa (Schnepf et al., 1985, J. Biol.
Chem. 260, 6264-6272), CrylAb (Hofte et a1.,1986, Eur. J. Biochem. 161, 271-280), CrylAc (Adang et al., 1985, Gene 36, 289-300), Cry1 B (Brizzard & Whiteley, 1988, Nucl.
Acids Res. 16, 4168-4169) and Cry1 C (Honee et al., 1988, Nucl. Acids Res. 16, 6240) toxins. Thus, in these insects the Cry9C(R164A) protein binds to a different receptor than these other Bt toxins. In Diatreae grandiosella competition assays, it was found that the Cry9C(R164A) does compete for a receptor site with the Cry1 B
and Cry1 C Bt toxins, but does not compete with any one of the Cry1 Aa, Cry1 Ab, and Cry1 Ac toxins.
The same results are found for all three insects when testing the Cry9C
~ protein with the amino acid sequence of SEQ ID No. 2 from amino acids 1-658.
Thus, in all these three insects, combination of the Cry9C and a selected non-competitively binding Bt toxin with good toxicity to the target insect can be used simultaneously in order to prevent or delay insect resistance development. In transgenic corn plants, a particularly interesting combination would be the Cry9C (or its protease-resistant variant} and a Cry1 B and/or any of the Cry1 A-type toxins for Osfrinia nubilalis control and the Cry9C (or its protease-resistant variant) and any one of the Cry1 A-type toxins, preferably a Cry1 Ab-type toxin, for D.
grandiosella control. For Heliothis virescens control, the Cry9C (or its protease-resistant variant) and any of the Cry1 A-type toxins are preferred toxins to be co-expressed.
4. CONSTRUCTION OF IMPROVED CRY9C PROTEINS
The modified position in every mutant protein of Example 2 giving rise to a significantly decreased or increased toxicity to an insect species is altered to all other amino acids and the toxicity is re-evaluated. The amino acids yielding the highest toxicity at a particular position are combined to form an improved Cry9C
protein.
Also the alanine mutants yielding an increase in toxicity (up-mutant amino acid positions) are included in such combinations to form improved Cry9C proteins for the selected insect species. Table 1 indeed shows already two up-mutant proteins for every insect tested. Analysis of all these improved Cry9C proteins in the bio assay shows that combinations of up-mutant amino acid positions can substantially increase toxicity of the Cry9C protein towards selected insect species.
5. GENE CONSTRUCTION AND PLANT TRANSFORMATION
A modified DNA sequence encoding a truncated Cry9C(R164K) protein for expression in corn and cotton plants is shown in SEQ lD No. 3. This DNA
sequence has an optimized codon usage for plants and encodes an N- and C-terminally truncated Cry9C protein wherein an arginine amino acid has been replaced by a lysine (at position 123 in SEQ iD No. 3). Based on this DNA sequence, DNA
sequences are made encoding the above improved Cry9C proteins and comprising amino acids 1 to 666 of the Cry9C(R164K) protein. Preferred codons to encode the amino acid replacements in the improved Cry9C proteins are those which are most preferred by the plant host (see, e.g., Murray, 1989, supra). A chimeric improved cry9C gene comprising the 35S promoter and 35S 3' transcription termination and polyadenylation signal is constructed by routine molecular biology techniques as described in the detailed description.
Corn cells are stably transformed by either Agrobacterium-mediated transformation (Ishida et al., 1996, supra and U.S. Patent No. 5,591,616) or by electroporation using wounded and enzyme-degraded embryogenic callus, as described in WO 92/09696 or US Patent 5,641,664 (incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Corn plants expressing a truncated improved Cry9C(R164K) protein wherein the amino acids at positions 364, 488, 319 and 321 have been replaced into alanine show a significantly higher protection from Ostrinia nubilalis and Diatraea grandiosella damage in comparative tests against corn plants expressing a truncated Cry9C(R164K} protein. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
Cotton cells are stably transformed by Agrobacterium-mediated transformation (Umbeck et al., 1987, Bio/Technology 5, 263-266; US Patent 5,004,863, incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Cotton plants expressing the truncated improved Cry9C(R164K, L321 A, P329A) protein at similar levels than cotton plants expressing the truncated Cry9C(R164K) protein show a significantly higher protection from Heliofhis virescens damage. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
The examples and embodiments of this invention described herein are only supplied for illustrative purposes. Many variations and modifications in accordance with the present invention are known to the person skilled in the art and are included . in this invention and the scope of the claims. For instance, it is possible to alter, delete or add some nucleotides or amino acids to certain regions of the DNA or ~ protein sequences of the invention without departing from the invention.
All publications (including patent publications) referred to in this application are hereby incorporated by reference, particularly WO 94/05771, WO 94/24264, and Lambert et al. (1996, supra).
Table 1: relative toxicity of modified trypsin-digested Cry9C proteins to different insects when compared with the Cry9C(R164A) trypsin-digested protein (mutant 'F313A': the Cry9C(R164A) trypsin-digested protein wherein also the phenylalanine at position 313 is replaced by alanine; 'down(2x)': mutant protein with a significantly lower toxicity (LC50 value about 2 times higher than the control protein), 'up (2x)':
mutant with a significantly higher toxicity (LC50 value about two times lower than that of the control protein), '-': no difference in toxicity found):
mutant H. virescens O. nubilalis D. grandiosella F313A down {2x) - -P3i6A - - up (2x) A317V - - up (2x) N318A down (2-3x) - -V319A - - up (3x) L321 A up (2x) - up (2x) R323A down (3x) - -W325A down (4-5x) down (2x) down (2-3x) P329A up (2x) - -Y330A - down (1.5x) up (2x) V362A down (3-4x) - -S364A - up (2x) -D368A down (2-3x) - -Y369A - - up (2x) R418A down ( 16x) down (2x) -A420V down (12x) - -L421 A - down (2x) -1422A - - up (2x) F480A down (5x) - down (40x) mutant H. virescens O. nubilalis D. grandiosella Q481 A down (3x) - -N483A - - down (2x) Q484A - - down (20x}
A485V down (3x) down (2x) down (20x) S487A down (2x) - down (20x) 1488A down (2x) up (2-3x) down (5x}
N490A - - down (20x) A491 V - - down (3x) SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: PLANT GENETIC SYSTEMS N.V.
(B) STREET: Jozef Plateaustraat 22 (C) CITY: Gent (E) COUNTRY: Belgium (F) POSTAL CODE (ZIP): B-9000 (G) TELEPHONE: (32)(9)2358411 (H) TELEFAX: (32)(9)2231923 (ii) TITLE OF INVENTION: Improved Bacillus thuringiensis toxin (iii) NUMBER OF SEQUENCES: 4 (iv) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) (vi) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 08/884,389 (B) FILING DATE: 27-JUN-1997 (2) INFORMATION FOR SEQ ID N0: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4344 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (ix) FEATURE:
(A) NAME/KEY: -(B) LOCATION:668..4141 (D) OTHER INFORMATION:/note= "coding sequence"
(xi) SEQUENCE
DESCRIPTION:
SEQ
ID NO:
1:
TCAAGTATCAGGTTTGTTTTGTTTTGTATATGAGTAAGAACCGAAGGTTTGTP.AAAAAGA480 GATTGAGAGT AAAGATATAT
ATATATAAAT ACAATAAAGA
GATATGATAT GAACATGCAC
TAGATTTATA GTATAGGAGG
CCCATTGTGG
' GTGTCCATCA GATGACGATG TGAGGTATCC TTTGGCAAGT GACCCAAATG 780 CAGCGTTACA
ATTCTTATAT
CTGTTGTTGG
TTTATCAATT
TCATGCGACA
CACTTGCAAG
ATTGGTTGGC
CTTTAGACCT
CATTACTGTC
CTCTTTTTGG
AATTGGAACT
ATCGTTTAAG
TGACTTTAGT
CAACGGGATC
CACCAGCTAA
AGCTCGAAAA
TCAGCAGTAA
TACGCCGTAG
CCACAAGAGC
TAGATTTTCG
GAGGCTTGTT
CAAATGATGA
TTACCTTTTT
CTGGATCTAT AGCTAATGCA
GGAAGTGTAC CTACTTATGT
ACCTTAATAA TACGATTACC
CCAAATAGAA TTACACAATT
CACCTGTTTC GGGTACTACG
GTCTTAAAAG GTCCAGGATT
GAAGAACAAC TAATGGCACA
TTTGGAACGT TAAGAGTAAC
AACAATATCG CCTAAGAGTT
CGTTTTGCCT CAACAGGAAA
1~AAAAAGGTAAAATGAATAGAACCCCCTACTGGTAGAAGGACCGATAGGGGGTTCTTACA 4260 TGAAAAAATGTAGCTGTTTACTAAGGTGTATAAAAAACAGCATATCTGATAGAAP.AAAGT4320 (2) INFORMATION FOR SEQ ID N0: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1157 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
Met Asn Arg Asn Asn Gln Asn Glu Tyr Glu Ile Ile Asp Ala Pro His Cys Gly Cys Pro Ser Asp Asp Asp Val Arg Tyr Pro Leu Ala Ser Asp Pro Asn Ala Ala Leu Gln Asn Met Asn Tyr Lys Asp Tyr Leu Gln Met Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg Asn Asp Thr Arg Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr Gln Gly Glu Ile Ser Thr Tyr Tyr Asp Arg Gln Leu Glu Leu Thr Ala Lys Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr 275 280 285 _ Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser Leu Thr Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Va1 Arg Phe Ala Ser Thr Gly Asn Phe Ser hle Arg Val Leu Arg Gly Gly Val Ser Ile Gly Asp Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu A1a Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn Val Thr Asp Tyr Gln Val Asp Gln Ala Ala Asn Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr Tyr Ser Asp Gly Ser Cys Ser Gly Ile Asn Arg Cys Asp Glu Gln His Gln Val Asp Met Gln Leu Asp Ala Glu His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Ile Asp Arg Val Tyr Leu Ala Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly Leu Ala Glu Ile Asn Glu Ala Ser Asn Leu Val Glu Ser Ile Ser Gly Val Tyr Ser Asp Thr Leu Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asp Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr Met Asp Ala Ser Val Gln Gln Asp Gly Asn Met His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Leu Arg Val Asn Pro Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Arg Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His Gln Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Asn Ser Tyr Ile Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met Trp Val Glu Val Ser Glu Ser Glu Gly Ser Phe Tyr Ile Asp Ser Ile Glu Phe Ile Glu Thr Gln Glu (2) INFORMATION FOR SEQ ID N0: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1897 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = "synthetic DNA"
(ix) FEATURE:
(A) NAME/KEY: -(B) LOCATION:13..1890 (D) OTHER INFORMATION:/note = ~"coding sequence"
(xi) SEQUENCE DESCRIPTION:
SEQ ID NO: 3:
ATGACCGACG
GCGACGTGCG
CCTGGGCAGC
(2) INFORMATION FOR SEQ ID N0: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 625 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
Met Ala Asp Tyr Leu Gln Met Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg Asn Asp Thr Lys Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr Gln Gly Glu Ile Ser Thr Tyr Tyr Asp Arg Gln Leu Glu Leu Thr Ala Lys Tyr Thr Asn Tyr Cys Glu Thr Trp W0.99/00407 PCT/EP98/04033 Tyr Asn Thr Gly Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser Leu Thr Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser Thr Gly Ser Ser Thr His Arg Leu Sex His Val Thr Phe Phe Ser Phe Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly Val Ser Ile Gly Asp Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro Val Asn Pro Ala Arg Glu Ala Glu Glu Asp
No. 2: 313-334, 358-369, 418-425, 480-492. A modified Cry9C protein, differing in one amino acid from the native protein or its protease-resistant variant and being significantly less toxic towards the target insect, allows the direct identification of this amino acid position as involved in toxicity (provided no gross structural changes are introduced), and thus has considerable value in improving toxicity. In accordance with this invention, the identification of these amino acid positions involved in toxicity allows the construction of modified proteins having increased toxicity to the target insect by amino acid randomization at these positions. Preferred modified Cry9C
proteins in accordance with this invention are the modified Cry9C proteins having altered toxicity to Ostrinia nubilalis, Heliothis virescens or Diatraea grandiosella as shown in Table 1, as well as combinations of those modifications in one modified protein.
An example of an improved Cry9C protein in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 or 666 wherein an amino acid in at least one of the following amino acid positions of SEQ ID No. 2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 362, 364, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491;
or an amino acid position located in the immediate vicinity of any one of these positions in the three-dimensional structure of the protein, preferably those amino acids whose C-alpha atom is at a maximum distance of about 7 Angstrom from the C-alpha atom _7_ of the amino acid listed above. A preferred improved Cry9C protein in accordance with this invention is the protein of SEQ ID No. 2 with at least one of the following amino acid changes: P316A, A317V, V319A, L321 A, P329A, Y330A, S364A, Y369A, 1422A, and 1488A. "V319A" or "Cry9C(V319A)", as used herein, means a change of the valine amino acid at position 319 in SEQ ID No. 2 to an alanine amino acid.
Preferred improved Cry9C proteins also include Cry9C proteins having also the arginine amino acid at position 164 in SEQ ID No. 2 altered into another amino acid, particularly alanine or lysine, to enhance stability upon protease, particularly trypsin, cleavage.
A preferred Cry9C protein for the control of Ostrinia nubilalis insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 325, 364, 418, 421, 485, and 488. A
particularly preferred improved Cry9C protein for the control of Ostrinia nubilalis insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least position 364 or at least in positions 364 and 488 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine.
A preferred Cry9C protein for the control of Heliothis virescens insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 492, particularly at least one of the following amino acid positions: 321, 325, 329, 418, 420, and 480. A particularly preferred improved Cry9C protein for the control of Heliothis virescens insects is a protein comprising the amino acid sequence of SEQ
ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 321 and 329 of SEQ ID
No. 2 are replaced by another amino acid, particularly alanine.
_g_ A preferred Cry9C protein for the control of Diatraea grandiosella insects in accordance with this invention is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein an amino acid in at least one of the following amino acid positions in SEQ ID No.
2 has been replaced by another amino acid: 316, 317, 319, 321, 325, 330, 369, 421, 422, 480, 483, 484, 485, 487, 488, 490, and 491; particularly at least one of the following amino acid positions: 480, 484, 485, 487, and 490. A particularly preferred improved Cry9C protein for the control of Diatraea grandiosella insects is a protein comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658 wherein the amino acids in at least one of the amino acid positions 316, 317, 319, 321, 330, 369 and 422 of SEQ ID No. 2 are replaced by another amino acid, particularly alanine or valine (for 317).
By using DNA sequences encoding improved Cry9C proteins in accordance with this invention, improved toxicity to a selected insect species can be obtained upon expression of such DNA in a transgenic plant.
A "cry9C gene", as used herein, is a DNA sequence comprising a DNA
encoding a Cry9C protein (a coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. A cry9C gene does not necessarily need to be expressed everywhere at all times, expression can be periodic (e.g. at certain stages of development in a plant) and/or can be spatially restricted (e.g. in certain cells or tissues in a plant), mainly depending on the activity of regulatory elements provided in the chimeric gene or in the site of insertion in the plant genome. A cry9C gene can be naturally-occurring or can be a hybrid or synthetic DNA and the regulatory elements can be from prokaryotic or eucaryotic origin.
The "modified cry9C gene", as used herein, is a DNA sequence comprising a DNA encoding a modified Cry9C protein (a modified coding region), and includes necessary regulatory sequences so that a Cry9C protein can be expressed in a cell, preferably a plant or bacterial cell. An example of a modified cry9C coding region is the cry9C coding region of SEQ ID No. 3 wherein the valine codon at nucleotide positions 844-846 of SEQ ID No. 3 has been replaced by an alanine codon.
_g_ "Substantial sequence homology" to a DNA sequence, as used herein, refers to DNA sequences differing in some, most or. all of their colons from another DNA
sequence but encoding the same or substantially the same protein. Indeed, because of the degeneracy of the genetic code, the colon usage of a particular DNA
coding region can be substantially modified, e.g., so as to more closely resemble the colon usage of the genes in the host cell, without changing the encoded protein.
Changing the colon usage of a DNA coding region to that of the host cell has been described to be desired for gene expression in foreign hosts (e.g. Bennetzen & Hall, 1982, J.
Biol. Chem. 257, 3026-3031.; Itakura, 1977, Science 198, 1056-1063). Colon usage tables are available in the literature (Wada et al., 1990, Nucl. Acids Res.
18, 2367-1411; Murray et al., 1989, Nucl. Acids Res. 17(2), 477-498) and in the major DNA
sequence databanks (e.g. at EMBL in Heidelberg, Germany). Accordingly, recombinant or synthetic DNA sequences can be constructed so that the same or substantially the same proteins with substantially the same insecticidal activity are produced (Koziel et al., 1993, Biotechnology 11, 194-200; Perlak et al., 1993, Plant Mol. Biol. 22, 313-321 ). A modified cry9C gene has all appropriate control regions so that the modified Cry9C protein can be expressed in a host cell, e.g. for expression in plants, a plant-expressible promoter and a 3' termination and polyadenylation region active in plants.
A "chimeric improved cry9C gene", as used herein, refers to a chimeric gene comprising a DNA sequence encoding the improved Cry9C protein inserted in between controlling elements of different origin, e.g. a DNA sequence encoding the improved Cry9C protein under the control of a promoter transcribing the DNA in the plant cell, and fused to 3' transcription termination sequences active in plant cells.
Protection of a plant, preferably a corn or cotton plant, against an insect species which is known to feed on said plant is preferably accomplished by expressing an improved Cry9C protein in the cells of the plant. This is preferably accomplished by expressing a chimeric improved cry9C gene encoding such an improved Cry9C protein in the cells of a plant, preferably a corn or cotton plant. An improved Cry9C protein of this invention preferably only has a small number, particularly less than 20, more particularly less than 15, preferably less than 10 amino acids replaced by other amino acids as compared to the Cry9C protein, preferably as compared to the region from between amino acid positions 1 and 45 to amino acid position 658 of the Cry9C protein of SEQ ID No. 2. A significant increase in toxicity can already be obtained by replacing only 1 amino acid, but it is preferred that more than one amino acid is changed to improve toxicity.
The following steps are followed to construct the new modified Cry9C proteins:
amino acids in domain II of the Cry9C protein from amino acid positions 313-334, 358-369, 418-425, and 480-492 were chosen for modification, using alanine-scanning mutagenesis (Cunningham & Wells, 1989, Science 244, 1081-85). In case the original position is alanine, a substitution by valine is done. These regions occur at positions corresponding to the solvent-exposed positions in the loop between beta-strands 1 and 2 (comprising alpha-helix 8) and in loop 1 (located between beta strands 2 and 3), in loop 2 (located between beta-strands 6 and 7), and in loop 3 (located between beta-strands 10 and 11 ) in the three-dimensional model of the Cry3A protein (Li et al., 1991, supra). To discount any observed lower toxicity of a modified Cry9C protein which is due to misfolding or structural distortion, the structural stability of mutant ICPs can be analysed by a variety of methods including toxicity to another target insect, crystal formation, solubilization, monoclonal antibody binding analysis, protease resistance, fluorometric monitoring of unfolding and circular dichroism spectrum analysis. In the case of structural distortion, it is impossible to determine the functional role of this position by alanine replacement.
However, a more conservative amino acid substitution may yield a correctly folded mutant protein which allows to determine the functional role of this position.
The amino acid positions, identified above, which yield modified proteins with significantly decreased toxicity ("down-mutants") are randomized. This means that a set of 20 different mutants, representing each type of amino acid, is generated for each position of interest (the original amino acid and the alanine substitution function as a control). This method is further referred to as "amino acid randomization".
Such mutants may be generated by a variety of methods, e.g. following the PCR
overlap extension method (Ho et al., 1989, Gene 77, 51-59). These mutant proteins are then tested in toxicity assays on the target insect. Mutants at each position which are more toxic, e.g., yield higher mortality than the wild type protein, are selected.
Such mutants with improved toxicity are termed "up-mutants". Alternatively, it is also possible to select potential up-mutants on the basis of increased reversible binding which can be measured following the procedures of Van Rie et al. (1990, Appl.
Environm. Microbiol. 56, 1378-1385) or Liang et al. (1995, J. Biol. Chem. 270, 24719-24724), which is incorporated herein by reference.
All or some of the "up-mutant" amino acids, identified in step 2, are combined in a single modified protein. According to additivity principles, mutations in non-interacting parts of a protein should combine to give simple additive changes in the free energy of binding (Lowman and Wells, 1993, J. Mol. Biol., 234, 564-578).
Increases in toxicity are thus accumulated by combining several single mutants into one multiple mutant. Finally a modified protein with improved toxicity is designed, which comprises some or all, preferably all, of the up-mutant amino acids previously identified.
In accordance with this invention, amino acids of domain II of a Cry9C
protein, located at the protruding regions of domain II are chosen for modification. By "protruding regions of domain II", as used herein, are meant the solvent-exposed regions organized in loops, alpha helices or beta-strands which are protruding from domain II and are located at or towards the apex of the molecule.
This invention is particularly suited for improving the toxicity to an insect species for which the Cry9C protein has a rather weak toxicity. The toxicity of this improved Cry9C protein can be increased by combining amino acid mutations in the protein, each yielding an increased toxicity when compared to the amino acid present in the native Cry9C protein. Insect species for which improved Cry9C proteins can be made also include Spodoptera frugiperda, Heliothis zea, Heliothis armigera, and Agrotis ipsilon. Also, this invention is suited to increase toxicity of a Cry9C protein or its protease-resistant variant to one insect species and to decrease toxicity of the same protein to another insect species by making the proper amino acid substitutions in the protein. This may be advantageous, e.g., to limit the likelihood of insect resistance occurrence to the protein in a particular insect species.
An insecticidally effective part of the modified cry9C gene of this invention encoding an insecticidally effective portion of the modified Cry9C protein, can be made in a conventional manner. An "insecticidally effective part" of the modified cry9C gene refers to a gene comprising a DNA coding region encoding a polypeptide with fewer amino acids than the full length modified Cry9C protein but that still retains toxicity to insects. A preferred insecticidally effective part of the Cry9C
protein is the part from amino acid position 1 or 44 to amino acid position 658 in SEQ ID No.
2.
In order to express all or an insecticidally effective part of the improved cry9C
gene in E. coli, in Bt strains and in plants, suitable restriction sites can be introduced, flanking each gene or gene part. This can be done by site-directed mutagenesis, using well-known procedures (Stanssens et al., 1989, Nucl. Acids Res. 92, 4441-4454; White et al., 1989, Trends in Genet. 5, 185-189).
In order to improve expression in foreign host cells such as plant cells, it may be preferred to alter the improved cry9C coding region or its insecticidally effective part to form an equivalent, artificial improved cry9C coding region.
Expression is improved by selectively inactivating certain cryptic regulatory or processing elements present in the native sequence as described in PCT publications WO 91/16432 and 8. This can be done by site-directed mutagenesis or site-directed intron-insertion (WO 93/09218), or by introducing overall changes to the codon usage, e.g., adapting the codon usage to that most preferred by the host organism (publication of European patent application number ("EP") 0 385 962, EP 0 359 472, publication of PCT patent application WO 93/07278, Murray et al., 1989, supra) without significantly changing, preferably without changing, the encoded amino acid sequence. Small modifications to a DNA sequence such as described above can be routinely made by PCR-mediated mutagenesis (Ho et al., 1989, supra; White et al., 1989, supra). For major changes to the DNA sequence, DNA synthesis methods are available in the art (e.g. Davies et al., 1991, Society for Applied Bacteriology, Technical Series 28, pp. 351-359). For obtaining enhanced expression in monocot plants such as com, a monocot intron can be added to the chimeric improved cry9C
gene (Callis et al., 1987, Genes & Development 1, 1183-1200; PCT publication WO
93/07278). Another preferred embodiment of this invention is the expression of the improved Cry9C proteins by the method described in PCT patent publication WO
97/49814, which is incorporated herein by reference.
The chimeric improved cry9C gene can be stably inserted in a conventional manner into the nuclear genome of a single plant cell, and the so-transformed plant cell can be used in a conventional manner to produce a transformed plant that is insect-resistant. Particularly preferred plants in accordance with this invention are corn plants. Corn cells can be stably transformed (e.g. by electroporation) using wounded or enzyme-degraded intact tissues capable of forming compact embryogenic callus (such as corn immature embryos), or the embryogenic callus (such as type I callus in corn) obtained thereof, as described in PCT patent publication WO 92/09696 or US Patent 5,641,664. Other methods for transformation of corn include the methods by Fromm et al. (1990, Bio/Technology 8, 833-839), Cordon-Kamm et al. (1990, The Plant Cell 2, 603-618) and Ishida et al. (1996, Nature Biotechnology 14, 745-750).
Alternatively, a disarmed Ti plasmid, containing the insecticidally effective chimeric improved cry9C gene, in Agrobacterium tumefaciens can be used to transform the plant cell, preferably the corn or cotton cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using the procedures described, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and EP 0242246 (which are also incorporated herein by reference), and in Could et al. (1991, Plant Physiol. 95, 426-434) or Ishida et al. (1996, supra), particularly the method described in PCT publication WO 94/00977. Preferred Ti-plasmid vectors each contain the insecticidally effective chimeric improved cry9C
gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and US
Patent 4,684,611 ), plant RNA virus-mediated transformation (as described, for example in EP 0067553 and US Patent 4,407,956), and liposome-mediated transformation (as described, for example in US Patent 4,536,475).
A resulting transformed plant, such as a transformed corn or cotton plant, can be used in a conventional plant breeding scheme to produce more transformed plants with the same characteristics or to introduce the improved cry9C gene, or an insecticidally effective part thereof in other varieties of the same or related plant species. Seeds, which are obtained from the transformed plants, contain the chimeric improved cry9C gene or its insecticidally effective part as a stable genomic insert. Cells of the transformed plant can be. cultured in a conventional manner to produce the improved Cry9C protein or insecticidally effective portions thereof, which can be recovered for use in conventional insecticide compositions against insects, particularly lepidopteran insects {U.S. Patent 5,254,799). Preferred plants in accordance with this invention, besides corn and cotton, include rice, plants of the genus Brassica such as oilseed rape, cauliflower and broccoli, and also soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
The improved cry9C coding region or its insecticidally effective part is inserted in a plant cell genome so that the inserted coding region is downstream (i.e., 3') of, and under the control of, a promoter which can direct the expression of the gene part in the plant cell. This is preferably accomplished by inserting the chimeric improved cry9C gene or its insecticidally effective part in the plant cell genome.
Preferred promoters include: the strong constitutive 35S promoters (the "35S promoters") of the cauliflower mosaic virus of isolates CM 1841 (Gardner et al., 1981, Nucleic Acids Research 9, 2871-2887), CabbB-S (Franck et al., 1980, Cell 27, 285-294) and CabbB-JI (Hull and Howell, 1987, Virology 86, 482-493); the ubiquitin promoter (EP
0342926), and the TR1' promoter and the TR2' promoter which drive the expression of the 1' and 2' genes, respectively, of the T-DNA (Velten et al., 1984, EMBO
J. 3, 2723-2730). Alternatively, a promoter can be utilized which is not constitutive but rather is specific for one or more tissues or organs of the plant, preferably leaf and stem tissue, whereby the inserted chimeric improved cry9C gene or its insecticidally effective part is expressed only in cells of the specific tissues) or organ(s). Another alternative is to use a promoter whose expression is inducible (e.g., by insect feeding or by chemical factors). Known wound-induced promoters inducing systemic expression of their gene product throughout the plant are also of particular interest.
The improved cry9C coding region, or its insecticidally effective part, is inserted in the plant genome so that the inserted coding region is upstream (i.e., 5') of suitable 3' end transcription regulation signals (i.e., transcript termination and polyadenylation signals). Preferred polyadenylation and transcript formation signals include those of the 35S gene (Mogen et al., 1990, The Plant Cell 2, 1261-1272), the _ __ _ __ ..~___ _____ _,__ octopine synthase gene (Gielen et aL, 1984, EMBO J. 3, 835-845) and the T-DNA
gene 7 (Velten and Schell, 1985, Nucl. Acids Res. 13, 6981-6998), which act as 3'-untranslated DNA sequences in transformed plant cells.
The chimeric improved cry9C gene, or its insecticidally effective gene part, can optionally be inserted in the plant genome as a hybrid gene (EP 0 193 259;
Vaeck et al., 1987, Nature 327, 33-37) under the control of the same promoter as the coding region of a selectable marker gene, such as the coding region of the neo gene (EP 0 242 236) encoding kanamycin resistance, so that the plant expresses a fusion protein.
Preferably, the improved cry9C gene is expressed in a plant in combination with another insect control protein, e.g., another Bt-derived crystal protein or an insecticidal fragment thereof, particularly a Cry1 Ab- or Cry1 B-type protein, to prevent or delay the occurrence of insect resistance development (EP 0 408 403).
All or part of the improved cry9C coding region can also be used to transform bacteria, such as a B, thuringiensis which produces other insecticidal toxins (Lereclus et al., 1992, Bio/Technology 10, 418-421; Gelernter & Schwab, 1993, In Bacillus thuringiensis, An Environmental Biopesticide: theory and Practice, pp. 89-104, eds.
Entwistle, P.F., Cory, J.S., Bailey, M.J. and Higgs, S., John Wiley & Sons Ltd.).
Thereby, a transformed Bt strain is produced which is useful for combating a wide spectrum of insect pests or for combating insects in such a manner that insect resistance development is prevented or delayed (EP 0 408 403). Preferred promoter and 3' termination and polyadenylation sequences for the chimeric improved cry9C
gene are derived from Bacillus thuringiensis genes, such as the native ICP
genes.
Alternatively, the improved coding region of the invention can be inserted and expressed in endophytic and/or root-colonizing bacteria, such as bacteria of the genus Pseudomonas or Clavibacter, e.g., under the control of a Bt ICP gene promoter and 3' termination sequences. Successful transfer and expression of ICP
genes into such bacteria has been described by Stock et al. (1990, Can. J.
Microbiol.
36, 879-884), Dimock et at. (1989, In Biotechnology, Biopesticides and Novel Plant Pest Resistance Management, eds. Roberts, D.W. & Granados, R.R., pp.88-92, Boyce Thompson Institute for Plant Research, Ithaca, New York), and Waalwijk et al.
(1991, FEMS Microbiol. Lett. 77, 257-264). Transformation of bacteria with all or part of the improved cry9C coding region of the invention, incorporated in a suitable cloning vehicle, can be carried out in a conventional manner, preferably using conventional electroporation techniques as described in Mahillon et al. {1989, FEMS
Microbiol. Letters 60, 205-210), in PCT patent publication WO 90/06999, Chassy et al. (1988, Trends Biotechnol. 6, 303-309) or other methods, e.g., as described by ~ Lerecfus et al. (1992, Bio/Technology 90, 418).
The improved Cry9C-producing strain can also be transformed with all or an insecticidally effective part of one or more DNA sequences encoding a Bt protein or an insecticidally effective part thereof, such as: a DNA encoding the Bt2 or Cry1 Ab protein (US patent 5,254,799; EP 0 193 259) or the Bt109P or Cry3C protein (PCT
publication WO 91/16433), or another DNA coding for an anti-lepidoptera or an anti-Coleoptera protein. Thereby, a transformed Bt strain can be produced which is useful for combating an even greater variety of insect pests {e.g., Coleoptera and/or additional lepidoptera) or for preventing or delaying the development of insect resistance.
For the purpose of combating insects by contacting them with the improved Cry9C protein, e.g. in the form of transformed plants or insecticidal formulations, any DNA sequence encoding any of the above described improved Cry9C proteins, can be used.
The following Examples are offered by way of illustration and not by way of limitation. The sequence listing referred to in the description and the Examples is as follows:
SEQUENCE LISTING
SEQ ID No. 1: Nucleotide sequence of the Bacillus thuringiensis cry9C gene, showing the coding region and flanking 5' and 3' regions.
SEQ ID No. 2: Amino acid sequence of the full length Bacillus thuringiensis Cry9C protein.
SEQ ID No. 3: Nucleotide sequence of a codon-optimized DNA sequence encoding a truncated Cry9C protein wherein the arginine at _17_ amino acid position 123 (corresponding to amino acid position 164 in the protein of SEQ ID No. 2) has been replaced by lysine.
SEQ ID No. 4: Amino acid sequence of the modified Cry9C protein encoded by the DNA of SEQ ID No. 3.
Unless otherwise stated in the Examples, all general materials and methods, including procedures for making and manipulating recombinant DNA are carried out by the standardized procedures as described in volumes 1 and 2 of Ausubel et al., Current Protocols in Molecular Biology, Current Protocols, USA (1994), in Plant Molecular Biology Labfax (1993, by R.D.D. Croy, jointly published by BIOS
Scientific publications Ltd. UK and Blackwell Scientific Publications, UK) and Sambrook et al., Molecular Cloning - A Laboratory Manual, Second Ed., Cold Spring Harbor Laboratory Press, NY (1989).
EXAMPLES:
1. CONSTRUCTION OF MODIFIED CRY9C PROTEINS
Multiple alignments between Bt crystal protein sequences including the sequences of Cry9C, Cry3A and Cry1 Aa allowed identification of the amino acids located in the expected binding site of the Cry9C domain II. Using known alignment programs, 52 amino acid positions were identified for amino acid replacement.
The amino acids in the Cry9C protein of SEQ ID No. 2 from amino acid positions 313-334, 358-369, 418-425, 480-492 have been identified to correspond to the solvent-accessible regions most likely involved in receptor-binding in the Cry3A
protein, and these positions in the Cry9C protein were chosen for amino acid modification.
Since alanine substitution does not alter the main chain of a protein, and does not impose extreme electrostatic or steric effects and since it eliminates the side chain beyond the beta carbon, each of the amino acids in these identified regions was changed into alanine, one by one, using splice overlap extension PCR (Ho et al., 1989, supra) on the protease-resistant form of the native cry9C gene wherein the arginine codon at position 164 was replaced by an alanine codon. The codon most preferred in the cry9C native gene for alanine, GCA, was used for these modifications. When the original codon encodes alanine, then this is replaced by a valine codon (GTA).
The obtained PCR fragments were ligated in pUCl9-derived vectors. If not present, suitable unique restriction sites were created in the cry9C DNA. All plasmids containing modified DNA sequences were controlled by sequencing the relevant portions and were found to be correctly constructed. The modified cry9C genes were expressed in transformed WK6 cells. Every mutant protein was expressed in these E. coli cells at least twice. Mutants causing problems in expression, probably caused by structural changes in these mutants, were discarded. No gross folding aberrations of the mutants identified to be involved in toxicity (and fisted in Table 1 ) are found, e.g., as was evidenced by the similar SDS-PAGE patterns following trypsin cleavage or treatment with midgut juice of the insect larvae of solubilized mutant and Cry9C(R164A) proteins.
2. INSECT TOXICITY OF THE MODIFIED CRY9C PROTEINS
Bio assays on the modified Cry9C proteins obtained in Example 1 were carried out with first instar larvae of the Southwestern corn borer, Diatraea grandiosella (family Pyralidae); the European corn borer, 4strinia nubilalis (family Pyralidae); and the tobacco budworm, Heliothis virescens (family Noctuidae). A
dilution series of each protein was surface-layered on the artificial diet to determine' the LCSO value. The artificial diet consisted of: agar (20 g), water (1,000 ml), corn flour (96 g, ICN Biochemicals), yeast (30 g), wheat germs (64 g, 1CN
Biochemicals), wesson salt (7.5 g, ICN Biochemicals), casein (15 g), sorbic acid (2 g), aureomycin (0.3 g), nipagin (1 g), wheat germ oil (4 ml), sucrose (15 g), cholesterol (1 g), ascorbic acid (3.5 g), Vanderzand modified vitamin mix (12 g, ICN
Biochemicals).
Larvae were placed on the diet in multi-well plates, 1 larva per well (2 for Ostrinia nubilalis). For each dilution, 24 larvae were tested, and dead and living larvae were counted after 5 days. Prior to application, the mutant proteins were digested with trypsin to release the toxin fragments. For each mutant protein, the assays are repeated at least 5 times, using two different protein preparations. As control protein, the trypsin-digested Cry9C(R164A) protein was used. The Cry9C(R164A) protein has the amino acid sequence of SEQ ID No. 2 wherein the arginine at position was replaced by alanine. This protein was found to be more stable than the wild-type Cry9C toxin while retaining its toxicity to the test insects (see, e.g., PCT
patent publication WO 94/24264). The LCSO values were calculated with the POLO-program, which is based on the probit analysis (POLO-PC, LeOra Software, 1119 Shattuck Ave., Berkeley California 94707). The results of these assays for those protein mutants which gave an LCSO value that is significantly different from that of the control protein in repeated bio assays are summarized in Table 1. It is clear that different positions in the Cry9C protein when substituted to alanine cause increased toxicity in each of the tested insects.
Binding assays on isolated brush border membrane vesicles of Heliothis virescens and Ostrinia nubilalis performed as described in Van Rie et al.
(1990, Appl.
Environm. Microbiol. 56, 1378-1385) showed that for all, with the exception of two, of the modified Cry9C proteins with altered toxicity, receptor binding is also altered (e.g., an observed shift in K~ value), thus confirming that for most amino acid residues altered toxicity is due to altered receptor binding. Hence, these residues are proper candidates for improvement of toxicity by amino acid randomization at or near the identified critical position.
3. COMPETITION BINDING EXPERIMENTS
The Cry9C(R164A) protein was tested in competition binding assays using the ECL protein biotinylation system (Amersham Life Sciences, Amersham International plc., UK) as described by Lambent et al. (1996, supra) to determine if competition occurred with other Bt toxins in selected insects. For the assays, 3ng biotinylated Cry9C(R164A) protein was added to 30 Ng brush border membrane vesicles in PBS
buffer (comprising 0,1 % BSA) in the presence of a 300-fold excess of non-biotinylated toxin (homologous competition assays were included in every test as control). Repeated competition tests showed that in both Ostrinia nubilaiis and Heliothis virescens brush border membranes, there was no detectable competition in receptor binding between the (activated) Cry9C(R164A) protein and any one of the following (activated) Bt toxins: the Cry1 Aa (Schnepf et al., 1985, J. Biol.
Chem. 260, 6264-6272), CrylAb (Hofte et a1.,1986, Eur. J. Biochem. 161, 271-280), CrylAc (Adang et al., 1985, Gene 36, 289-300), Cry1 B (Brizzard & Whiteley, 1988, Nucl.
Acids Res. 16, 4168-4169) and Cry1 C (Honee et al., 1988, Nucl. Acids Res. 16, 6240) toxins. Thus, in these insects the Cry9C(R164A) protein binds to a different receptor than these other Bt toxins. In Diatreae grandiosella competition assays, it was found that the Cry9C(R164A) does compete for a receptor site with the Cry1 B
and Cry1 C Bt toxins, but does not compete with any one of the Cry1 Aa, Cry1 Ab, and Cry1 Ac toxins.
The same results are found for all three insects when testing the Cry9C
~ protein with the amino acid sequence of SEQ ID No. 2 from amino acids 1-658.
Thus, in all these three insects, combination of the Cry9C and a selected non-competitively binding Bt toxin with good toxicity to the target insect can be used simultaneously in order to prevent or delay insect resistance development. In transgenic corn plants, a particularly interesting combination would be the Cry9C (or its protease-resistant variant} and a Cry1 B and/or any of the Cry1 A-type toxins for Osfrinia nubilalis control and the Cry9C (or its protease-resistant variant) and any one of the Cry1 A-type toxins, preferably a Cry1 Ab-type toxin, for D.
grandiosella control. For Heliothis virescens control, the Cry9C (or its protease-resistant variant) and any of the Cry1 A-type toxins are preferred toxins to be co-expressed.
4. CONSTRUCTION OF IMPROVED CRY9C PROTEINS
The modified position in every mutant protein of Example 2 giving rise to a significantly decreased or increased toxicity to an insect species is altered to all other amino acids and the toxicity is re-evaluated. The amino acids yielding the highest toxicity at a particular position are combined to form an improved Cry9C
protein.
Also the alanine mutants yielding an increase in toxicity (up-mutant amino acid positions) are included in such combinations to form improved Cry9C proteins for the selected insect species. Table 1 indeed shows already two up-mutant proteins for every insect tested. Analysis of all these improved Cry9C proteins in the bio assay shows that combinations of up-mutant amino acid positions can substantially increase toxicity of the Cry9C protein towards selected insect species.
5. GENE CONSTRUCTION AND PLANT TRANSFORMATION
A modified DNA sequence encoding a truncated Cry9C(R164K) protein for expression in corn and cotton plants is shown in SEQ lD No. 3. This DNA
sequence has an optimized codon usage for plants and encodes an N- and C-terminally truncated Cry9C protein wherein an arginine amino acid has been replaced by a lysine (at position 123 in SEQ iD No. 3). Based on this DNA sequence, DNA
sequences are made encoding the above improved Cry9C proteins and comprising amino acids 1 to 666 of the Cry9C(R164K) protein. Preferred codons to encode the amino acid replacements in the improved Cry9C proteins are those which are most preferred by the plant host (see, e.g., Murray, 1989, supra). A chimeric improved cry9C gene comprising the 35S promoter and 35S 3' transcription termination and polyadenylation signal is constructed by routine molecular biology techniques as described in the detailed description.
Corn cells are stably transformed by either Agrobacterium-mediated transformation (Ishida et al., 1996, supra and U.S. Patent No. 5,591,616) or by electroporation using wounded and enzyme-degraded embryogenic callus, as described in WO 92/09696 or US Patent 5,641,664 (incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Corn plants expressing a truncated improved Cry9C(R164K) protein wherein the amino acids at positions 364, 488, 319 and 321 have been replaced into alanine show a significantly higher protection from Ostrinia nubilalis and Diatraea grandiosella damage in comparative tests against corn plants expressing a truncated Cry9C(R164K} protein. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
Cotton cells are stably transformed by Agrobacterium-mediated transformation (Umbeck et al., 1987, Bio/Technology 5, 263-266; US Patent 5,004,863, incorporated herein by reference). The resulting transformed cells are selected by means of the incorporated selectable marker gene, grown into plants and tested for susceptibility towards insects. Cotton plants expressing the truncated improved Cry9C(R164K, L321 A, P329A) protein at similar levels than cotton plants expressing the truncated Cry9C(R164K) protein show a significantly higher protection from Heliofhis virescens damage. A positive correlation is found between the level of expression, as measured by RNA and protein analysis, and the observed insecticidal effect.
The examples and embodiments of this invention described herein are only supplied for illustrative purposes. Many variations and modifications in accordance with the present invention are known to the person skilled in the art and are included . in this invention and the scope of the claims. For instance, it is possible to alter, delete or add some nucleotides or amino acids to certain regions of the DNA or ~ protein sequences of the invention without departing from the invention.
All publications (including patent publications) referred to in this application are hereby incorporated by reference, particularly WO 94/05771, WO 94/24264, and Lambert et al. (1996, supra).
Table 1: relative toxicity of modified trypsin-digested Cry9C proteins to different insects when compared with the Cry9C(R164A) trypsin-digested protein (mutant 'F313A': the Cry9C(R164A) trypsin-digested protein wherein also the phenylalanine at position 313 is replaced by alanine; 'down(2x)': mutant protein with a significantly lower toxicity (LC50 value about 2 times higher than the control protein), 'up (2x)':
mutant with a significantly higher toxicity (LC50 value about two times lower than that of the control protein), '-': no difference in toxicity found):
mutant H. virescens O. nubilalis D. grandiosella F313A down {2x) - -P3i6A - - up (2x) A317V - - up (2x) N318A down (2-3x) - -V319A - - up (3x) L321 A up (2x) - up (2x) R323A down (3x) - -W325A down (4-5x) down (2x) down (2-3x) P329A up (2x) - -Y330A - down (1.5x) up (2x) V362A down (3-4x) - -S364A - up (2x) -D368A down (2-3x) - -Y369A - - up (2x) R418A down ( 16x) down (2x) -A420V down (12x) - -L421 A - down (2x) -1422A - - up (2x) F480A down (5x) - down (40x) mutant H. virescens O. nubilalis D. grandiosella Q481 A down (3x) - -N483A - - down (2x) Q484A - - down (20x}
A485V down (3x) down (2x) down (20x) S487A down (2x) - down (20x) 1488A down (2x) up (2-3x) down (5x}
N490A - - down (20x) A491 V - - down (3x) SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: PLANT GENETIC SYSTEMS N.V.
(B) STREET: Jozef Plateaustraat 22 (C) CITY: Gent (E) COUNTRY: Belgium (F) POSTAL CODE (ZIP): B-9000 (G) TELEPHONE: (32)(9)2358411 (H) TELEFAX: (32)(9)2231923 (ii) TITLE OF INVENTION: Improved Bacillus thuringiensis toxin (iii) NUMBER OF SEQUENCES: 4 (iv) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) (vi) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 08/884,389 (B) FILING DATE: 27-JUN-1997 (2) INFORMATION FOR SEQ ID N0: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4344 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (ix) FEATURE:
(A) NAME/KEY: -(B) LOCATION:668..4141 (D) OTHER INFORMATION:/note= "coding sequence"
(xi) SEQUENCE
DESCRIPTION:
SEQ
ID NO:
1:
TCAAGTATCAGGTTTGTTTTGTTTTGTATATGAGTAAGAACCGAAGGTTTGTP.AAAAAGA480 GATTGAGAGT AAAGATATAT
ATATATAAAT ACAATAAAGA
GATATGATAT GAACATGCAC
TAGATTTATA GTATAGGAGG
CCCATTGTGG
' GTGTCCATCA GATGACGATG TGAGGTATCC TTTGGCAAGT GACCCAAATG 780 CAGCGTTACA
ATTCTTATAT
CTGTTGTTGG
TTTATCAATT
TCATGCGACA
CACTTGCAAG
ATTGGTTGGC
CTTTAGACCT
CATTACTGTC
CTCTTTTTGG
AATTGGAACT
ATCGTTTAAG
TGACTTTAGT
CAACGGGATC
CACCAGCTAA
AGCTCGAAAA
TCAGCAGTAA
TACGCCGTAG
CCACAAGAGC
TAGATTTTCG
GAGGCTTGTT
CAAATGATGA
TTACCTTTTT
CTGGATCTAT AGCTAATGCA
GGAAGTGTAC CTACTTATGT
ACCTTAATAA TACGATTACC
CCAAATAGAA TTACACAATT
CACCTGTTTC GGGTACTACG
GTCTTAAAAG GTCCAGGATT
GAAGAACAAC TAATGGCACA
TTTGGAACGT TAAGAGTAAC
AACAATATCG CCTAAGAGTT
CGTTTTGCCT CAACAGGAAA
1~AAAAAGGTAAAATGAATAGAACCCCCTACTGGTAGAAGGACCGATAGGGGGTTCTTACA 4260 TGAAAAAATGTAGCTGTTTACTAAGGTGTATAAAAAACAGCATATCTGATAGAAP.AAAGT4320 (2) INFORMATION FOR SEQ ID N0: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1157 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
Met Asn Arg Asn Asn Gln Asn Glu Tyr Glu Ile Ile Asp Ala Pro His Cys Gly Cys Pro Ser Asp Asp Asp Val Arg Tyr Pro Leu Ala Ser Asp Pro Asn Ala Ala Leu Gln Asn Met Asn Tyr Lys Asp Tyr Leu Gln Met Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg Asn Asp Thr Arg Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr Gln Gly Glu Ile Ser Thr Tyr Tyr Asp Arg Gln Leu Glu Leu Thr Ala Lys Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr 275 280 285 _ Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser Leu Thr Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Va1 Arg Phe Ala Ser Thr Gly Asn Phe Ser hle Arg Val Leu Arg Gly Gly Val Ser Ile Gly Asp Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu A1a Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn Val Thr Asp Tyr Gln Val Asp Gln Ala Ala Asn Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr Tyr Ser Asp Gly Ser Cys Ser Gly Ile Asn Arg Cys Asp Glu Gln His Gln Val Asp Met Gln Leu Asp Ala Glu His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Ile Asp Arg Val Tyr Leu Ala Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly Leu Ala Glu Ile Asn Glu Ala Ser Asn Leu Val Glu Ser Ile Ser Gly Val Tyr Ser Asp Thr Leu Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asp Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr Met Asp Ala Ser Val Gln Gln Asp Gly Asn Met His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Leu Arg Val Asn Pro Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Arg Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His Gln Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Asn Ser Tyr Ile Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met Trp Val Glu Val Ser Glu Ser Glu Gly Ser Phe Tyr Ile Asp Ser Ile Glu Phe Ile Glu Thr Gln Glu (2) INFORMATION FOR SEQ ID N0: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1897 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = "synthetic DNA"
(ix) FEATURE:
(A) NAME/KEY: -(B) LOCATION:13..1890 (D) OTHER INFORMATION:/note = ~"coding sequence"
(xi) SEQUENCE DESCRIPTION:
SEQ ID NO: 3:
ATGACCGACG
GCGACGTGCG
CCTGGGCAGC
(2) INFORMATION FOR SEQ ID N0: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 625 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID N0: 4:
Met Ala Asp Tyr Leu Gln Met Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg Asn Asp Thr Lys Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr Gln Gly Glu Ile Ser Thr Tyr Tyr Asp Arg Gln Leu Glu Leu Thr Ala Lys Tyr Thr Asn Tyr Cys Glu Thr Trp W0.99/00407 PCT/EP98/04033 Tyr Asn Thr Gly Leu Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser Leu Thr Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser Thr Gly Ser Ser Thr His Arg Leu Sex His Val Thr Phe Phe Ser Phe Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly Val Ser Ile Gly Asp Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro Val Asn Pro Ala Arg Glu Ala Glu Glu Asp
Claims (16)
1. A modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 or an insecticidally-effective fragment thereof, wherein at least one amino acid in one of the following regions in SEQ ID No. 2 is replaced by another amino acid: 313-334, 358-369, 418-425, 480-492.
2. A modified Cry9C protein with an improved toxicity to an insect species, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid 658, wherein at least one of the amino acids at the following positions in SEQ ID No. 2 have been replaced by another amino acid:
313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491.
313, 316, 317, 318, 319, 321, 323, 325, 329, 330, 368, 369, 418, 420, 421, 422, 480, 481, 483, 484, 485, 487, 488, 490 and 491.
3. The modified Cry9C protein of claim 1 wherein said at least one amino acid position is position 316, 317, 319, 321, 329, 330, 369, 422, or 488 in SEQ ID
No. 2.
No. 2.
4. The modified Cry9C protein of claim 1 with improved toxicity to Ostrinia nubilalis, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein at least the amino acids at positions 364 and 488 in SEQ ID No. 2 are replaced by other amino acids.
5. The modified Cry9C protein of claim 1 with improved toxicity to Heliothis virescens, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at position 321 or position 329 in SEQ ID No 2, is replaced by another amino acid.
6. The modified Cry9C protein of claim 1 with improved toxicity to Diatraea grandiosella, comprising the amino acid sequence of SEQ ID No. 2 from amino acid position 1 or 44 to amino acid position 658, wherein the amino acid at any or all of amino acid positions 316, 317, 319, 321, 330, 369, or 422 in SEQ ID No. 2 is replaced by another amino acid.
7. The modified Cry9C protein of any one of claims 1 to 6 wherein the arginine at position 164 in SEQ ID No. 2 is replaced by another amino acid.
8. The modified Cry9C protein of any one of claims 1 to 6 wherein said at least one amino acid position is replaced by alanine.
9. A DNA sequence encoding the protein of any one of claims 1 to 6.
10. A DNA sequence encoding the protein of claim 7 or 8.
11. A plant, comprising the DNA of claim 9 or 10.
12. A seed, comprising the DNA of claim 9 or 10.
13. The plant of claim 11 which is selected from the group consisting of:
corn, cotton, rice, oilseed rape, cauliflower, broccoli, soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
corn, cotton, rice, oilseed rape, cauliflower, broccoli, soybean, tomato, tobacco, potato, eggplant, beet, oat, pepper, gladiolus, dahlia, chrysanthemum, sorghum, and garden peas.
14. A method for controlling insects feeding on a plant, comprising expressing the protein of any one of claims 1 to 6 in a plant.
15. A method for controlling insects feeding on a plant, comprising growing the plant of Claim 11.
16. A method of obtaining a seed comprising the DNA of Claim 9 or 10 comprising inserting said DNA into the genome of a plant and harvesting the seed from said plant.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US88438997A | 1997-06-27 | 1997-06-27 | |
US08/884,389 | 1997-06-27 | ||
PCT/EP1998/004033 WO1999000407A2 (en) | 1997-06-27 | 1998-06-25 | Improved bacillus thuringiensis toxin |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2290718A1 true CA2290718A1 (en) | 1999-01-07 |
Family
ID=25384517
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002290718A Abandoned CA2290718A1 (en) | 1997-06-27 | 1998-06-25 | Improved bacillus thuringiensis toxin |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0989998A1 (en) |
AU (1) | AU741600B2 (en) |
CA (1) | CA2290718A1 (en) |
WO (1) | WO1999000407A2 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6570005B1 (en) | 1996-07-01 | 2003-05-27 | Mycogen Corporation | Toxins active against pests |
US6369213B1 (en) | 1996-07-01 | 2002-04-09 | Mycogen Corporation | Toxins active against pests |
WO2001021821A2 (en) * | 1999-09-17 | 2001-03-29 | Aventis Cropscience N.V. | Insect-resistant rice plants |
JP2003518930A (en) | 1999-12-28 | 2003-06-17 | バイエル・バイオサイエンス・エヌ・ヴェー | Insecticidal protein from Bacillus thuringiensis |
EP1307098A4 (en) * | 2000-08-11 | 2004-06-16 | Univ California | METHODS FOR BLOCKING RESISTANCE TO Bt TOXINS IN INSECTS AND NEMATODES |
US7230167B2 (en) | 2001-08-31 | 2007-06-12 | Syngenta Participations Ag | Modified Cry3A toxins and nucleic acid sequences coding therefor |
DK1694846T3 (en) | 2003-12-10 | 2014-11-17 | Novozymes As | Cell with enhanced secretion mediated by MrgA protein or a homolog |
WO2005066202A2 (en) | 2003-12-22 | 2005-07-21 | E.I. Du Pont De Nemours And Company | Bacillus cry9 family members |
US7361813B2 (en) | 2004-03-25 | 2008-04-22 | Syngenta Participations Ag | Corn event MIR604 |
JP4899180B2 (en) * | 2004-12-22 | 2012-03-21 | 独立行政法人農業・食品産業技術総合研究機構 | Primer set for nucleic acid test, test kit and test method using the same |
US9522937B2 (en) | 2007-03-28 | 2016-12-20 | Syngenta Participations Ag | Insecticidal proteins |
CN103588865B (en) | 2007-03-28 | 2016-09-07 | 先正达参股股份有限公司 | The protein of desinsection |
BRPI0924153A2 (en) * | 2009-01-23 | 2016-05-24 | Pioneer Hi Bred Int | isolated nucleic acid molecule, DNA construct, host cell, transgenic plant, transformed seed, pesticide-isolated isolated polypeptide, composition and method for controlling an insect pest population, for exterminating an insect pest, for producing an active polypeptide pesticide and to protect a plant from a pest |
DE102015113908B4 (en) | 2015-08-21 | 2023-05-04 | Truma Gerätetechnik GmbH & Co. KG | level gauge |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0317511A3 (en) * | 1987-11-18 | 1991-10-16 | Ciba-Geigy Ag | Insecticidal cotton plant cells |
WO1990006999A1 (en) * | 1988-12-12 | 1990-06-28 | E.I. Du Pont De Nemours And Company | New strain of bacillus thuringiensis |
HU226143B1 (en) * | 1993-04-09 | 2008-05-28 | Plant Genetic Systems Nv | New bacillus thuringiensis strains and their insecticidal proteins |
-
1998
- 1998-06-25 EP EP98939581A patent/EP0989998A1/en not_active Withdrawn
- 1998-06-25 CA CA002290718A patent/CA2290718A1/en not_active Abandoned
- 1998-06-25 AU AU88044/98A patent/AU741600B2/en not_active Ceased
- 1998-06-25 WO PCT/EP1998/004033 patent/WO1999000407A2/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
EP0989998A1 (en) | 2000-04-05 |
WO1999000407A2 (en) | 1999-01-07 |
AU8804498A (en) | 1999-01-19 |
WO1999000407A3 (en) | 1999-05-14 |
AU741600B2 (en) | 2001-12-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1594966B1 (en) | Delta-endotoxin genes and methods for their use | |
AU2008312468B2 (en) | AXMI-066 and AXMI-076: delta-endotoxin proteins and methods for their use | |
US8829279B2 (en) | Family of pesticidal proteins and methods for their use | |
US5659123A (en) | Diabrotica toxins | |
AU2002252974A1 (en) | Bacillus thuringiensis insecticidal proteins | |
CA2646471A1 (en) | Novel genes encoding insecticidal proteins | |
AU741600B2 (en) | Improved bacillus thuringiensis toxin | |
CA2480319C (en) | Chimeric delta endotoxin protein of cry1ea and cry1ca | |
US20100168387A1 (en) | Chimeric CrylE Delta Endotoxin and Methods of Controlling Insects | |
AU2012258422B2 (en) | Novel genes encoding insecticidal proteins | |
AU2013203135B2 (en) | Delta-endotoxin genes and methods for their use | |
NZ561959A (en) | Delta-endotoxin genes of bacillus thuringiensis and methods for their use as a pesticide | |
CA2924415A1 (en) | Novel genes encoding insecticidal proteins | |
AU2013200703A1 (en) | Delta-endotoxin genes and methods for their use |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |