CN116410277A - Novel insect-resistant gene and application thereof - Google Patents
Novel insect-resistant gene and application thereof Download PDFInfo
- Publication number
- CN116410277A CN116410277A CN202111663267.5A CN202111663267A CN116410277A CN 116410277 A CN116410277 A CN 116410277A CN 202111663267 A CN202111663267 A CN 202111663267A CN 116410277 A CN116410277 A CN 116410277A
- Authority
- CN
- China
- Prior art keywords
- leu
- protein
- ser
- thr
- asn
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 71
- 241000238631 Hexapoda Species 0.000 title claims abstract description 19
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 58
- 150000007523 nucleic acids Chemical class 0.000 claims description 22
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 20
- 108020004707 nucleic acids Proteins 0.000 claims description 20
- 102000039446 nucleic acids Human genes 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 19
- 125000003729 nucleotide group Chemical group 0.000 claims description 19
- 150000001413 amino acids Chemical class 0.000 claims description 16
- 230000035772 mutation Effects 0.000 claims description 12
- 240000008042 Zea mays Species 0.000 claims description 8
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 8
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 8
- 235000005822 corn Nutrition 0.000 claims description 8
- 239000004218 Orcein Substances 0.000 claims description 2
- 102220528606 Ribonuclease P/MRP protein subunit POP5_S99D_mutation Human genes 0.000 claims 1
- 102200115358 rs121918322 Human genes 0.000 claims 1
- 102220014523 rs397517157 Human genes 0.000 claims 1
- 230000000749 insecticidal effect Effects 0.000 abstract description 25
- 238000010353 genetic engineering Methods 0.000 abstract description 3
- 241000196324 Embryophyta Species 0.000 description 62
- 235000018102 proteins Nutrition 0.000 description 49
- 210000004027 cell Anatomy 0.000 description 31
- 238000009396 hybridization Methods 0.000 description 20
- 235000001014 amino acid Nutrition 0.000 description 18
- 229940024606 amino acid Drugs 0.000 description 15
- 238000000034 method Methods 0.000 description 15
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 14
- 239000000523 sample Substances 0.000 description 13
- 241000346285 Ostrinia furnacalis Species 0.000 description 11
- 230000009261 transgenic effect Effects 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 8
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 238000005406 washing Methods 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 241000193388 Bacillus thuringiensis Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- 206010034133 Pathogen resistance Diseases 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 229940097012 bacillus thuringiensis Drugs 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101150102464 Cry1 gene Proteins 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- JPIJQSOTBSSVTP-GBXIJSLDSA-N D-threonic acid Chemical compound OC[C@@H](O)[C@H](O)C(O)=O JPIJQSOTBSSVTP-GBXIJSLDSA-N 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- 240000001549 Ipomoea eriocarpa Species 0.000 description 1
- 235000005146 Ipomoea eriocarpa Nutrition 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- 231100000111 LD50 Toxicity 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 241001147397 Ostrinia Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- QEFHBVDWKFFKQI-PMVMPFDFSA-N Phe-His-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QEFHBVDWKFFKQI-PMVMPFDFSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 230000011681 asexual reproduction Effects 0.000 description 1
- 238000013465 asexual reproduction Methods 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 101150065438 cry1Ab gene Proteins 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 239000003517 fume Substances 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/12—Processes for modifying agronomic input traits, e.g. crop yield
- A01H1/122—Processes for modifying agronomic input traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- A01H1/1245—Processes for modifying agronomic input traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, e.g. pathogen, pest or disease resistance
- A01H1/127—Processes for modifying agronomic input traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, e.g. pathogen, pest or disease resistance for insect resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
- A01H6/46—Gramineae or Poaceae, e.g. ryegrass, rice, wheat or maize
- A01H6/4684—Zea mays [maize]
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/033—Rearing or breeding invertebrates; New breeds of invertebrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Environmental Sciences (AREA)
- Botany (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Developmental Biology & Embryology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Cell Biology (AREA)
- Pest Control & Pesticides (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Gastroenterology & Hepatology (AREA)
- Insects & Arthropods (AREA)
- Natural Medicines & Medicinal Plants (AREA)
- Physiology (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention discloses a novel insect-resistant protein and application thereof, and belongs to the field of genetic engineering. The invention provides a novel insecticidal protein and a coding gene thereof, and application thereof in cultivation of insect-resistant crops.
Description
Technical Field
The invention relates to a novel insect-resistant gene and application thereof, belonging to the field of genetic engineering.
Background
Among the Cry Bt insect-resistant genes that have been identified, cry1I proteins are unique, e.g., they are typically silent genes in Bacillus thuringiensis strains, but can be expressed in E.coli cultures as a protoxin of about 81 kDa, a unique molecular mass in Cry1 proteins. In addition, different Cry1 type I proteins have been identified as active against both lepidopteran and coleopteran pests. More importantly, insect resistance to Cry1A proteins that develop resistance does not exhibit cross resistance to Cry1I proteins (Zhao C, jurat-Funtes J L, abdelgaffar H M, et al identification of a New Cry1I-Type Gene as a Candidate for Gene Pyramiding in Corn To Control Ostrinia Species Larvae [ J ]. Appl Environ Microbiol,2015,81 (11): 3699-705.).
The Cry1Ab has good corn borer insecticidal activity, and a plurality of transgenic insect-resistant corn transformants are also developed by using the Cry1Ab (such as MON810 and DBN 9936). However, with the gradual expansion of the industrialized planting of transgenic insect-resistant corn in China, the drug resistance of corn borers to Cry1Ab proteins appears sooner or later. Superimposing new insect-resistant proteins with different insecticidal mechanisms is a better solution in a comprehensive sense. Studies have shown that Cry1Ie has no cross resistance to Cry1Ab, can be superimposed on this protein, and delays the development of corn borer resistance (Xu L, wang Z, zhang J, et al Cross-resistance of Cry1Ab-selected Asian corn borer to other Cry toxins [ J ]. Journal of Applied Entomology,2010,134 (5): 429-438)
However, the insecticidal activity of Cry1Ie on corn borers is still poorer than that of Cry1Ab, so that transgenic corn developed by utilizing Cry1Ie has a certain insecticidal activity, but the resistance effect is not ideal, and the requirement of commercial planting is difficult to be met. The transformation of the toxic region of the insecticidal protein can enhance the insecticidal activity of the protein, so that the insecticidal activity of the corn borer is better as long as the Cry1Ie protein is transformed, and the novel protein which has no interaction resistance with Cry1Ab is applied to the development of insect-resistant corn, and has good commercial application prospect.
In order to solve the problems, the invention carries out mutation on a nucleic acid molecule encoding Cry1Ie1 protein by a continuous error-prone PCR method, expresses the mutant into protein, then screens the mutant protein with higher insecticidal activity by a corn borer biological test, analyzes the nucleotide sequence of the encoded protein, and finds out the mutation site.
Disclosure of Invention
The invention provides an amino acid variation site of a protein, which is characterized in that the site is positioned at any one or more of the following sites of a sequence shown in SEQ ID NO. 1: 1) I82; 2) S99; 3) D113; 4) K147; 5) E182; 6) N214; 7) And D233.
The invention also provides a protein which is characterized by comprising an amino acid sequence shown in SEQ ID NO.1 and having the following mutation: 1) I82v+s99n+l111i+k147g+n214S; 2) D233N; 3) E182K; 4) D113s+e182V; 5) I680s+s99d; 6) D233Y.
The invention also provides a nucleic acid, which is characterized in that the nucleic acid codes for the protein.
In some embodiments, the nucleotide sequence of the nucleic acid is set forth in any one of SEQ ID NO.3-SEQ ID NO. 8.
The invention also provides application of the protein and the nucleic acid in cultivation of insect-resistant crops.
In some embodiments, the crop described in the above application is corn.
The invention has the beneficial effects that: through random mutation and insecticidal activity screening, the invention obtains 6 novel insecticidal proteins with high insecticidal activity, and can be used for cultivating insect-resistant crops.
Detailed Description
The following definitions and methods are provided to better define the present application and to guide those of ordinary skill in the art in the practice of the present application. Unless otherwise indicated, terms are to be construed according to conventional usage by those of ordinary skill in the relevant art. All patent documents, academic papers, industry standards, and other publications cited herein are incorporated by reference in their entirety.
As used herein, a "plant" is any plant, including whole plants, plant cells, plant organelles, plant protoplasts, plant cell tissue cultures from which plants can regenerate, plant callus tissue, whole plant cells in plants or plant parts, such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruits, stems, roots, root tips, anthers, and the like. Unless otherwise indicated, nucleic acids are written in the 5 'to 3' direction from left to right; the amino acid sequence is written in the amino to carboxyl direction from left to right. Amino acids may be represented herein by their commonly known three-letter symbols or by the single-letter symbols recommended by the IUPAC-IUB biochemical nomenclature committee. Likewise, nucleotides may be referred to by commonly accepted single letter codes. The numerical range includes the numbers defining the range. As used herein, "nucleic acid" includes reference to deoxyribonucleotide or ribonucleotide polymers in either single-or double-stranded form, and unless otherwise limited, includes known analogs (e.g., peptide nucleic acids) having the basic properties of natural nucleotides that hybridize to single-stranded nucleic acids in a manner similar to naturally occurring nucleotides. As used herein, the term "encode" or "encoded" when used in the context of a particular nucleic acid, means that the nucleic acid contains the necessary information to direct translation of the nucleotide sequence into the particular protein. The information encoding the protein is represented using codons. As used herein, reference to a "full-length sequence" of a particular polynucleotide or protein encoded thereby refers to an entire nucleic acid sequence or an entire amino acid sequence having a natural (non-synthetic) endogenous sequence. The full length polynucleotide encodes the full length, catalytically active form of the particular protein. The terms "polypeptide", "polypeptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The term is used for amino acid polymers in which one or more amino acid residues are artificial chemical analogs of the corresponding naturally occurring amino acid. The term is also used for naturally occurring amino acid polymers. The terms "residue" or "amino acid" are used interchangeably herein to refer to an amino acid that is incorporated into a protein, polypeptide, or peptide (collectively, "protein"). Amino acids may be naturally occurring amino acids, and unless otherwise limited, may include known analogs of natural amino acids, which analogs may function in a similar manner to naturally occurring amino acids.
The term "trait" refers to a physiological, morphological, biochemical or physical characteristic of a plant or a particular plant material or cell. In some cases, this property is visible to the human eye, such as seed or plant size, or can be measured by biochemical techniques, such as detecting the protein, starch or oil content of the seed or leaf, or by observing metabolic or physiological processes, for example by measuring tolerance to water deprivation or specific salts or sugar or nitrogen concentrations, or by observing the expression level of one or more genes, or by agronomic observations such as osmotic stress tolerance or yield.
"transgenic" refers to any cell, cell line, callus, tissue, plant part or plant whose genome has been altered by the presence of a heterologous nucleic acid, such as a recombinant DNA construct. The term "transgene" as used herein includes those initial transgenic events as well as those produced from the initial transgenic events by sexual hybridization or asexual reproduction, and does not encompass genomic (chromosomal or extrachromosomal) changes by conventional plant breeding methods or by naturally occurring events such as random fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
In this application, the terms "comprises," "comprising," or variations thereof are to be understood to encompass other elements, numbers, or steps in addition to those described. "subject plant" or "subject plant cell" refers to a plant or plant cell in which genetic engineering has been effected, or a progeny cell of a plant or cell so engineered, which progeny cell comprises the engineering. "control" or "control plants" provide a reference point for measuring phenotypic changes in a subject plant.
Negative or control plants can include, for example: (a) Wild-type plants or cells, i.e., plants or cells having the same genotype as the genetically engineered starting material, which genetic alteration produces the subject plant or cell; (b) A plant or plant cell having the same genotype as the starting material but which has been transformed with an empty construct (i.e., with a construct that has no known effect on the trait of interest, such as a construct comprising a marker gene); (c) A plant or plant cell that is a non-transformed isolate of the subject plant or plant cell; (d) A plant or plant cell genetically identical to the test plant or plant cell but not exposed to conditions or stimuli that induce expression of the gene of interest; or (e) the subject plant or plant cell itself, under conditions in which the gene of interest is not expressed.
Those skilled in the art will readily recognize that advances in molecular biology, such as site-specific and random mutagenesis, polymerase chain reaction methods, and protein engineering techniques, provide a wide range of suitable tools and procedures for engineering or engineering amino acid sequences and potentially genetic sequences of proteins of agricultural interest.
In some embodiments, the nucleotide sequences of the present application may be altered to make conservative amino acid substitutions. The principles and examples of conservative amino acid substitutions are described further below. In certain embodiments, the nucleotide sequences of the present application may be substituted according to the disclosed monocot codon bias without altering the amino acid sequence, e.g., codons encoding the same amino acid sequence may be substituted with monocot-preferred codons without altering the amino acid sequence encoded by the nucleotide sequence. In some embodiments, a portion of the nucleotide sequence herein is replaced with a different codon encoding the same amino acid sequence, such that the amino acid sequence encoded thereby is not changed while the nucleotide sequence is changed. Conservative variants include those sequences that encode the amino acid sequence of one of the proteins of an embodiment due to the degeneracy of the genetic code. In some embodiments, a portion of the nucleotide sequences herein are replaced according to monocot preference codons. Those skilled in the art will recognize that amino acid additions and/or substitutions are generally based on the relative similarity of amino acid side chain substituents, e.g., hydrophobicity, charge, size, etc., of the substituents. Exemplary amino acid substituents having various of the aforementioned contemplated properties are well known to those skilled in the art and include arginine and lysine; glutamic acid and aspartic acid; serine and threonic acid; glutamine and asparagine; and valine, leucine and isoleucine. Guidelines for suitable amino acid substitutions that do not affect the biological activity of the protein of interest can be found in the model of Dayhoff et al (1978) Atlas of Protein Sequence and Structure (protein sequence and structure atlas) (Natl. Biomed. Res. Foundation, washington, D.C.), incorporated herein by reference. Conservative substitutions such as substitution of one amino acid for another with similar properties may be made. Identification of sequence identity includes hybridization techniques. For example, all or part of a known nucleotide sequence is used as a probe for selective hybridization with other corresponding nucleotide sequences present in a cloned genomic DNA fragment or population of cDNA fragments (i.e., a genomic library or cDNA library) from a selected organism. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 32P or other detectable marker. Thus, for example, hybridization probes can be prepared by labeling synthetic oligonucleotides based on the sequences of the embodiments. Methods for preparing hybridization probes and constructing cDNA and genomic libraries are generally known in the art. Hybridization of the sequences may be performed under stringent conditions. As used herein, the term "stringent conditions" or "stringent hybridization conditions" refers to conditions under which a probe will hybridize to its target sequence to a detectably greater extent (e.g., at least 2-fold, 5-fold, or 10-fold over background) than to other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the hybridization stringency and/or controlling the washing conditions, target sequences 100% complementary to the probe can be identified (homologous probe method). Alternatively, stringent conditions can be adjusted to allow for some sequence mismatches in order to detect lower similarity (heterologous probe method). Typically, the probe is less than about 1000 or 500 nucleotides in length. Typically, stringent conditions are those in which the salt concentration is less than about 1.5M Na ion, typically about 0.01M to 1.0M Na ion concentration (or other salt) at a pH of 7.0 to 8.3, and the temperature conditions are: when used with short probes (e.g., 10 to 50 nucleotides), at least about 30 ℃; when used with long probes (e.g., greater than 50 nucleotides), at least about 60 ℃. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization at 37 ℃ with 30% to 35% formamide buffer, 1M NaCl, 1% sds (sodium dodecyl sulfate), washing in 1 x to 2 x SSC (20 x SSC = 3.0M NaCl/0.3M trisodium citrate) at 50 ℃ to 55 ℃. Exemplary moderately stringent conditions include hybridization in 40% to 45% formamide, 1.0M NaCl, 1% SDS at 37℃and washing in 0.5 XSSC to 1 XSSC at 55℃to 60 ℃. Exemplary high stringency conditions include hybridization in 50% formamide, 1M NaCl, 1% sds at 37 ℃ and a final wash in 0.1 x SSC at 60 ℃ to 65 ℃ for at least about 20 minutes. Optionally, the wash buffer may comprise about 0.1% to about 1% sds. The duration of hybridization is typically less than about 24 hours, typically from about 4 hours to about 12 hours. Specificity generally depends on post-hybridization washing, the key factors being the ionic strength and temperature of the final wash solution. The Tm (thermodynamic melting point) of DNA-DNA hybrids can be approximated from the formula Meinkoth and Wahl (1984) Anal. Biochem. 138:267-284: tm=81.5 ℃ +16.6 (log) +0.41 (% GC) -0.61 (% formamide) -500/L; where M is the molar concentration of monovalent cations,% GC is the percentage of guanosine and cytosine nucleotides in the DNA,% formamide is the percentage of formamide in the hybridization solution, and L is the base pair length of the hybrid. Tm is the temperature (at a defined ionic strength and pH) at which 50% of the complementary target sequence hybridizes to a perfectly matched probe. Washing is typically performed at least until equilibrium is reached and a low hybridization background level is reached, such as 2 hours, 1 hour, or 30 minutes. Each 1% mismatch corresponds to a decrease in Tm of about 1 ℃; thus, tm, hybridization, and/or wash conditions can be adjusted to hybridize to sequences of desired identity. For example, if sequences with ≡90% identity are desired, the Tm can be reduced by 10 ℃. Typically, stringent conditions are selected to be about 5 ℃ lower than the Tm for the specific sequence and its complement at a defined ionic strength and pH. However, under very stringent conditions, hybridization and/or washing may be performed at 4℃below the Tm; hybridization and/or washing may be performed at 6 ℃ below the Tm under moderately stringent conditions; hybridization and/or washing can be performed at 11℃below the Tm under low stringency conditions.
In some embodiments, fragments of the nucleotide sequence and the amino acid sequence encoded thereby are also included. As used herein, the term "fragment" refers to a portion of the nucleotide sequence of a polynucleotide of an embodiment or a portion of the amino acid sequence of a polypeptide. Fragments of a nucleotide sequence may encode protein fragments that retain the biological activity of the native or corresponding full-length protein and thus have protein activity. Mutant proteins include biologically active fragments of a native protein that comprise contiguous amino acid residues that retain the biological activity of the native protein. Some embodiments also include a transformed plant cell or transgenic plant comprising the nucleotide sequence of at least one embodiment. In some embodiments, the plant is transformed with an expression vector comprising the nucleotide sequence of at least one embodiment and operably linked thereto a promoter that drives expression in a plant cell. Transformed plant cells and transgenic plants refer to plant cells or plants comprising a heterologous polynucleotide within the genome. In general, the heterologous polynucleotide is stably integrated within the genome of the transformed plant cell or transgenic plant, such that the polynucleotide is delivered to the offspring. The heterologous polynucleotide may be integrated into the genome, either alone or as part of an expression vector. In some embodiments, the plants contemplated herein include plant cells, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, plant clumps, and plant cells, which are whole plants or parts of plants, such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruits, nuts, ears, cobs, hulls, stalks, roots, root tips, anthers, and the like. The present application also includes plant cells, protoplasts, tissues, calli, embryos and flowers, stems, fruits, leaves and roots derived from the transgenic plants of the present application or progeny thereof, and thus at least partially comprising the nucleotide sequences of the present application.
The following examples are illustrative of the invention and are not intended to limit the scope of the invention. Modifications and substitutions to methods, procedures, or conditions of the present invention without departing from the spirit and nature of the invention are intended to be within the scope of the present application. Examples follow conventional experimental conditions, such as the molecular cloning laboratory Manual of Sambrook et al (Sambrook J & Russell DW, molecular cloning: a laboratory manual, 2001), or conditions recommended by the manufacturer's instructions, unless otherwise indicated. Unless otherwise indicated, all chemical reagents used in the examples were conventional commercial reagents, and the technical means used in the examples were conventional means well known to those skilled in the art.
EXAMPLE 1 mutagenesis of target protein
The invention uses Cry1Ie1 protein as a template to carry out amino acid mutation transformation. It is believed that domain I of Bt proteins is involved in the formation of intestinal tracts in insects and determines toxicity, and domains II and III determine specific binding of proteins to receptors. The amino acids 1-648 of Cry1Ie1 protein (the sequence is shown as SEQ ID NO. 1) are the core insecticidal region comprising structural domains I-III, and in order to improve the insecticidal activity of Cry1Ie1 protein, the modified region is mainly concentrated in the structural domain I region, namely the amino acids 58-285.
The sequence shown in SEQ ID No.1 was designed to encode a nucleic acid sequence of this type (http:// www.friendbio.com/codon. Html. In the following in-line tool) in which codons were set to E.coli (K12 strain) preference and XhoI and HindIII cleavage sites were avoided. A nucleic acid molecule encoding the sequence shown as SEQ ID NO.1 (the sequence is shown as SEQ ID NO. 2) is obtained. Further, the sequence shown in SEQ ID NO.2 was mutated using a sequential error-prone PCR (sequential error-prone PCR) strategy to obtain a mutated nucleic acid molecule.
A total of 158 mutant nucleic acid molecules were obtained, which together with the unmutated nucleic acid molecules were cloned between the sites of restriction enzymes XhoI and HindIII in the vector pET28a expression vector, respectively, to obtain a protein expression vector. The vector is transferred into an escherichia coli BL21 cell line and protein expression is carried out. The method comprises the following specific steps:
inoculating a single colony to 0.5mL of LB liquid culture medium, culturing at 37 ℃ for 4 hours until the culture medium is turbid, adding IPTG (Isopropyl-beta-D-thiohinging) into 100uL of bacterial liquid to a final concentration of 0.8mM, simultaneously taking 100uL of bacterial liquid as negative control, continuously culturing for 4 hours, adding 25uL of loading buffer solution into 100uL of bacterial liquid for sample preparation electrophoresis, and comparing according to the negative control and the result induced by adding IPTG, and judging whether the expression exists. The remaining 20uL of the expression was inoculated into 2mL of LB liquid medium and cultured at 37℃for 12 to 16 hours as a seed liquid, the seed liquid was inoculated into 250mL of LB liquid medium again to OD600 = 0.5 to 0.6, then IPTG (isopopyl-beta-D-thiogaside) was added to a concentration of 0.8mM, and the culture was continued under the same conditions for 4 hours. The culture medium was centrifuged at 5000g for 10 minutes to pellet E.coli cells, and the supernatant was discarded to collect the pellet. The precipitate was sonicated with 30mL of 20mM Tris-50mM NaCl buffer. After centrifugation, the supernatant was examined for the presence of recombinant proteins.
A total of 141 recombinant proteins were obtained, and part of the expression vector failed to obtain soluble proteins, possibly due to the effects of normal expression or folding of the proteins after mutation.
EXAMPLE 2 insecticidal Activity test of muteins
141 recombinant proteins obtained in example 1 were tested for insecticidal activity. The method comprises the following steps:
50. Mu.L of each insecticidal protein was applied to the surface of a 24-well plate to which about 1mL of artificial feed had been added, and corn borer (Ostrinia furnacalis) newborn first-instar larvae were raised for insecticidal activity determination. The insecticidal rate was counted after 7 days of feeding. Tris-HCl buffer was used as a blank control, pET28a empty vector product as a negative control, cry1Ab protein as a positive control.
The results show that 132 of the 141 recombinant proteins had insecticidal activity comparable to or weaker than that of unmutated Cry1Ie1, and that 9 recombinant proteins had significantly improved insecticidal activity compared to unmutated Cry1Ie 1.
These 9 recombinant proteins were further subjected to a detailed insecticidal activity test. The testing method comprises the following steps:
the biological assay is carried out by adopting a surface smearing method, firstly, about 1mL of non-solidified artificial feed (about 0.5 g) is added into a 24-hole plate, the feed is paved on the bottom of the hole plate by slight shaking, protein solutions (25 mu L/hole) with different concentrations are added after the feed is solidified, the liquid medicine is evenly paved on the surface of the feed by slight shaking after the addition, and the feed is naturally dried in a fume hood for 1h. The experiment was set up with 5 gradient concentrations (0, 0.5, 10, 25, 50. Mu.g/g) and a blank (buffer), each treatment was inoculated with 24 first larvae (incubation time 2-12 h), 3 replicates were set up, placed at 25.+ -. 2 ℃ and photoperiod 14:10 (L: D) h, incubated in a chamber with 50-70% relative humidity, and mortality was investigated after 7 days. The tail of the larva is touched by a writing brush, the larva is regarded as dead, and the larva which does not develop 2 years is also regarded as dead.
Mortality and corrective mortality were calculated according to the following formulas, and LC50 values were calculated using graphpad.
Mortality (%) = (number of insects-number of surviving insects+number of insects whose age is still below 2 years)/number of insects×100 (formula 1)
The test results show that 6 recombinant proteins have better insecticidal activity. Sequencing the inserted nucleic acid molecular region on the expression vector for expressing the 6 proteins to obtain nucleotide sequences (shown as SEQ ID NO. 3-8), determining the protein amino acid sequence according to the nucleotide sequences, and comparing the protein amino acid sequence with the amino acid sequence before mutation to obtain mutation sites. The mutation sites of the above 6 mutations and the insecticidal activity after the mutation are shown in Table 1.
Table 1 insecticidal Activity of muteins
". Times." indicates a significant difference (α=0.05) relative to the unmutated protein control. 1: in μg/g.
The 6 recombinant proteins can be expressed in crops such as corn, and can be used for cultivating transgenic crops resistant to corn borers, and can also be used for preparing novel pesticides to prevent and treat corn borers.
While the invention has been described in detail in the general context and with reference to specific embodiments thereof, it will be apparent to one skilled in the art that modifications and improvements can be made thereto. Accordingly, such modifications or improvements may be made without departing from the spirit of the invention and are intended to be within the scope of the invention as claimed.
Sequence listing
<110> Wuhan morning glory decoding technology Co.Ltd
<120> novel insect-resistant gene and use thereof
<130> 1
<160> 8
<170> SIPOSequenceListing 1.0
<210> 1
<211> 648
<212> PRT
<213> Bacillus thuringiensis
<400> 1
Met Lys Leu Lys Asn Pro Asp Lys His Gln Ser Leu Ser Ser Asn Ala
1 5 10 15
Lys Val Asp Lys Ile Ala Thr Asp Ser Leu Lys Asn Glu Thr Asp Ile
20 25 30
Glu Leu Lys Asn Ile Asn His Glu Asp Phe Leu Arg Met Ser Glu His
35 40 45
Glu Ser Ile Asp Pro Phe Val Ser Ala Ser Thr Ile Gln Thr Gly Ile
50 55 60
Gly Ile Ala Gly Lys Ile Leu Gly Thr Leu Gly Val Pro Phe Ala Gly
65 70 75 80
Gln Ile Ala Ser Leu Tyr Ser Phe Ile Leu Gly Glu Leu Trp Pro Lys
85 90 95
Gly Lys Ser Gln Trp Glu Ile Phe Met Glu His Val Glu Glu Leu Ile
100 105 110
Asp Gln Lys Ile Ser Thr Tyr Ala Arg Asn Ile Ala Leu Ala Asp Leu
115 120 125
Lys Gly Leu Gly Asp Ala Leu Ala Val Tyr His Glu Ser Leu Glu Ser
130 135 140
Trp Ile Lys Asn Arg Asn Asn Ala Arg Ala Thr Ser Val Val Lys Ser
145 150 155 160
Gln Tyr Ile Ala Leu Glu Leu Leu Phe Val Gln Lys Leu Pro Ser Phe
165 170 175
Ala Val Ser Gly Glu Glu Val Pro Leu Leu Pro Ile Tyr Ala Gln Ala
180 185 190
Ala Asn Leu His Leu Leu Leu Leu Arg Asp Ala Ser Val Phe Gly Lys
195 200 205
Glu Trp Gly Leu Ser Asn Ser Gln Ile Ser Thr Phe Tyr Asn Arg Gln
210 215 220
Val Glu Arg Thr Ser Asp Tyr Ser Asp His Cys Val Lys Trp Tyr Ser
225 230 235 240
Thr Gly Leu Asn Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Val Arg
245 250 255
Tyr Asn Gln Phe Arg Lys Asp Met Thr Leu Met Val Leu Asp Leu Ile
260 265 270
Ala Leu Phe Pro Ser Tyr Asp Thr Leu Val Tyr Pro Ile Lys Thr Thr
275 280 285
Ser Gln Leu Thr Arg Glu Val Tyr Thr Asp Ala Ile Gly Thr Val His
290 295 300
Pro Asn Ala Ser Phe Ala Ser Thr Thr Trp Tyr Asn Asn Asn Ala Pro
305 310 315 320
Ser Phe Ser Ala Ile Glu Ser Ala Val Val Arg Asn Pro His Leu Leu
325 330 335
Asp Phe Leu Glu Gln Val Thr Ile Tyr Ser Leu Leu Ser Arg Trp Ser
340 345 350
Asn Thr Gln Tyr Met Asn Met Trp Gly Gly His Arg Leu Glu Phe Arg
355 360 365
Thr Ile Gly Gly Val Leu Asn Thr Ser Thr Gln Gly Ser Thr Asn Thr
370 375 380
Ser Ile Asn Pro Val Thr Leu Pro Phe Thr Ser Arg Asp Val Tyr Arg
385 390 395 400
Thr Glu Ser Leu Ala Gly Leu Asn Leu Phe Leu Thr Gln Pro Val Asn
405 410 415
Gly Val Pro Arg Val Asp Phe His Trp Lys Phe Ala Thr Leu Pro Ile
420 425 430
Ala Ser Asp Asn Phe Tyr Tyr Leu Gly Tyr Ala Gly Val Gly Thr Gln
435 440 445
Leu Gln Asp Ser Glu Asn Glu Leu Pro Pro Glu Thr Thr Gly Gln Pro
450 455 460
Asn Tyr Glu Ser Tyr Ser His Arg Leu Ser His Ile Gly Leu Ile Ser
465 470 475 480
Ala Ser His Val Lys Ala Leu Val Tyr Ser Trp Thr His Arg Ser Ala
485 490 495
Asp Arg Thr Asn Thr Ile Glu Pro Asn Ser Ile Thr Gln Ile Pro Leu
500 505 510
Val Lys Ala Phe Asn Leu Ser Ser Gly Ala Ala Val Val Arg Gly Pro
515 520 525
Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr Asn Thr Gly Thr Phe
530 535 540
Gly Asp Ile Arg Val Asn Ile Asn Pro Pro Phe Ala Gln Arg Tyr Arg
545 550 555 560
Val Arg Ile Arg Tyr Ala Ser Thr Thr Asp Leu Gln Phe His Thr Ser
565 570 575
Ile Asn Gly Lys Ala Ile Asn Gln Gly Asn Phe Ser Ala Thr Met Asn
580 585 590
Arg Gly Glu Asp Leu Asp Tyr Lys Thr Phe Arg Thr Val Gly Phe Thr
595 600 605
Thr Pro Phe Ser Phe Ser Asp Val Gln Ser Thr Phe Thr Ile Gly Ala
610 615 620
Trp Asn Phe Ser Ser Gly Asn Glu Val Tyr Ile Asp Arg Ile Glu Phe
625 630 635 640
Val Pro Val Glu Val Thr Tyr Glu
645
<210> 2
<211> 1944
<212> DNA
<213> Bacillus thuringiensis
<400> 2
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagatcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaatctcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctgacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 3
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 3
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
caggtcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaaaatcag 300
tgggaaatct tcatggaaca cgttgaagaa atcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcgg aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta gctctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctgacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 4
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 4
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagatcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaatctcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctaacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 5
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 5
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagatcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaatctcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaaaaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctgacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 6
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 6
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagatcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaatctcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcagcc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagtagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctgacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 7
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 7
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagagcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaagatcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctgacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
<210> 8
<211> 1944
<212> DNA
<213> Synthesis (unown)
<400> 8
atgaaactga aaaacccgga caaacaccag tctctgtctt ctaacgctaa agttgacaaa 60
atcgctaccg actctctgaa aaacgaaacc gacatcgaac tgaaaaacat caaccacgaa 120
gacttcctgc gtatgtctga acacgaatct atcgacccgt tcgtttctgc ttctaccatc 180
cagaccggta tcggtatcgc tggtaaaatc ctgggtaccc tgggtgttcc gttcgctggt 240
cagatcgctt ctctgtactc tttcatcctg ggtgaactgt ggccgaaagg taaatctcag 300
tgggaaatct tcatggaaca cgttgaagaa ctcatcgacc agaaaatctc tacctacgct 360
cgtaacatcg ctctggctga cctgaaaggt ctgggtgacg ctctggctgt ttaccacgaa 420
tctctggaat cttggatcaa aaaccgtaac aacgctcgtg ctacctctgt tgttaaatct 480
cagtacatcg ctctggaact gctgttcgtt cagaaactgc cgtctttcgc tgtttctggt 540
gaagaagttc cgctgctgcc gatctacgct caggctgcta acctgcacct gctgctgctg 600
cgtgacgctt ctgttttcgg taaagaatgg ggtctgtcta actctcagat ctctaccttc 660
tacaaccgtc aggttgaacg tacctctgac tactctyacc actgcgttaa atggtactct 720
accggtctga acaacctgcg tggtaccaac gctgaatctt gggttcgtta caaccagttc 780
cgtaaagaca tgaccctgat ggttctggac ctgatcgctc tgttcccgtc ttacgacacc 840
ctggtttacc cgatcaaaac cacctctcag ctgacccgtg aagtttacac cgacgctatc 900
ggtaccgttc acccaaatgc tagcttcgct tctaccactt ggtacaacaa caacgctccg 960
tctttctctg ctatcgaatc tgctgttgtt cgtaacccgc acctgctgga cttcctggaa 1020
caggttacca tctactctct gctgtctcgt tggtctaaca cccagtacat gaacatgtgg 1080
ggtggtcacc gtctggaatt ccgtaccatc ggtggtgttc tgaacacctc tacccagggt 1140
tctaccaaca cctctatcaa cccggttacc ctgccgttca cctctcgtga cgtttaccgt 1200
accgaatctc tggctggtct gaacctgttc ctgacccagc cggttaacgg tgttccgcgt 1260
gttgacttcc actggaaatt cgctaccctg ccgatcgctt ctgacaactt ctactacctg 1320
ggttacgctg gtgttggtac ccagctgcag gactctgaaa acgaactgcc gccggaaacc 1380
accggtcagc cgaactacga atcttactct caccgtctgt ctcacatcgg tctgatctct 1440
gcttctcacg ttaaagctct ggtttactct tggacccacc gttctgctga ccgtaccaac 1500
accatcgaac cgaactctat cacccagatc ccgctggtta aagcgttcaa cctgtcttct 1560
ggtgctgctg ttgttcgtgg tccgggtttc accggtggtg acatcctgcg tcgtaccaac 1620
accggtacct tcggtgacat ccgtgttaac atcaacccgc cgttcgctca gcgttaccgt 1680
gttcgtatcc gttacgcttc taccaccgac ctgcagttcc acacctctat caacggtaaa 1740
gctatcaacc agggtaactt ctctgctacc atgaaccgtg gtgaagacct ggactacaaa 1800
accttccgta ccgttggttt caccaccccg ttctctttct ctgacgttca gtctaccttc 1860
accatcggtg cttggaactt ctcttctggt aacgaagttt acatcgaccg tatcgaattc 1920
gttccggttg aagttaccta cgaa 1944
Claims (6)
1. An amino acid variation site of a protein, characterized in that the site is located at any one or more of the following sites in the sequence shown in SEQ ID NO. 1:
1)I82;
2)S99;
3)D113;
4)K147;
5)E182;
6)N214;
7)D233。
2. a protein comprising the amino acid sequence of SEQ ID No.1 and having the following mutations:
1)I82V+S99N+L111I+K147G+N214S;
2)D233N;
3)E182K;
4)D113S+E182V;
5)I82S+S99D
6)D233Y。
3. a nucleic acid encoding the protein of claim 2.
4. A nucleic acid according to claim 3, wherein the nucleotide sequence of the nucleic acid is as set forth in any one of SEQ ID No.3 to SEQ ID No. 8.
5. Use of a protein according to claim 2 and a nucleic acid according to claim 3 to claim 4 for the cultivation of insect-resistant crops.
6. The use of claim 5, wherein the crop is corn.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111663267.5A CN116410277A (en) | 2021-12-31 | 2021-12-31 | Novel insect-resistant gene and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111663267.5A CN116410277A (en) | 2021-12-31 | 2021-12-31 | Novel insect-resistant gene and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116410277A true CN116410277A (en) | 2023-07-11 |
Family
ID=87049984
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111663267.5A Pending CN116410277A (en) | 2021-12-31 | 2021-12-31 | Novel insect-resistant gene and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116410277A (en) |
-
2021
- 2021-12-31 CN CN202111663267.5A patent/CN116410277A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116514936B (en) | Insect-resistant protein and preparation method and application thereof | |
CN116284280B (en) | Insect-resistant protein and preparation method and application thereof | |
CN101583720A (en) | Plants having enhanced yield-related traits and a method for method for making the same | |
CN101868544A (en) | Plants having increased yield-related traits and a method for making the same | |
CA2475467C (en) | Gene conferring resistance to phytophthora infestans (late-blight) in solanaceae | |
CN115612695B (en) | Application of GhGPX5 and GhGPX13 genes in improving salt stress tolerance of plants | |
CN114369147B (en) | Application of BFNE gene in tomato plant type improvement and biological yield improvement | |
CN116947986A (en) | Insect-resistant protein and preparation method and application thereof | |
CN112812161B (en) | Application of protein IbMYC2 in regulation and control of plant drought resistance | |
AU2004210748B2 (en) | Gene resistant to Aphis gossypii | |
CN116396368A (en) | Insect-resistant protein and preparation method and application thereof | |
CN112724213B (en) | Sweet potato anthocyanin synthesis and stress resistance related protein IbMYB4, and coding gene and application thereof | |
US7915485B2 (en) | Nod-factor perception | |
CN111826364B (en) | Disease and pest resistance related gene and application thereof | |
CN112708603B (en) | Application of rice ARE2 gene in plant nitrogen metabolism regulation | |
CN116410277A (en) | Novel insect-resistant gene and application thereof | |
CN110358774B (en) | Gene, protein, gene expression cassette, expression vector, host cell, method and application for controlling rice flowering time | |
CN113388018B (en) | Application of cynodon dactylon CdWRKY2 protein and coding gene thereof in regulation and control of plant salt tolerance | |
CN114656540B (en) | Application of protein CYCA3-1 in improving saline-alkali resistance of corn | |
CN113462661B (en) | SIZ1 protein separated from corn, encoding gene thereof and application thereof in variety improvement | |
CN113416747B (en) | Method for creating temperature-sensitive male sterile plant | |
CN108841839B (en) | Application of protein TabZIP60 in regulation and control of nitrogen absorption of plants | |
WO2024121150A1 (en) | Methods of modulating immune function in plants | |
CN115368445A (en) | Method for increasing seed size or increasing seed weight in plants | |
CN114717243A (en) | GRMZM2G063882 gene, encoding protein, biological material and application thereof in plant drought resistance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |