CN107109418B - 用于控制植物有害生物的组合物和方法 - Google Patents
用于控制植物有害生物的组合物和方法 Download PDFInfo
- Publication number
- CN107109418B CN107109418B CN201580067474.0A CN201580067474A CN107109418B CN 107109418 B CN107109418 B CN 107109418B CN 201580067474 A CN201580067474 A CN 201580067474A CN 107109418 B CN107109418 B CN 107109418B
- Authority
- CN
- China
- Prior art keywords
- leu
- thr
- ser
- val
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 69
- 241000607479 Yersinia pestis Species 0.000 title abstract description 70
- 239000000203 mixture Substances 0.000 title description 57
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 385
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 291
- 241000238631 Hexapoda Species 0.000 claims abstract description 113
- 230000014509 gene expression Effects 0.000 claims description 111
- 150000007523 nucleic acids Chemical class 0.000 claims description 91
- 239000002773 nucleotide Substances 0.000 claims description 78
- 125000003729 nucleotide group Chemical group 0.000 claims description 78
- 102000039446 nucleic acids Human genes 0.000 claims description 75
- 108020004707 nucleic acids Proteins 0.000 claims description 75
- 230000009261 transgenic effect Effects 0.000 claims description 69
- 231100000331 toxic Toxicity 0.000 claims description 39
- 230000002588 toxic effect Effects 0.000 claims description 39
- 239000013598 vector Substances 0.000 claims description 38
- 108020004705 Codon Proteins 0.000 claims description 20
- 241000894006 Bacteria Species 0.000 claims description 15
- 230000001580 bacterial effect Effects 0.000 claims description 12
- 241000218475 Agrotis segetum Species 0.000 claims 1
- 241000282376 Panthera tigris Species 0.000 claims 1
- 230000000749 insecticidal effect Effects 0.000 abstract description 39
- 241000193388 Bacillus thuringiensis Species 0.000 abstract description 26
- 229940097012 bacillus thuringiensis Drugs 0.000 abstract description 25
- 241000196324 Embryophyta Species 0.000 description 285
- 235000018102 proteins Nutrition 0.000 description 282
- 108091033319 polynucleotide Proteins 0.000 description 122
- 239000002157 polynucleotide Substances 0.000 description 122
- 102000040430 polynucleotide Human genes 0.000 description 122
- 210000004027 cell Anatomy 0.000 description 117
- 125000003275 alpha amino acid group Chemical group 0.000 description 71
- 108091028043 Nucleic acid sequence Proteins 0.000 description 55
- 108020004414 DNA Proteins 0.000 description 54
- 235000001014 amino acid Nutrition 0.000 description 40
- 230000009466 transformation Effects 0.000 description 39
- 241000566547 Agrotis ipsilon Species 0.000 description 32
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 30
- 240000008042 Zea mays Species 0.000 description 29
- 229940024606 amino acid Drugs 0.000 description 29
- 150000001413 amino acids Chemical class 0.000 description 29
- 210000001519 tissue Anatomy 0.000 description 29
- 108090000765 processed proteins & peptides Proteins 0.000 description 27
- 102000004196 processed proteins & peptides Human genes 0.000 description 27
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 26
- 241000346285 Ostrinia furnacalis Species 0.000 description 25
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 25
- 235000009973 maize Nutrition 0.000 description 25
- 229920001184 polypeptide Polymers 0.000 description 25
- 239000003795 chemical substances by application Substances 0.000 description 23
- 238000009396 hybridization Methods 0.000 description 23
- 102000004190 Enzymes Human genes 0.000 description 19
- 108090000790 Enzymes Proteins 0.000 description 19
- 229940088598 enzyme Drugs 0.000 description 19
- 108010089804 glycyl-threonine Proteins 0.000 description 19
- 241000589158 Agrobacterium Species 0.000 description 18
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 18
- 108010005233 alanylglutamic acid Proteins 0.000 description 18
- 230000000361 pesticidal effect Effects 0.000 description 18
- 230000001105 regulatory effect Effects 0.000 description 18
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- 241001147381 Helicoverpa armigera Species 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 230000001276 controlling effect Effects 0.000 description 16
- 239000004009 herbicide Substances 0.000 description 16
- 238000012360 testing method Methods 0.000 description 16
- 108010073969 valyllysine Proteins 0.000 description 16
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 15
- 241000426497 Chilo suppressalis Species 0.000 description 15
- 241000255967 Helicoverpa zea Species 0.000 description 15
- 241001147398 Ostrinia nubilalis Species 0.000 description 15
- 230000000694 effects Effects 0.000 description 15
- 239000002609 medium Substances 0.000 description 15
- 210000002706 plastid Anatomy 0.000 description 15
- -1 rRNA Proteins 0.000 description 15
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 14
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 14
- 108010068265 aspartyltyrosine Proteins 0.000 description 14
- 238000003556 assay Methods 0.000 description 14
- 238000004519 manufacturing process Methods 0.000 description 14
- 230000001404 mediated effect Effects 0.000 description 14
- 108010051242 phenylalanylserine Proteins 0.000 description 14
- 239000000126 substance Substances 0.000 description 14
- 108700012359 toxins Proteins 0.000 description 14
- 241000209510 Liliopsida Species 0.000 description 13
- 241000256251 Spodoptera frugiperda Species 0.000 description 13
- 230000015572 biosynthetic process Effects 0.000 description 13
- 239000003550 marker Substances 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- 238000006467 substitution reaction Methods 0.000 description 13
- 241001367803 Chrysodeixis includens Species 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 12
- 108010048818 seryl-histidine Proteins 0.000 description 12
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 12
- 239000003053 toxin Substances 0.000 description 12
- 231100000765 toxin Toxicity 0.000 description 12
- 108091026890 Coding region Proteins 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 230000008685 targeting Effects 0.000 description 11
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 10
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 10
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 108010047857 aspartylglycine Proteins 0.000 description 10
- 241001233957 eudicotyledons Species 0.000 description 10
- 238000009472 formulation Methods 0.000 description 10
- 108010037850 glycylvaline Proteins 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- 241000894007 species Species 0.000 description 10
- 241000353522 Earias insulana Species 0.000 description 9
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 9
- 108091005804 Peptidases Proteins 0.000 description 9
- 239000004365 Protease Substances 0.000 description 9
- 108090000637 alpha-Amylases Proteins 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 229920001817 Agar Polymers 0.000 description 8
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 8
- ZFSIGJMSVGZVGP-DHATWTDPSA-N Arg-Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)[C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZFSIGJMSVGZVGP-DHATWTDPSA-N 0.000 description 8
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 8
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 8
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 8
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 8
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 8
- 241000193830 Bacillus <bacterium> Species 0.000 description 8
- 241000008892 Cnaphalocrocis patnalis Species 0.000 description 8
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 8
- 241000879145 Diatraea grandiosella Species 0.000 description 8
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 8
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 8
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 8
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 8
- 235000010469 Glycine max Nutrition 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 8
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 8
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 8
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 8
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 8
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 8
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 8
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 8
- 108010079005 RDV peptide Proteins 0.000 description 8
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 8
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 8
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 8
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 8
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 8
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 8
- 239000008272 agar Substances 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 8
- 108010062796 arginyllysine Proteins 0.000 description 8
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 8
- 238000004166 bioassay Methods 0.000 description 8
- 101150086784 cry gene Proteins 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010084389 glycyltryptophan Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 239000000575 pesticide Substances 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 108010020532 tyrosyl-proline Proteins 0.000 description 8
- 108010076441 Ala-His-His Proteins 0.000 description 7
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- 244000068988 Glycine max Species 0.000 description 7
- 108091022912 Mannose-6-Phosphate Isomerase Proteins 0.000 description 7
- 102000048193 Mannose-6-phosphate isomerases Human genes 0.000 description 7
- 229920002472 Starch Polymers 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 230000003321 amplification Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- 235000019621 digestibility Nutrition 0.000 description 7
- 210000002257 embryonic structure Anatomy 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 230000007613 environmental effect Effects 0.000 description 7
- 230000002496 gastric effect Effects 0.000 description 7
- 239000000499 gel Substances 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 210000001938 protoplast Anatomy 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 235000019698 starch Nutrition 0.000 description 7
- 239000008107 starch Substances 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- 239000005660 Abamectin Substances 0.000 description 6
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 6
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 6
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 6
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 6
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 6
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 6
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 6
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 6
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 6
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 6
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 6
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 6
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 6
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 6
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 6
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 6
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 6
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 6
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 6
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 6
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 6
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 6
- MKWFGXSFLYNTKC-XIRDDKMYSA-N His-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N MKWFGXSFLYNTKC-XIRDDKMYSA-N 0.000 description 6
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 6
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 6
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 6
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 6
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- 102000057297 Pepsin A Human genes 0.000 description 6
- 108090000284 Pepsin A Proteins 0.000 description 6
- 102000035195 Peptidases Human genes 0.000 description 6
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 6
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 6
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 6
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 6
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 6
- 241000563489 Sesamia inferens Species 0.000 description 6
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 6
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 6
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 6
- 108700019146 Transgenes Proteins 0.000 description 6
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 6
- 108090000848 Ubiquitin Proteins 0.000 description 6
- 102000044159 Ubiquitin Human genes 0.000 description 6
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 6
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 6
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 6
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 6
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 239000011324 bead Substances 0.000 description 6
- 238000004113 cell culture Methods 0.000 description 6
- 244000038559 crop plants Species 0.000 description 6
- 239000012530 fluid Substances 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 230000002363 herbicidal effect Effects 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010000761 leucylarginine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 239000002245 particle Substances 0.000 description 6
- 229940111202 pepsin Drugs 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 6
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid Chemical compound CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 5
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 5
- 101150078024 CRY2 gene Proteins 0.000 description 5
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 5
- 101150102464 Cry1 gene Proteins 0.000 description 5
- 239000005561 Glufosinate Substances 0.000 description 5
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 5
- 244000046052 Phaseolus vulgaris Species 0.000 description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 description 5
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 5
- 102000004139 alpha-Amylases Human genes 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 210000002421 cell wall Anatomy 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- CWFOCCVIPCEQCK-UHFFFAOYSA-N chlorfenapyr Chemical compound BrC1=C(C(F)(F)F)N(COCC)C(C=2C=CC(Cl)=CC=2)=C1C#N CWFOCCVIPCEQCK-UHFFFAOYSA-N 0.000 description 5
- 210000003763 chloroplast Anatomy 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- NYPJDWWKZLNGGM-UHFFFAOYSA-N fenvalerate Chemical compound C=1C=C(Cl)C=CC=1C(C(C)C)C(=O)OC(C#N)C(C=1)=CC=CC=1OC1=CC=CC=C1 NYPJDWWKZLNGGM-UHFFFAOYSA-N 0.000 description 5
- 230000002538 fungal effect Effects 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 230000000670 limiting effect Effects 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 230000000813 microbial effect Effects 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000002689 soil Substances 0.000 description 5
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 5
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical class O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 4
- QUTYKIXIUDQOLK-PRJMDXOYSA-N 5-O-(1-carboxyvinyl)-3-phosphoshikimic acid Chemical compound O[C@H]1[C@H](OC(=C)C(O)=O)CC(C(O)=O)=C[C@H]1OP(O)(O)=O QUTYKIXIUDQOLK-PRJMDXOYSA-N 0.000 description 4
- 108010000700 Acetolactate synthase Proteins 0.000 description 4
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 4
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 4
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 4
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 4
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 4
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 4
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 4
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 4
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 4
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 4
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 4
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 4
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 4
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 4
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 4
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 4
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 4
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 4
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 4
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 4
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 4
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 4
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 4
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 4
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 4
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 4
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 4
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 4
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 4
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 4
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 4
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 4
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 4
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 4
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 4
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 4
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 4
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 4
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 4
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 4
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 4
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 4
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 4
- 102100032487 Beta-mannosidase Human genes 0.000 description 4
- 229920000742 Cotton Polymers 0.000 description 4
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 4
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 4
- JUUMIGUJJRFQQR-KKUMJFAQSA-N Cys-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O JUUMIGUJJRFQQR-KKUMJFAQSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 4
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 4
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 4
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 4
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 4
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 4
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 4
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 4
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 4
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 4
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 4
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 4
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 4
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 4
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 4
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 4
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 4
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 4
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 4
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 4
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 4
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 4
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 4
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 4
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 4
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 4
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 4
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 4
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 4
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 4
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 4
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 4
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 4
- 239000005562 Glyphosate Substances 0.000 description 4
- 244000020551 Helianthus annuus Species 0.000 description 4
- 235000003222 Helianthus annuus Nutrition 0.000 description 4
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 4
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 4
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 4
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 4
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 4
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 4
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 4
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 4
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 4
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 4
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 4
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 4
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 4
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 4
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 4
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 4
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 4
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 4
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 4
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 4
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 4
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 4
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 4
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 4
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 4
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 4
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 4
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 4
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 4
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 4
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 4
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 4
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 4
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 4
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 4
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 4
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 4
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 4
- DZTDEZSHBVRUCQ-FXQIFTODSA-N Met-Asp-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DZTDEZSHBVRUCQ-FXQIFTODSA-N 0.000 description 4
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- 241000244206 Nematoda Species 0.000 description 4
- 241000208125 Nicotiana Species 0.000 description 4
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 4
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 4
- 240000007594 Oryza sativa Species 0.000 description 4
- 235000007164 Oryza sativa Nutrition 0.000 description 4
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 4
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 4
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 4
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 4
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 4
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 4
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 4
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 4
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 4
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 4
- 108700001094 Plant Genes Proteins 0.000 description 4
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 4
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 4
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 4
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 4
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 4
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 4
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 4
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 4
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 4
- 241000098281 Scirpophaga innotata Species 0.000 description 4
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 4
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 4
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 4
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 4
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 4
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 4
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 4
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 4
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 4
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 4
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 4
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 4
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 4
- 244000061456 Solanum tuberosum Species 0.000 description 4
- 235000002595 Solanum tuberosum Nutrition 0.000 description 4
- 244000062793 Sorghum vulgare Species 0.000 description 4
- 229940100389 Sulfonylurea Drugs 0.000 description 4
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 4
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 4
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 4
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 4
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 4
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 4
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 4
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 4
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 4
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 4
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 4
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 4
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 4
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 4
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 4
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 4
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 4
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 4
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 4
- 241000499912 Trichoderma reesei Species 0.000 description 4
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 4
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 4
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 4
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 4
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 4
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 4
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 4
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 4
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 4
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 4
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 4
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 4
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 4
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 4
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 4
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 4
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 4
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 4
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 4
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 4
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 4
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 4
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 4
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 4
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 4
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 4
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 4
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 4
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 4
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Chemical class Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- RRZXIRBKKLTSOM-XPNPUAGNSA-N avermectin B1a Chemical compound C1=C[C@H](C)[C@@H]([C@@H](C)CC)O[C@]11O[C@H](C\C=C(C)\[C@@H](O[C@@H]2O[C@@H](C)[C@H](O[C@@H]3O[C@@H](C)[C@H](O)[C@@H](OC)C3)[C@@H](OC)C2)[C@@H](C)\C=C\C=C/2[C@]3([C@H](C(=O)O4)C=C(C)[C@@H](O)[C@H]3OC\2)O)C[C@H]4C1 RRZXIRBKKLTSOM-XPNPUAGNSA-N 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 108010055059 beta-Mannosidase Proteins 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 239000013078 crystal Substances 0.000 description 4
- 108010004073 cysteinylcysteine Proteins 0.000 description 4
- 230000006378 damage Effects 0.000 description 4
- 230000034994 death Effects 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 235000013399 edible fruits Nutrition 0.000 description 4
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 229940097068 glyphosate Drugs 0.000 description 4
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 238000002744 homologous recombination Methods 0.000 description 4
- 230000006801 homologous recombination Effects 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 230000002147 killing effect Effects 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 239000002853 nucleic acid probe Substances 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 238000012552 review Methods 0.000 description 4
- 235000009566 rice Nutrition 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 230000028070 sporulation Effects 0.000 description 4
- 230000000451 tissue damage Effects 0.000 description 4
- 231100000827 tissue damage Toxicity 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000001131 transforming effect Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 3
- HOKKPVIRMVDYPB-UVTDQMKNSA-N (Z)-thiacloprid Chemical compound C1=NC(Cl)=CC=C1CN1C(=N/C#N)/SCC1 HOKKPVIRMVDYPB-UVTDQMKNSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 3
- 108091093088 Amplicon Proteins 0.000 description 3
- 241000625764 Anticarsia gemmatalis Species 0.000 description 3
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 3
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 3
- 102100026189 Beta-galactosidase Human genes 0.000 description 3
- 108010084185 Cellulases Proteins 0.000 description 3
- 102000005575 Cellulases Human genes 0.000 description 3
- 241000098289 Cnaphalocrocis medinalis Species 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 3
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 102100022624 Glucoamylase Human genes 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 102000004157 Hydrolases Human genes 0.000 description 3
- 108090000604 Hydrolases Proteins 0.000 description 3
- 206010020751 Hypersensitivity Diseases 0.000 description 3
- 108010028688 Isoamylase Proteins 0.000 description 3
- 241000588748 Klebsiella Species 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 3
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- UXJHNUBJSQQIOC-SZMVWBNQSA-N Met-Trp-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O UXJHNUBJSQQIOC-SZMVWBNQSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 241000606860 Pasteurella Species 0.000 description 3
- 241001148062 Photorhabdus Species 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 240000000111 Saccharum officinarum Species 0.000 description 3
- 235000007201 Saccharum officinarum Nutrition 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 240000003768 Solanum lycopersicum Species 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 235000021536 Sugar beet Nutrition 0.000 description 3
- 239000005940 Thiacloprid Substances 0.000 description 3
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 3
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 3
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 3
- 235000021307 Triticum Nutrition 0.000 description 3
- 244000098338 Triticum aestivum Species 0.000 description 3
- 241000607757 Xenorhabdus Species 0.000 description 3
- 229940024171 alpha-amylase Drugs 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- JFDZBHWFFUWGJE-UHFFFAOYSA-N benzonitrile Chemical compound N#CC1=CC=CC=C1 JFDZBHWFFUWGJE-UHFFFAOYSA-N 0.000 description 3
- 108010005774 beta-Galactosidase Proteins 0.000 description 3
- 230000000975 bioactive effect Effects 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 239000003593 chromogenic compound Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 230000001351 cycling effect Effects 0.000 description 3
- 230000000593 degrading effect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000000855 fermentation Methods 0.000 description 3
- 230000004151 fermentation Effects 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 230000037406 food intake Effects 0.000 description 3
- 210000001035 gastrointestinal tract Anatomy 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 102000005396 glutamine synthetase Human genes 0.000 description 3
- 108020002326 glutamine synthetase Proteins 0.000 description 3
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 3
- 238000003306 harvesting Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 210000004379 membrane Anatomy 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 238000004161 plant tissue culture Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000005507 spraying Methods 0.000 description 3
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 2
- PGOOBECODWQEAB-UHFFFAOYSA-N (E)-clothianidin Chemical compound [O-][N+](=O)\N=C(/NC)NCC1=CN=C(Cl)S1 PGOOBECODWQEAB-UHFFFAOYSA-N 0.000 description 2
- KIAPWMKFHIKQOZ-UHFFFAOYSA-N 2-[[(4-fluorophenyl)-oxomethyl]amino]benzoic acid methyl ester Chemical compound COC(=O)C1=CC=CC=C1NC(=O)C1=CC=C(F)C=C1 KIAPWMKFHIKQOZ-UHFFFAOYSA-N 0.000 description 2
- UPMXNNIRAGDFEH-UHFFFAOYSA-N 3,5-dibromo-4-hydroxybenzonitrile Chemical compound OC1=C(Br)C=C(C#N)C=C1Br UPMXNNIRAGDFEH-UHFFFAOYSA-N 0.000 description 2
- ZOCSXAVNDGMNBV-UHFFFAOYSA-N 5-amino-1-[2,6-dichloro-4-(trifluoromethyl)phenyl]-4-[(trifluoromethyl)sulfinyl]-1H-pyrazole-3-carbonitrile Chemical compound NC1=C(S(=O)C(F)(F)F)C(C#N)=NN1C1=C(Cl)C=C(C(F)(F)F)C=C1Cl ZOCSXAVNDGMNBV-UHFFFAOYSA-N 0.000 description 2
- IBSREHMXUMOFBB-JFUDTMANSA-N 5u8924t11h Chemical compound O1[C@@H](C)[C@H](O)[C@@H](OC)C[C@@H]1O[C@@H]1[C@@H](OC)C[C@H](O[C@@H]2C(=C/C[C@@H]3C[C@@H](C[C@@]4(O3)C=C[C@H](C)[C@@H](C(C)C)O4)OC(=O)[C@@H]3C=C(C)[C@@H](O)[C@H]4OC\C([C@@]34O)=C/C=C/[C@@H]2C)/C)O[C@H]1C.C1=C[C@H](C)[C@@H]([C@@H](C)CC)O[C@]11O[C@H](C\C=C(C)\[C@@H](O[C@@H]2O[C@@H](C)[C@H](O[C@@H]3O[C@@H](C)[C@H](O)[C@@H](OC)C3)[C@@H](OC)C2)[C@@H](C)\C=C\C=C/2[C@]3([C@H](C(=O)O4)C=C(C)[C@@H](O)[C@H]3OC\2)O)C[C@H]4C1 IBSREHMXUMOFBB-JFUDTMANSA-N 0.000 description 2
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 2
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 241000218473 Agrotis Species 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 2
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 2
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 2
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 2
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 2
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- YQNBILXAUIAUCF-CIUDSAMLSA-N Asn-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N YQNBILXAUIAUCF-CIUDSAMLSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 2
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 2
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- OFQPMRDJVWLMNJ-CIUDSAMLSA-N Asn-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N OFQPMRDJVWLMNJ-CIUDSAMLSA-N 0.000 description 2
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 2
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 2
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- TZBJAXGYGSIUHQ-XUXIUFHCSA-N Asp-Leu-Leu-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O TZBJAXGYGSIUHQ-XUXIUFHCSA-N 0.000 description 2
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- FOXXZZGDIAQPQI-XKNYDFJKSA-N Asp-Pro-Ser-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FOXXZZGDIAQPQI-XKNYDFJKSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 2
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241001513093 Aspergillus awamori Species 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 240000006439 Aspergillus oryzae Species 0.000 description 2
- 235000007319 Avena orientalis Nutrition 0.000 description 2
- 244000075850 Avena orientalis Species 0.000 description 2
- 239000005884 Beta-Cyfluthrin Substances 0.000 description 2
- 108010018763 Biotin carboxylase Proteins 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 2
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 2
- 241000555281 Brevibacillus Species 0.000 description 2
- 241000193417 Brevibacillus laterosporus Species 0.000 description 2
- 239000005489 Bromoxynil Substances 0.000 description 2
- JFLRKDZMHNBDQS-UCQUSYKYSA-N CC[C@H]1CCC[C@@H]([C@H](C(=O)C2=C[C@H]3[C@@H]4C[C@@H](C[C@H]4C(=C[C@H]3[C@@H]2CC(=O)O1)C)O[C@H]5[C@@H]([C@@H]([C@H]([C@@H](O5)C)OC)OC)OC)C)O[C@H]6CC[C@@H]([C@H](O6)C)N(C)C.CC[C@H]1CCC[C@@H]([C@H](C(=O)C2=C[C@H]3[C@@H]4C[C@@H](C[C@H]4C=C[C@H]3C2CC(=O)O1)O[C@H]5[C@@H]([C@@H]([C@H]([C@@H](O5)C)OC)OC)OC)C)O[C@H]6CC[C@@H]([C@H](O6)C)N(C)C Chemical compound CC[C@H]1CCC[C@@H]([C@H](C(=O)C2=C[C@H]3[C@@H]4C[C@@H](C[C@H]4C(=C[C@H]3[C@@H]2CC(=O)O1)C)O[C@H]5[C@@H]([C@@H]([C@H]([C@@H](O5)C)OC)OC)OC)C)O[C@H]6CC[C@@H]([C@H](O6)C)N(C)C.CC[C@H]1CCC[C@@H]([C@H](C(=O)C2=C[C@H]3[C@@H]4C[C@@H](C[C@H]4C=C[C@H]3C2CC(=O)O1)O[C@H]5[C@@H]([C@@H]([C@H]([C@@H](O5)C)OC)OC)OC)C)O[C@H]6CC[C@@H]([C@H](O6)C)N(C)C JFLRKDZMHNBDQS-UCQUSYKYSA-N 0.000 description 2
- 235000002566 Capsicum Nutrition 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 2
- 241000426499 Chilo Species 0.000 description 2
- 241000193403 Clostridium Species 0.000 description 2
- 239000005888 Clothianidin Substances 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 239000005946 Cypermethrin Substances 0.000 description 2
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 2
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 2
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 2
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 2
- BPHKULHWEIUDOB-FXQIFTODSA-N Cys-Gln-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BPHKULHWEIUDOB-FXQIFTODSA-N 0.000 description 2
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 2
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 2
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 2
- NDUPDOJHUQKPAG-UHFFFAOYSA-N Dalapon Chemical compound CC(Cl)(Cl)C(O)=O NDUPDOJHUQKPAG-UHFFFAOYSA-N 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 241000588698 Erwinia Species 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 108090000371 Esterases Proteins 0.000 description 2
- FNELVJVBIYMIMC-UHFFFAOYSA-N Ethiprole Chemical compound N1=C(C#N)C(S(=O)CC)=C(N)N1C1=C(Cl)C=C(C(F)(F)F)C=C1Cl FNELVJVBIYMIMC-UHFFFAOYSA-N 0.000 description 2
- 239000005657 Fenpyroximate Substances 0.000 description 2
- 239000005899 Fipronil Substances 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 2
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 2
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 2
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 2
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 2
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 2
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 2
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 2
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 2
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 2
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 2
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 2
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 2
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 2
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 2
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- 108050008938 Glucoamylases Proteins 0.000 description 2
- 108010060309 Glucuronidase Proteins 0.000 description 2
- 102000053187 Glucuronidase Human genes 0.000 description 2
- NEDQVOQDDBCRGG-UHFFFAOYSA-N Gly Gly Thr Tyr Chemical compound NCC(=O)NCC(=O)NC(C(O)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 NEDQVOQDDBCRGG-UHFFFAOYSA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- 241000219146 Gossypium Species 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 2
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 2
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 2
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 2
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 2
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 2
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 2
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 2
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 2
- 239000005906 Imidacloprid Substances 0.000 description 2
- 239000005907 Indoxacarb Substances 0.000 description 2
- 101100288095 Klebsiella pneumoniae neo gene Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- 108090001090 Lectins Proteins 0.000 description 2
- 102000004856 Lectins Human genes 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- BAJIJEGGUYXZGC-CIUDSAMLSA-N Leu-Asn-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BAJIJEGGUYXZGC-CIUDSAMLSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 241000192132 Leuconostoc Species 0.000 description 2
- 108090001060 Lipase Proteins 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- 102000004882 Lipase Human genes 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- 241000193386 Lysinibacillus sphaericus Species 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- XUMBMVFBXHLACL-UHFFFAOYSA-N Melanin Chemical compound O=C1C(=O)C(C2=CNC3=C(C(C(=O)C4=C32)=O)C)=C2C4=CNC2=C1C XUMBMVFBXHLACL-UHFFFAOYSA-N 0.000 description 2
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 2
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 2
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 2
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 2
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 2
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- 239000005916 Methomyl Substances 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- 244000111261 Mucuna pruriens Species 0.000 description 2
- 235000008540 Mucuna pruriens var utilis Nutrition 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- 241001477931 Mythimna unipuncta Species 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108010033272 Nitrilase Proteins 0.000 description 2
- 108091005461 Nucleic proteins Chemical group 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 239000005950 Oxamyl Substances 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 206010034133 Pathogen resistance Diseases 0.000 description 2
- 239000006002 Pepper Substances 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 2
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- 235000016761 Piper aduncum Nutrition 0.000 description 2
- 240000003889 Piper guineense Species 0.000 description 2
- 235000017804 Piper guineense Nutrition 0.000 description 2
- 235000008184 Piper nigrum Nutrition 0.000 description 2
- 235000010582 Pisum sativum Nutrition 0.000 description 2
- 240000004713 Pisum sativum Species 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 2
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 239000005925 Pymetrozine Substances 0.000 description 2
- 239000005927 Pyriproxyfen Substances 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000589180 Rhizobium Species 0.000 description 2
- 241000235403 Rhizomucor miehei Species 0.000 description 2
- 241000235527 Rhizopus Species 0.000 description 2
- 241000190932 Rhodopseudomonas Species 0.000 description 2
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 2
- 241001620634 Roger Species 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 206010070834 Sensitisation Diseases 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 2
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 2
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 2
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 2
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 241000931987 Sesamia Species 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 239000005930 Spinosad Substances 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 2
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 2
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 2
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 2
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- 241000223259 Trichoderma Species 0.000 description 2
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 2
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 2
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 2
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 2
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 2
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 2
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 2
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 2
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 2
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 2
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 2
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- VSYROIRKNBCULO-BWAGICSOSA-N Tyr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O VSYROIRKNBCULO-BWAGICSOSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 2
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 2
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 2
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000589634 Xanthomonas Species 0.000 description 2
- QQODLKZGRKWIFG-RUTXASTPSA-N [(R)-cyano-(4-fluoro-3-phenoxyphenyl)methyl] (1S)-3-(2,2-dichloroethenyl)-2,2-dimethylcyclopropane-1-carboxylate Chemical compound CC1(C)C(C=C(Cl)Cl)[C@@H]1C(=O)O[C@@H](C#N)C1=CC=C(F)C(OC=2C=CC=CC=2)=C1 QQODLKZGRKWIFG-RUTXASTPSA-N 0.000 description 2
- 229950008167 abamectin Drugs 0.000 description 2
- 108010093941 acetylxylan esterase Proteins 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 230000007815 allergy Effects 0.000 description 2
- 108010030291 alpha-Galactosidase Proteins 0.000 description 2
- 108010061314 alpha-L-Fucosidase Proteins 0.000 description 2
- 108010044879 alpha-L-rhamnosidase Proteins 0.000 description 2
- 108010012864 alpha-Mannosidase Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108010019077 beta-Amylase Proteins 0.000 description 2
- 108010047754 beta-Glucosidase Proteins 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 235000013339 cereals Nutrition 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- SBPBAQFWLVIOKP-UHFFFAOYSA-N chlorpyrifos Chemical compound CCOP(=S)(OCC)OC1=NC(Cl)=C(Cl)C=C1Cl SBPBAQFWLVIOKP-UHFFFAOYSA-N 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 108010005400 cutinase Proteins 0.000 description 2
- JHIVVAPYMSGYDF-UHFFFAOYSA-N cyclohexanone Chemical compound O=C1CCCCC1 JHIVVAPYMSGYDF-UHFFFAOYSA-N 0.000 description 2
- 229960001591 cyfluthrin Drugs 0.000 description 2
- QQODLKZGRKWIFG-QSFXBCCZSA-N cyfluthrin Chemical compound CC1(C)[C@@H](C=C(Cl)Cl)[C@H]1C(=O)O[C@@H](C#N)C1=CC=C(F)C(OC=2C=CC=CC=2)=C1 QQODLKZGRKWIFG-QSFXBCCZSA-N 0.000 description 2
- ZXQYGBMAQZUVMI-UNOMPAQXSA-N cyhalothrin Chemical compound CC1(C)C(\C=C(/Cl)C(F)(F)F)C1C(=O)OC(C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 ZXQYGBMAQZUVMI-UNOMPAQXSA-N 0.000 description 2
- 229960005424 cypermethrin Drugs 0.000 description 2
- KAATUXNTWXVJKI-UHFFFAOYSA-N cypermethrin Chemical compound CC1(C)C(C=C(Cl)Cl)C1C(=O)OC(C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 KAATUXNTWXVJKI-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 102000004419 dihydrofolate reductase Human genes 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- RDYMFSUJUZBWLH-SVWSLYAFSA-N endosulfan Chemical compound C([C@@H]12)OS(=O)OC[C@@H]1[C@]1(Cl)C(Cl)=C(Cl)[C@@]2(Cl)C1(Cl)Cl RDYMFSUJUZBWLH-SVWSLYAFSA-N 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 229960003276 erythromycin Drugs 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- XQUXKZZNEFRCAW-UHFFFAOYSA-N fenpropathrin Chemical compound CC1(C)C(C)(C)C1C(=O)OC(C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 XQUXKZZNEFRCAW-UHFFFAOYSA-N 0.000 description 2
- YYJNOYZRYGDPNH-MFKUBSTISA-N fenpyroximate Chemical compound C=1C=C(C(=O)OC(C)(C)C)C=CC=1CO/N=C/C=1C(C)=NN(C)C=1OC1=CC=CC=C1 YYJNOYZRYGDPNH-MFKUBSTISA-N 0.000 description 2
- 229940013764 fipronil Drugs 0.000 description 2
- 239000013568 food allergen Substances 0.000 description 2
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 2
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 108010002430 hemicellulase Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 229940056881 imidacloprid Drugs 0.000 description 2
- YWTYJOPNNQFBPC-UHFFFAOYSA-N imidacloprid Chemical compound [O-][N+](=O)\N=C1/NCCN1CC1=CC=C(Cl)N=C1 YWTYJOPNNQFBPC-UHFFFAOYSA-N 0.000 description 2
- VBCVPMMZEGZULK-NRFANRHFSA-N indoxacarb Chemical compound C([C@@]1(OC2)C(=O)OC)C3=CC(Cl)=CC=C3C1=NN2C(=O)N(C(=O)OC)C1=CC=C(OC(F)(F)F)C=C1 VBCVPMMZEGZULK-NRFANRHFSA-N 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000009655 industrial fermentation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000001764 infiltration Methods 0.000 description 2
- 230000008595 infiltration Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 229940039696 lactobacillus Drugs 0.000 description 2
- 230000009571 larval growth Effects 0.000 description 2
- 239000002523 lectin Substances 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- SXTAYKAGBXMACB-UHFFFAOYSA-N methionine sulfoximine Chemical compound CS(=N)(=O)CCC(N)C(O)=O SXTAYKAGBXMACB-UHFFFAOYSA-N 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- UHXUZOCRWCRNSJ-QPJJXVBHSA-N methomyl Chemical compound CNC(=O)O\N=C(/C)SC UHXUZOCRWCRNSJ-QPJJXVBHSA-N 0.000 description 2
- 229960000485 methotrexate Drugs 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- YTYGAJLZOJPJGH-UHFFFAOYSA-N noviflumuron Chemical compound FC1=C(Cl)C(OC(F)(F)C(C(F)(F)F)F)=C(Cl)C=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F YTYGAJLZOJPJGH-UHFFFAOYSA-N 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- KZAUOCCYDRDERY-UHFFFAOYSA-N oxamyl Chemical compound CNC(=O)ON=C(SC)C(=O)N(C)C KZAUOCCYDRDERY-UHFFFAOYSA-N 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 230000029553 photosynthesis Effects 0.000 description 2
- 238000010672 photosynthesis Methods 0.000 description 2
- 230000008654 plant damage Effects 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000002331 protein detection Methods 0.000 description 2
- QHMTXANCGGJZRX-WUXMJOGZSA-N pymetrozine Chemical compound C1C(C)=NNC(=O)N1\N=C\C1=CC=CN=C1 QHMTXANCGGJZRX-WUXMJOGZSA-N 0.000 description 2
- NHDHVHZZCFYRSB-UHFFFAOYSA-N pyriproxyfen Chemical compound C=1C=CC=NC=1OC(C)COC(C=C1)=CC=C1OC1=CC=CC=C1 NHDHVHZZCFYRSB-UHFFFAOYSA-N 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 230000033458 reproduction Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000008313 sensitization Effects 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000013605 shuttle vector Substances 0.000 description 2
- SQGYOTSLMSWVJD-UHFFFAOYSA-N silver(1+) nitrate Chemical compound [Ag+].[O-]N(=O)=O SQGYOTSLMSWVJD-UHFFFAOYSA-N 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 229940014213 spinosad Drugs 0.000 description 2
- GOLXNESZZPUPJE-UHFFFAOYSA-N spiromesifen Chemical compound CC1=CC(C)=CC(C)=C1C(C(O1)=O)=C(OC(=O)CC(C)(C)C)C11CCCC1 GOLXNESZZPUPJE-UHFFFAOYSA-N 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- XLNZEKHULJKQBA-UHFFFAOYSA-N terbufos Chemical compound CCOP(=S)(OCC)SCSC(C)(C)C XLNZEKHULJKQBA-UHFFFAOYSA-N 0.000 description 2
- BAKXBZPQTXCKRR-UHFFFAOYSA-N thiodicarb Chemical compound CSC(C)=NOC(=O)NSNC(=O)ON=C(C)SC BAKXBZPQTXCKRR-UHFFFAOYSA-N 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- ZXQYGBMAQZUVMI-RDDWSQKMSA-N (1S)-cis-(alphaR)-cyhalothrin Chemical compound CC1(C)[C@H](\C=C(/Cl)C(F)(F)F)[C@@H]1C(=O)O[C@@H](C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 ZXQYGBMAQZUVMI-RDDWSQKMSA-N 0.000 description 1
- AAQFSZFQCXLMNT-ACMTZBLWSA-N (3s)-3-amino-4-[[(2s)-1-methoxy-1-oxo-3-phenylpropan-2-yl]amino]-4-oxobutanoic acid;hydrochloride Chemical compound Cl.OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 AAQFSZFQCXLMNT-ACMTZBLWSA-N 0.000 description 1
- FCHBECOAGZMTFE-ZEQKJWHPSA-N (6r,7r)-3-[[2-[[4-(dimethylamino)phenyl]diazenyl]pyridin-1-ium-1-yl]methyl]-8-oxo-7-[(2-thiophen-2-ylacetyl)amino]-5-thia-1-azabicyclo[4.2.0]oct-2-ene-2-carboxylate Chemical compound C1=CC(N(C)C)=CC=C1N=NC1=CC=CC=[N+]1CC1=C(C([O-])=O)N2C(=O)[C@@H](NC(=O)CC=3SC=CC=3)[C@H]2SC1 FCHBECOAGZMTFE-ZEQKJWHPSA-N 0.000 description 1
- WCXDHFDTOYPNIE-RIYZIHGNSA-N (E)-acetamiprid Chemical compound N#C/N=C(\C)N(C)CC1=CC=C(Cl)N=C1 WCXDHFDTOYPNIE-RIYZIHGNSA-N 0.000 description 1
- XGWIJUOSCAQSSV-XHDPSFHLSA-N (S,S)-hexythiazox Chemical compound S([C@H]([C@@H]1C)C=2C=CC(Cl)=CC=2)C(=O)N1C(=O)NC1CCCCC1 XGWIJUOSCAQSSV-XHDPSFHLSA-N 0.000 description 1
- ZFHGXWPMULPQSE-SZGBIDFHSA-N (Z)-(1S)-cis-tefluthrin Chemical compound FC1=C(F)C(C)=C(F)C(F)=C1COC(=O)[C@@H]1C(C)(C)[C@@H]1\C=C(/Cl)C(F)(F)F ZFHGXWPMULPQSE-SZGBIDFHSA-N 0.000 description 1
- JIHQDMXYYFUGFV-UHFFFAOYSA-N 1,3,5-triazine Chemical compound C1=NC=NC=N1 JIHQDMXYYFUGFV-UHFFFAOYSA-N 0.000 description 1
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical compound COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 1
- 101150072531 10 gene Proteins 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- NDUPDOJHUQKPAG-UHFFFAOYSA-M 2,2-Dichloropropanoate Chemical compound CC(Cl)(Cl)C([O-])=O NDUPDOJHUQKPAG-UHFFFAOYSA-M 0.000 description 1
- GOCUAJYOYBLQRH-UHFFFAOYSA-N 2-(4-{[3-chloro-5-(trifluoromethyl)pyridin-2-yl]oxy}phenoxy)propanoic acid Chemical compound C1=CC(OC(C)C(O)=O)=CC=C1OC1=NC=C(C(F)(F)F)C=C1Cl GOCUAJYOYBLQRH-UHFFFAOYSA-N 0.000 description 1
- SXERGJJQSKIUIC-UHFFFAOYSA-N 2-Phenoxypropionic acid Chemical compound OC(=O)C(C)OC1=CC=CC=C1 SXERGJJQSKIUIC-UHFFFAOYSA-N 0.000 description 1
- BOTNFCTYKJBUMU-UHFFFAOYSA-N 2-[4-(2-methylpropyl)piperazin-4-ium-1-yl]-2-oxoacetate Chemical compound CC(C)C[NH+]1CCN(C(=O)C([O-])=O)CC1 BOTNFCTYKJBUMU-UHFFFAOYSA-N 0.000 description 1
- CFBILACNYSPRPM-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[[1,3-dihydroxy-2-(hydroxymethyl)propan-2-yl]amino]acetic acid Chemical compound OCC(N)(CO)CO.OCC(CO)(CO)NCC(O)=O CFBILACNYSPRPM-UHFFFAOYSA-N 0.000 description 1
- 102100027328 2-hydroxyacyl-CoA lyase 2 Human genes 0.000 description 1
- YZEUHQHUFTYLPH-UHFFFAOYSA-N 2-nitroimidazole Chemical compound [O-][N+](=O)C1=NC=CN1 YZEUHQHUFTYLPH-UHFFFAOYSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- XJFIKRXIJXAJGH-UHFFFAOYSA-N 5-chloro-1,3-dihydroimidazo[4,5-b]pyridin-2-one Chemical group ClC1=CC=C2NC(=O)NC2=N1 XJFIKRXIJXAJGH-UHFFFAOYSA-N 0.000 description 1
- HUNCSWANZMJLPM-UHFFFAOYSA-N 5-methyltryptophan Chemical compound CC1=CC=C2NC=C(CC(N)C(O)=O)C2=C1 HUNCSWANZMJLPM-UHFFFAOYSA-N 0.000 description 1
- 108010011619 6-Phytase Proteins 0.000 description 1
- 241001524388 Abrupta Species 0.000 description 1
- 239000005875 Acetamiprid Substances 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 101710103719 Acetolactate synthase large subunit Proteins 0.000 description 1
- 101710182467 Acetolactate synthase large subunit IlvB1 Proteins 0.000 description 1
- 101710171176 Acetolactate synthase large subunit IlvG Proteins 0.000 description 1
- 101710176702 Acetolactate synthase small subunit Proteins 0.000 description 1
- 101710147947 Acetolactate synthase small subunit 1, chloroplastic Proteins 0.000 description 1
- 101710095712 Acetolactate synthase, mitochondrial Proteins 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 241000588986 Alcaligenes Species 0.000 description 1
- 101710187578 Alcohol dehydrogenase 1 Proteins 0.000 description 1
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 244000291564 Allium cepa Species 0.000 description 1
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 1
- 240000002234 Allium sativum Species 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 208000031295 Animal disease Diseases 0.000 description 1
- 108010037870 Anthranilate Synthase Proteins 0.000 description 1
- 240000007087 Apium graveolens Species 0.000 description 1
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 description 1
- 235000010591 Appio Nutrition 0.000 description 1
- 241000511859 Aproaerema anthyllidella Species 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101710152845 Arabinogalactan endo-beta-1,4-galactanase Proteins 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 244000003416 Asparagus officinalis Species 0.000 description 1
- 235000005340 Asparagus officinalis Nutrition 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 235000000832 Ayote Nutrition 0.000 description 1
- 239000005878 Azadirachtin Substances 0.000 description 1
- 101100497223 Bacillus thuringiensis cry1Ag gene Proteins 0.000 description 1
- 108700003860 Bacterial Genes Proteins 0.000 description 1
- 108020004256 Beta-lactamase Proteins 0.000 description 1
- 239000005653 Bifenazate Substances 0.000 description 1
- 239000005874 Bifenthrin Substances 0.000 description 1
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 235000000540 Brassica rapa subsp rapa Nutrition 0.000 description 1
- 101000981883 Brevibacillus parabrevis ATP-dependent tryptophan/phenylalanine/tyrosine adenylase Proteins 0.000 description 1
- 101000981889 Brevibacillus parabrevis Linear gramicidin-PCP reductase Proteins 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- 239000005885 Buprofezin Substances 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 235000009467 Carica papaya Nutrition 0.000 description 1
- 240000006432 Carica papaya Species 0.000 description 1
- 241001466804 Carnivora Species 0.000 description 1
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 1
- 244000020518 Carthamus tinctorius Species 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 229930186147 Cephalosporin Natural products 0.000 description 1
- 241001480010 Cestrum Species 0.000 description 1
- 108010004539 Chalcone isomerase Proteins 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 1
- 239000005945 Chlorpyrifos-methyl Substances 0.000 description 1
- 108010089254 Cholesterol oxidase Proteins 0.000 description 1
- 101000906861 Chondromyces crocatus ATP-dependent tyrosine adenylase Proteins 0.000 description 1
- 108091060290 Chromatid Proteins 0.000 description 1
- 241000123346 Chrysosporium Species 0.000 description 1
- 235000007542 Cichorium intybus Nutrition 0.000 description 1
- 244000298479 Cichorium intybus Species 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000218631 Coniferophyta Species 0.000 description 1
- 241000219112 Cucumis Species 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 235000009854 Cucurbita moschata Nutrition 0.000 description 1
- 240000001980 Cucurbita pepo Species 0.000 description 1
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 1
- 241000219130 Cucurbita pepo subsp. pepo Species 0.000 description 1
- 235000003954 Cucurbita pepo var melopepo Nutrition 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- 108010025880 Cyclomaltodextrin glucanotransferase Proteins 0.000 description 1
- 235000017788 Cydonia oblonga Nutrition 0.000 description 1
- 239000005891 Cyromazine Substances 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000289763 Dasygaster padockina Species 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 239000005892 Deltamethrin Substances 0.000 description 1
- 241000122106 Diatraea saccharalis Species 0.000 description 1
- 239000005893 Diflubenzuron Substances 0.000 description 1
- 239000005947 Dimethoate Substances 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- AHMIDUVKSGCHAU-UHFFFAOYSA-N Dopaquinone Natural products OC(=O)C(N)CC1=CC(=O)C(=O)C=C1 AHMIDUVKSGCHAU-UHFFFAOYSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 108010001817 Endo-1,4-beta Xylanases Proteins 0.000 description 1
- 101710147028 Endo-beta-1,4-galactanase Proteins 0.000 description 1
- 102100023882 Endoribonuclease ZC3H12A Human genes 0.000 description 1
- 101710112715 Endoribonuclease ZC3H12A Proteins 0.000 description 1
- 241000701832 Enterobacteria phage T3 Species 0.000 description 1
- 102100023164 Epididymis-specific alpha-mannosidase Human genes 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 239000005895 Esfenvalerate Substances 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- QTANTQQOYSUMLC-UHFFFAOYSA-O Ethidium cation Chemical compound C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 QTANTQQOYSUMLC-UHFFFAOYSA-O 0.000 description 1
- 239000005958 Fenamiphos (aka phenamiphos) Substances 0.000 description 1
- 239000005656 Fenazaquin Substances 0.000 description 1
- HMIBKHHNXANVHR-UHFFFAOYSA-N Fenothiocarb Chemical compound CN(C)C(=O)SCCCCOC1=CC=CC=C1 HMIBKHHNXANVHR-UHFFFAOYSA-N 0.000 description 1
- 239000005898 Fenoxycarb Substances 0.000 description 1
- PNVJTZOFSHSLTO-UHFFFAOYSA-N Fenthion Chemical compound COP(=S)(OC)OC1=CC=C(SC)C(C)=C1 PNVJTZOFSHSLTO-UHFFFAOYSA-N 0.000 description 1
- 239000005900 Flonicamid Substances 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101710115777 Glycine-rich cell wall structural protein 2 Proteins 0.000 description 1
- 101710168683 Glycine-rich protein 1 Proteins 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- 241000256257 Heliothis Species 0.000 description 1
- 241000256244 Heliothis virescens Species 0.000 description 1
- 241000258937 Hemiptera Species 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 239000005661 Hexythiazox Substances 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 241000257303 Hymenoptera Species 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- 206010061217 Infestation Diseases 0.000 description 1
- IMQLKJBTEOYOSI-GPIVLXJGSA-N Inositol-hexakisphosphate Chemical compound OP(O)(=O)O[C@H]1[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H]1OP(O)(O)=O IMQLKJBTEOYOSI-GPIVLXJGSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 235000002678 Ipomoea batatas Nutrition 0.000 description 1
- 241001495069 Ischnocera Species 0.000 description 1
- 241000588744 Klebsiella pneumoniae subsp. ozaenae Species 0.000 description 1
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AHMIDUVKSGCHAU-LURJTMIESA-N L-dopaquinone Chemical compound [O-]C(=O)[C@@H]([NH3+])CC1=CC(=O)C(=O)C=C1 AHMIDUVKSGCHAU-LURJTMIESA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108010029541 Laccase Proteins 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 239000005912 Lufenuron Substances 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102100033448 Lysosomal alpha-glucosidase Human genes 0.000 description 1
- 241000710118 Maize chlorotic mottle virus Species 0.000 description 1
- 239000005949 Malathion Substances 0.000 description 1
- 235000011430 Malus pumila Nutrition 0.000 description 1
- 235000015103 Malus silvestris Nutrition 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 239000005917 Methoxyfenozide Substances 0.000 description 1
- 241000863391 Methylophilus Species 0.000 description 1
- 101150054907 Mrps12 gene Proteins 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 241001460678 Napo <wasp> Species 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 101150014068 PPIP5K1 gene Proteins 0.000 description 1
- 102100026367 Pancreatic alpha-amylase Human genes 0.000 description 1
- 241001668579 Pasteuria Species 0.000 description 1
- 101710091688 Patatin Proteins 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 244000025272 Persea americana Species 0.000 description 1
- 235000008673 Persea americana Nutrition 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 239000005921 Phosmet Substances 0.000 description 1
- 108091000041 Phosphoenolpyruvate Carboxylase Proteins 0.000 description 1
- 241001516577 Phylloxera Species 0.000 description 1
- IMQLKJBTEOYOSI-UHFFFAOYSA-N Phytic acid Natural products OP(O)(=O)OC1C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C1OP(O)(O)=O IMQLKJBTEOYOSI-UHFFFAOYSA-N 0.000 description 1
- 239000005923 Pirimicarb Substances 0.000 description 1
- 108010064851 Plant Proteins Proteins 0.000 description 1
- 241001257016 Platyphylla Species 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- 101710196435 Probable acetolactate synthase large subunit Proteins 0.000 description 1
- 101710181764 Probable acetolactate synthase small subunit Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108020001991 Protoporphyrinogen Oxidase Proteins 0.000 description 1
- 102000005135 Protoporphyrinogen oxidase Human genes 0.000 description 1
- 235000009827 Prunus armeniaca Nutrition 0.000 description 1
- 244000018633 Prunus armeniaca Species 0.000 description 1
- 235000006029 Prunus persica var nucipersica Nutrition 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 244000017714 Prunus persica var. nucipersica Species 0.000 description 1
- 101710104000 Putative acetolactate synthase small subunit Proteins 0.000 description 1
- 239000005663 Pyridaben Substances 0.000 description 1
- 239000005926 Pyridalyl Substances 0.000 description 1
- 235000014443 Pyrus communis Nutrition 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 244000088415 Raphanus sativus Species 0.000 description 1
- 235000006140 Raphanus sativus var sativus Nutrition 0.000 description 1
- 101100120298 Rattus norvegicus Flot1 gene Proteins 0.000 description 1
- 101100412401 Rattus norvegicus Reg3a gene Proteins 0.000 description 1
- 101100412403 Rattus norvegicus Reg3b gene Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241001092459 Rubus Species 0.000 description 1
- 235000017848 Rubus fruticosus Nutrition 0.000 description 1
- 240000007651 Rubus glaucus Species 0.000 description 1
- 235000011034 Rubus glaucus Nutrition 0.000 description 1
- 235000009122 Rubus idaeus Nutrition 0.000 description 1
- 101100199945 Schizosaccharomyces pombe (strain 972 / ATCC 24843) rps1201 gene Proteins 0.000 description 1
- 241000209056 Secale Species 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- CSPPKDPQLUUTND-NBVRZTHBSA-N Sethoxydim Chemical compound CCO\N=C(/CCC)C1=C(O)CC(CC(C)SCC)CC1=O CSPPKDPQLUUTND-NBVRZTHBSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 235000002597 Solanum melongena Nutrition 0.000 description 1
- 244000061458 Solanum melongena Species 0.000 description 1
- 241000592344 Spermatophyta Species 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 239000005665 Spiromesifen Substances 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 240000006694 Stellaria media Species 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- 101710156615 Sucrose synthase 1 Proteins 0.000 description 1
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 1
- 239000005937 Tebufenozide Substances 0.000 description 1
- 239000005658 Tebufenpyrad Substances 0.000 description 1
- 239000005939 Tefluthrin Substances 0.000 description 1
- 239000005941 Thiamethoxam Substances 0.000 description 1
- FOCVUCIESVLUNU-UHFFFAOYSA-N Thiotepa Chemical compound C1CN1P(N1CC1)(=S)N1CC1 FOCVUCIESVLUNU-UHFFFAOYSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- 241000656145 Thyrsites atun Species 0.000 description 1
- 239000007997 Tricine buffer Substances 0.000 description 1
- 239000005857 Trifloxystrobin Substances 0.000 description 1
- 239000005942 Triflumuron Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- 235000009754 Vitis X bourquina Nutrition 0.000 description 1
- 235000012333 Vitis X labruscana Nutrition 0.000 description 1
- 240000006365 Vitis vinifera Species 0.000 description 1
- 235000014787 Vitis vinifera Nutrition 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 229920002494 Zein Polymers 0.000 description 1
- INISTDXBRIBGOC-CGAIIQECSA-N [cyano-(3-phenoxyphenyl)methyl] (2s)-2-[2-chloro-4-(trifluoromethyl)anilino]-3-methylbutanoate Chemical compound N([C@@H](C(C)C)C(=O)OC(C#N)C=1C=C(OC=2C=CC=CC=2)C=CC=1)C1=CC=C(C(F)(F)F)C=C1Cl INISTDXBRIBGOC-CGAIIQECSA-N 0.000 description 1
- 101150067314 aadA gene Proteins 0.000 description 1
- 230000036579 abiotic stress Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000000895 acaricidal effect Effects 0.000 description 1
- 239000000642 acaricide Substances 0.000 description 1
- YASYVMFAVPKPKE-UHFFFAOYSA-N acephate Chemical compound COP(=O)(SC)NC(C)=O YASYVMFAVPKPKE-UHFFFAOYSA-N 0.000 description 1
- 231100000460 acute oral toxicity Toxicity 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 239000013566 allergen Substances 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- 102000005840 alpha-Galactosidase Human genes 0.000 description 1
- 102000016679 alpha-Glucosidases Human genes 0.000 description 1
- 108010028144 alpha-Glucosidases Proteins 0.000 description 1
- 102000012086 alpha-L-Fucosidase Human genes 0.000 description 1
- 102000019199 alpha-Mannosidase Human genes 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229960002587 amitraz Drugs 0.000 description 1
- QXAITBQSYVNQDR-ZIOPAAQOSA-N amitraz Chemical compound C=1C=C(C)C=C(C)C=1/N=C/N(C)\C=N\C1=CC=C(C)C=C1C QXAITBQSYVNQDR-ZIOPAAQOSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229940025131 amylases Drugs 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- VEHPJKVTJQSSKL-UHFFFAOYSA-N azadirachtin Natural products O1C2(C)C(C3(C=COC3O3)O)CC3C21C1(C)C(O)C(OCC2(OC(C)=O)C(CC3OC(=O)C(C)=CC)OC(C)=O)C2C32COC(C(=O)OC)(O)C12 VEHPJKVTJQSSKL-UHFFFAOYSA-N 0.000 description 1
- FTNJWQUOZFUQQJ-IRYYUVNJSA-N azadirachtin A Natural products C([C@@H]([C@]1(C=CO[C@H]1O1)O)[C@]2(C)O3)[C@H]1[C@]23[C@]1(C)[C@H](O)[C@H](OC[C@@]2([C@@H](C[C@@H]3OC(=O)C(\C)=C/C)OC(C)=O)C(=O)OC)[C@@H]2[C@]32CO[C@@](C(=O)OC)(O)[C@@H]12 FTNJWQUOZFUQQJ-IRYYUVNJSA-N 0.000 description 1
- FTNJWQUOZFUQQJ-NDAWSKJSSA-N azadirachtin A Chemical compound C([C@@H]([C@]1(C=CO[C@H]1O1)O)[C@]2(C)O3)[C@H]1[C@]23[C@]1(C)[C@H](O)[C@H](OC[C@@]2([C@@H](C[C@@H]3OC(=O)C(\C)=C\C)OC(C)=O)C(=O)OC)[C@@H]2[C@]32CO[C@@](C(=O)OC)(O)[C@@H]12 FTNJWQUOZFUQQJ-NDAWSKJSSA-N 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- XEGGRYVFLWGFHI-UHFFFAOYSA-N bendiocarb Chemical compound CNC(=O)OC1=CC=CC2=C1OC(C)(C)O2 XEGGRYVFLWGFHI-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 108010048056 beta-1,3-exoglucanase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- VHLKTXFWDRXILV-UHFFFAOYSA-N bifenazate Chemical compound C1=C(NNC(=O)OC(C)C)C(OC)=CC=C1C1=CC=CC=C1 VHLKTXFWDRXILV-UHFFFAOYSA-N 0.000 description 1
- OMFRMAHOUUJSGP-IRHGGOMRSA-N bifenthrin Chemical compound C1=CC=C(C=2C=CC=CC=2)C(C)=C1COC(=O)[C@@H]1[C@H](\C=C(/Cl)C(F)(F)F)C1(C)C OMFRMAHOUUJSGP-IRHGGOMRSA-N 0.000 description 1
- 238000010364 biochemical engineering Methods 0.000 description 1
- 239000003124 biologic agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000007321 biological mechanism Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- VEMKTZHHVJILDY-UXHICEINSA-N bioresmethrin Chemical compound CC1(C)[C@H](C=C(C)C)[C@H]1C(=O)OCC1=COC(CC=2C=CC=CC=2)=C1 VEMKTZHHVJILDY-UXHICEINSA-N 0.000 description 1
- 235000021029 blackberry Nutrition 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- PRLVTUNWOQKEAI-VKAVYKQESA-N buprofezin Chemical compound O=C1N(C(C)C)\C(=N\C(C)(C)C)SCN1C1=CC=CC=C1 PRLVTUNWOQKEAI-VKAVYKQESA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 101150039352 can gene Proteins 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 1
- 235000013877 carbamide Nutrition 0.000 description 1
- DUEPRVBVGDRKAG-UHFFFAOYSA-N carbofuran Chemical compound CNC(=O)OC1=CC=CC2=C1OC(C)(C)C2 DUEPRVBVGDRKAG-UHFFFAOYSA-N 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 101150052795 cbh-1 gene Proteins 0.000 description 1
- 229960004261 cefotaxime Drugs 0.000 description 1
- AZZMGZXNTDTSME-JUZDKLSSSA-M cefotaxime sodium Chemical compound [Na+].N([C@@H]1C(N2C(=C(COC(C)=O)CS[C@@H]21)C([O-])=O)=O)C(=O)\C(=N/OC)C1=CSC(N)=N1 AZZMGZXNTDTSME-JUZDKLSSSA-M 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229940124587 cephalosporin Drugs 0.000 description 1
- 108010080434 cephalosporin-C deacetylase Proteins 0.000 description 1
- 150000001780 cephalosporins Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- UISUNVFOGSJSKD-UHFFFAOYSA-N chlorfluazuron Chemical compound FC1=CC=CC(F)=C1C(=O)NC(=O)NC(C=C1Cl)=CC(Cl)=C1OC1=NC=C(C(F)(F)F)C=C1Cl UISUNVFOGSJSKD-UHFFFAOYSA-N 0.000 description 1
- 210000004756 chromatid Anatomy 0.000 description 1
- 210000001726 chromosome structure Anatomy 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000000306 component Substances 0.000 description 1
- 238000010205 computational analysis Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000009402 cross-breeding Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- OILAIQUEIWYQPH-UHFFFAOYSA-N cyclohexane-1,2-dione Chemical compound O=C1CCCCC1=O OILAIQUEIWYQPH-UHFFFAOYSA-N 0.000 description 1
- LVQDKIWDGQRHTE-UHFFFAOYSA-N cyromazine Chemical compound NC1=NC(N)=NC(NC2CC2)=N1 LVQDKIWDGQRHTE-UHFFFAOYSA-N 0.000 description 1
- 229950000775 cyromazine Drugs 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- GVJHHUAWPYXKBD-UHFFFAOYSA-N d-alpha-tocopherol Natural products OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 229960002483 decamethrin Drugs 0.000 description 1
- OWZREIFADZCYQD-NSHGMRRFSA-N deltamethrin Chemical compound CC1(C)[C@@H](C=C(Br)Br)[C@H]1C(=O)O[C@H](C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 OWZREIFADZCYQD-NSHGMRRFSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001784 detoxification Methods 0.000 description 1
- FHIVAFMUCKRCQO-UHFFFAOYSA-N diazinon Chemical compound CCOP(=S)(OCC)OC1=CC(C)=NC(C(C)C)=N1 FHIVAFMUCKRCQO-UHFFFAOYSA-N 0.000 description 1
- OEBRKCOSUFCWJD-UHFFFAOYSA-N dichlorvos Chemical compound COP(=O)(OC)OC=C(Cl)Cl OEBRKCOSUFCWJD-UHFFFAOYSA-N 0.000 description 1
- 229950001327 dichlorvos Drugs 0.000 description 1
- UOAMTSKGCBMZTC-UHFFFAOYSA-N dicofol Chemical compound C=1C=C(Cl)C=CC=1C(C(Cl)(Cl)Cl)(O)C1=CC=C(Cl)C=C1 UOAMTSKGCBMZTC-UHFFFAOYSA-N 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 235000020788 dietary exposure Nutrition 0.000 description 1
- 235000021245 dietary protein Nutrition 0.000 description 1
- JXSJBGJIGXNWCI-UHFFFAOYSA-N diethyl 2-[(dimethoxyphosphorothioyl)thio]succinate Chemical compound CCOC(=O)CC(SP(=S)(OC)OC)C(=O)OCC JXSJBGJIGXNWCI-UHFFFAOYSA-N 0.000 description 1
- QQQYTWIFVNKMRW-UHFFFAOYSA-N diflubenzuron Chemical compound FC1=CC=CC(F)=C1C(=O)NC(=O)NC1=CC=C(Cl)C=C1 QQQYTWIFVNKMRW-UHFFFAOYSA-N 0.000 description 1
- 229940019503 diflubenzuron Drugs 0.000 description 1
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 1
- MCWXGJITAZMZEV-UHFFFAOYSA-N dimethoate Chemical compound CNC(=O)CSP(=S)(OC)OC MCWXGJITAZMZEV-UHFFFAOYSA-N 0.000 description 1
- MHUWZNTUIIFHAS-CLFAGFIQSA-N dioleoyl phosphatidic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(COP(O)(O)=O)OC(=O)CCCCCCC\C=C/CCCCCCCC MHUWZNTUIIFHAS-CLFAGFIQSA-N 0.000 description 1
- 238000007598 dipping method Methods 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000010410 dusting Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000000408 embryogenic effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 108010050200 endo-1,4-beta-D-mannanase Proteins 0.000 description 1
- YERABYSOHUZTPQ-UHFFFAOYSA-P endo-1,4-beta-Xylanase Chemical compound C=1C=CC=CC=1C[N+](CC)(CC)CCCNC(C(C=1)=O)=CC(=O)C=1NCCC[N+](CC)(CC)CC1=CC=CC=C1 YERABYSOHUZTPQ-UHFFFAOYSA-P 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- NYPJDWWKZLNGGM-RPWUZVMVSA-N esfenvalerate Chemical compound C=1C([C@@H](C#N)OC(=O)[C@@H](C(C)C)C=2C=CC(Cl)=CC=2)=CC=CC=1OC1=CC=CC=C1 NYPJDWWKZLNGGM-RPWUZVMVSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 1
- 108010093305 exopolygalacturonase Proteins 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000009313 farming Methods 0.000 description 1
- ZCJPOPBZHLUFHF-UHFFFAOYSA-N fenamiphos Chemical compound CCOP(=O)(NC(C)C)OC1=CC=C(SC)C(C)=C1 ZCJPOPBZHLUFHF-UHFFFAOYSA-N 0.000 description 1
- DMYHGDXADUDKCQ-UHFFFAOYSA-N fenazaquin Chemical compound C1=CC(C(C)(C)C)=CC=C1CCOC1=NC=NC2=CC=CC=C12 DMYHGDXADUDKCQ-UHFFFAOYSA-N 0.000 description 1
- HJUFTIJOISQSKQ-UHFFFAOYSA-N fenoxycarb Chemical compound C1=CC(OCCNC(=O)OCC)=CC=C1OC1=CC=CC=C1 HJUFTIJOISQSKQ-UHFFFAOYSA-N 0.000 description 1
- 108010041969 feruloyl esterase Proteins 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- RLQJEEJISHYWON-UHFFFAOYSA-N flonicamid Chemical compound FC(F)(F)C1=CC=NC=C1C(=O)NCC#N RLQJEEJISHYWON-UHFFFAOYSA-N 0.000 description 1
- GBIHOLCMZGAKNG-CGAIIQECSA-N flucythrinate Chemical compound O=C([C@@H](C(C)C)C=1C=CC(OC(F)F)=CC=1)OC(C#N)C(C=1)=CC=CC=1OC1=CC=CC=C1 GBIHOLCMZGAKNG-CGAIIQECSA-N 0.000 description 1
- RYLHNOVXKPXDIP-UHFFFAOYSA-N flufenoxuron Chemical compound C=1C=C(NC(=O)NC(=O)C=2C(=CC=CC=2F)F)C(F)=CC=1OC1=CC=C(C(F)(F)F)C=C1Cl RYLHNOVXKPXDIP-UHFFFAOYSA-N 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 235000004611 garlic Nutrition 0.000 description 1
- 210000001156 gastric mucosa Anatomy 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 230000037442 genomic alteration Effects 0.000 description 1
- 102000034238 globular proteins Human genes 0.000 description 1
- 108091005896 globular proteins Proteins 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000003630 growth substance Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002573 hemicellulolytic effect Effects 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- RGNPBRKPHBKNKX-UHFFFAOYSA-N hexaflumuron Chemical compound C1=C(Cl)C(OC(F)(F)C(F)F)=C(Cl)C=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F RGNPBRKPHBKNKX-UHFFFAOYSA-N 0.000 description 1
- 231100000171 higher toxicity Toxicity 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 239000002917 insecticide Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 108010090785 inulinase Proteins 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 229930014550 juvenile hormone Chemical class 0.000 description 1
- 239000002949 juvenile hormone Chemical class 0.000 description 1
- 150000003633 juvenile hormone derivatives Chemical class 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 239000005910 lambda-Cyhalothrin Substances 0.000 description 1
- 108010005131 levanase Proteins 0.000 description 1
- 229960004502 levodopa Drugs 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 108010062085 ligninase Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 229960000521 lufenuron Drugs 0.000 description 1
- PWPJGUXAGUPAHP-UHFFFAOYSA-N lufenuron Chemical compound C1=C(Cl)C(OC(F)(F)C(C(F)(F)F)F)=CC(Cl)=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F PWPJGUXAGUPAHP-UHFFFAOYSA-N 0.000 description 1
- 239000003120 macrolide antibiotic agent Substances 0.000 description 1
- 229940041033 macrolides Drugs 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 229960000453 malathion Drugs 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- NNKVPIKMPCQWCG-UHFFFAOYSA-N methamidophos Chemical compound COP(N)(=O)SC NNKVPIKMPCQWCG-UHFFFAOYSA-N 0.000 description 1
- MEBQXILRKZHVCX-UHFFFAOYSA-N methidathion Chemical compound COC1=NN(CSP(=S)(OC)OC)C(=O)S1 MEBQXILRKZHVCX-UHFFFAOYSA-N 0.000 description 1
- 229930002897 methoprene Natural products 0.000 description 1
- 229950003442 methoprene Drugs 0.000 description 1
- QCAWEPFNJXQPAN-UHFFFAOYSA-N methoxyfenozide Chemical compound COC1=CC=CC(C(=O)NN(C(=O)C=2C=C(C)C=C(C)C=2)C(C)(C)C)=C1C QCAWEPFNJXQPAN-UHFFFAOYSA-N 0.000 description 1
- KBHDSWIXRODKSZ-UHFFFAOYSA-N methyl 5-chloro-2-(trifluoromethylsulfonylamino)benzoate Chemical compound COC(=O)C1=CC(Cl)=CC=C1NS(=O)(=O)C(F)(F)F KBHDSWIXRODKSZ-UHFFFAOYSA-N 0.000 description 1
- NDNKHWUXXOFHTD-UHFFFAOYSA-N metizoline Chemical compound CC=1SC2=CC=CC=C2C=1CC1=NCCN1 NDNKHWUXXOFHTD-UHFFFAOYSA-N 0.000 description 1
- 229960002939 metizoline Drugs 0.000 description 1
- 229960001952 metrifonate Drugs 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- KRTSDMXIXPKRQR-AATRIKPKSA-N monocrotophos Chemical compound CNC(=O)\C=C(/C)OP(=O)(OC)OC KRTSDMXIXPKRQR-AATRIKPKSA-N 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- NJPPVKZQTLUDBO-UHFFFAOYSA-N novaluron Chemical compound C1=C(Cl)C(OC(F)(F)C(OC(F)(F)F)F)=CC=C1NC(=O)NC(=O)C1=C(F)C=CC=C1F NJPPVKZQTLUDBO-UHFFFAOYSA-N 0.000 description 1
- 235000021049 nutrient content Nutrition 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- YCIMNLLNPGFGHC-UHFFFAOYSA-N o-dihydroxy-benzene Natural products OC1=CC=CC=C1O YCIMNLLNPGFGHC-UHFFFAOYSA-N 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- LCCNCVORNKJIRZ-UHFFFAOYSA-N parathion Chemical compound CCOP(=S)(OCC)OC1=CC=C([N+]([O-])=O)C=C1 LCCNCVORNKJIRZ-UHFFFAOYSA-N 0.000 description 1
- RLBIQVVOMOPOHC-UHFFFAOYSA-N parathion-methyl Chemical compound COP(=S)(OC)OC1=CC=C([N+]([O-])=O)C=C1 RLBIQVVOMOPOHC-UHFFFAOYSA-N 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 229960000490 permethrin Drugs 0.000 description 1
- RLLPVAHGXHCWKJ-UHFFFAOYSA-N permethrin Chemical compound CC1(C)C(C=C(Cl)Cl)C1C(=O)OCC1=CC=CC(OC=2C=CC=CC=2)=C1 RLLPVAHGXHCWKJ-UHFFFAOYSA-N 0.000 description 1
- 210000002824 peroxisome Anatomy 0.000 description 1
- NONJJLVGHLVQQM-JHXYUMNGSA-N phenethicillin Chemical compound N([C@@H]1C(N2[C@H](C(C)(C)S[C@@H]21)C(O)=O)=O)C(=O)C(C)OC1=CC=CC=C1 NONJJLVGHLVQQM-JHXYUMNGSA-N 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- BULVZWIRKLYCBC-UHFFFAOYSA-N phorate Chemical compound CCOP(=S)(OCC)SCSCC BULVZWIRKLYCBC-UHFFFAOYSA-N 0.000 description 1
- LMNZTLDVJIUSHT-UHFFFAOYSA-N phosmet Chemical compound C1=CC=C2C(=O)N(CSP(=S)(OC)OC)C(=O)C2=C1 LMNZTLDVJIUSHT-UHFFFAOYSA-N 0.000 description 1
- RGCLLPNLLBQHPF-HJWRWDBZSA-N phosphamidon Chemical compound CCN(CC)C(=O)C(\Cl)=C(/C)OP(=O)(OC)OC RGCLLPNLLBQHPF-HJWRWDBZSA-N 0.000 description 1
- 235000002949 phytic acid Nutrition 0.000 description 1
- 239000000467 phytic acid Substances 0.000 description 1
- 229940068041 phytic acid Drugs 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- YFGYUFNIOHWBOB-UHFFFAOYSA-N pirimicarb Chemical compound CN(C)C(=O)OC1=NC(N(C)C)=NC(C)=C1C YFGYUFNIOHWBOB-UHFFFAOYSA-N 0.000 description 1
- 235000021118 plant-derived protein Nutrition 0.000 description 1
- 101150082349 pmi gene Proteins 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 150000004804 polysaccharides Chemical class 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- QYMMJNLHFKGANY-UHFFFAOYSA-N profenofos Chemical compound CCCSP(=O)(OCC)OC1=CC=C(Br)C=C1Cl QYMMJNLHFKGANY-UHFFFAOYSA-N 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- ZYHMJXZULPZUED-UHFFFAOYSA-N propargite Chemical compound C1=CC(C(C)(C)C)=CC=C1OC1C(OS(=O)OCC#C)CCCC1 ZYHMJXZULPZUED-UHFFFAOYSA-N 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 235000021251 pulses Nutrition 0.000 description 1
- 235000015136 pumpkin Nutrition 0.000 description 1
- DWFZBUWUXWZWKD-UHFFFAOYSA-N pyridaben Chemical compound C1=CC(C(C)(C)C)=CC=C1CSC1=C(Cl)C(=O)N(C(C)(C)C)N=C1 DWFZBUWUXWZWKD-UHFFFAOYSA-N 0.000 description 1
- AEHJMNVBLRLZKK-UHFFFAOYSA-N pyridalyl Chemical group N1=CC(C(F)(F)F)=CC=C1OCCCOC1=C(Cl)C=C(OCC=C(Cl)Cl)C=C1Cl AEHJMNVBLRLZKK-UHFFFAOYSA-N 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012502 risk assessment Methods 0.000 description 1
- 229940080817 rotenone Drugs 0.000 description 1
- JUVIOZPCNVVQFO-UHFFFAOYSA-N rotenone Natural products O1C2=C3CC(C(C)=C)OC3=CC=C2C(=O)C2C1COC1=C2C=C(OC)C(OC)=C1 JUVIOZPCNVVQFO-UHFFFAOYSA-N 0.000 description 1
- 101150015537 rps12 gene Proteins 0.000 description 1
- 101150098466 rpsL gene Proteins 0.000 description 1
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 230000014639 sexual reproduction Effects 0.000 description 1
- HBMJWWWQQXIZIP-UHFFFAOYSA-N silicon carbide Chemical compound [Si+]#[C-] HBMJWWWQQXIZIP-UHFFFAOYSA-N 0.000 description 1
- 229910010271 silicon carbide Inorganic materials 0.000 description 1
- 229910001961 silver nitrate Inorganic materials 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 239000003195 sodium channel blocking agent Substances 0.000 description 1
- 244000000000 soil microbiome Species 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- SEEPANYCNGTZFQ-UHFFFAOYSA-N sulfadiazine Chemical compound C1=CC(N)=CC=C1S(=O)(=O)NC1=NC=CC=N1 SEEPANYCNGTZFQ-UHFFFAOYSA-N 0.000 description 1
- 229960004306 sulfadiazine Drugs 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- QYPNKSZPJQQLRK-UHFFFAOYSA-N tebufenozide Chemical compound C1=CC(CC)=CC=C1C(=O)NN(C(C)(C)C)C(=O)C1=CC(C)=CC(C)=C1 QYPNKSZPJQQLRK-UHFFFAOYSA-N 0.000 description 1
- ZZYSLNWGKKDOML-UHFFFAOYSA-N tebufenpyrad Chemical compound CCC1=NN(C)C(C(=O)NCC=2C=CC(=CC=2)C(C)(C)C)=C1Cl ZZYSLNWGKKDOML-UHFFFAOYSA-N 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- NWWZPOKUUAIXIW-FLIBITNWSA-N thiamethoxam Chemical compound [O-][N+](=O)\N=C/1N(C)COCN\1CC1=CN=C(Cl)S1 NWWZPOKUUAIXIW-FLIBITNWSA-N 0.000 description 1
- 229960001196 thiotepa Drugs 0.000 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 1
- 229960001295 tocopherol Drugs 0.000 description 1
- 229930003799 tocopherol Natural products 0.000 description 1
- 235000010384 tocopherol Nutrition 0.000 description 1
- 239000011732 tocopherol Substances 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- YWSCPYYRJXKUDB-KAKFPZCNSA-N tralomethrin Chemical compound CC1(C)[C@@H](C(Br)C(Br)(Br)Br)[C@H]1C(=O)O[C@H](C#N)C1=CC=CC(OC=2C=CC=CC=2)=C1 YWSCPYYRJXKUDB-KAKFPZCNSA-N 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- NKNFWVNSBIXGLL-UHFFFAOYSA-N triazamate Chemical compound CCOC(=O)CSC1=NC(C(C)(C)C)=NN1C(=O)N(C)C NKNFWVNSBIXGLL-UHFFFAOYSA-N 0.000 description 1
- 150000003918 triazines Chemical class 0.000 description 1
- YWBFPKPWMSWWEA-UHFFFAOYSA-O triazolopyrimidine Chemical compound BrC1=CC=CC(C=2N=C3N=CN[N+]3=C(NCC=3C=CN=CC=3)C=2)=C1 YWBFPKPWMSWWEA-UHFFFAOYSA-O 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- NFACJZMKEDPNKN-UHFFFAOYSA-N trichlorfon Chemical compound COP(=O)(OC)C(O)C(Cl)(Cl)Cl NFACJZMKEDPNKN-UHFFFAOYSA-N 0.000 description 1
- ONCZDRURRATYFI-TVJDWZFNSA-N trifloxystrobin Chemical compound CO\N=C(\C(=O)OC)C1=CC=CC=C1CO\N=C(/C)C1=CC=CC(C(F)(F)F)=C1 ONCZDRURRATYFI-TVJDWZFNSA-N 0.000 description 1
- XAIPTRIXGHTTNT-UHFFFAOYSA-N triflumuron Chemical compound C1=CC(OC(F)(F)F)=CC=C1NC(=O)NC(=O)C1=CC=CC=C1Cl XAIPTRIXGHTTNT-UHFFFAOYSA-N 0.000 description 1
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 1
- 101150019416 trpA gene Proteins 0.000 description 1
- 150000003672 ureas Chemical class 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- 101150085703 vir gene Proteins 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000005019 zein Substances 0.000 description 1
- 229940093612 zein Drugs 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N α-tocopherol Chemical compound OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
- A01H5/10—Seeds
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N37/00—Biocides, pest repellants or attractants, or plant growth regulators containing organic compounds containing a carbon atom having three bonds to hetero atoms with at the most two bonds to halogen, e.g. carboxylic acids
- A01N37/44—Biocides, pest repellants or attractants, or plant growth regulators containing organic compounds containing a carbon atom having three bonds to hetero atoms with at the most two bonds to halogen, e.g. carboxylic acids containing at least one carboxylic group or a thio analogue, or a derivative thereof, and a nitrogen atom attached to the same carbon skeleton by a single or double bond, this nitrogen atom not being a member of a derivative or of a thio analogue of a carboxylic group, e.g. amino-carboxylic acids
- A01N37/46—N-acyl derivatives
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N63/00—Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
- A01N63/50—Isolated enzymes; Isolated proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H21/00—Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Plant Pathology (AREA)
- Pest Control & Pesticides (AREA)
- Environmental Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Dentistry (AREA)
- Agronomy & Crop Science (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Insects & Arthropods (AREA)
- Crystallography & Structural Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Physiology (AREA)
- Botany (AREA)
- Developmental Biology & Embryology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Peptides Or Proteins (AREA)
Abstract
披露了从苏云金芽孢杆菌中分离的新颖的杀昆虫蛋白,这些杀昆虫蛋白针对鳞翅目昆虫有害生物是有活性的。编码这些杀昆虫蛋白的DNA可以用于转化各种原核和真核生物以表达这些杀昆虫蛋白。这些重组生物可以用于控制在不同环境中的鳞翅目昆虫。
Description
电子提交的序列表参考
该序列表的官方副本是通过EFS-Web以ASCII格式的序列表以2014年12月5日生成的名为“80668-US-L-ORG-NAT-1_SeqList.txt”的文件进行电子提交的,并且该序列表的大小为135千字节并且与本说明书同时提交。包含在该ASCII格式文件中的序列表是本说明书的一部分,并且通过引用以其全文结合在此。
技术领域
本发明涉及杀有害生物蛋白和编码它们的核酸分子,连同用于控制植物有害生物的组合物和方法。
背景技术
苏云金芽孢杆菌(Bt)是一种革兰氏阳性的孢子形成的土壤细菌,其特征在于它产生晶体包含体的能力,这些晶体包含体对于某些目和种的植物有害生物(包括昆虫)是特异地有毒性的,但是对于植物和其他非目标生物是无害的。出于这个原因,包含苏云金芽孢杆菌菌株或它们的杀昆虫蛋白的组合物可以用作环境上可接受的杀昆虫剂以控制农业昆虫有害生物或多种人或动物疾病的昆虫载体。
来自苏云金芽孢杆菌的晶体(Cry)蛋白主要针对鳞翅目的、双翅目的、以及鞘翅目的幼虫具有有力的杀昆虫活性。这些蛋白质还已经显示针对以下目的有害生物的活性:膜翅目、同翅目、毛虱目、食毛目、以及壁虱目,连同其他的无脊椎动物目,例如线虫动物门、扁形动物门、以及肉足鞭毛亚门(Feitelson,J.1993.The Bacillus Thuringiensis familytree.In Advanced Engineered Pesticides.Marcel Dekker,Inc.,New York,N.Y.[Feitelson,J,1993,在前沿的工程化的杀有害生物剂中的苏云金芽孢杆菌家族树,马塞尔德克尔公司,纽约,纽约州])。这些蛋白质最初主要基于它们的杀虫活性而被分类为CryI至CryVI。主要的类别是鳞翅目特异性(I)、鳞翅目和双翅目特异性(II)、鞘翅目特异性(III)、双翅目特异性(IV)、以及线虫特异性(V)和(VI)。这些蛋白质进一步被分类为子族,在各个家族内的更高相关的蛋白质指定了区分的字母,例如CryIA、CryIB、CryIC等。在各个区分内的甚至更紧密相关的蛋白质被给定名称,例如CryIC(a)、CryIC(b)等。术语“Cry毒素”以及“δ-内毒素”与术语“Cry蛋白”已可互换地使用。对于Cry蛋白和基因的当前新命名法基于氨基酸序列同源性而不是昆虫靶标特异性(Crickmore et al.(1998)Microbiol.Mol.Biol.Rev.62:807-813[Crickmore等人(1998)微生物分子生物学评论,62:807-813])。在这个更可接受的分类中,每种毒素被指定唯一的名称,该名称合并了初级等级(阿拉伯数字)、二级等级(大写字母)、三级等级(小写字母)、以及四级等级(另一个阿拉伯数字)。在当前分类中,在初级等级中罗马数字已经换为阿拉伯数字。例如,在旧命名法下的“CryIA(a)”现在在当前命名法下是“Cry1Aa”。
Cry蛋白是在Bt的孢子形成阶段期间以晶态形式积聚为原毒素的球状蛋白质分子。在被有害生物摄取后,这些晶体典型地被溶解以释放原毒素,原毒素大小的范围可以为,例如,对于鳞翅目活性的Cry蛋白为从130-140kDa,并且对于鞘翅目活性的Cry蛋白为60-80kDa。原毒素经由靶有害生物中的肠蛋白酶转变为成熟的毒素片段(大约60-70kDa N末端区域)。这些蛋白质中许多对于特异性目标昆虫是相当有毒性的,但是对于植物和其他非目标生物是无害的。
Cry蛋白通常具有五个保守序列结构域、以及三个保守结构性结构域(参见例如,de Maagd et al.(2001)Trends Genetics 17:193-199[de Maagd等人(2001)遗传学趋势,17:193-199])。第一保守结构性结构域(称作结构域I)典型地由七个α螺旋组成并且参与膜插入以及孔形成。结构域II典型地由三个安排为希腊钥匙构型的β片层组成,并且结构域III典型地由两个处于“果冻卷”(‘jelly-roll’)构造的反平行的β片层组成(de Maagd etal.,2001,supra[de Maagd等人,2001,同上])。结构域II和III参与受体识别和结合,并且因此被考虑为毒素特异性的决定物。
众多商业上有价值的植物(包括普通的农作物)易受植物有害生物(包括昆虫和线虫有害生物)的攻击的影响,导致作物产量和品质的实质性降低。例如,植物有害生物是在全世界重要农作物损失中的主要因素。由于非哺乳动物有害生物(包括昆虫)的侵染,仅在美国每年就损失大约80亿美元。除了农作物中的损失之外,昆虫有害生物对于菜农和果农,对于观赏性花卉的生产商,以及对于家庭花匠也是负担。
昆虫有害生物主要是通过密集使用化学杀有害生物剂来控制,这些化学杀有害生物剂通过抑制昆虫成长、预防昆虫摄食或繁殖、或者导致死亡而有效。生物性有害生物控制剂,例如表达杀有害生物毒素(如Cry蛋白)的苏云金芽孢杆菌(Bacillus thuringiensis)菌株,也已经应用至作物植物中,产生了令人满意的结果,提供化学杀有害生物剂的替代物或补充物。已经分离了编码这些Cry蛋白的一些的基因并且它们在异源性宿主(例如转基因植物)中的表达已经显示出提供了另一种用于控制经济上重要的昆虫有害生物的手段。
因此可以达到良好的昆虫控制,但是某些化学品有时也能影响非目标有益昆虫,并且某些生物制剂具有非常窄的活性谱。此外,某些化学和生物控制方法的继续使用增加了昆虫有害生物对此类控制措施产生抗性的机会。通过各种抗性管理实践已部分地缓和了这种状况,但仍需要找到新型并有效的有害生物控制剂,这些有害生物控制剂为农民提供经济利益并且是环境可接受的。特别需要的是靶向更广谱的经济上重要的昆虫有害生物并有效控制昆虫品系的控制剂,这些昆虫品系对现有的昆虫控制剂是有抗性或可以变得有抗性。
发明内容
鉴于这些需求,本发明的目的是通过提供可以用来控制多种植物有害生物的新颖基因和杀有害生物蛋白来提供新的有害生物控制剂。
本发明提供了用于赋予细菌、植物、植物细胞、组织以及种子的杀有害生物活性的组合物以及方法。具体地,提供了包含从苏云金芽孢杆菌(Bt)中分离的编码Cry蛋白的新颖多核苷酸的嵌合基因以及基本上与其一致的序列,这些序列的表达导致具有对经济上重要的昆虫有害生物(特别是侵染植物的昆虫有害生物)的毒性的蛋白质。本发明进一步涉及由这些核酸序列的表达生成的新颖的Cry1蛋白,并且涉及含有这些Cry蛋白的组合物以及配制品,它们通过抑制昆虫有害生物的生存、生长以及繁殖或者限制昆虫相关的对作物植物的损害或损失的能力对昆虫是有毒的。本发明的Cry蛋白包括天然Cry蛋白以及具有一个或多个氨基酸置换、添加或缺失的突变型Cry蛋白。突变型Cry蛋白的实例包括但不限于被突变为具有比其天然Cry蛋白对应物更宽的活性谱的那些,或经突变以引入表位来产生从天然蛋白中差异性地识别出经突变的蛋白的抗体的那些。本发明的新颖Cry蛋白对昆虫有害生物是高活性的。例如,本发明的Cry蛋白可以用于控制一种或多种经济上重要的昆虫有害生物,例如黑色地老虎(black cutworm)(小地老虎(Agrotis ipsilon))、欧洲玉米蛀虫(European corn borer)(欧洲玉米螟(Ostrinia nubilalis))、秋黏虫(fall armyworm)(草地贪夜蛾(Spodoptera frugiperda))、玉米穗蛾(corn earworm)(玉米穗虫(Helicoverpa zea))、甘蔗螟(sugarcane borer)(小蔗螟(Diatraea saccharalis)),绒毛豆毛虫(velvetbean caterpillar)(黎豆夜蛾(Anticarsia gemmatalis))、大豆夜蛾(soybean looper)(大豆尺蠖(Chrysodeixis includens)),西南玉米蛀虫(southwestcorn borer)(西南玉米螟(Diatraea grandiosella))、西部豆切根虫(western beancutworm)(西部豆夜蛾(Richia albicosta))、烟夜蛾(tobacco budworm)(烟芽夜蛾(Heliothis virescens))、亚洲玉米蛀虫(Asian corn borer)(亚洲玉米螟(Ostriniafurnacalis))、棉螟蛉(cotton bollworm)(棉铃虫(Helicoverpa armigera))、条纹蛀茎虫(striped stem borer)(二化螟(Chilo suppressalis))、粉蛀茎虫(pink stem borer)(非洲大螟(Sesamia calamistis))、水稻卷叶螟(rice leaffolder)(稻纵卷叶螟(Cnaphalocrocis medinalis))等。
本发明还提供了合成的多核苷酸,其编码本发明的Cry蛋白,并已经进行一个或多个密码子优化用于在转基因生物如细菌和植物中表达。
本发明进一步涉及表达盒和重组载体,其包含编码本发明的Cry蛋白的多核苷酸。本发明还提供了包含嵌合基因、或表达盒或重组载体的经转化的细菌、植物、植物细胞、组织、以及种子,该嵌合基因或表达盒或重组载体包含编码本发明的Cry蛋白的多核苷酸。
本发明还涉及使用这些多核苷酸的方法,例如在DNA构建体或嵌合基因或表达盒或重组载体中用于在生物(包括微生物和植物)中进行转化和表达。核苷酸或氨基酸序列可以是已经被设计用于在生物(包括但不限于微生物或植物)中表达的合成序列,或者已经被设计用于在制备杂交的、具有增强的杀有害生物活性的毒素的合成序列。本发明进一步涉及制备这些Cry蛋白的方法以及例如在微生物中使用这些核酸序列以控制昆虫或在转基因植物中使用这些核酸序列以赋予对抗昆虫损害的保护的方法,并且涉及使用这些Cry蛋白、以及包含这些Cry蛋白的组合物以及配制品的方法,例如将Cry蛋白或组合物或配制品施用至昆虫侵染的区域,或施用它们以预防性地处理易受昆虫影响的区域或植物从而赋予针对昆虫有害生物的保护。核苷酸或氨基酸序列可以是已经被设计用于在生物(包括但不限于微生物或植物)中表达的合成序列。
本发明的组合物以及方法对于对昆虫有毒的生物(确切地为细菌和植物)的产生是有用的。这些生物以及从它们衍生的组合物用于农业目的是所希望的。本发明的组合物对于产生改变的或改进的具有杀有害生物活性的Cry蛋白,或对于检测在产物或生物中的Cry蛋白或核酸的存在也是有用的。
参考以下详细说明书和权利要求书,本发明的这些和其他特征、方面、以及优点将变得更好理解。
序列表中的序列简述
SEQ ID NO:1代表对BT-0044蛋白进行编码的核苷酸序列。
SEQ ID NO:2代表对BT-0051蛋白进行编码的核苷酸序列。
SEQ ID NO:3代表对BT-0068蛋白进行编码的核苷酸序列。
SEQ ID NO:4代表对BT-0128蛋白进行编码的核苷酸序列。
SEQ ID NO:5代表对BT-0044蛋白进行编码的密码子优化序列。
SEQ ID NO:6代表对BT-0051蛋白进行编码的密码子优化序列。
SEQ ID NO:7代表对BT-0068蛋白进行编码的密码子优化序列。
SEQ ID NO:8代表对BT-0128蛋白进行编码的密码子优化序列。
SEQ ID NO:9代表对突变型BT-0044蛋白进行编码的核苷酸序列。
SEQ ID NO:10代表对突变型BT-0051蛋白进行编码的核苷酸序列。
SEQ ID NO:11代表对突变型BT-0068蛋白进行编码的核苷酸序列。
SEQ ID NO:12代表对突变型BT-0128蛋白进行编码的核苷酸序列。
SEQ ID NO:13代表BT-0044蛋白的氨基酸序列。
SEQ ID NO:14代表BT-0051蛋白的氨基酸序列。
SEQ ID NO:15代表BT-0068蛋白的氨基酸序列。
SEQ ID NO:16代表BT-0128蛋白的氨基酸序列。
SEQ ID NO:17代表突变型BT-0044蛋白的氨基酸序列。
SEQ ID NO:18代表突变型BT-0051蛋白的氨基酸序列。
SEQ ID NO:19代表突变型BT-0068蛋白的氨基酸序列。
SEQ ID NO:20代表突变型BT-0128蛋白的氨基酸序列。
SEQ ID NO:21-26代表在本发明中有用的引物。
具体实施方式
本说明不旨在是本发明以其而实施的所有不同方式,或可以加入本发明中的所有特征的详细目录。例如,关于一个实施例所说明的特征可以结合入其他实施例中,并且关于具体实施例所说明的特征可以从那个实施例删除。因此,本发明考虑了,在本发明的一些实施例中,可以排除或省略在此陈述的任何特征或特征的组合。另外,鉴于本披露内容,对在此建议的不同实施例的众多变体以及附加对于本领域技术人员是显而易见的,这不脱离本发明。因此,以下说明旨在阐述本发明的一些具体实施例,并且并没有穷尽地叙述其所有排列、组合和变化。
除非另外定义,否则所有在此使用的技术和科学术语具有与本发明所属领域的普通技术人员通常所理解的相同的意思。在此的本发明的说明中使用的术语仅仅是出于描述具体实施例的目的并且不旨在限制本发明。还应当理解的是在此使用的术语仅仅是出于描述具体实施例的目的并且不旨在限制本发明的范围。
定义
当在此和所附权利要求书中使用时,单数形式“一个/种(a)”、“和(and)”和“该(the)”包括复数指代物,除非上下文另外明确地指示。因此,例如,提及“一种植物”是提及一种或多种植物并且包括本领域技术人员已知的其等效物等等。如在此使用的,词语“或(or)”意指具体清单的任何一个成员并且还包括这个清单的成员的任何组合(即,还包括“和”)。
术语“约”在此用于意指大约、大致、约或在...左右。当术语“约”结合数值范围来使用时,它通过将边界延伸至高于以及低于所阐明的数值来限定这个范围。通常,术语“约”在此用于将数值限定至以20%的变化,优选10%上下(更高或更低)地高于以及低于规定值。关于温度,术语“约”意指±1℃,优选±0.5℃。当术语“约”被用于本发明的上下文中(例如与温度或分子量值组合)时,精确值(即,无“约”)是优选的。
本发明的毒性Cry蛋白的“活性”意指该毒性蛋白作为口服活性的昆虫控制剂发挥作用,具有毒性作用、或者能够干扰或阻止昆虫摄食,这可能引起或者可能不引起昆虫的死亡。当本发明的毒性蛋白被递送至昆虫时,这种结果典型地是该昆虫的死亡,或者该昆虫不以使该毒性蛋白可供该昆虫利用的来源为食。
如在此所使用的,术语“扩增的”意指使用至少一种核酸分子作为模板,构建核苷酸分子的多个拷贝或与该核酸分子互补的多个拷贝。扩增系统包括聚合酶链式反应(PCR)系统、连接酶链式反应(LCR)系统、基于核酸序列的扩增(NASBA,Cangene公司,密西索加,安大略省)、Q-β复制酶系统、基于转录的扩增系统(TAS)以及链置换扩增(SDA)。参见例如Diagnostic Molecular Microbiology:Principles and Applications,PERSING et al.,Ed.,American Society for Microbiology,Washington,D.C.(1993)[诊断分子微生物学:原理与应用,PERSING等人编著,美国微生物学会,华盛顿哥伦比亚特区(1993)]。扩增产物被称为“扩增子”。
如在此使用的术语“嵌合构建体”或“嵌合基因”或“嵌合多核苷酸”或“嵌合核酸”(或类似术语)是指如下构建体或分子,该构建体或分子包含被组装进单个核酸分子中的不同来源的两个或更多个多核苷酸。术语“嵌合构建体”、“嵌合基因”、“嵌合多核苷酸或“嵌合核酸”是指如下任何构建体或分子,其含有但不限于(1)多核苷酸(例如,DNA),包括在自然界中没有被发现在一起的调节多核苷酸和编码多核苷酸(即,构建体中的至少一个多核苷酸相对于它的其他多核苷酸中的至少一个是异源的),或(2)编码不是天然毗邻的蛋白质部分的多核苷酸,或(3)不是天然毗邻的启动子部分。另外,嵌合构建体、嵌合基因、嵌合多核苷酸或嵌合核酸可以包括衍生自不同来源的调节多核苷酸和编码多核苷酸,或包括衍生自相同来源、但以与在自然界中所发现的不同的方式进行安排的调节多核苷酸和编码多核苷酸。在本发明的一些实施例中,嵌合构建体、嵌合基因、嵌合多核苷酸或嵌合核酸包含表达盒,该表达盒包含在调节多核苷酸的控制下、具体地在植物或细菌中具有功能性的调节多核苷酸的控制下的本发明的多核苷酸。
“编码序列”是转录成RNA(如mRNA、rRNA、tRNA、snRNA、正义RNA或反义RNA)的核酸序列。优选地,RNA进而在生物中被翻译以产生蛋白质。
如在此使用的,“密码子优化的”序列意指重组的、转基因的、或合成的多核苷酸的核苷酸序列,其中这些密码子被选择以反映宿主细胞或生物可以具有的特定的密码子偏好性。这典型地是以这样一种方式来完成,该方式是为了保持由密码子优化的核苷酸序列所编码的多肽的氨基酸序列。在某些实施例中,重组DNA构建体的DNA序列包括已经针对该构建体有待在其中进行表达的细胞(例如,动物、植物、或真菌细胞)进行了密码子优化的序列。例如,有待在植物细胞中表达的构建体可以使它的全部或部分序列(例如,第一基因抑制元件或基因表达元件)进行密码子优化用于在植物中表达。参见例如美国专利号6,121,014,通过引用结合在此。
“控制”昆虫意指通过毒性作用抑制昆虫有害生物存活、生长、摄食、和/或繁殖的能力,或者限制与昆虫有关的作物植物损害或损失,或者保护在昆虫有害生物存在的条件下生长时的作物的产量潜力。“控制”昆虫可以是或可以不是意指杀死昆虫,尽管它优选意指杀死昆虫。
术语“包含(comprises和/或comprising)”当在本说明书中使用时,指明所列举特征、整体、步骤、操作、元件、和/或部件的存在,但是不排除一种或多种其他特征、整体、步骤、操作、元件、部件、和/或其组的存在或添加。
如在此使用的,连接词“基本上由...组成”(以及语法变体)意指,权利要求书的范围有待被解读为涵盖权利要求书中所列举的指定材料或步骤以及不实质上改变所要求的发明的一个或多个基本和新颖特征的那些。因此,当用于本发明的权利要求中时,术语“基本上由...组成”并不旨在被解释为等同于“包含(comprising)”。
在本发明的上下文中,“对应于(corresponding to或corresponds to)”意指当变体Cry蛋白的氨基酸序列与彼此比对时,“对应于”在该变体或同系物蛋白中某些枚举的位置的氨基酸是与参考蛋白中的这些位置比对的那些,但在相对于本发明的特定参考氨基酸序列而言的这些精确的数字位置中是不必要的。例如,如果SEQ ID NO:13是参考序列并且与SEQ ID NO:15比对的话,SEQ ID NO:15的Asn4“对应于”SEQ ID NO:13的Asn6。
“递送(deliver)”组合物或毒性蛋白意指该组合物或毒性蛋白与昆虫接触,产生毒性作用和对该昆虫的控制。该组合物或毒性蛋白可以按照许多公认的方式进行递送,例如,通过昆虫摄取经口或通过经由转基因植物表达、一种或多种配制的蛋白质组合物、一种或多种可喷洒的蛋白质组合物、饵基、或任何其他的领域公认的蛋白质递送系统与昆虫接触。
术语“结构域”是指沿着进化相关蛋白的序列的比对在特定位置处保守的一组氨基酸。虽然其他位置上的氨基酸可在同系物之间有所不同,但是在特定位置处高度保守的氨基酸指示在蛋白质的结构、稳定性或功能中很可能是必需的氨基酸。通过其在蛋白质同系物家族的经比对序列中的高度保守性进行鉴别,其可用作鉴别物(identifier),用来确定所讨论的任何多肽是否属于先前鉴别的多肽家族。
“有效的昆虫控制量”意指毒性蛋白的浓度,它通过毒性作用抑制昆虫存活、生长、摄食和/或繁殖的能力,或者限制与昆虫有关的作物植物损害或损失,或者保护在昆虫有害生物存在的条件下生长时的作物的产量潜力。“有效的昆虫控制量”可以是或可以不是意指杀死昆虫,尽管它优选意指杀死昆虫。
如在此使用的“表达盒”意指能够在适当的宿主细胞中指导至少一种感兴趣的多核苷酸的表达的核酸序列,包含可操作地连接至该感兴趣的多核苷酸的启动子,该多核苷酸可操作地连接至终止信号。“表达盒”还典型地包含正确翻译感兴趣的多核苷酸所需的另外的多核苷酸。该表达盒还可以包含在感兴趣的多核苷酸的直接表达中不是必需的但是由于用于从表达载体去除该表达盒的方便限制位点而存在的其他多核苷酸。包含一个或多个感兴趣的核苷酸序列的表达盒可以是嵌合的,意味着它的组分中的至少一种相对于它的其他组分中的至少一种是异源的。该表达盒还可以是天然存在的但已经是以对于异源表达有用的重组形式而获得的表达盒。然而,典型地,该表达盒相对于该宿主是异源的,即在该表达盒中的感兴趣的多核苷酸不是天然存在于该宿主细胞中的,并且必须已经通过转化过程或育种过程引入到该宿主细胞或该宿主细胞的祖先中。该表达盒中的一个或多个感兴趣的多核苷酸的表达通常是在启动子的控制下。在多细胞生物(如植物)的情况下,启动子还可能对于特定组织、或器官、或者发育阶段是特异性的或优先的。当被转化进植物中时,表达盒或其片段也可被称为“插入的多核苷酸”或者“插入多核苷酸”。
“基因”在此定义为由多核苷酸组成的遗传单位,该遗传单位占据染色体或质粒上特定位置并且含有用于生物中的具体特征或性状的遗传指令。
“肠蛋白酶”是在昆虫的消化道中天然发现的蛋白酶。这种蛋白酶通常参与被摄取的蛋白质的消化。
当提及基因或核酸使用时,术语“异源”是指编码因子的基因不是在其天然环境中(即,已经通过人工改变)。例如,异源基因可以包括自一个物种引入到另一个物种的基因。异源基因还可以包括对生物来说是天然的基因,该基因已经以一些方式(例如,突变;以多个拷贝添加;连接至非天然启动子或增强子多核苷酸等)被改变。异源基因可以进一步包含植物基因多核苷酸,其包含植物基因的cDNA形式;这些cDNA可以以正义方向(以产生mRNA)或反义方向(以产生反义RNA转录本,其与mRNA转录本是互补的)被表达。在本发明的一个方面中,异源基因区别于内源性植物基因在于该异源基因多核苷酸被典型地连接至包含调节元件如启动子的多核苷酸上,未发现这些多核苷酸与由该异源基因编码的蛋白质的基因或与该染色体中的植物基因多核苷酸天然相关联,或者与在自然界中未发现的染色体的部分(例如,在基因座中表达的基因,其中该基因未正常表达)相关联。另外,“异源”多核苷酸是指不与将多核苷酸引入其中的宿主细胞天然地相关联的多核苷酸,包括天然存在的多核苷酸的非天然存在的多拷贝。
“同源重组”是在相同的多核苷酸的区域中成对染色体的两个DNA分子或染色单体之间的DNA片段的交换(“交叉”)。“重组事件”在此被理解为意指减数分裂交叉。
当核酸序列编码了多肽(该多肽与由参考核酸序列所编码的多肽具有相同的氨基酸序列)时,这种核苷酸序列与这种参考核酸序列“同类编码”。
术语“分离的”核酸分子、多核苷酸或毒素是不再存在于其天然环境中的核酸分子、多核苷酸或毒性蛋白。本发明的分离的核酸分子、多核苷酸或毒素可以按照纯化的形式存在,或者可以存在于重组宿主中,例如转基因细菌细胞或转基因植物中。
“核酸分子”是可以从任何来源中分离的单链或双链DNA或RNA。在本发明的上下文中,核酸分子优选地是DNA区段。
“可操作地连接”是指在单一核酸片段上多核苷酸的关联,这样使得一者的功能影响另一者的功能。例如,当启动子能够影响编码多核苷酸或功能RNA的表达时(即,该编码多核苷酸或功能RNA处于该启动子的转录控制之下),则该启动子与该编码多核苷酸或功能RNA是可操作地连接的。正义方向或者反义方向的编码多核苷酸能够与调节多核苷酸可操作地连接。
如在此使用的“杀有害生物”、“杀昆虫”等是指本发明的Cry蛋白控制有害生物的能力或者可以控制如在此所定义的有害生物的Cry蛋白的量。因此,杀有害生物Cry蛋白可以杀死或抑制有害生物(例如,昆虫有害生物)存活、生长、摄食、和/或繁殖的能力。
“植物”是在发育的任何阶段的任何植物,特别是种子植物。
“植物细胞”是植物的结构和生理单位,包含原生质体和细胞壁。植物细胞可以处于分离的单细胞或培养细胞的形式,或者是作为较高级的组织单位(例如,植物组织、植物器官、或整株植物)的一部分。
“植物细胞培养物”意指植物单元(例如像,原生质体、细胞培养物细胞、植物组织中的细胞、花粉、花粉管、胚珠、胚囊、接合子以及处于不同发育阶段的胚)的培养物。
“植物材料”是指叶、茎、根、花或花的部分、果实、花粉、卵细胞、接合子、种子、切条、细胞或组织培养物、或植物的任何其他部分或产物。
“植物器官”是植物的独特而明显的已结构化并且分化的部分,如根、茎、叶、花蕾或胚。
如在此使用的“植物组织”意指组织化成结构和功能单元的一组植物细胞。包括植物中或培养物中的任何植物组织。这个术语包括但不限于全株植物、植物器官、植物种子、组织培养物以及被组织化成结构和/或功能单元的任何植物细胞群组。这个术语与如以上列出的或由该定义以其他方式涵盖的任何具体类型的植物组织的联合应用或单独应用并不旨在排除任何其他类型的植物组织。
“多核苷酸”是指由共价键合于链中的许多核苷酸单体构成的聚合物。此类“多核苷酸”包括DNA、RNA、经修饰的寡核苷酸(例如,包含对于生物RNA或DNA不典型的碱基的寡核苷酸,如2'-O-甲基化寡核苷酸)等。在一些实施例中,核酸或多核苷酸可以是单链的、双链的、多链的或其组合。除非另外指示,否则本发明的具体核酸或多核苷酸任选地包含或编码除明确指示的任何多核苷酸之外的互补多核苷酸。
“感兴趣的多核苷酸”是指如下任何多核苷酸,当其转移至生物(例如,植物)中时赋予该生物所希望的特征,如抗生素抗性、病毒抗性、昆虫抗性、疾病抗性、或对其他有害生物的抗性、除草剂耐受性、改进的营养价值、工业过程中改进的性能、商业上有价值的酶或代谢物的生产、或者改变的繁殖能力。
术语“启动子”是指如下多核苷酸,通常在它的编码多核苷酸的上游(5'),它通过提供对正确转录所需的RNA聚合酶以及其他因子的识别来控制该编码多核苷酸的表达。
“原生质体”是分离的植物细胞,没有细胞壁或仅具有部分的细胞壁。
如在此使用的,术语“重组”是指核酸分子(例如,DNA或RNA)和/或蛋白质和/或生物的如下形式,该形式通常不会在自然界中发现并且正因为如此通过人类干预来产生。如在此使用的,“重组核酸分子”是包含多核苷酸组合的核酸分子,这些多核苷酸不会天然地一起存在并且是人类干预的结果,例如,由至少两种彼此异源的多核苷酸的组合组成的核酸分子,和/或人工合成的并且包含偏离通常存在于自然界中的多核苷酸的多核苷酸的核酸分子,和/或包含人工掺入至宿主细胞的基因组DNA中的转基因和该宿主细胞基因组的相关侧翼DNA的核酸分子。重组核酸分子的实例是由将转基因插入至植物的基因组DNA中产生的DNA分子,其可以最终导致该生物中的重组RNA和/或蛋白质分子的表达。如在此使用的,“重组植物”是通常不会在自然界中存在的植物,是人类干预的结果,并且含有掺入至其基因组中的转基因和/或异源核酸分子。由于此类基因组改变,重组植物明显不同于相关的野生型植物。
“调节元件”是指参与控制核苷酸序列的表达的序列。调节元件包含可操作地连接至感兴趣的核苷酸序列的启动子以及终止信号。它们还典型地涵盖正确翻译该核苷酸序列所需的序列。
在两个核酸或蛋白质序列的背景下,术语“一致的”或“基本上一致的”是指当针对最大对应性进行比较和比对时具有至少60%、优选80%、更优选90%、甚至更优选95%、并且最优选至少99%核苷酸或氨基酸残基一致性的两个或更多个序列或子序列,如使用以下序列比较算法之一或通过目测检查所测量的。优选地,基本上的一致性存在于这些序列的整个长度为至少约50个残基的区域中,更优选地在整个至少约100个残基的区域中,并且最优选地这些序列在至少约150个残基中是基本上一致的。在尤其优选的实施例中,这些序列在编码区的全部长度上基本上是一致的。此外,基本上一致的核酸或蛋白质序列基本上执行相同的功能。
对于序列比较,典型地,一个序列充当与测试序列进行比较的参考序列。当使用序列比较算法时,将测试序列和参考序列输入到计算机中(若有必要,指定子序列坐标),并且指定序列算法程序的参数。然后,该序列比较算法基于所指定的程序参数来计算这个或这些测试序列相对于该参考序列的序列一致性百分比。
用于比较的序列的最佳比对可以按照以下方式进行,例如通过Smith&Waterman,Adv.Appl.Math.2:482(1981)[Smith和Waterman,应用数学进展,2:482(1981)]的局部同源性算法、通过Needleman&Wunsch,J.Mol.Biol.48:443(1970)[Needleman和Wunsch,分子生物学杂志,48:443(1970)]的同源比对算法、通过Pearson&Lipman,Proc.Nat'l.AcadSci.USA 85:2444(1988)[Pearson和Lipman,美国国家科学院院刊,85:2444(1988)]的相似性方法的搜索,通过这些算法的计算机化实施(威斯康星州遗传学分析软件包(WisconsinGenetics Software Package),遗传学计算机组(Genetics Computer Group),科学街575号(575Science Dr.),麦迪逊,威斯康星州中的GAP、BESTFIT、FASTA、和TFASTA),或通过目测检查(总体上参见Ausubel et al.,infra[Ausubel等人,下文])。
适合于确定序列一致性百分比以及序列相似性的算法的一个实例是BLAST算法,它描述于以下文献中:Altschul et al.,J.Mol.Biol.215:403-410(1990)[Altschul等人,分子生物学杂志,215:403-410(1990)]。执行BLAST分析的软件是通过国家生物技术信息中心(the National Center for Biotechnology Information,美国国家医学图书馆(U.S.National Library of Medicine),洛克维尔大道8600号(8600Rockville Pike),贝塞斯达,马里兰州20894美国)可供公众使用的。这种算法涉及首先通过识别查询序列中具有长度W的短字码而识别得分高的序列对(HSP),这些得分高的序列对当与数据库序列中具有相同长度的字码(word)进行比对时匹配或满足一些正值阈值的得分T。T被称为邻近字码得分阈(Altschul et al.,1990[Altschul等人,1990])。这些初始的邻近字码命中充当种子用于起始搜索以发现含有它们的较长的HSP。然后,将这些字码命中在两个方向上沿着每个序列延伸直到累积的比对得分可以得到增加。对于核苷酸序列,使用参数M(对于一对匹配残基的奖赏得分;总是>0)和N(对于错配残基的罚分;总是<0)来计算累积的得分。对于氨基酸序列,使用评分矩阵来计算累积得分。当累积的比对得分从它的最大达到值降低了数量X;由于累积一个或多个负得分的残基比对使累积得分趋于0或0以下;或者到达任一序列的末端时,停止这些字码命中在每个方向上的延伸。BLAST算法的参数W、T、以及X决定了该比对的灵敏度与速度。BLASTN程序(对核苷酸序列来说)使用字长(W)为11、期望值(E)为10、截止值(cutoff)为100、M=5、N=-4、以及两条链的比较作为默认值。对于氨基酸序列,BLASTP程序使用字长(W)为3、期望值(E)为10、以及BLOSUM62评分矩阵作为默认值(参见Henikoff&Henikoff,Proc.Natl.Acad Sci.USA89:10915(1989)[Henikoff和Henikoff,美国国家科学院院刊,89:10915(1989)])。
除计算序列一致性百分比之外,BLAST算法还执行两个序列之间的相似性统计分析(参见例如,Karlin&Altschul,Proc.Nat'l.Acad.Sci.USA 90:5873-5787(1993)[Karlin和Altschul,美国国家科学院院刊,90:5873-5787(1993)])。由BLAST算法提供的相似性的一种量度是最小概率总和(P(N)),它提供了在两个核苷酸或氨基酸序列之间会偶然发生匹配的概率的指示。例如,若在测试核酸序列与参考核酸序列的比较中最小概率总和小于约0.1、更优选地小于约0.01、并且最优选地小于约0.001,则该测试核酸序列被认为是与该参考序列相类似的。
两个核酸序列基本上一致的另一指示是这两个分子在严格条件下彼此杂交。短语“特异性杂交”是指分子在严格条件下仅与特定的核苷酸序列结合、双链化或杂交,这是在该序列存在于复合混合物(例如,总细胞的)DNA或RNA中时进行的。“实质上结合”是指在探针核酸与靶核酸之间的互补杂交,并且涵盖少量错配,这些错配可以通过降低杂交介质的严格来容纳,以实现靶核酸序列的所希望的检测。
在核酸杂交实验(如DNA杂交和RNA杂交)的背景下,“严格杂交条件”和“严格杂交洗涤条件”是序列依赖性的,并且在不同的环境参数下是不同的。较长的序列特定地在较高的温度下杂交。对核酸杂交的广泛指导见于以下文献中:Tijssen(1993)LaboratoryTechniques in Biochemistry and Molecular Biology-Hybridization with NucleicAcid Probes part I chapter 2"Overview of principles of hybridization and thestrategy of nucleic acid probe assays"Elsevier,New York[Tijssen(1993)生物化学和分子生物学实验室技术-使用核酸探针的杂交第2章第I部分“杂交原理和核酸探针检验策略综述”,爱思唯尔,纽约]。通常,高严格杂交和洗涤条件在限定的离子强度和pH下被选定为比特定序列的热熔点(Tm)低约5℃。典型地,在“严格条件”下,探针将会与它的靶子序列进行杂交,但不会与其他序列杂交。
Tm是50%的靶序列与完全匹配的探针进行杂交时的温度(在限定的离子强度和pH下)。极严格条件被选定为等于具体探针的Tm。对于互补核酸(它们在DNA或RNA印迹中在滤器上具有超过100个互补的残基)的杂交的严格杂交条件的实例是在42℃下、具有1mg肝素的50%甲酰胺、将杂交进行过夜。高严格洗涤条件的实例是0.15M NaCl在72℃下持续约15分钟。严格洗涤条件的实例是0.2x SSC洗涤在65℃下持续15分钟(参见,Sambrook,下文,对于SSC缓冲剂的说明)。通常,高严格洗涤之前会先进行低严格洗涤,以去除背景探针信号。对于例如超过100个核苷酸的双链体的示例性中等严格洗涤是1x SSC在45℃下持续15分钟。对于例如超过100个核苷酸的双链体的低严格洗涤的实例是4-6x SSC在40℃下持续15分钟。对于短探针(例如,约10至50个核苷酸),严格条件典型地涉及小于约1.0M的Na离子的盐浓度,典型地在pH 7.0至8.3下约0.01至1.0M的Na离子浓度(或其他盐),并且温度典型地是至少约30℃。通过加入去稳定剂例如甲酰胺也可以实现严格条件。一般而言,相比于不相关的探针,在特定的杂交测定中观察到高出2倍(或更高)的信噪比就表明检测到特异性杂交。如果在严格条件下彼此不杂交的核酸所编码的蛋白质是基本上一致的,则它们仍然是基本上一致的。例如,当使用遗传密码允许的最大程度的密码子简并而生成核酸的拷贝时,则发生这种情况。
以下是杂交/洗涤条件组的实例,这些条件可被用来克隆与本发明的参考核苷酸序列基本上一致的同源核苷酸序列:参考核苷酸序列在以下条件下优选地与该参考核苷酸序列杂交:在7%十二烷基硫酸钠(SDS)、0.5M NaPO4、1mM EDTA中在50℃下,并且在2x SSC、0.1%SDS中在50℃下洗涤;更令人希望的是在7%十二烷基硫酸钠(SDS)、0.5M NaPO4、1mMEDTA中在50℃下,并且在1x SSC、0.1%SDS中在50℃下洗涤;仍更令人希望的是在7%十二烷基硫酸钠(SDS)、0.5M NaPO4、1mM EDTA中在50℃下,并且在0.5x SSC、0.1%SDS中在50℃下洗涤;优选地在7%十二烷基硫酸钠(SDS)、0.5M NaPO4、1mM EDTA中在50℃下,并且在0.1x SSC、0.1%SDS中在50℃下洗涤;更优选地在7%十二烷基硫酸钠(SDS)、0.5M NaPO4、1mM EDTA中在50℃下,并且在0.1x SSC、0.1%SDS中在65℃下洗涤。
两个核酸序列或蛋白质基本上一致的另一个指示是由第一核酸编码的蛋白质与由第二核酸编码的蛋白质进行免疫性交联反应或与其特异性结合。因此,蛋白质典型地是与第二蛋白质基本上一致的,例如其中这两种蛋白质仅区别于保守性置换。
“合成的”是指如下核苷酸序列,该核苷酸序列包含在天然序列中不存在的碱基和/或结构特征。例如,编码本发明的Cry蛋白的人工序列(其更类似于双子叶植物和/或单子叶植物基因的G+C含量和正常密码子分布)被表述为合成的。
“转化”是用于将异源核酸引入到宿主细胞或生物的方法。具体地,“转化”意指DNA分子稳定整合到感兴趣生物的基因组中。
“转化的/转基因的/重组的”是指引入了异源核酸分子的宿主生物,例如细菌或植物。该核酸分子可以被稳定地整合到宿主的基因组中,或者该核酸分子还可以作为染色体外分子存在。这样的染色体外分子能够自主复制。转化的细胞、组织或植物应当理解为不仅涵盖转化过程的终产物,而且涵盖其转基因子代。“非转化的”、“非转基因的”、或“非重组的”宿主是指不含有该异源的核酸分子的野生型生物,例如细菌或植物。
通过以下标准缩写由核苷酸的碱基来指示这些核苷酸:腺嘌呤(A)、胞嘧啶(C)、胸腺嘧啶(T)、以及鸟嘌呤(G)。氨基酸同样是由以下标准缩写来指示:丙氨酸(Ala;A)、精氨酸(Arg;R)、天冬酰胺(Asn;N)、天冬氨酸(Asp;D)、半胱氨酸(Cys;C)、谷氨酰胺(Gln;Q)、谷氨酸(Glu;E)、甘氨酸(Gly;G)、组氨酸(His;H)、异亮氨酸(Ile;I)、亮氨酸(Leu;L)、赖氨酸(Lys;K)、甲硫氨酸(Met;M)、苯丙氨酸(Phe;F)、脯氨酸(Pro;P)、丝氨酸(Ser;S)、苏氨酸(Thr;T)、色氨酸(Trp;W)、酪氨酸(Tyr;Y)、以及缬氨酸(Val;V)。
本发明提供了用于控制有害的植物有害生物的组合物以及方法。具体地,本发明涉及对植物有害生物有毒的Cry蛋白,并且涉及包含编码这些Cry蛋白的核苷酸序列的多核苷酸,并且涉及制备和使用这些多核苷酸和Cry蛋白控制植物有害生物的方法。
因此,在一些实施例中,提供了如下嵌合基因,该嵌合基因包含可操作地连接至多核苷酸的异源启动子,该多核苷酸包含编码对于至少黑色地老虎(小地老虎)有毒的蛋白质的核苷酸序列,其中该核苷酸序列(a)与SEQ ID NO:1-4中任一项具有至少80%(例如80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%)到至少99%(99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%)序列一致性;或者(b)编码包含氨基酸序列的蛋白质,该氨基酸序列与SEQID NO:13-16中任一项具有至少80%(例如80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%)到至少99%(99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%)序列一致性;或者(c)是(a)或(b)的合成序列,该合成序列已经进行密码子优化用于在转基因生物中表达。
在其他实施例中,该异源启动子是植物可表达型启动子。例如但不限于,该植物可表达型启动子可以选自下组,该组由以下各项组成:泛素、cmp、玉米TrpA、噬菌体T3基因95'UTR、玉米蔗糖合成酶1、玉米醇脱氢酶1、玉米捕光复合物、玉米热休克蛋白、豌豆小亚基RuBP羧化酶、Ti质粒甘露碱合酶、Ti质粒胭脂碱合酶、矮牵牛花查尔酮异构酶、大豆富甘氨酸蛋白1、马铃薯糖蛋白(Potato patatin)、凝集素、CaMV 35S以及S-E9小亚基RuBP羧化酶启动子。
在另外的实施例中,由嵌合基因编码的蛋白质另外对选自下组的一个或多个昆虫种类是有毒的,该组由以下各项组成:欧洲玉米蛀虫(欧洲玉米螟)、秋黏虫(草地贪夜蛾)、玉米穗蛾(玉米穗虫)、甘蔗螟(小蔗螟),绒毛豆毛虫(黎豆夜蛾)、大豆夜蛾(大豆尺蠖),西南玉米蛀虫(西南玉米螟)、西部豆切根虫(西部豆夜蛾)、烟夜蛾(烟芽夜蛾)、亚洲玉米蛀虫(亚洲玉米螟)、棉螟蛉(棉铃虫)、条纹蛀茎虫(二化螟)、粉蛀茎虫(非洲大螟)以及水稻卷叶螟(稻纵卷叶螟)。
在另外的实施例中,该多核苷酸包含如下核苷酸序列,该核苷酸序列与SEQ IDNO:1具有至少80%到至少99%序列一致性,或者与SEQ ID NO:2具有至少80%到至少99%序列一致性,或者与SEQ ID NO:3具有至少80%到至少99%序列一致性,或者与SEQ ID NO:4具有至少80%到至少99%序列一致性。
在其他实施例中,该多核苷酸包含编码如下蛋白质的核苷酸序列,该蛋白质包含与SEQ ID NO:13-16中任一项具有至少80%到至少99%序列一致性的氨基酸序列。
在仍其他实施例中,该氨基酸序列与SEQ ID NO:13具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在另外的实施例中,该氨基酸序列与SEQ ID NO:14具有至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在仍另外的实施例中,该氨基酸序列与SEQ ID NO:15具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在其他实施例中,该氨基酸序列与SEQ ID NO:16具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在一些实施例中,本发明的嵌合基因包含如下多核苷酸,该多核苷酸包含如下核苷酸序列的合成序列,该核苷酸序列与SEQ ID NO:5-12中任一项具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%一致性,其中该合成序列已经进行密码子优化用于在转基因生物中表达。在其他实施例中,本发明的嵌合基因包含如下核酸分子,该核酸分子包含编码如下蛋白质的核苷酸序列的合成序列,该蛋白质包含如下氨基酸序列,该氨基酸序列与SEQ ID NO:13-20中任一项具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性,其中该合成序列已经进行密码子优化用于在转基因生物中表达。在另外的实施例中,该转基因生物是转基因细菌或转基因植物。
在一些实施例中,本发明提供了如下合成多核苷酸,该合成多核苷酸包含以下项、基本上由以下项组成、或由以下项组成:编码对于至少黑色地老虎(小地老虎)有活性的蛋白质的核苷酸序列,其中该核苷酸序列与SEQ ID NO:5-12中任一项具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在其他实施例中,本发明提供了如下合成多核苷酸,该合成多核苷酸包含以下项、基本上由以下项组成、或由以下项组成:编码对于至少黑色地老虎(小地老虎)有活性的蛋白质的核苷酸序列,其中该核苷酸序列编码如下氨基酸序列,该氨基酸序列与SEQ ID NO:13-20中任一项具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在一些实施例中,本发明提供了如下合成多核苷酸,该合成多核苷酸包含以下项、基本上由以下项组成、或由以下项组成:如下核苷酸序列,该核苷酸序列进行至少一个密码子优化用于在转基因生物中表达并编码对于至少黑色地老虎(小地老虎)和玉米穗蛾(玉米穗虫)有毒的蛋白质,其中该蛋白质包含与SEQ ID NO:13具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性的氨基酸序列,并且在对应于SEQ ID NO:13的氨基酸位置40-44的位置处的氨基酸序列是NLNSC。在另外的实施例中,该多核苷酸包含以下项、基本上由以下项组成、或由以下项组成:SEQ ID NO:5或SEQ ID NO:9。在另外的实施例中,该氨基酸序列包含以下项、基本上由以下项组成、或由以下项组成:SEQ ID NO:13或SEQ ID NO:17。
根据一些实施例,本发明提供了对于至少黑色地老虎(小地老虎)有毒的分离的蛋白质,其中该蛋白质包含以下项、基本上由以下项组成、或由以下项组成:(a)与SEQ ID NO:13-20中任一项代表的氨基酸序列具有至少80%序列一致性到至少99%序列一致性的氨基酸序列;或者(b)由如下核苷酸序列编码的氨基酸序列,该核苷酸序列与由SEQ ID NO:5-12中任一项代表的核苷酸序列具有至少80%序列一致性到至少99%序列一致性。
在其他实施例中,该分离的蛋白质包含以下项、基本上由以下项组成、或由以下项组成:与SEQ ID NO:13-16中任一项具有至少80%到至少99%序列一致性的氨基酸序列。在仍其他实施例中,该氨基酸序列与SEQ ID NO:13具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在其他实施例中,该氨基酸序列与SEQ ID NO:14具有至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在另外的实施例中,该氨基酸序列与SEQ ID NO:15具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在仍另外的实施例中,该氨基酸序列与SEQ ID NO:16具有至少80%、或至少81%、或至少82%、或至少83%、或至少84%、或至少85%、或至少86%、或至少87%、或至少88%、或至少89%、或至少90%、或至少91%、或至少92%、或至少94%、或至少94%、或至少95%、或至少96%、或至少97%、或至少98%、或至少99%、或至少99.1%、或至少99.2%、或至少99.3%、或至少99.4%、或至少99.5%、或至少99.6%、或至少99.7%、或至少99.8%、或至少99.9%序列一致性。
在一些实施例中,该氨基酸序列包含以下项、基本上由以下项组成、或由以下项组成:SEQ ID NO:13-20中任一项。
本发明的响应于通过天然的或突变型BT-0044、BT-0051、BT-0068和BT-0128等或相关蛋白的免疫激发而产生的抗体可以使用生产多克隆抗血清的标准免疫学技术来生产,并且如果需要的话,无限增殖经免疫宿主的抗体产生细胞用作单克隆抗体生产源。用于生产任何感兴趣物质的抗体的技术是熟知的,例如如在以下文献中:Harlow and Lane(1988)[Harlow和Lane(1988)]和Goding(1986)[Goding(1986)]。本发明涵盖杀昆虫蛋白,其与针对本发明的杀昆虫Cry蛋白中的一种或多种产生的抗体交叉反应。
本发明中生产的这些抗体在用于确定生物样品中天然或突变型BT-0044、BT-0051、BT-0068和BT-0128或相关蛋白的量或存在的免疫测定中也是有用的。此类测定在质量控制生产含有本发明的毒性蛋白中的一种或多种或相关毒性蛋白的组合物中也是有用的。此外,这些抗体可以用来评估本发明的蛋白中的一种或多种或相关蛋白的重组生产的效力,连同针对编码本发明的蛋白中的一种或多种的核苷酸序列或相关蛋白编码序列的存在而筛选表达文库的效力。抗体还作为亲和配体用于纯化和/或分离本发明的蛋白中的任何一种或多种和相关蛋白是有用的。本发明的蛋白和含有相关抗原表位的蛋白可以通过在优选的宿主细胞中过度表达编码全部或部分本发明的蛋白或相关蛋白的序列的全长或部分长度来获得。
应当认识到,可以通过不同的方法来改变编码本发明的Cry蛋白的DNA序列,并且这些改变可以产生编码如下蛋白质的DNA序列,这些蛋白质具有不同于由本发明的天然Cry蛋白所编码的氨基酸序列。这种蛋白质可以按照不同的方式进行改变,包括SEQ ID NO:13-16中任一项的一个或多个氨基酸的氨基酸置换、缺失、截短、以及插入,包括多达约2、约3、约4、约5、约6、约7、约8、约9、约10、约15、约20、约25、约30、约35、约40、约45、约50、约55、约60、约65、约70、约75、约80、约85、约90、约100、约105、约110、约115、约120、约125、约130、约135、约140、约145、约150、约155个、或更多个氨基酸的置换、缺失或插入。用于这样的操作的方法在本领域中通常是已知的。例如,通过在编码该蛋白质的多核苷酸中的突变可以制备天然Cry蛋白的氨基酸序列变体。这还可以通过几种诱变形式之一和/或在定向进化中来完成。在一些方面中,在该氨基酸序列中所编码的改变将实质上不影响该蛋白质的功能。此类变体将具有所希望的杀昆虫活性。在本发明的一个实施例中,由SEQ ID NO:1-4代表的核苷酸序列被改变,以在编码的蛋白质中引入氨基酸置换。在一些实施例中,所得到的突变型蛋白是由合成的突变型多核苷酸编码的,该多核苷酸包含由SEQ ID NO:9-12中任一项代表的核苷酸序列。在其他实施例中,这些突变型蛋白包含以下项、基本上由以下项组成、或由以下项组成:由SEQ ID NO:17-20中任一项代表的氨基酸序列。
应当理解的是可以通过使用此类技术改善杀昆虫蛋白对本发明的这些组合物赋予杀昆虫活性的能力。例如,可以在如下宿主细胞中表达Cry蛋白,这些宿主细胞在DNA复制过程中显示出高比率的的碱基错掺入,如XL-l Red(Stratagene公司,拉荷亚,加利福尼亚州)。在此类菌株中繁殖之后,可以分离出DNA(例如通过制备质粒DNA,或通过由PCR进行扩增并且将得到的PCR片段克隆到载体中),在非诱变菌株中培养这些Cry蛋白突变体,并且鉴定具有杀昆虫活性的经突变的基因,例如通过进行对杀昆虫活性进行测试的测定。通常,在摄食测定中混合并使用了该蛋白质。参见例如Marrone et al.(1985)J.of EconomicEntomology 78:290-293[Marrone等人(1985)经济昆虫学杂志,78:290-293]。此类测定可以包括使植物与一种或多种有害生物接触,并且确定该植物存活和/或引起这些有害生物死亡的能力。导致毒性提高的突变的实例见于Schnepf et al.(1998)Microbiol.Mol.Biol.Rev.62:775-806[Schnepf等人(1998)微生物分子生物学综述,62:775-806]。
可替代地,可以在氨基或羧基的末端上对本发明的氨基酸序列进行改变,而实质上不影响活性。这可以包括通过现代分子方法所引入的插入、缺失、或改变,这些方法是如PCR,包括PCR扩增,这些PCR扩增借助于将编码氨基酸的序列包含到在PCR扩增中所使用的寡核苷酸之中而改变或延长该蛋白质编码序列。可替代地,所加入的蛋白质序列可以包括完整的蛋白质编码序列,例如在本领域内通常用于产生蛋白质融合物的那些序列。此类融合蛋白常常用于(1)增加感兴趣的蛋白质的表达;(2)引入结合结构域、酶活性、或表位以促进蛋白质纯化、蛋白质检测、或本领域已知的其他实验用途;(3)将蛋白质的分泌或翻译靶向亚细胞器,例如革兰氏阴性菌的壁膜间隙,或真核细胞的内质网,后者常常导致蛋白质的糖基化。
本发明的Cry蛋白还可以被突变以引入表位来产生识别该经突变蛋白的抗体。因此,在一些实施例中,本发明提供了经突变的Cry蛋白,其中在天然Cry蛋白中的氨基酸置换产生了具有抗原区域的突变型Cry蛋白,该抗原区域允许该突变型Cry蛋白在蛋白质检测分析中区别于该天然Cry蛋白。在其他实施例中,本发明提供了本发明的经突变Cry蛋白,其中氨基酸序列包含对应于SEQ ID NO:6的氨基酸342-354的区域中的氨基酸置换。在其他实施例中,该氨基酸序列包含在SEQ ID NO:6的位置342、343、344、345、346、347、348、349、350、351、352、353或354处的氨基酸置换。在仍其他实施例中,该氨基酸序列包含在对应于SEQID NO:6的氨基酸350、351和354的氨基酸位置处的氨基酸置换。在另外的实施例中,该氨基酸序列包含在对应于SEQ ID NO:6的氨基酸350、351和354的氨基酸位置处的氨基酸置换。在仍另外的实施例中,对应于位置350的氨基酸被异亮氨酸(I)置换,对应于位置351的氨基酸被谷氨酰胺(Q)置换,且对应于位置354的氨基酸被丝氨酸(S)置换。在其他实施例中,SEQID NO:6的位置350处的亮氨酸(L)被异亮氨酸(I)置换,在SEQ ID NO:6的位置351处的天冬酰胺(N)被谷氨酰胺(Q)置换,且在SEQ ID NO:6的位置354处的苏氨酸(T)被丝氨酸(S)置换。在其他实施例中,该天然Cry蛋白包含由SEQ ID NO:13-16中任一项代表的氨基酸序列。在仍其他实施例中,该天然Cry蛋白包含由SEQ ID NO:6代表的氨基酸序列,并且该突变型蛋白包含由SEQ ID NO:18代表的氨基酸序列。
在一些实施例中,本发明提供了特异性识别本发明的突变型Cry蛋白的表位的抗体,其中该表位包含在对应于SEQ ID NO:6的氨基酸342-354的氨基酸中具有一处或多处置换的氨基酸序列。在其他实施例中,该表位包含在SEQ ID NO:6的氨基酸342-354中具有一处或多处置换的氨基酸序列。在仍其他实施例中,该表位包含SEQ ID NO:18的氨基酸342-354。
在一些实施例中,本发明提供了制备如下抗体的方法,所述抗体从衍生出经突变的Cry蛋白的天然Cry蛋白中差异性地识别出经突变的Cry蛋白,该方法包括以下步骤:在天然Cry蛋白的抗原环中置换氨基酸;并且产生特异性识别茎突变Cry蛋白的经突变抗原环但不识别该天然Cry蛋白的抗体。在一个实施例中,该抗原环在天然Cry蛋白的结构域I的外部的非保守区中被鉴定出。在另一个实施例中,该抗原环不是参与Cry蛋白的昆虫肠受体识别或参与Cry蛋白的蛋白酶活化的环。在另一个实施例中,该抗原环包含对应于SEQ ID NO:6的氨基酸341-354的氨基酸序列。在又另一个实施例中,该抗原环包含SEQ ID NO:6的氨基酸342-354。
本发明的变体核苷酸和氨基酸序列还涵盖了由诱变和引起重组的程序(如DNA改组)所衍生的序列。使用此类程序,可以将一个或多个不同的毒性蛋白编码区用来创造出新的具有所希望特性的毒性蛋白。用这种方式,从相关序列多核苷酸的群体产生重组多核苷酸文库,这些相关序列多核苷酸包含如下序列区域,这些序列区域具有基本的序列一致性并且可以在体外或体内进行同源重组。例如,使用这种方法,可以将编码感兴趣的结构域的序列基序在本发明的杀有害生物基因与其他已知的杀有害生物基因之间进行改组,以获得编码如下蛋白质的新基因,该蛋白质具有改进的感兴趣的特性,例如增加的杀昆虫活性。用于此种DNA改组的策略在本领域中是已知的。参见例如,Stemmer(1994)Proc.Natl.Acad.Sci.USA 91:10747-10751[Stemmer(1994)美国国家科学院院刊,91:10747-10751];Stemmer(1994)Nature 370:389-391[Stemmer(1994)自然,370:389-391];Crameri et al.(1997)Nature Biotech.15:436-438[Crameri等人(1997)自然生物技术,15:436-438];Moore et al.(1997)J.Mol.Biol.272:336-347[Moore等人(1997)分子生物学杂志,272:336-347];Zhang et al.(1997)Proc.Natl.Acad.Sci.USA 94:4504-4509[Zhang等人(1997)美国国家科学院院刊,94:4504-4509];Crameri et al.(1998)Nature391:288-291[Crameri等人(1998)自然,391:288-291];以及美国专利号5,605,793和5,837,458。
结构域交换或改组是用于产生本发明的经改变的Cry蛋白的另一种机制。可以在Cry蛋白之间交换结构域,从而产生具有改进的杀有害生物活性或目标谱的杂合或嵌合毒性蛋白。用于产生重组蛋白和测试它们的杀有害生物活性的方法在本领域是熟知的(参见例如,Naimov et al.(2001)Appl.Environ.Microbiol.67:5328-5330[Naimov等人(2001)应用与环境微生物学,67:5328-5330];de Maagd et al.(1996)Appl.Environ.Microbiol.62:1537-1543[de Maagd等人(1996)应用与环境微生物学,62:1537-1543];Ge et al.(1991)J.Biol.Chem.266:17954-17958[Ge等人(1996)生物化学杂志,266:17954-17958];Schnepf et al.(1990)J.Biol.Chem.265:20923-20930[Schnepf等人(1990)生物化学杂志,265:20923-20930];Rang et al.(1999)Appl.Environ.Microbiol.65:2918-2925[Rang等人(1999)应用与环境微生物学,65:2918-2925])。
在一些实施例中,本发明提供了如下重组载体,该重组载体包含本发明的多核苷酸、核酸分子、表达盒或嵌合基因。在其他实施例中,该载体被进一步限定为质粒、粘粒、噬菌粒、人工染色体、噬菌体或病毒载体。用于在植物和其他生物的转化中使用的某些载体在本领域是已知的。
因此,本发明的一些实施例针对被设计成表达本发明的多核苷酸和核酸分子的表达盒。如在此使用的,“表达盒”是指如下核酸分子,该核酸分子具有至少一个操作性地连接到感兴趣的核苷酸序列上的控制序列。以这种方式,例如,可操作地连接至待表达的核苷酸序列的植物启动子可以在表达盒中提供,用于在植物、植物部分和/或植物细胞中进行表达。
包含感兴趣的核苷酸序列的表达盒可以是嵌合的,意味着它的组分中的至少一种相对于它的其他组分中的至少一种是异源的。表达盒还可以是天然存在的但已经是以对于异源表达有用的重组形式而获得的表达盒。然而,典型地,该表达盒相对于该宿主是异源的,即该表达盒的特定核酸序列不是天然存在于该宿主细胞中的,并且必须已经通过转化事件引入到该宿主细胞或该宿主细胞的祖先中。
除可操作地连接至本发明的核苷酸序列的启动子之外,本发明的表达盒还可以包括其他调节序列。如在此使用的,“调节序列”意指位于编码序列的上游(5'非编码序列)、内部或下游(3'非编码序列)并且影响相关编码序列的转录、RNA加工或稳定性、或翻译的核苷酸序列。调节序列包括但不限于增强子、内含子、翻译前导序列、终止信号、以及多腺苷酸化信号序列。
在一些实施例中,本发明的表达盒还可以包括对其他所希望的性状进行编码的核苷酸序列。此类核苷酸序列可以与核苷酸序列的任何组合叠加,以产生具有所希望的表型的植物、植物部分或植物细胞。叠加的组合可以通过任何方法来产生,包括但不限于,通过任何常规的方法学的杂交育种植物或通过遗传转化(即,分子叠加)。如果是通过遗传转化这些植物来进行叠加的,感兴趣的核苷酸序列可以在任何时间并且以任何次序进行组合。例如,包含一种或多种所希望的性状的转基因植物可以用作通过后续转化而引入另外的性状的靶标。另外的核苷酸序列可以在共转化方案中与由表达盒的任何组合提供的本发明的核苷酸序列、核酸分子、核酸构建体、和/或组合物同时引入。例如,如果将引入两个核苷酸序列,则它们可以掺入在分开的盒(反式)中或可以掺入在相同的盒(顺式)上。多核苷酸的表达可以通过相同的启动子或通过不同的启动子来驱动。应进一步认识到多核苷酸可以使用位点特异性重组系统在所希望的基因组位置处叠加。参见例如,国际专利申请公开号WO99/25821;WO 99/25854;WO 99/25840;WO 99/25855以及WO 99/25853。
表达盒还可以包括一种或多种多肽的编码序列,这一种或多种多肽用于主要受益者是种子公司、栽培者或谷物加工者的农艺性状。感兴趣的多肽可以是由感兴趣的核苷酸序列编码的任何多肽。适合用于在植物中产生的感兴趣的多肽的非限制性实例包括产生农艺学重要性状的那些多肽,这些性状是如除草剂抗性(有时也称为“除草剂耐受性”)、病毒抗性、细菌病原体抗性、昆虫抗性、线虫抗性、和/或真菌抗性。参见例如,美国专利号5,569,823;5,304,730;5,495,071;6,329,504;以及6,337,431。多肽还可以是提高植物活力或产量(包括允许植物在不同的温度、土壤条件以及日光和沉淀水平下生长的性状)的多肽,或是允许对展现感兴趣性状(例如,选择性标记、种皮颜色等)的植物进行鉴定的多肽。不同的感兴趣的多肽,以及用于将这些多肽引入植物的方法描述于例如,美国专利号4,761,373、4,769,061、4,810,648、4,940,835、4,975,374、5,013,659、5,162,602、5,276,268、5,304,730、5,495,071、5,554,798、5,561,236、5,569,823、5,767,366、5,879,903、5,928,937、6,084,155、6,329,504以及6,337,431;以及美国专利公开号2001/0016956中。还参见,万维网上的lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/。
赋予对抑制生长点或分生组织的除草剂(例如咪唑啉酮或磺酰脲)的抗性/耐受性的多核苷酸也可以适用于本发明的一些实施例中。对于突变型ALS和AHAS酶在这一分类号中的示例性多核苷酸如描述于例如,美国专利号5,767,366和5,928,937中。美国专利号4,761,373和5,013,659针对抵抗不同的咪唑啉酮或磺酰脲除草剂的植物。美国专利号4,975,374涉及含有如下核酸的植物细胞和植物,所述核酸编码突变型谷氨酰胺合成酶(GS),所述突变型谷氨酰胺合成酶抵抗已知抑制GS的除草剂(例如,草胺膦和甲硫氨酸磺基肟(methionine sulfoximine))的抑制作用。美国专利号5,162,602披露了抵抗环己二酮和芳氧苯氧丙酸除草剂的抑制作用的植物。该抗性由改变的乙酰辅酶A羧化酶(ACCase)赋予。
赋予对草甘膦抗性的由核苷酸序列编码的多肽也适用于本发明。参见例如,美国专利号4,940,835和美国专利号4,769,061。美国专利号5,554,798披露了抗草甘膦的转基因玉蜀黍植物,所述抗性由改变的5-烯醇丙酮莽草酸-3-磷酸(EPSP)合酶基因赋予。
编码对磷酰基化合物(例如草铵膦或草胺膦、以及吡啶氧丙酸或苯氧丙酸以及环己酮)的抗性的多核苷酸也是适合的。参见欧洲专利申请号0 242246。还参见美国专利号5,879,903、5,276,268和5,561,236。
其他合适的多核苷酸包括编码对抑制光合作用的除草剂(例如三嗪和苯基氰(腈水解酶))的抗性的那些,参见美国专利号4,810,648。编码用于除草剂抗性的另外的合适多核苷酸包括编码对2,2-二氯丙酸、烯禾啶、吡氟氯禾灵、咪唑啉酮除草剂、磺酰脲除草剂、三唑并嘧啶除草剂、均三嗪除草剂以及溴草腈的抗性的那些。同样适合的是赋予对原卟啉原氧化酶的抗性或者提供增强的对植物疾病的抗性、增强的对不利环境条件(非生物胁迫)的耐受性(这些条件包括但不限于干旱、极冷、极热、或极端的土壤盐度或极端的酸度或碱度)、以及在植物构造或发育中的改变(包括发育时间方面的变化)的多核苷酸。参见例如,美国专利公开号2001/0016956和美国专利号6,084,155。
另外的合适的多核苷酸包括对杀有害生物(例如杀昆虫)多肽进行编码的那些。这些多肽可以按足以控制例如昆虫有害生物的量(即,昆虫控制量)进行生产。应认识到在植物中对控制昆虫或其他有害生物必要的杀有害生物多肽的生产量可以变化,这取决于栽培品种、有害生物的类型、环境因素等。有用于另外的昆虫或有害生物抗性的多核苷酸例如包括如下核苷酸序列,它们编码芽孢杆菌属(Bacillus)生物中鉴定到的毒素。已经克隆了包含编码来自几个亚种的苏云金芽孢杆菌(Bt)杀昆虫蛋白的核苷酸序列的多核苷酸,并且已经发现这些重组克隆对鳞翅目、双翅目和鞘翅目昆虫幼虫是有毒的。此类Bt杀昆虫蛋白的实例包括以下Cry蛋白,例如Cry1Aa、Cry1Ab、Cry1Ac、Cry1B、Cry1C、Cry1D、Cry1Ea、Cry1Fa、Cry3A、Cry9A、Cry9B、Cry9C等,连同营养期杀昆虫蛋白例如Vip1、Vip2、Vip3等。Bt来源的蛋白质的完整清单可以在万维网在苏塞克斯大学(University of Sussex)维护的苏云金芽孢杆菌毒素命名法数据库中找到(还参见,Crickmore et al.(1998)Microbiol.Mol.Biol.Rev.62:807-813[Crickmore等人(1998)微生物分子生物学综述,62:807-813])。
适合在植物中产生的多肽进一步包括改进或通过其他方式有助于收获的植物和/或植物部分转化成为商业上有用的产品(包括例如增加的或改变的碳水化合物含量和/或分布、改进的发酵特性、增加的油含量、增加的蛋白含量、改进的消化率、以及增加的营养成分含量(例如,增加的植物甾醇含量、增加的生育酚含量、增加的甾烷醇含量和/或增加的维生素含量))的那些。感兴趣的多肽还包括例如在收获的作物中导致或促成不需要的成分,例如植酸、或降解糖的酶类的含量降低的那些。“导致(resulting in)”或“促成(contributing to)”是指这种感兴趣的多肽可以直接或间接地促成感兴趣的性状的存在(例如,通过异源纤维素酶的使用来增加纤维素降解)。
在一个实施例中,多肽有助于改进的食品或饲料的可消化性。木聚糖酶是半纤维素分解酶,这些酶改善了植物细胞壁的分解,这导致动物更好地利用这些植物营养素。这导致了改进的生长率和饲料转化。同样,可以减小含有木聚糖的饲料的粘度。在植物细胞内异源产生木聚糖酶也可以促进木质纤维素转化成工业加工中的可发酵糖。
来自真菌和细菌微生物的多种木聚糖酶已经得到鉴别和表征(参见例如,美国专利号5,437,992;Coughlin et al.(1993)“Proceedings of the Second TRICELSymposium on Trichoderma reesei Cellulases and Other Hydrolases”Espoo[Coughlin等人(1993)“里氏木霉纤维素酶和其他水解酶的第二TRICEL研讨会论文集”,埃斯波];Souminen and Reinikainen,eds.(1993)Foundation for Biotechnical andIndustrial Fermentation Research 8:125-135[Souminen和Reinikainen编著(1993)生物技术和工业发酵研究基金会,8:125-135];美国专利公开号2005/0208178;以及PCT公开号WO03/16654)。具体地说,在里氏木霉(T.reesei)中已经鉴别出三种特异性木聚糖酶(XYL-I、XYL-II、和XYL-III)(Tenkanen et al.(1992)Enzyme Microb.Technol.14:566[Tenkanen等人(1992)酶与微生物技术,14:566];Torronen et al.(1992)Bio/Technology10:1461[Torronen等人(1992)生物/技术,10:1461];以及Xu et al.(1998)Appl.Microbiol.Biotechnol.49:718[Xu等人(1998)应用微生物与生物技术,49:718])。
在另一个实施例中,对于本发明有用的多肽可以是多糖降解酶。产生这样的酶的本发明的植物对于产生例如用于生物加工的发酵原料会是有用的。在一些实施例中、可用于发酵过程的酶包括α淀粉酶、蛋白酶、支链淀粉酶、异淀粉酶、纤维素酶、半纤维素酶、木聚糖酶、环糊精糖基转移酶、脂肪酶、植酸酶、漆酶、氧化酶、酯酶、角质酶、颗粒淀粉水解酶以及其他葡糖淀粉酶。
多糖降解酶包括:淀粉降解酶,例如α-淀粉酶(EC 3.2.1.1)、葡糖醛酸酶(E.C.3.2.1.131);外-1,4-α-D葡聚糖酶,例如淀粉糖化酶和葡糖淀粉酶(EC 3.2.1.3)、β-淀粉酶(EC 3.2.1.2)、α-糖苷酶(EC 3.2.1.20)以及其他外-淀粉酶;淀粉脱支酶,例如a)异淀粉酶(EC 3.2.1.68)、支链淀粉酶(EC 3.2.1.41)等;b)纤维素酶,例如外-1,4-3-纤维二糖水解酶(EC3.2.1.91)、外-1,3-β-D-葡聚糖酶(EC 3.2.1.39)、β-糖苷酶(EC3.2.1.21);c)L-阿拉伯糖酶,例如内-1,5-α-L-阿拉伯糖酶(EC3.2.1.99)、α-阿拉伯糖苷酶(EC3.2.1.55)等;d)半乳聚糖酶,例如内-1,4-β-D-半乳聚糖酶(EC 3.2.1.89)、内-1,3-β-D-半乳聚糖酶(EC3.2.1.90)、α-半乳糖苷酶(EC 3.2.1.22)、β-半乳糖苷酶(EC 3.2.1.23)等;e)甘露聚糖酶,例如内-1,4-β-D-甘露聚糖酶(EC 3.2.1.78)、β-甘露糖苷酶(EC 3.2.1.25)、α-甘露糖苷酶(EC 3.2.1.24)等;f)木聚糖酶,例如内-1,4-β-木聚糖酶(EC 3.2.1.8)、β-D-木糖苷酶(EC 3.2.1.37)、1,3-β-D-木聚糖酶等;以及g)其他酶,例如α-L-岩藻糖苷酶(EC3.2.1.51)、α-L-鼠李糖苷酶(EC 3.2.1.40)、果聚糖酶(EC 3.2.1.65)、菊粉酶(EC3.2.1.7)等。在一个实施例中,α-淀粉酶是描述于美国专利号8,093,453中的合成α-淀粉酶Amy797E,将该专利通过引用以其全文结合在此。
可以与本发明一起使用的另外的酶包括蛋白酶,如真菌和细菌蛋白酶。真菌蛋白酶包括但不限于从曲霉属(Aspergillus)、木霉属(Trichoderma)、毛霉属(Mucor)和根霉属(Rhizopus),如黑曲霉(A.niger)、泡盛曲霉(A.awamori)、米曲霉(A.oryzae)和米黑毛霉(M.miehei)获得的那些。在一些实施例中,本发明的多肽可以是纤维二糖水解酶(CBH)(EC3.2.1.91)。在一个实施例中,该纤维二糖水解酶可以是CBH1或CBH2。
与本发明一起使用的其他酶包括但不限于半纤维素酶,如甘露聚糖酶和阿拉伯呋喃糖苷酶(EC 3.2.1.55);木质素酶;脂肪酶(例如,E.C.3.1.1.3)、葡糖氧化酶、果胶酶、木聚糖酶、转葡糖苷酶、α1.6葡糖苷酶(例如,E.C.3.2.1.20);酯酶,如阿魏酸酯酶(EC3.1.1.73)和乙酰基木聚糖酯酶(EC 3.1.1.72);以及角质酶(例如E.C.3.1.1.74)。
在一些实施例中,本发明提供了如下转基因非人类宿主细胞,该细胞包含本发明的多核苷酸、核酸分子、嵌合基因、表达盒或重组载体。转基因非人类宿主细胞可以包括但不限于植物细胞、酵母细胞、细菌细胞或昆虫细胞。因此,在一些实施例中,本发明提供了选自以下属的细菌细胞:芽孢杆菌属(Bacillus)、短芽孢杆菌属(Brevibacillus)、梭菌属(Clostridium)、致病杆菌属(Xenorhabdus)、发光杆菌属(Photorhabdus)、巴斯德氏芽菌属(Pasteuria)、埃希氏菌属(Escherichia)、假单胞菌属(Pseudomonas)、欧文氏菌属(Erwinia)、沙雷氏菌属(Serratia)、克雷伯菌属(Klebsiella)、沙门氏菌属(Salmonella)、巴氏杆菌属(Pasteurella)、黄单胞菌属(Xanthomonas)、链霉菌属(Streptomyces)、根瘤菌属(Rhizobium)、红假单胞菌属(Rhodopseudomonas)、嗜甲基菌属(Methylophilius)、农杆菌属(Agrobacterium)、醋杆菌属(Acetobacter)、乳杆菌属(Lactobacillus)、节杆菌属(Arthrobacter)、固氮菌属(Azotobacter)、明串珠菌属(Leuconostoc)或产碱杆菌属(Alcaligenes)。因此,例如,作为生物昆虫控制剂,本发明的Cry蛋白可以通过在细菌细胞中表达编码本发明的Cry蛋白的嵌合基因而产生。例如,在一个实施例中,提供了包含本发明的嵌合基因的苏云金芽孢杆菌细胞。
在另外的实施例中,本发明提供了作为双子叶植物细胞或单子叶植物细胞的植物细胞。在另外的实施例中,该双子叶植物细胞选自下组,该组由以下各项组成:大豆细胞、向日葵细胞、番茄细胞、芸苔属作物细胞、棉花细胞、糖用甜菜细胞以及烟草细胞。在另外的实施例中,该单子叶植物细胞选自下组,该组由以下各项组成:大麦细胞、玉蜀黍细胞、燕麦细胞、水稻细胞、高粱细胞、甘蔗细胞以及小麦细胞。在一些实施例中,本发明提供了多个双子叶植物细胞或单子叶植物细胞,这些细胞表达由本发明的嵌合基因编码的本发明的毒性蛋白。在其他实施例中,将该多个细胞并列以形成质外体并且使其在自然光照中生长。
在本发明的另一个实施例中,在高等生物(例如,植物)中表达本发明的毒性蛋白。在这种情况下,表达有效量的毒性蛋白的转基因植物保护自身免受植物有害生物如昆虫有害生物的伤害。在昆虫开始以这种转基因植物为食时,它也摄取了这种已表达的毒素。这可以妨碍昆虫进一步咬食植物组织或者甚至可以伤害或杀死昆虫。本发明的多核苷酸被插入表达盒中,然后该表达盒被稳定地整合到植物的基因组中。在另一个实施例中,该多核苷酸被包括在非致病性自我复制病毒中。根据本发明转化的植物可以是单子叶植物或双子叶植物,并且包括但不限于玉米(玉蜀黍)、大豆、水稻、小麦、大麦、黑麦、燕麦、高粱、粟、向日葵、红花、糖用甜菜、棉花、甘蔗、油菜、苜蓿、烟草、花生、蔬菜(包括甘薯、豆类、豌豆、菊苣、莴苣、甘蓝、花椰菜、西兰花、芜菁、胡萝卜、茄子、黄瓜、萝卜、菠菜、马铃薯、番茄、芦笋、洋葱、大蒜、瓜类、胡椒、芹菜、南瓜、西葫芦、绿皮西葫芦)、水果(包括苹果、梨、榅桲、李、樱桃、桃、蜜桃、杏、草莓、葡萄、覆盆子、黑莓、菠萝、鳄梨、番木瓜、芒果、香蕉)和特种植物如拟南芥以及木本植物如针叶树和落叶树。优选地,本发明的植物是作物植物,如玉蜀黍、高粱、小麦、向日葵、番茄、十字花科植物、胡椒、马铃薯、棉花、水稻、大豆、糖用甜菜、甘蔗、烟草、大麦、油菜等。
一旦所希望的多核苷酸已经被转化进特定的植物种类中,便可以使用传统的育种技术将其在该种类中繁殖或将其转移到相同种类的其他品种中,特别是包括商业品种。
在转基因植物中表达本发明的多核苷酸,由此导致在这些转基因植物中相应Cry蛋白的生物合成。以此方式,产生在存在昆虫压力下具有增强的产量保护的转基因植物。用于它们在转基因植物中的表达,本发明的核苷酸序列可能需要修饰和优化。尽管在许多情况下,来自微生物有机体的基因能够在植物中高水平表达而无需修饰,在转基因植物中的低表达可能是由于微生物核苷酸序列的缘故,这些序列具有在植物中并不优选的密码子。在本领域中已知,活生物具有特定的密码子使用偏好,而且在本发明中所描述的这些核苷酸序列的密码子可以被改变以符合植物偏好,同时维持由其编码的氨基酸。此外,在植物(例如玉米植物)中高表达最好是由如下编码序列实现的,这些编码序列具有至少约35%、或至少约45%、或至少约50%、或至少约60%的GC含量。具有低GC含量的微生物核苷酸序列在植物中也许表达欠佳,这是由于存在着可能使信息不稳定的ATTTA基序,以及可导致不恰当的多腺苷酸化的AATAAA基序。尽管某些基因序列可以在单子叶植物和双子叶植物种类两者中充分表达,但是可以对序列进行修饰以便迎合单子叶植物或双子叶植物的特定密码子偏好以及GC含量偏好,因为这些偏好已经被证明是不同的(Murray et al.Nucl.AcidsRes.17:477-498(1989)[Murray等人,核酸研究,17:477-498(1989)])。此外,针对不正常剪接位点的存在来对这些核苷酸序列进行筛选,这些位点可能导致信息平截(messagetruncation)。使用描述于例如美国专利号5,625,136、5,500,365和6,013,523中的方法,使用熟知的定点诱变、PCR以及合成基因构建技术对在这些核苷酸序列之内所有需要做出的变化(如以上所描述的那些)进行改变。
在一些实施例中,本发明提供了根据披露于美国专利号5,625,136中的程序制备的合成基因,将该专利通过引用结合在此。在这个操作中,使用了玉蜀黍偏好的密码子,即最频繁地编码玉蜀黍中的氨基酸的单一密码子。针对特定的氨基酸的玉蜀黍偏好的密码子可源自例如来自玉蜀黍的已知基因序列。例如,针对来自玉蜀黍植物的28个基因的玉蜀黍密码子使用发现于以下文献中:Murray et al.,Nucleic Acids Research 17:477-498(1989)[Murray等人,核酸研究,17:477-498(1989)],将其披露内容通过引用结合在此。本发明的确切示例的用玉蜀黍优化密码子制备的合成序列由SEQ ID NO:13-20中的任一项代表。以此方式,这些核苷酸序列可以进行优化用于在任何植物中表达。应认识到,核苷酸序列的全部或任何部分可以是优化的或合成的。也就是说,多核苷酸可以包含作为部分天然序列和部分合成优化序列的核苷酸序列。
为了有效的翻译起始,可能需要修饰与起始甲硫氨酸相邻的序列。例如,它们可以通过包含已知在植物中有效的序列而被修饰。Joshi已经提出了针对植物的适当的共有序列(NAR 15:6643-6653(1987)),并且Clonetech提出了另一种共有翻译起始子(1993/1994目录,第210页)。这些共有序列适于与本发明的核苷酸序列一起使用。将这些序列掺入至包含核苷酸序列的构建体中,达到ATG并且包括ATG(同时保持不修饰第二氨基酸),或者可替代地达到ATG后的GTC并且包括ATG后的GTC(具有修饰该转基因的第二氨基酸的可能性)。
本发明的新颖cry蛋白编码序列(作为它们的天然序列或作为如上所述的合成序列)可以可操作地融合至用于在植物中表达的多种启动子(包括组成型、诱导型、时序性调节的、发育调节的、化学调节的、组织优选的以及组织特异性启动子)以制备重组DNA分子(即,嵌合基因)。启动子的选择将取决于表达的时间和空间需要而变化,并且还取决于目标种类而变化。因此,本发明的核苷酸序列在叶、柄(stalk)或茎(stem)、穗、花序(例如穗状花序、圆锥花序、穗轴等)、根、和/或籽苗中的表达是优选的。然而在许多情况下,寻求针对多于一种类型昆虫有害生物的保护,并且因此在多个组织中的表达是令人希望的。尽管已经显示来自双子叶植物的很多启动子在单子叶植物中是可操作的并且反之亦然,但理想的是选择双子叶植物启动子用于在双子叶植物中表达,并且选择单子叶植物启动子用于在单子叶植物中表达。然而,对所选择的启动子的起源并没有限制,足够的是它们在驱动核苷酸序列在所希望的细胞中的表达中是操作性的。
在本发明中有用的组成型启动子的实例包括CaMV 35S和19S启动子(Fraley等人,美国专利号5,352,605,通过引用结合在此)。此外,启动子是从肌动蛋白基因的任一种衍生的,这些肌动蛋白基因在大多数细胞类型中被表达。由McElroy等人(Mol.Gen.Genet.231:150-160(1991)[分子遗传学与普通遗传学,231:150-160(1991)])描述的启动子表达盒可以易于被修饰用于新颖的毒素基因的表达并且特别适用于在单子叶植物宿主中使用。又另一种组成型启动子是从泛素衍生的,泛素是在许多细胞类型中积聚的已知的另一种基因产物。泛素启动子已经从一些物种中克隆用于在转基因植物中使用,例如在向日葵(Binet etal.,1991.Plant Science 79:87-94[Binet等人,1991,植物科学,79:87-94])、玉蜀黍(Christensen et al.,1989.Plant Molec.Biol.12:619-632[Christensen等人,1989,植物分子生物学,12:619-632])、以及拟南芥属(Norris et al.1993.Plant Molec.Biol.21:895-906[Norris等人,1993,植物分子生物学,21:895-906])中使用。玉蜀黍泛素启动子已经在转基因单子叶植物系统中得到发展,并且它的序列以及构建用于单子叶植物转化的载体披露于专利公开EP 0 342 926中。泛素启动子适用于新颖的毒素基因在转基因植物(尤其是单子叶植物)中的表达。
对于在植物(特别是玉蜀黍)中表达本发明的新颖cry蛋白编码序列有用的组织特异性或组织优先启动子是指导在根、髓、叶或花粉中的表达的那些。此类启动子披露于美国专利号5625136中,通过引用以其全文结合在此。在本发明中有用的其他组织特异性启动子包括棉花二磷酸核酮糖羧化酶(rubisco)启动子,披露于美国专利号6,040,504中;水稻蔗糖合成酶启动子,披露于美国专利号5,604,121中;以及夜香树黄化曲叶病毒启动子,披露于美国专利号7,166,770中,所有这些专利都通过引用以其全文结合。对于指导新颖毒素基因在植物中的表达有用的化学诱导型启动子披露于美国专利号5,614,395中(通过引用以其全文结合在此)。
本发明的核苷酸序列也可以在被化学调节的启动子的调节下进行表达。这使得本发明的Cry蛋白能够仅在用诱导化学品对作物植物进行处理时被合成。用于基因表达的化学诱导的此类技术的实例详述于公开申请EP 0 332104和美国专利号5,614,395中。在一个实施例中,该化学调节的启动子是烟草PR-1a启动子。
另一类在本发明中有用的启动子是创伤可诱导的启动子。已经描述了数量众多的在创伤部位并且还在植物病原菌感染的部位表达的启动子。理想的是,这样的启动子在昆虫入侵的部位应该仅有局部活性,并且以此方式这些杀昆虫蛋白仅在需要合成这些杀昆虫蛋白的细胞中积聚以杀死入侵的昆虫有害生物。这类启动子的实例包括由以下文献所描述的那些:Stanford et al.Mol.Gen.Genet.215:200-208(1989)[Stanford等人,分子遗传学与普通遗传学,215:200-208(1989)];Xu et al.Plant Molec.Biol.22:573-588(1993)[Xu等人,植物分子生物学,22:573-588(1993)];Logemann et al.Plant Cell 1:151-158(1989)[Logemann等人,植物细胞,1:151-158(1989)];Rohrmeier&Lehle,PlantMolec.Biol.22:783-792(1993)[Rohrmeier和Lehle,植物分子生物,22:783-792(1993)];Firek et al.Plant Molec.Biol.22:129-142(1993)[Firek等人,植物分子生物学,22:129-142(1993)]以及Warner et al.Plant J.3:191-201(1993)[Warner等人,植物杂志,3:191-201(1993)]。
导致在本发明中有用的组织特异性表达模式的启动子的非限制性实例包括绿色组织特异性的、根特异性的、茎特异性的和/或花特异性的。适用于在绿色组织中表达的启动子包括调节参与光合作用的基因的许多启动子,并且这些中的许多已经从单子叶植物和双子叶植物两者中得以克隆。一种这样的启动子是来自磷酸烯醇羧化酶基因的玉米PEPC启动子(Hudspeth&Grula,Plant Molec.Biol.12:579-589(1989)[Hudspeth和Grula,植物分子生物学,12:579-589(1989)])。另一种用于根特异性表达的启动子是由以下文献描述的启动子:de Framond(FEBS 290:103-106(1991)[de Framond(FEBS 290:103-106(1991)]或美国专利号5,466,785。另一种在本发明中有用的启动子是描述于美国专利号5,625,136中的茎特异性启动子,它天然地驱动玉蜀黍trpA基因的表达。
除了选择适合的启动子之外,用于在植物中表达杀昆虫毒素的构建体还需要适当的可操作地连接在异源核苷酸序列下游的转录终止子。一些此类的终止子是可获得的并且在本领域中是已知的(例如来自CaMV的tml,来自rbcS的E9)。任何已知在植物中发挥功能的可供使用的终止子均可以在本发明的上下文下使用。
可以将数量众多的其他序列掺入本发明所描述的表达盒中。这些序列包括已经显示出增强表达的序列,如内含子序列(例如,来自Adhl和bronzel)以及病毒的前导序列(例如,来自TMV、MCMV、和AMV)。
本发明的核苷酸序列在植物中针对不同的细胞定位的靶向表达可能是更优选的。在一些情况下,在胞质溶胶中的定位可能是令人希望的,而在其他情况下,在某个亚细胞器中的定位可能是优选的。用于靶向例如植物中的基因产物的任何机构都可以用于实践本发明,并且已知此类机构存在于植物中并且已经相当详细地表征了控制这些机构的功能的序列。已经表征了导致将基因产物靶向其他细胞区室的序列。氨基末端序列可以负责将感兴趣的蛋白质靶向任何细胞区室,如植物的液泡、线粒体、过氧化物酶体、蛋白体、内质网、叶绿体、淀粉颗粒、淀粉体、质外体或细胞壁(例如Unger et.al.Plant Molec.Biol.13:411-418(1989)[Unger等人,植物分子生物学,13:411-418(1989)];Rogers et.al.(1985)Proc.Natl.Acad.Sci.USA 82:6512-651[Rogers等人(1985)美国国家科学院院刊,82:6512-651];美国专利号7,102,057;WO 2005/096704,将其全部通过引用而特此结合)。任选地,信号序列可以是来自waxy的N-末端信号序列、来自γ-玉米蛋白的N-末端信号序列、淀粉结合结构域、C-末端淀粉结合结构域、将成熟蛋白引入叶绿体的叶绿体靶向序列(Comaiet.al.(1988)J.Biol.Chem.263:15104-15109[Comai等人(1988)生物化学杂志,263:15104-15109];van den Broeck,et.al.(1985)Nature 313:358-363[van den Broeck等人(1985)自然,313:358-363];美国专利号5,639,949)或来自糊粉细胞的分泌信号序列(Koehler&Ho,Plant Cell 2:769-783(1990)[Koehler和Ho,植物细胞,2:769-783(1990)])。另外,与羧基末端序列结合的氨基末端序列负责基因产物的液泡靶向(Shinshiet.al.(1990)Plant Molec.Biol.14:357-368[Shinshi等人(1990)植物分子生物学,14:357-368])。在一个实施例中,所选择的信号序列包括已知的切割位点,并且构建的融合体考虑了在一个或多个切割位点之后的需要切割的任何氨基酸。在一些情况下,这个要求可以通过在切割位点与转基因ATG之间添加小数目的氨基酸,或可替代地置换转基因序列内的一些氨基酸来满足。这些构建技术在本领域是熟知的并且同样适用于任何细胞区室。
应认识到,用于细胞靶向的上述机构不仅可以与其同源启动子结合使用,还可以与异源启动子结合使用,从而在启动子的转录调节下实现特定的细胞靶向目标,该启动子具有不同于自其衍生靶向信号的启动子的表达谱。
植物转化
用于转化植物的程序在本领域中是熟知且常规的并且普遍描述于文献中。用于植物转化的方法的非限制性实例包括通过以下各项的转化:细菌介导的核酸递送(例如,经由农杆菌)、病毒介导的核酸递送、碳化硅或核酸须晶介导的核酸递送、脂质体介导的核酸递送、微注射、微粒轰击、磷酸钙介导的转化、环糊精介导的转化、电穿孔、纳米粒子介导的转化、超声处理、渗入、PEG介导的核酸吸收、以及使得核酸引入到植物细胞中的任何其他电学的、化学的、物理的(机械的)和/或生物的机制,包括其任何组合。对于本领域已知的不同植物转化方法的一般指导包括以下文献:Miki等人(“Procedures for Introducing ForeignDNA into Plants”in Methods in Plant Molecular Biology and Biotechnology,Glick,B.R.and Thompson,J.E.,Eds.(CRC Press,Inc.,Boca Raton,1993),pages 67-88[在Glick,B.R.和Thompson,J.E.编辑的植物分子生物学与生物技术方法中的“用于将外来DNA引入植物中的程序”(CRC出版公司,波卡拉顿,1993),第67-88页])和Rakowoczy-Trojanowska(Cell.Mol.Biol.Lett.7:849-858(2002)[细胞与分子生物学快报,7:849-858(2002)])。
对于农杆菌介导的转化,二元载体或携带至少一个T-DNA边界序列的载体是适合的,而对于直接基因转移(例如,微粒轰击等),任何载体都是适合的,并且仅含有感兴趣的构建体的线性DNA可以是优选的。在直接基因转移的情况下,可以使用以单个DNA种类的转化或共转化(Schocher et al.,Biotechnology 4:1093-1096(1986)[Schocher等人,生物技术,4:1093-1096(1986)])。对于直接基因转移以及农杆菌介导的转移二者,转化通常(但不是必需的)用如下选择性标记进行,该选择性标记可以是正向选择(磷甘露糖异构酶),提供对抗生素(卡那霉素、潮霉素或甲氨蝶呤)或除草剂(草甘膦或草丁膦)的抗性。然而,选择性标记的选择对于本发明并不是至关重要的。
农杆菌介导的转化是用于转化植物(特别是双子叶植物)的常用方法,这是由于其高转化效率并且由于其在许多不同物种中的广泛实用性。农杆菌介导的转化典型地涉及将携带感兴趣的外来DNA的二元载体转移至适当的农杆菌菌株,这可能取决于由宿主农杆菌菌株或者在共同存在的Ti质粒上或染色体地携带的vir基因的互补体(Uknes et al.(1993)Plant Cell 5:159-169[Uknes等人(1993)植物细胞,5:159-169])。将该重组二元载体转移至农杆菌可以使用携带该重组二元载体的大肠杆菌、辅助大肠杆菌菌株(该辅助菌株携带能够将该重组二元载体移动到目标农杆菌菌株中的质粒)通过三亲本交配程序实现。可替代地,可以通过核酸转化将该重组二元载体转移至农杆菌中(&Willmitzer(1988)Nucleic Acids Res.16:9877[和Willmitzer(1988)核酸研究,16:9877])。
通过重组农杆菌进行的植物转化通常涉及该农杆菌与来自该植物的外植体的共培养,并且遵循本领域熟知的方法。在携带位于这些二元质粒T-DNA边界之间的抗生素或除草剂抗性标记的选择培养基上对转化的组织进行再生。
如先前所讨论的,另一种用于转化植物、植物部分和植物细胞的方法涉及在植物组织和细胞上推进惰性或生物活性的粒子。参见例如,美国专利号4,945,050、5,036,006和5,100,792。通常,这种方法涉及在有效于穿透该细胞的外表面并提供掺入在其内部中的条件下在植物细胞处推进惰性或生物活性的粒子。当使用惰性粒子时,可以通过用含有感兴趣的核酸的载体包被这些粒子而将该载体引入该细胞中。可替代地,一个或多个细胞可以被该载体围绕使得该载体通过该粒子的激发而被带入该细胞中。也可以将生物活性的粒子(例如,干酵母细胞、干细菌或噬菌体,各自含有一种或多种被试图引入的核酸)推进到植物组织中。
在另一个实施例中,本发明的多核苷酸可以被直接转化进质体基因组中。质体转化的主要优点在于质体通常能够表达细菌基因而无需实质性的修饰,而且质体能够在单一启动子的控制下表达多个开放阅读框。在以下文献中广泛描述了质体转化技术:美国专利号5,451,513、5,545,817和5,545,818;PCT申请号WO 95/16783;以及McBride et al.(1994)Proc.Nati.Acad.Sci.USA 91,7301-7305[McBride等人(1994)美国国家科学院院刊,91,7301-7305]。基本的叶绿体转化技术涉及将位于选择性标记侧翼的经克隆的质体DNA区连同感兴趣的基因一起引入适合的靶组织中,这是例如使用生物射弹(biolistic)或原生质体转化(例如,氯化钙或PEG介导的转化)来进行的。这些1至1.5kb的侧翼区(被命名为靶向序列)促进了与质体基因组的同源重组,并且因而允许置换或修饰原质体(plastome)的特定区域。最初,可以将叶绿体16S rRNA和rps12基因(赋予针对大观霉素和/或链霉素的抗性)的点突变用作供转化用的选择性标记(Svab,Z.,Hajdukiewicz,P.,andMaliga,P.(1990)Proc.Natl.Acad.Sci.USA 87,8526-8530[Svab,Z.、Hajdukiewicz,P.和Maliga,P.(1990)美国国家科学院院刊,87,8526-8530];Staub,J.M.,and Maliga,P.(1992)Plant Cell 4,39-45[Staub,J.M.和Maliga,P.(1992)植物细胞,4,39-45])。在这些标记之间克隆位点的存在允许建立质体靶向载体用于外来基因的引入(Staub,J.M.,andMaliga,P.(1993)EMBO J.12,601-606[Staub,J.M.和Maliga,P.(1993)欧洲分子生物学杂志,12,601-606])。转化效率的实质性增加可以通过用显性的选择性标记(对大观霉素解毒酶氨基糖苷-3'-腺苷转移酶进行编码的细菌aadA基因)置换隐性的rRNA或r蛋白抗生素抗性基因而获得(Svab,Z.,and Maliga,P.(1993)Proc.Natl.Acad.Sci.USA 90,913-917[Svab,Z.和Maliga,P.(1993)美国国家科学院院刊,90,913-917])。先前,这种标记已经被成功地用于莱茵衣藻(Chlamydomonas reinhardtii)这种绿藻的质体基因组的高频率转化(Goldschmidt-Clermont,M.(1991)Nucl.Acids Res.19:4083-4089[Goldschmidt-Clermont,M.(1991)核酸研究,19:4083-4089])。有用于质体转化的其他选择性标记在本领域是已知的,并且被包括在本发明的范围之内。典型地,转化之后需要大约15-20个细胞分裂循环以便达到同质状态。质体表达(其中基因通过同源重组被插入到在每个植物细胞中存在的所有数千个环状质体基因组的拷贝中)利用了超过核表达的基因的庞大的拷贝数目的优点,以便允许能够很容易超过总的可溶性植物蛋白的10%的表达水平。在一个实施例中,可以将本发明的多核苷酸插入质体靶向载体中并转化进所希望的植物宿主的质体基因组中。因此,可以获得与含有本发明的核苷酸序列的质体基因组同型的植物,这些植物能够高表达该多核苷酸。
选择转化的转基因植物、植物细胞和/或植物组织培养物的方法在本领域中是常规的,并且可以用于在此提供的本发明的方法中。例如,本发明的重组载体还可以包括包含用于选择性标记的核苷酸序列的表达盒,该选择性标记可以用于选择转化的植物、植物部分和/或植物细胞。如在此使用的,“选择性标记(selectable marker)”意指如下核苷酸序列,当该核苷酸序列表达时向表达该标记的植物、植物部分和/或植物细胞赋予不同的表型,并且因此允许此类转化的植物、植物部分和/或植物细胞与不具有该标记的那些区别开来。这样的核苷酸序列可以编码可选择的或可筛选的标记,这取决于该标记是否赋予可以通过化学手段而被选择的性状,如通过使用选择剂(例如,抗生素、除草剂等),或者取决于该标记是否仅是人们可以通过观察或测试而鉴别的性状,如通过筛选(例如,R基因座性状)。当然,适合的选择性标记的许多实例在本领域中是已知的并且可以用于在此描述的表达盒中。
选择性标记的实例包括但不限于编码neo或nptII的核苷酸序列,它赋予对卡那霉素、G418等的抗性(Potrykus et al.(1985)Mol.Gen.Genet.199:183-188[Potrykus等人(1985)分子遗传学与普通遗传学,199:183-188]);编码bar的核苷酸序列,它赋予对草丁膦的抗性;编码改变的5-烯醇丙酮莽草酸-3-磷酸(EPSP)合酶的核苷酸序列,它赋予对草甘膦的抗性(Hinchee et al.(1988)Biotech.6:915-922[Hinchee等人(1988)生物技术,6:915-922]);编码腈水解酶(如来自臭鼻克雷伯菌(Klebsiella ozaenae)的bxn)的核苷酸序列,它赋予对溴草腈的抗性(Stalker et al.(1988)Science242:419-423[Stalker等人(1988)科学,242:419-423]);编码改变的乙酰乳酸合酶(ALS)的核苷酸序列,它赋予对咪唑啉酮、磺酰脲或其他ALS-抑制化学品的抗性(欧洲专利申请号154204);编码甲氨蝶呤-抗性的二氢叶酸还原酶(DHFR)的核苷酸序列(Thillet et al.(1988)J.Biol.Chem.263:12500-12508[Thillet等人(1988)生物化学杂志,263:12500-12508]);编码茅草枯脱卤素酶的核苷酸序列,它赋予对茅草枯的抗性;编码甘露糖-6-磷酸异构酶(也称为磷酸甘露糖异构酶(PMI))的核苷酸序列,它赋予代谢甘露糖的能力(美国专利号5,767,378和5,994,629);编码改变的邻氨基苯甲酸盐合酶的核苷酸序列,它赋予对5-甲基色氨酸的抗性;和/或编码hph的核苷酸序列,它赋予对潮霉素的抗性。本领域技术人员能够选择用于在本发明的表达盒中使用的适合的选择性标记。
另外的选择性标记包括但不限于编码β-葡糖醛酸酶的核苷酸序列或编码对于多种显色底物已知的酶的uidA(GUS);编码在植物组织中对花色苷色素(红色)进行调节的产物的R基因座核苷酸序列(Dellaporta et al.,“Molecular cloning of the maize R-njallele by transposon-tagging with Ac”263-282In:Chromosome Structure andFunction:Impact of New Concepts,18th Stadler Genetics Symposium(Gustafson&Appels eds.,Plenum Press1988)[Dellaporta等人,“染色体结构与功能:新概念的影响中的通过Ac转座子标签技术对玉蜀黍R-nj等位基因的分子克隆”,第18届斯特德莱遗传学专题讨论会(Gustafson和Appels编著,Plenum出版社,1988)]);编码β-内酰胺酶的核苷酸序列,对于β-内酰胺酶而言多种显色底物是已知的(例如,PADAC,一种显色头孢菌素)(Sutcliffe(1978)Proc.Natl.Acad.Sci.USA 75:3737-3741[Sutcliffe(1978)美国国家科学院院刊,75:3737-3741]);编码xylE的核苷酸序列,xylE编码儿茶酚双加氧酶(Zukowskyet al.(1983)Proc.Natl.Acad.Sci.USA 80:1101-1105[Zukowsky等人(1983)美国国家科学院院刊,80:1101-1105]);编码酪氨酸酶的核苷酸序列,酪氨酸酶能够氧化酪氨酸成为DOPA和多巴醌,其进而缩合形成黑色素(Katz et al.(1983)J.Gen.Microbiol.129:2703-2714[Katz等人(1983)普通微生物学杂志,129:2703-2714]);编码β-半乳糖苷酶的核苷酸序列,对于β-半乳糖苷酶而言存在显色底物;编码荧光素酶(lux)的核苷酸序列,荧光素酶允许生物发光检测(Ow et al.(1986)Science 234:856-859[Ow等人(1986)科学,234:856-859]);编码水母发光蛋白的核苷酸序列,水母发光蛋白可以在钙敏感的生物发光检测中采用(Prasher et al.(1985)Biochem.Biophys.Res.Comm.126:1259-1268[Prasher等人(1985)生物化学与生物物理学研究通讯,126:1259-1268]);或编码绿色荧光蛋白的核苷酸序列(Niedz et al.(1995)Plant Cell Reports 14:403-406[Niedz等人(1995)植物细胞报道,14:403-406])。本领域技术人员能够选择用于在本发明的表达盒中使用的适合的选择性标记。
此外,如本领域中所熟知的,完整的转基因植物可以使用多种已知技术中的任何技术从转化的植物细胞、植物组织培养物和/或培养的原生质体再生而来。例如在以下文献中描述了从植物细胞、植物组织培养物和/或培养的原生质体进行的植物再生:Evans etal.(Handbook of Plant Cell Cultures,Vol.1,MacMilan Publishing Co.New York(1983))[Evans等人(植物细胞培养手册,第1卷,麦克米兰出版公司,纽约(1983))];和Vasil I.R.(ed.)(Cell Culture and Somatic Cell Genetics of Plants,Acad.Press,Orlando,Vol.I(1984),and Vol.II(1986))[Vasil I.R.(编著)(植物的细胞培养和体细胞遗传学,学术出版社,奥兰多,第I卷(1984)和第II卷(1986))]。
另外,工程化进以上所述的本发明的转基因种子和植物、植物部分和/或植物细胞中的遗传特性可以通过有性生殖或营养生长来传递,并且因此可以在子代植物中维持并传代。通常,维持和传代利用了被开发以适合特定目的(如收获、播种或耕作)的已知农业方法。
因此,可以按本领域熟知的任何数目的方法(如上所述的)将多核苷酸引入该植物、植物部分和/或植物细胞中。因此,没有依赖用于将一种或多种多核苷酸引入植物中的具体方法,相反可以使用允许将该一种或多种多核苷酸稳定地整合到该植物的基因组中的任何方法。在有待引入一种以上多核苷酸的情况下,这些对应的多核苷酸可以作为单一核酸分子的一部分、或者作为分开的核酸分子而进行组装,并且可以位于相同的或不同的核酸分子上。因此,可以在单一转化事件中、在分开的转化事件中、或者例如作为育种方案的一部分在植物中,将这些多核苷酸引入感兴趣的细胞中。
本发明的另外的实施例包括从本发明的转基因植物和/或其部分产生的收获产物以及从所述收获产物产生的加工产物。收获产物可以是如在此描述的全株或任何植物部分。因此,在一些实施例中,收获产物的非限制性实例包括种子、果实、花或其部分(例如,花药、柱头等)、叶、茎等。在其他实施例中,加工产物包括但不限于从收获的本发明的种子或其他植物部分产生的细粉、粗粉、油、淀粉、谷物等,其中所述种子或其他植物部分包含本发明的核酸分子/多核苷酸/核苷酸序列。
在其他实施例中,本发明提供了来自本发明的转基因种子和/或转基因植物的提取物,其中该提取物包含本发明的核酸分子、多核苷酸、核苷酸序列或毒性蛋白。可以根据本领域熟知的程序制备来自植物或植物部分的提取物(参见,de la Torre et al.,Food,Agric.Environ.2(1):84-89(2004)[de la Torre等人,食品农业与环境,2(1):84-89(2004)];Guidet,Nucleic Acids Res.22(9):1772-1773(1994)[Guidet,核酸研究,22(9):1772-1773(1994)];Lipton et al.,Food Agric.Immun.12:153-164(2000)[Lipton等人,食品农业通讯,12:153-164(2000)])。
杀昆虫组合物
在一些实施例中,本发明提供了杀昆虫组合物,该组合物包含农业上可接受的载体中的本发明的Cry蛋白。如在此使用的,“农业上可接受的载体”可以包括与该活性组分结合以有助于它施用于植物或其部分的天然或合成、有机或无机材料。农业上可接受的载体的实例包括但不限于粉剂、尘剂、丸剂、颗粒剂、喷雾剂、乳剂、胶体以及溶液。农业上可接受的载体进一步包括但不限于可用于农业配制品中的惰性组分、分散剂、表面活性剂、佐剂、增粘剂、粘着剂、粘合剂或其组合。此类组合物可以按使杀昆虫蛋白或其他有害生物控制剂与这些有害生物接触的任何方式施用。因此,可以将这些组合物施用于植物或植物部分的表面,包括种子、叶、花、茎、块茎、根等。另一种农业上可接受的载体可以是转基因植物或植物部分。
在另外的实施例中,该杀昆虫组合物包含本发明的转基因细菌细胞,其中该细菌细胞包含本发明的嵌合基因。例如,这样的杀昆虫组合物可以通过脱水、冷冻干燥、均化、萃取、过滤、离心、沉降或浓缩包含本发明的多核苷酸的苏云金芽孢杆菌细胞的培养物来制备。在另外的实施例中,该组合物包含按重量计从约1%至约99%的本发明的Cry蛋白。
本发明的Cry蛋白可以与其他有害生物控制剂组合使用,以增加有害生物目标范围或用于预防和/或管理昆虫抗性。因此,在一些实施例中,本发明提供了控制一种或多种植物有害生物的组合物,其中该组合物包含本发明的第一Cry蛋白和不同于该第一Cry蛋白的第二有害生物控制剂。在其他实施例中,该组合物是用于局部施用至植物的配制品。在仍其他实施例中,该组合物是转基因植物。在另外的实施例中,该组合物是局部施用至转基因植物的配制品的组合。在一个实施例中,当该转基因植物包含第二有害生物控制剂时,该配制品包含本发明的第一Cry蛋白。在另一个实施例中,当该转基因植物包含本发明的第一Cry蛋白时,该配制品包括第二有害生物控制剂。
在一些实施例中,该第二有害生物控制剂可以是选自下组的试剂,该组由以下各项组成:化学杀有害生物剂、苏云金芽孢杆菌(Bt)杀昆虫蛋白、致病杆菌属杀昆虫蛋白、发光杆菌属杀昆虫蛋白、侧孢短芽孢杆菌(Brevibacillus laterosporus)杀昆虫蛋白、球形芽孢杆菌(Bacillus sphaericus)杀昆虫蛋白、蛋白酶抑制剂(丝氨酸和半胱氨酸类型两者)、凝集素、α-淀粉酶、过氧化物酶以及胆固醇氧化酶。
在其他实施例中,该第二有害生物控制剂是选自下组的化学杀有害生物剂,该组由以下各项组成:拟除虫菊酯、氨基甲酸酯、新烟碱、神经元钠通道阻断剂、杀昆虫大环内酯、γ-氨基丁酸(GABA)拮抗剂、杀昆虫脲以及保幼激素模拟物。在另一个实施例中,该化学杀有害生物剂选自下组,该组由以下各项组成:阿巴美丁、乙酰甲胺磷、啶虫脒、磺胺螨酯(amidoflumet)(S-1955)、除虫菌素(avermectin)、印楝素、甲基谷硫磷、联苯菊酯、联苯肼酯(binfenazate)、噻嗪酮、克百威、溴虫腈、定虫隆、毒死蜱、甲基毒死蜱、环虫酰肼、噻虫胺、氟氯氰菊酯、β-氟氯氰菊酯、三氯氟氰菊酯、λ-三氯氟氰菊酯、氯氰菊酯、灭蝇胺、溴氰菊酯、杀螨隆、二嗪磷、除虫脲、乐果、苯虫醚、甲氨基阿维菌素、硫丹、高氰戊菊酯、乙虫腈、苯硫威(fenothicarb)、苯氧威、甲氰菊酯、唑螨酯、氰戊菊酯、氟虫腈、氟啶虫酰胺、氟氰戊菊酯、τ-氟胺氰菊酯、嘧虫胺(UR-50701)、氟虫脲、地虫硫磷、氯虫酰肼、氟铃脲、吡虫啉、茚虫威、异柳磷、虱螨脲、马拉硫磷、聚乙醛、甲胺磷、杀扑磷、灭多威、烯虫酯、甲氧氯、久效磷、甲氧虫酰肼、噻虫醛(nithiazin)、双苯氟脲、多氟脲(XDE-007)、杀线威、对硫磷、甲基对硫磷、氯菊酯、甲拌磷、伏杀磷、亚胺硫磷、磷胺、抗蚜威、丙溴磷、吡蚜酮、啶虫丙醚、蚊蝇醚、鱼藤酮、多杀菌素、螺甲螨酯(spiromesifin)(BSN 2060)、硫丙磷、虫酰肼、伏虫隆、七氟菊酯、特丁硫磷、杀虫畏、噻虫啉、噻虫嗪、硫双威、杀虫双(thiosultap-sodium)、四溴菊酯、敌百虫和杀铃脲、涕灭威、杀线威、苯线磷、双甲脒、灭螨猛、乙酯杀螨醇、三环锡、三氯杀螨醇、除螨灵、依杀螨、喹螨醚、苯丁锡、甲氰菊酯、唑螨酯、噻螨酮、克螨特、哒螨灵以及吡螨胺。在另一个实施例中,该化学杀有害生物剂选自下组,该组由以下各项组成:氯氰菊酯、三氯氟氰菊酯、氟氯氰菊酯和β-氟氯氰菊酯、高氰戊菊酯、氰戊菊酯、四溴菊酯、苯硫威、灭多威、杀线威、硫双威、噻虫胺、吡虫啉、噻虫啉、茚虫威、多杀菌素、阿巴美丁、除虫菌素(avermectin)、苯虫醚、硫丹、乙虫腈、氟虫腈、氟虫脲、杀铃脲、苯虫醚、蚊蝇醚、吡蚜酮以及双甲脒。
在另外的实施例中,该第二有害生物控制剂可以是任何数目的苏云金芽孢杆菌杀昆虫蛋白中的一种或多种,包括但不限于Cry蛋白、营养期杀昆虫蛋白(VIP)以及任何前述杀昆虫蛋白的杀昆虫嵌合体。在其他实施例中,该第二有害生物控制剂是选自下组的Cry蛋白,该组由以下各项组成:Cry1Aa、Cry1Ab、Cry1Ac、Cry1Ad、Cry1Ae、Cry1Af、Cry1Ag、Cry1Ah、Cry1Ai、Cry1Aj、Cry1Ba、Cry1Bb、Cry1Bc、Cry1Bd、Cry1Be、Cry1Bf、Cry1Bg、Cry1Bh、Cry1Bi、Cry1Ca、Cry1Cb、Cry1Da、Cry1Db、Cry1Dc、Cry1Dd、Cry1Ea、Cry1Eb、Cry1Fa、Cry1Fb、Cry1Ga、Cry1Gb、Cry1Gc、Cry1Ha、Cry1Hb、Cry1Hc、Cry1Ia、Cry1Ib、Cry1Ic、Cry1Id、Cry1Ie、Cry1If、Cry1Ig、Cry1Ja、Cry1Jb、Cry1Jc、Cry1Jd、Cry1Ka、Cry1La、Cry1Ma、Cry1Na、Cry1Nb、Cry2Aa、Cry2Ab、Cry2Ac、Cry2Ad、Cry2Ae、Cry2Af、Cry2Ag、Cry2Ah、Cry2Ai、Cry2Aj、Cry2Ak、Cry2Al、Cry2Ba、Cry3Aa、Cry3Ba、Cry3Bb、Cry3Ca、Cry4Aa、Cry4Ba、Cry4Ca、Cry4Cb、Cry4Cc、Cry5Aa、Cry5Ab、Cry5Ac、Cry5Ad、Cry5Ba、Cry5Ca、Cry5Da、Cry5Ea、Cry6Aa、Cry6Ba、Cry7Aa、Cry7Ab、Cry7Ac、Cry7Ba、Cry7Bb、Cry7Ca、Cry7Cb、Cry7Da、Cry7Ea、Cry7Fa、Cry7Fb、Cry7Ga、Cry7Gb、Cry7Gc、Cry7Gd、Cry7Ha、Cry7Ia、Cry7Ja、Cry7Ka、Cry7Kb、Cry7La、Cry8Aa、Cry8Ab、Cry8Ac、Cry8Ad、Cry8Ba、Cry8Bb、Cry8Bc、Cry8Ca、Cry8Da、Cry8Db、Cry8Ea、Cry8Fa、Cry8Ga、Cry8Ha、Cry8Ia、Cry8Ib、Cry8Ja、Cry8Ka、Cry8Kb、Cry8La、Cry8Ma、Cry8Na、Cry8Pa、Cry8Qa、Cry8Ra、Cry8Sa、Cry8Ta、Cry9Aa、Cry9Ba、Cry9Bb、Cry9Ca、Cry9Da、Cry9Db、Cry9Dc、Cry9Ea、Cry9Eb、Cry9Ec、Cry9Ed、Cry9Ee、Cry9Fa、Cry9Ga、Cry10Aa、Cry11Aa、Cry11Ba、Cry11Bb、Cry12Aa、Cry13Aa、Cry14Aa、Cry14Ab、Cry15Aa、Cry16Aa、Cry17Aa、Cry18Aa、Cry18Ba、Cry18Ca、Cry19Aa、Cry19Ba、Cry19Ca、Cry20Aa、Cry20Ba、Cry21Aa、Cry21Ba、Cry21Ca、Cry21Da、Cry21Ea、Cry21Fa、Cry21Ga、Cry21Ha、Cry22Aa、Cry22Ab、Cry22Ba、Cry22Bb、Cry23Aa、Cry24Aa、Cry24Ba、Cry24Ca、Cry25Aa、Cry26Aa、Cry27Aa、Cry28Aa、Cry29Aa、Cry29Ba、Cry30Aa、Cry30Ba、Cry30Ca、Cry30Da、Cry30Db、Cry30Ea、Cry30Fa、Cry30Ga、Cry31Aa、Cry31Ab、Cry31Ac、Cry31Ad、Cry32Aa、Cry32Ab、Cry32Ba、Cry32Ca、Cry32Cb、Cry32Da、Cry32Ea、Cry32Eb、Cry32Fa、Cry32Ga、Cry32Ha、Cry32Hb、Cry32Ia、Cry32Ja、Cry32Ka、Cry32La、Cry32Ma、Cry32Mb、Cry32Na、Cry32Oa、Cry32Pa、Cry32Qa、Cry32Ra、Cry32Sa、Cry32Ta、Cry32Ua、Cry33Aa、Cry34Aa、Cry34Ab、Cry34Ac、Cry34Ba、Cry35Aa、Cry35Ab、Cry35Ac、Cry35Ba、Cry36Aa、Cry37Aa、Cry38Aa、Cry39Aa、Cry40Aa、Cry40Ba、Cry40Ca、Cry40Da、Cry41Aa、Cry41Ab、Cry41Ba、Cry42Aa、Cry43Aa、Cry43Ba、Cry43Ca、Cry43Cb、Cry43Cc、Cry44Aa、Cry45Aa、Cry46Aa、Cry46Ab、Cry47Aa、Cry48Aa、Cry48Ab、Cry49Aa、Cry49Ab、Cry50Aa、Cry50Ba、Cry51Aa、Cry52Aa、Cry52Ba、Cry53Aa、Cry53Ab、Cry54Aa、Cry54Ab、Cry54Ba、Cry55Aa、Cry56Aa、Cry57Aa、Cry57Ab、Cry58Aa、Cry59Aa、Cry59Ba、Cry60Aa、Cry60Ba、Cry61Aa、Cry62Aa、Cry63Aa、Cry64Aa、Cry65Aa、Cry66Aa、Cry67Aa、Cry68Aa、Cry69Aa、Cry69Ab、Cry70Aa、Cry70Ba、Cry70Bb、Cry71Aa、Cry72Aa以及Cry73Aa。
在另外的实施例中,该第二有害生物控制剂是选自下组的Vip3营养期杀昆虫蛋白,该组由以下各项组成:Vip3Aa1、Vip3Aa2、Vip3Aa3、Vip3Aa4、Vip3Aa5、Vip3Aa6、Vip3Aa7、Vip3Aa8、Vip3Aa9、Vip3Aa10、Vip3Aa11、Vip3Aa12、Vip3Aa13、Vip3Aa14、Vip3Aa15、Vip3Aa16、Vip3Aa17、Vip3Aa18、Vip3Aa19、Vip3Aa20、Vip3Aa21、Vip3Aa22、Vip3Aa2、Vip3Aa24、Vip3Aa25、Vip3Aa26、Vip3Aa27、Vip3Aa28、Vip3Aa29、Vip3Aa30、Vip3Aa31、Vip3Aa32、Vip3Aa33、Vip3Aa34、Vip3Aa35、Vip3Aa36、Vip3Aa37、Vip3Aa38、Vip3Aa39、Vip3Aa40、Vip3Aa41、Vip3Aa42、Vip3Aa43、Vip3Aa44、Vip3Ab1、Vip3Ab2、Vip3Ac1、Vip3Ad1、Vip3Ad2、Vip3Ae1、Vip3Af1、Vip3Af2、Vip3Af3、Vip3Ag1、Vip3Ag2、Vip3Ag3HM117633、Vip3Ag4、Vip3Ag5、Vip3Ah1、Vip3Ba1、Vip3Ba2、Vip3Bb1、Vip3Bb2以及Vip3Bb3。
在仍另外的实施例中,在转基因植物中共表达本发明的第一Cry蛋白和该第二有害生物控制剂。可以通过将植物遗传工程化以含有并表达所有的必需基因来实现一种以上杀有害生物成分在同一个转基因植物中的共表达。可替代地,可以将植物(亲本1)遗传工程化,用于本发明的Cry蛋白的表达。可以将第二植物(亲本2)遗传工程化,用于第二有害生物控制剂的表达。通过将亲本1与亲本2杂交,获得了表达被引入至亲本1和亲本2中的所有基因的子代植物。
在另外的实施例中,提供了产生对于至少黑色地老虎(小地老虎)有毒的蛋白质的方法,该方法包括培养包含本发明的多核苷酸或嵌合基因或核酸分子或重组载体的转基因非人类宿主细胞,在该宿主产生对于至少黑色地老虎(小地老虎)有毒的蛋白质的条件下。在一些实施例中,该转基因非人类宿主细胞是植物细胞。在一个实施例中,该植物细胞是玉蜀黍细胞。在其他实施例中,植物细胞或玉蜀黍细胞在其下生长的条件包括自然光照。在其他实施例中,该转基因非人类宿主细胞是细菌细胞。在仍其他实施例中,该转基因非人类宿主细胞是酵母细胞。
在其他实施例中,所产生的蛋白质对至少一种另外的昆虫有杀昆虫活性,其中该另外的昆虫选自下组,该组由以下各项组成:欧洲玉米蛀虫(欧洲玉米螟)、秋黏虫(草地贪夜蛾)、玉米穗蛾(玉米穗虫)、甘蔗螟(小蔗螟),绒毛豆毛虫(黎豆夜蛾)、大豆夜蛾(大豆尺蠖),西南玉米蛀虫(西南玉米螟)、西部豆切根虫(西部豆夜蛾)、烟夜蛾(烟芽夜蛾)、亚洲玉米蛀虫(亚洲玉米螟)、棉螟蛉(棉铃虫)、条纹蛀茎虫(二化螟)、粉蛀茎虫(非洲大螟)或水稻卷叶螟(稻纵卷叶螟)及其任何组合。
在其他实施例中,该嵌合基因包含SEQ ID NO:1-4中的任一项。在仍其他实施例中,所产生的蛋白质包含SEQ ID NO:13-16中任一项的氨基酸序列。
在一些实施例中,该嵌合基因包含密码子优化用于在植物中表达的核苷酸序列。在其他实施例中,该嵌合基因包含SEQ ID NO:5-12中的任一项。在另外的实施例中,所产生的蛋白质包含SEQ ID NO:13-20中任一项的氨基酸序列。
在另外的实施例中,本发明提供了产生抗有害生物(例如,抗昆虫)转基因植物的方法,该方法包括向植物中引入包含编码本发明的Cry蛋白的核苷酸序列的本发明的多核苷酸、嵌合基因、重组载体、表达盒或核酸分子,其中该核苷酸序列被表达于该植物中,由此赋予该植物对至少欧洲玉米蛀虫的抗性,并且产生抗有害生物(例如,抗昆虫)转基因植物。在一些实施例中,与缺乏本发明的多核苷酸、嵌合基因、重组载体、表达盒或核酸分子的对照植物相比,抗有害生物转基因植物对至少黑色地老虎(小地老虎)有抗性。在一些实施例中,通过转化植物实现引入。在其他实施例中,通过使包含本发明的嵌合基因、重组载体、表达盒或核酸分子的第一植物与不同的第二植物杂交来实现引入。
在一些实施例中,对至少黑色地老虎(小地老虎)有抗性的本发明的转基因植物还对至少一种另外的昆虫有抗性,其中该另外的昆虫包括但不限于欧洲玉米蛀虫(欧洲玉米螟)、秋黏虫(草地贪夜蛾)、玉米穗蛾(玉米穗虫)、甘蔗螟(小蔗螟),绒毛豆毛虫(黎豆夜蛾)、大豆夜蛾(大豆尺蠖),西南玉米蛀虫(西南玉米螟)、西部豆切根虫(西部豆夜蛾)、烟夜蛾(烟芽夜蛾)、亚洲玉米蛀虫(亚洲玉米螟)、棉螟蛉(棉铃虫)、条纹蛀茎虫(二化螟)、粉蛀茎虫(非洲大螟)或水稻卷叶螟(稻纵卷叶螟)及其任何组合。
在另外的实施例中,提供了控制至少黑色地老虎(小地老虎)昆虫的方法,该方法包括向这些昆虫递送有效量的本发明的Cry蛋白。为了有效,该Cry蛋白首先被昆虫经口摄取。然而,该Cry蛋白可以按许多公认的方式被递送至该昆虫。用于将蛋白质经口递送至昆虫的方式包括但不限于将该蛋白质提供于(1)转基因植物中,其中该昆虫取食(摄取)该转基因植物的一个或多个部分,由此摄取在该转基因植物中表达的多肽;(2)一种或多种配制的蛋白质组合物中,它们可以被施用至或掺入例如昆虫生长介质中;(3)一种或多种蛋白质组合物中,它们可以被施用至表面,例如喷雾在植物部分的表面,然后当该昆虫取食喷雾的一个或多个植物部分时组合物被该昆虫摄取;(4)饵基;(5)经注射进该昆虫;或(6)任何其他本领域公认的蛋白质递送系统。因此,可以使用经口递送至昆虫的任何方法来递送本发明的毒性Cry蛋白。在一些具体实施例中,将本发明的Cry蛋白经口递送至昆虫,其中该昆虫摄取转基因植物的一个或多个部分。
在其他实施例中,将本发明的Cry蛋白经口递送至昆虫,其中该昆虫摄取用包含本发明的Cry蛋白的组合物喷雾的植物的一个或多个部分。可以使用本领域技术人员已知的用于将化合物、组合物、配制品等施用于植物表面的任何方法将本发明的组合物递送至植物表面。递送至或接触植物或其部分的一些非限制性实例包括喷雾、撒粉、喷洒、分散、下雾、雾化、撒播、浸泡、土壤注入、土壤掺入、浸透(例如,根、土壤处理)、浸渍、灌注、涂覆、叶或茎渗透、侧施或种子处理等及其组合。用于使植物或其部分与一种或多种化合物、一种或多种组合物或一种或多种配制品接触的这些和其他程序是本领域技术人员熟知的。
在一些实施例中,本发明涵盖为农民提供控制鳞翅目昆虫有害生物的手段的方法,该方法包括向该农民供应或出售植物材料如种子,该植物材料包含能够表达本发明的Cry蛋白的多核苷酸、嵌合基因、表达盒或重组载体,如上所述的。
本发明的实施例可以通过参考以下实例而被更好地理解。前述的和以下的本发明的实施例以及各种实施例的描述不是旨在限制权利要求书,而是对其具有说明性。因此,应理解的是权利要求书不旨在限制这些实例的具体细节。本领域技术人员应理解的是本发明的其他实施例可以在不偏离本披露的精神和范围的情况下进行实践,本披露的范围是由所附权利要求书限定的。实例
实例1.活性Bt菌株的鉴别
将来自当前收集物中存在的孢子的苏云金芽孢杆菌分离株在T3+盘尼西林琼脂平板上进行培养和维持。在28℃下使每种分离株需氧地生长在24孔深块中约10天直至孢子形成,将其通过用考马斯蓝/乙酸染色并且用显微镜可视化来验证。孢子形成之后,将可溶的和不溶的部分都针对感兴趣的鳞翅目种类的活性进行测试。在表面污染生物测定中测试各部分,在该生物测定中将各部分覆盖到多种类人工饲料上。针对至少四种鳞翅目种类(包括玉米穗虫(玉米穗蛾)、小地老虎(黑色地老虎)、欧洲玉米螟(欧洲玉米蛀虫)和草地贪夜蛾(秋黏虫)筛选每种分离物,其中样本量为12只新生幼虫。每个测定的持续时间为在室温下约7天;针对死亡率以及幼虫生长抑制对这些平板评分。将在相对于阴性对照观察到的死亡率增加30%时认为是有效的。基于初始昆虫测试,选择菌株C0633、C2080、M0262和M1455用于进一步分析。
实例2.Bt基因的分离和测序
Fosmid基因组文库构建:对于在实例1中鉴别的一些Bt菌株,使用Park等人(FEMSMicrobiol.Lett.284:28-34(2008)[FEMS微生物学简讯,284:28-34(2008)])描述的fosmid文库方法来分离编码推定活性蛋白质的基因。使用CopyControlTMFosmid LibraryProduction Kit(Epicentre公司,麦迪逊,威斯康星州)根据制造商方案构建fosmid文库。简言之,将来自每个Bt菌株的纯化DNA(约0.5μg)经酶处理以对平端进行修端(endrepair),并且然后连接到fosmid载体pCC1FOS(Epicentre公司)中。在体外包装进λ噬菌体和感染大肠杆菌(E.coli)后,将细菌细胞铺板在含有12.5μg/ml氯霉素的Luria-Bertani(LB)上。在选择菌落之前,将各板在约37℃下孵育24h。将经转染的大肠杆菌菌落转移到含有150μl含氯霉素的LB培养基的96孔平板中,并在37℃下孵育24h。
菌落杂交筛选:以300cfu/100x 15mm L-琼脂加15μg/ml氯霉素平板的密度铺板fosmid文库。总计铺板3000板fosmid。使用Immobilon-Ny+87mm过滤圆片(EMD密理博公司(Millipore),比勒利卡,马萨诸塞州)进行膜杂交。如下完成菌落转移:将滤器置于各板上约5min,然后使用镊子,将滤器从琼脂表面转移并将菌落向上放置在用0.5M NaOH浸泡5min的Whatman滤纸上。然后将菌落滤器放置于2X SSC中浸泡5分钟的Whatman滤纸上。用设定为2000x 100μJ的UV (Stratagene公司,拉荷亚,加利福尼亚州)将DNA固定在膜上。然后将滤器在Whatman滤纸上进行空气干燥。如供应商所述的,将滤器预杂交,并在65℃下在250mM NaPO4(pH 7.0)、7%SDS、1%BSA中杂交。将杂交滤器在65℃下在2X SSC、0.5%SDS中洗涤30min,随后在65℃下在0.2X SSC、0.2%SDS中洗涤30min。在-80℃下用增感屏将滤器暴露于X射线胶片(XAR,飞世尔科技公司(Fisher Scientific),匹兹堡,宾夕法尼亚州)过夜。将阳性菌落影印(patched)到L琼脂(加有15μg/ml氯霉素)上。
杂交探针:设计PCR引物以从指定为C0633的Bt菌株的基因组DNA扩增cry9B样基因的720bp片段。引物对包括具有序列AAACATGAACCGAAATAATCAAAATG(SEQ ID NO:21)的指定为OAR2613a的正向引物和具有序列ATCCGTCCCTTGTGCGTGTAAA(SEQ ID NO:22)的指定为OAR2615a的反向引物。PCR反应在以下循环条件下运行:[94℃,5min],12x[94℃,30sec,57℃至51℃,每次循环下降0.5℃,30sec,72℃2.5min],以及35x[94℃,30sec,52℃,30sec,72℃,2.5min]。该反应含有1X One 缓冲液(新英格兰生物实验室(New EnglandBiolabs),贝弗利,马萨诸塞州)、200um dNTP、80ng DNA、2.5U One DNA聚合酶、50ng各引物和无菌蒸馏水至50μl总反应。
在含有溴化乙锭的1%琼脂糖TAE凝胶上分离得到的扩增子。在UV光下观察扩增子,并从凝胶中切出。使用凝胶提取试剂盒如供应商(凯杰公司(Qiagen),巴伦西亚,加利福尼亚州)所述的分离DNA。使用Rediprime II随机引物标记系统(通用电气医疗集团(GEHealthcare),匹兹堡,宾夕法尼亚州),用EasyTidedCTP 3000Ci/mmol(珀金埃尔默公司(Perkin Elmer),沃尔瑟姆,马萨诸塞州)标记探针。使用Micro Bio-Spin30色谱柱(伯乐公司(BioRad),赫拉克勒斯(Hercules),加利福尼亚州)除去未掺入的核苷酸。将探针在添加至杂交溶液中之前在95℃下加热5min。
Bt基因测序:遵循制造商的说明书(凯杰公司)制备2-4个独立克隆的DNA制剂。根据制造商的说明书,使用BigDyeTMTerminator Kit(应用生物系统公司(AppliedBiosystems)),福斯特城,加利福尼亚州)进行用针对感兴趣的预测核苷酸序列的两条链设计的引物的测序反应。将反应产物在ABI373或ABI377测序仪上进行电泳。使用Phred/Phrap/Consed软件包(华盛顿大学)对所有测序数据进行分析,其误差率在共有序列水平下等于或小于10-4。将序列用程序SequencherTM(4.7版,基因密码公司(Gene Codes Corp.),安阿伯,密歇根州)进行组装。每个基因被测序到4X覆盖度。
实例3.Bt基因克隆和合成
设计Cry9特异性引物对以促进cry9型基因的鉴别和克隆。将引物对设计成伴随加入PmeI限制性位点与cry9型基因的5'末端杂交,并伴随加入AscI限制性位点与3'末端杂交。用于扩增5'末端的引物对包括具有序列GTTTAAACATGAATCGAAATAATCAAAATG(SEQ IDNO:23)的正向引物和具有序列GGCGCGCCCTACTCTTGTGTTTCAATAAA(SEQ ID NO:24)的反向引物。用于扩增3'末端的引物对包括具有序列GTTTAAACATGAATCAAAATAAACACGGA(SEQ ID NO:25)的正向引物和具有序列GGCGCGCCTTACTGTTGGGTTTCCATGAACT(SEQ ID NO:26)的反向引物。插入的限制性位点在相应的引物中加下划线。使用以下循环条件进行PCR反应:[94℃,5min]和30x[94℃,30sec,45℃,30sec,72℃,3.5min]。该反应含有1X OneTaq缓冲液、200umdNTP、80ng DNA、2.5U OneTaq DNA聚合酶(新英格兰生物实验室)、50μg各引物和无菌蒸馏水至50μl总反应。
如供应商(生命技术公司(Life Technologies))所述的,将所得扩增子克隆到TOPO pCR 4.0载体中。如供应商(新英格兰生物实验室)所述的,将分离的质粒DNA用PmeI和AscI消化。
将PmeI/AscI片段克隆到设计用于在大肠杆菌和苏云金芽孢杆菌中表达的指定为pCIB5634`的穿梭载体中。将pCIB5634`载体用PmeI和AscI消化。将消化的载体和基因片段通过在基于1%琼脂糖Tris乙酸盐EDTA缓冲液的凝胶上跑胶而纯化。从凝胶中切出各片段,并使用QIAGEN凝胶提取试剂盒如供应商所述的进行清理。使用来自新英格兰生物实验室的连接试剂盒如供应商所述的将各片段连接在一起。如供应商所述的,将连接反应转化到TOP10细胞(生命技术公司)中,并将其铺板在含有100mg/ml氨苄青霉素的L-琼脂上。从单菌落中分离质粒DNA,并将鉴别的克隆再次测序到2X覆盖度以证实序列正确。
选择用于重组产生但未直接克隆出基因组DNA的一些Bt基因被提交给第三销售方用于全基因合成。将这些合成的Bt基因亚克隆到上述穿梭载体中,用于随后的表达和测试进一步的生物活性。
实例4.基因组组装与分析
使用全基因组测序方法鉴别本发明的一些Bt基因。简言之,使用Covaris S2超声波装置(Covaris公司,沃本,马萨诸塞州)剪切芽孢杆菌属DNA,其中将程序DNA_400bp设为工作循环:10%;强度:4;循环/脉冲:200。将DNA用UltraTMEnd Repair/dA-加尾模块(新英格兰生物实验室公司(New England Biolabs,Inc.),伊普斯维奇,马萨诸塞州)处理。如供应商(新英格兰生物实验室公司,伊普斯维奇,马萨诸塞州)所述的,使用NEBQuick LigationTM连接生物科技(Biooscience)有索引的1-57个适配子(1-27巴西,28-57美国、英国和瑞士)。如供应商(贝克曼库尔特公司(Beckman Coulter,Inc.),印第安纳波利斯,印第安纳州)所述的,使用Agencourt AMPure XP珠粒清理连接物。
将文库如下进行大小分级:将50uL样品与45ul 75%珠粒混合物(25%AMPure珠粒加75%NaCl/PEG溶液TekNova目录号P4136)混合。搅拌混合物并置于磁性支架上。将所得上清液转移至新孔中并且添加45ul 50%珠粒混合物(50%AMPure珠粒加50%NaCl/PEG溶液TekNova目录号P4136)。搅拌此混合物并置于磁性支架上。除去所得上清液并且用80%乙醇洗涤珠粒。添加25uL的洗脱缓冲液(EB)并且将混合物置于磁性支架上。除去所得最终上清液并置于1.5mL管中。这个方法产生了525个DNA碱基对(bp)(插入物加适配子)大小范围内的文库。
使用KAPA Biosystem HiFi Hot Start(KAPA生物系统公司(Kapa Biosystems,Inc.),威尔明顿,马萨诸塞州)使用以下循环条件扩增大小确定的DNA文库:[98℃,45s];12x[98℃,15s,60℃,30s,72℃,30s];[72℃,1min]。每个反应含有:5ul DNA文库、1uL生物科技通用引物(25uM)、18uL无菌水、1uL生物科技有索引的引物(25uM)、25ul2X KAPA HiFi聚合酶。
使用高灵敏度芯片在Agilent 2100Bioanalyzer(安捷伦科技公司(AgilentTechnologies),圣克拉拉,加利福尼亚州)上跑文库,以确定文库大小范围和平均插入物大小。使用标准的制造商测序方案(亿明达公司(Illumina,Inc.),圣地亚哥,加利福尼亚州)在HiSeq 2500测序系统上针对配对末端(PE)测序(100个循环/读数;12-24个文库/泳道)处理所有文库。
开发用以鉴别和表征可能的毒性基因的芽孢杆菌属计算分析工具来优先化引导物,用于进一步实验室测试。
上述基因组组装与分析以及基因组文库分析在苏云金芽孢杆菌菌株中鉴别出四个Cry9样基因,它们对至少黑色地老虎(小地老虎)有毒性。这些Cry9样基因和蛋白质的鉴别特征示于表1中。
表1.苏云金芽孢杆菌菌株中鉴别的Cry9样基因。
实例5.BT0044、BT0051、BT0068和BT0128与已知Bt Cry蛋白的同源性
利用本发明蛋白质的氨基酸序列搜索蛋白质数据库揭示它们与已知的杀昆虫蛋白同源。使用BLAST算法将本发明蛋白质的氨基酸序列与由NCBI维护的非冗余(nr)数据库进行比较,揭示以下蛋白质与本发明的序列具有一批(block)最强的氨基酸一致性(表2)。
表2.本发明的Cry蛋白与已知Cry蛋白的一致性百分比。
实例6.重组宿主细胞中的Bt蛋白表达
芽孢杆菌属表达。将感兴趣的基因通过上述含有适当的Cry蛋白启动子和红霉素抗性标记的pCIB5634`表达载体在不具有可观察到的鞘翅目或鳞翅目活性的无晶芽孢杆菌数菌株中进行表达。通过电穿孔将构建体转化到宿主菌株中,并且随后在含有红霉素的琼脂平板上进行选择。使这些重组菌株在T3培养基中于28℃下生长4-5天至孢子形成阶段。收获细胞沉淀并且在溶解于含有2mM DTT的高pH碳酸盐缓冲液(50mM)中之前反复洗涤。
大肠杆菌表达。使用pET28a或pET29a载体(EMD密理博公司)在各个大肠杆菌菌株中表达感兴趣的基因。通过电穿孔转化构建体,并且随后在含有卡那霉素的琼脂平板上进行选择。使这些重组菌株生长并且使用IPTG诱导在28℃下诱导表达。将细胞再悬浮于含有2mM DTT的高pH碳酸盐缓冲液(50mM)中并且然后使用Microfluidics LV-1匀浆器打碎。
表达分析。然后通过离心澄清所得的细胞裂解液(来自任一宿主),并通过SDS-PAGE和电泳图谱(BioRad Experion)分析样品的纯度。经由Bradford或Thermo660测定确定总蛋白浓度。然后在生物测定中测试经纯化的Cry蛋白。
实例7.Cry蛋白在生物测定中的活性
使用本领域公认的人工饲料生物测定法针对以下昆虫有害生物种类中的一种或多种测试实例6中产生的蛋白质:秋黏虫(FAW;草地贪夜蛾)、玉米穗蛾(CEW;玉米穗虫)、欧洲玉米蛀虫(ECB;欧洲玉米螟)、黑色地老虎(BCW;小地老虎)、甘蔗螟(SCB;小蔗螟)、绒毛豆毛虫(VBC;黎豆夜蛾)、大豆夜蛾(SBL;大豆尺夜蛾(Pseudoplusia includens))、西南玉米蛀虫(SWCB;西南玉米螟)、西部豆切根虫(WBCW;西部豆夜蛾)、烟夜蛾(TBW;烟芽夜蛾)、亚洲玉米蛀虫(ACB;亚洲玉米螟)、棉螟蛉(CBW;棉铃虫)、条纹蛀茎虫(SSB;二化螟)、粉蛀茎虫(PSB;水稻大螟(Sesamia inferens))和水稻卷叶螟(RLF;稻纵卷叶螟)。
向24孔平板中的人工昆虫饲料(Bioserv公司,弗伦奇敦,新泽西州)的表面施加等量的溶液中的蛋白质。在饲料表面干燥之后,向每个孔中添加有待测试的昆虫种类的幼虫。将这些平板密封并且保持在就温度、光照以及相对湿度而言的环境实验室条件下。阳性对照组由暴露于非常具活性且广谱的野生型芽孢杆菌属菌株的幼虫组成。阴性对照组由暴露于仅用缓冲溶液处理的昆虫饲料的幼虫和未处理的昆虫饲料(即只有饲料)上的幼虫组成。约120小时之后评估死亡率并且相对于对照评分。
结果示于表3中,其中“-”意指与对照相比无活性,“+/-”意指与对照相比0-10%的活性(此类别还包括具有强烈幼虫生长抑制的0%死亡率),“+”意指与对照相比10%-25%的活性,“++”意指与对照相比25%-75%的活性,并且“+++”意指与对照相比75%-100%的活性。
表3.Cry蛋白的生物测定结果。
实例8.Cry蛋白在模拟胃液测定中的结局
某些Cry蛋白已经在植物中表达,并且来自这样的植物的种子每年出售给农民用于控制各种昆虫有害生物。使这样的自我保护的杀有害生物产品经历各种监管机构的审查和注册,包括例如美国环保署(EPA)。
膳食暴露是人类可以暴露于转基因植物中表达的Cry蛋白的主要途径。哺乳动物急性经口毒性和蛋白质消化率是EPA对人体健康风险评估的终点。Cry蛋白安全性的进一步科学证据在于,它们已显示使用模拟胃液在体外迅速降解。用代表性Cry1、Cry2和Cry3蛋白进行的七次体外测定的结果确立这些蛋白质通常在30秒内迅速降解。这些结果支持更广泛的结论,即Cry蛋白的这些组的成员(共享显著的氨基酸序列一致性)在人类摄入后可能迅速降解。另一方面考虑是Cry蛋白是否可能引起过敏反应。所证实的Cry蛋白在体外迅速降解应使这种发生的可能性降至最低。相比之下,食物过敏原通常保留在体外胃肠道模型中,而无过敏史的常见食物蛋白在模拟胃液中迅速降解(Metcalfe et al.1996[Metcalfe等人,1996])。
通过对模拟胃液(SGF)中蛋白质消化率的分析可以获得对蛋白质潜在过敏原性的另外的见解。在迄今为止已经测试的转基因植物中表达的几乎所有Cry蛋白都被快速消化,并且因此已被确定为不会引起过敏。然而,发现在称为Starlink的转基因玉米产品中发现的Cry9C蛋白对SGF部分稳定。虽然Starlink Cry9C对动物无毒性,但部分消化率和部分加工稳定性的特性使得EPA很难绝对排除Starlink Cry9C蛋白可成为食物过敏原的可能性,最终导致开发Starlink的公司从美国市场召回产品。
目前,不存在用于确定新颖蛋白的致敏可能性的确定性测试。因此,EPA使用证据权衡法,其中考虑以下因素:性状来源;与已知过敏原的氨基酸序列比较;以及该蛋白质的生物化学性质,包括模拟胃液(SGF)中的体外消化率和糖基化。
模拟胃液(SGF)测定在代表哺乳动物上消化道的严格控制的条件下测量测试蛋白的体外消化率。简言之,在37℃下经一小时的时间段,将细菌产生的测试Cry蛋白(在0.5-5mg/ml的浓度下)以10单位的胃蛋白酶活性/μg测试蛋白的比率暴露于胃蛋白酶(来自猪胃粘膜,溶解于2mg/ml NaCl中,pH 1.2)。将样品在1、2、5、10、30和60分钟时取出,并通过加入预热的(95℃-2分钟)终止缓冲液(65%0.5M碳酸氢钠(pH 11)、35%Tricine加样缓冲液)立即淬灭,以使胃蛋白酶立即失活,并返回加热再持续5分钟。一旦测定完成,便通过SDS-PAGE在10%-20%Tris-Tricine凝胶(肽可见低至1kDa)上检查时间点样品和对照(只有测试蛋白,只有胃蛋白酶))以跟踪由胃蛋白酶进行的消化的动力学和水平。
SGF测定的结果表明本发明的所有Cry蛋白质都非常迅速地降解。这些结果提供了如下证据,即尽管本发明的Cry蛋白与Cry9蛋白家族有关,但是与某些公开的结果相比,例如Starlink中的Cry9C,它们对SGF测定的响应是非常不同的,这表明在蛋白质中的关键胃蛋白酶切割位点具有显著的结构差异。这些结果进一步表明本发明的Cry蛋白的致敏可能性是极小的。
实例9.BT-0051的诱变
预测蛋白质中的抗原区域有助于合成可引发与完整蛋白质有反应性的抗体且区分紧密相关的蛋白质的肽的合理方法。对于这个实例,天然BT-0051的氨基酸序列(SEQ IDNO:6)被叠加到Cry8Ea1蛋白(登录号3EB7;worldwide web.rcsb.org/pdb/的ProteinDatabank;还参见Berman et al.,2000.Nuc.Acids Res.28:235-242[Berman等人,2000,核酸研究,28:235-242])的晶体结构上,并且使用载体NTI 8.0(赛默飞世尔科技公司(ThermoFisher Scientific,Inc.),沃尔瑟姆,马萨诸塞州;还参见Welling et al.1985.FEBSLett.188:215-218[Welling等人,1985,欧洲生化学会联合会快报,188:215-218])将预测的抗性区域映射到该结构上。选择适合的诱变区域由选取结构域I外的非保守区中的环结构域组成。已知参与Cry蛋白受体识别的环如预测参与蛋白酶活化的任何残基一样从选择中被淘汰。这留下了一个由SEQ ID NO:6的342-354代表的诱变区域。基于以下预期选取变化L350I、N351Q和T354S(SEQ ID NO:18),即它们将导致相对于天然BT-0051的最小结构变化或功能变化。这样的变化产生允许突变型BT-0051(mBT-0051;SEQ ID NO:18)与天然BT-0051(SEQ ID NO:14)和其他相关Cry9蛋白区分开的抗原区域。
实例10.针对植物表达的基因定向
在植物中表达之前,在自动化基因合成平台(金斯瑞公司(Genscript,Inc.),皮斯卡塔韦,新泽西州)上合成包含编码Bt Cry蛋白BT-0044、BT-0051、BT-0068和BT-0128中每种的核苷酸序列(分别为SEQ ID NO:5-8)的合成多核苷酸以及包含编码突变型Bt Cry蛋白mBT-0044、mBT-0051、mBT-0068和mBT-0128中每种的核苷酸序列(分别为SEQ ID NO:17-20)的合成多核苷酸。用于这个实例,制备包含可操作地连接至Cry蛋白编码序列(该编码序列可操作地连接至NOS终止子)的玉蜀黍泛素启动子(Ubi1)的第一表达盒,并且制备包含可操作地连接至磷酸甘露糖异构酶(PMI)编码序列(该编码序列可操作地连接至NOS终止子)的Ubi1启动子的第二表达盒。PMI的表达允许在甘露糖上正向选择转基因植物。将两个表达盒克隆进适于农杆菌介导的玉蜀黍转化的载体中。
实例11.Cry蛋白在植物中的表达
未成熟的玉蜀黍胚的转化基本上如在以下文献中描述的来进行:Negrotto etal.,2000,Plant Cell Reports 19:798 803[Negrotto等人,2000,植物细胞报告,19:798-803]内。简言之,使包含描述于实例12中的载体的农杆菌菌株LBA4404(pSB1)在28℃下在YEP(酵母提取物(5g/L)、蛋白胨(10g/L)、NaCl(5g/L)、15g/l琼脂,pH 6.8)固体培养基上生长2-4天。将大约0.8X 109个农杆菌细胞悬浮于补充有100μM As的LS-inf培养基中。在这个培养基中对细菌预诱导大约30-60分钟。
将来自自交玉蜀黍系的未成熟胚从8-12天大的穗中切除到液体LS-inf+100μM As中。用新鲜的感染培养基漂洗这些胚。然后添加农杆菌溶液,并且将这些胚涡旋30秒并且允许其与细菌一起沉降5分钟。然后将这些胚盾片向上地转移到LSA培养基中,并且在暗处培养两到三天。随后,将每皮氏板(petri plate)大约20与25个之间的胚转移至补充有头孢噻肟(250mg/l)和硝酸银(1.6mg/l)的LSDc培养基中,并且在大约28℃下在黑暗中培养10天。
将产生胚性愈伤组织的未成熟胚转移至LSD1M0.5S培养基中。在这种培养基上对培养物进行持续大约6周的选择,在约3周时进行传代培养步骤。将存活的愈伤组织转移至补充有甘露糖的Reg1培养基中。在光照中(16小时光照/8小时黑暗方案)培养之后,然后将绿色组织转移至没有生长调节剂的Reg2培养基中并且孵育约1-2周。将这些小植株转移至含有Reg3培养基的Magenta GA-7盒(Magenta公司,芝加哥,伊利诺伊州)中并使其在光照中生长。约2-3周之后,通过PCR测试植物的PMI基因和Bt cry基因的存在。将来自PCR测定的阳性植物转移至温室用于进一步评估。
在叶切除生物测定中,针对拷贝数(通过Taqman分析确定)、蛋白质表达水平(通过ELISA确定)和对抗感兴趣的昆虫种类的功效对转基因植物进行评估。确切地说,从单拷贝事件(V3-V4阶段)中切取叶组织并且用新生幼虫侵染,然后在室温下孵育5天。叶圆片生物测定的样品量取决于所测试的昆虫种类(欧洲玉米蛀虫(ECB),n=10;玉米穗蛾(CEW),n=3,黑色地老虎(BCW),n=5)而变化。在大约第3天和第5天读取评估组织损伤和死亡率的读数;样品相对于阴性对照的损伤使用以下量表进行评定:“+”:<5%组织损伤,所有幼虫死亡;“+/-”:5%-20%组织损伤,所有幼虫死亡;或“-”:>20%组织损伤,一些幼虫活着和/或进展到2龄。
转基因植物组织生物测定的结果证实当在转基因植物中表达时本发明的Cry蛋白对于目标昆虫是有毒的。例如,在用本发明的嵌合基因稳定转化的玉蜀黍中表达的mBT-0051对于至少黑色地老虎(小地老虎)连同亚洲玉米蛀虫(亚洲玉米螟)、条纹蛀茎虫(二化螟)和棉螟蛉(棉铃虫)是有活性的。
序列表
<110> Syngenta Participations AG
Bramlett, Matthew Richard
Seguin, Katherine
Kramer, Vance Cary
Rose, Mark Scott
<120> 用于控制植物有害生物的组合物和方法
<130> 80668-WO-REG-ORG-P-1
<160> 26
<170> PatentIn version 3.5
<210> 1
<211> 3444
<212> DNA
<213> 苏云金芽孢杆菌
<400> 1
atggatttag acggtaataa aactgaaact gagactgaaa ttgtaaatgg ttccgaaagt 60
agtatcgatc catcaagcgt gtcttatgcg ggaaataaca gctattcttc cgctttgaat 120
ctcaattctt gtcaaaacag agggattgca cagtgggtta atacgcttgg aggtgcaatc 180
ggtcaggctg tatccatagg aacatccatc atttccttgc ttgcggcgcc tacgcttact 240
ggaagtattt cgttagcttt taatcttata aggagaatgg ggacaggcag taatggaagc 300
tctatttcgg acttgtcaat atgtgactta ctatccataa ttaatttacg tgtaagtcaa 360
gctgtattga acgacgggat tgcagatttt aacggctcag tggctgtata tgatctctat 420
ttgcatgctt tacgcagttg gaacaataac cctaatgctg ctaccgcgga ggaacttcgc 480
actcgttttc gtattgcaga ttccgaattc gaacgtatct taacgcgggg gtccttgaca 540
catggtggtt cattagctag acaagatgct caagtgttac tgttaccttc ttttgtaaat 600
gctgcctatc ttcatttact tatattaagg gatgctagca gatatggggc tagctggggc 660
ttgtttaata cgacaccaca tatcaattat ccagtaagat tacaacaact tattggttct 720
tatacccatt actgcacaca ttggtataat caaggtttaa atgaaatcag acaacgaggc 780
aatacggctg tcaattggtt ggaatttcat agatacagaa gagatatgac attaatggta 840
ctagatgttg tgtcattatt ttcagcgctt gatactataa ggtatccgaa cgcaaccgtt 900
gtccaattaa gcagaaccgt ttatacagac ccgattggtt ttgtaaatcg tggaagcggc 960
aacagattaa gctggtttga ttggcggaat caagctaatt tttcaacgct agaaagtgaa 1020
atgccaaccc cctcgtctcc actttctttg aatcatatga gtatatttac gggtcccctt 1080
actttacctg tctctcctaa tacccataga gccagggtat ggtatggcaa tcaaaatatg 1140
ttcacaacgg gtagtcaaaa ttcaggtcaa acaacaaact ctattcaaaa catttcgggt 1200
ttagaaatat ttagaataga ttctcaagcc tgtaatctaa acaataattc gtatggcgtg 1260
aaccgagctg aattttttca tggcgctagt cagggctccc aaagatctgt ttatcaaggc 1320
tatattagac aaagtggatt ggacaacccg gtagttatga atcttcaaag ctttttacct 1380
ggcgaaaatt cagcgacacc aaccgcacaa gattatacgc atatattaag taatcctgtt 1440
aatataagag gaggacttcg acaaatagta gctgatcgtc gttcttctgt agtcgtttat 1500
ggttggacac acaaaagttt gagtcgacgt agtttagttg caccagatca aattactcaa 1560
gtacctgctg ttaaagcaag tccctcatcc cattgtacca tcattgcagg acctggattt 1620
acgggcgggg atctcgtaag tctgcaacca aatggacaac tcgttatacc gtttcaggta 1680
tcggcgccag aaacaaatta tcatattcga atatgttatg tttctacgtc cgactgttcc 1740
ataaatacaa tatgtaatga tgagacccat ttaagtacgt tgccttccac aacctcatca 1800
cttgaaaatt tacaatgtaa ccatttgcat tattttaacg tgggcacttt caaacctacg 1860
atagatagta aactaacgct tgtaaataca agtccaaatg caaatattat catcgacaaa 1920
attgaattta ttcccgtaga tacggcccaa caacaaaatg aggatctaga agcagcaaaa 1980
aaagcggtgg cgagcttgtt tacacgcaca agggacggat tacaagtaaa tgtgaaagat 2040
tatcaagtcg atcaagcggc aaatttagtg tcatgcttat cagatgaaca atatgggtat 2100
gacaaaaaga tgttattgga agcggtacgt gcggcaaaac gacttagccg agaacgcaac 2160
ttacttcagg acccagattt taatacaatc aatagtacag aagaaaatgg atggaaagca 2220
agtaacggcg ttactattag tgagggcggg ccattctata aaggccgtgc aattcagcta 2280
gcaagtgcac gagaaaatta cccaacatac atctatcaaa aagtagatgc atcggagtta 2340
aagccgtata cacgttatag actggatggg ttcgtgaaga gtagtcaaga tttagaaatt 2400
gatctcattc accatcataa agtccatctt gtgaaaaatg taccagataa tttagtatct 2460
gatacttacc cagatgattc ttgtagtgga atcaatcgat gtcaggaaca acagatggta 2520
aatgcgcaac tggaaacaga gcatcatcat ccgatggatt gctgtgaagc agctcaaaca 2580
catgagtttt cttcctatat tgatacaggg gatttaaatt cgagtgtaga ccagggaatc 2640
tgggcgatct ttaaagttcg aacaaccgat ggttatgcga cgttaggaaa tcttgaattg 2700
gtagaggtcg gaccgttatc gggtgaatct ttagaacgtg aacaaaggga taatacaaaa 2760
tggagtgcag agctaggaag aaagcgtgca gaaacagatc gcgtgtatca agatgccaaa 2820
caatccatca atcatttatt tgtggattat caagatcaac aattaaatcc agaaataggg 2880
atggcagata ttatggacgc tcaaaatctt gtcgcatcaa tttcagatgt atatagcgat 2940
gccgtactgc aaatccctgg aattaactat gagatttaca cagagctgtc caatcgctta 3000
caacaagcat cgtatctgta tacgtctcga aatgcggtgc aaaatgggga ctttaacaac 3060
gggctagata gctggaatgc aacagcgggt gcatcggtac aacaggatgg caatacgcat 3120
ttcttagttc tttctcattg ggatgcacaa gtttctcaac aatttagagt gcagccgaat 3180
tgtaaatatg tattacgtgt aacagcagag aaagtaggcg gcggagacgg atacgtgact 3240
atccgggatg atgctcatca tacagaaacg cttacattta atgcatgtga ttatgatata 3300
aatggcacgt acgtgactga taatacgtat ctaacaaaag aagtggtatt ccatccggag 3360
acacaacaca tgtgggtaga ggtaaatgaa acagaaggtg catttcatat agatagtatt 3420
gaattcgttg aaacagaaaa gtaa 3444
<210> 2
<211> 3474
<212> DNA
<213> 苏云金芽孢杆菌
<400> 2
atgaatcgaa ataatcaaaa tgaatatgaa attattgatg ccccccattg tgggtgtcca 60
tcagatgacg atgtgaagta tcctttggca agtgacccaa atgcagcgtt acaaaatatg 120
aactataaag attacttaca aatgacagat gaggactaca ctgattctta tataaatcct 180
agtttatcta ttagtggtag agatgcagtt cagactgcgc ttactgttgt tgggagaata 240
ctcggggctt taggtgttcc gttttctgga caaatagtga gtttttatca attcctttta 300
aatacactgt ggccagttaa tgatacagct atatgggaag ctttcatgcg acaggtggag 360
gaacttgtca atcaacaaat aacagaattt gcaagaaatc aggcacttgc aagattgcaa 420
ggattaggag attcttttaa tgtatatcaa cgttcccttc aaaattggtt ggctgatcga 480
aatgatacac gaaatttaag tgttgttcgt gctcaattta tagctttaga ccttgatttt 540
gttaatgcta ttccattgtt tgcagtaaat ggacagcagg ttccattact gtcagtatat 600
gcacaagctg tgaatttaca tttgttatta ttaaaagatg catctctttt tggagaagga 660
tggggattca cacaggggga aatttccaca tattatgacc gtcaattgga actaaccgct 720
aggtacacta attactgtga aacttggtat aatacaggtt tagatcgttt aagaggaaca 780
aatactgaaa gttggttaag atatcatcaa ttccgtagag aaatgacttt agtggtatta 840
gatgttgtgg cgctatttcc atattatgat gtacgacttt atccaacggg atcaaaccca 900
cagcttacac gtgaggtata tacagatccg attgtattta atccaccagc taatgttgga 960
ctttgccgac gttggggtac taatccctat aatacttttt ctgagctcga aaatgccttc 1020
attcgcccac cacatctttt tgataggctg aatagcttaa caatcagcag taatcgattt 1080
ccagtttcat ctaattttat ggattattgg tcaggacata cgttacgccg tagttatctg 1140
aacgattcag cagtacaaga agatagttat ggcctaatta caaccacaag agcaacaatt 1200
aatcccggag ttgatggaac aaaccgcata gagtcaacgg cagtagattt tcgttctgca 1260
ttgataggta tatatggcgt gaatagagct tcttttgtcc caggaggctt gtttaatggt 1320
acgacttctc ctgctaatgg aggatgtaga gatctctatg atacaaatga tgaattacca 1380
ccagatgaaa gtaccggaag ttcaacccat agactatctc atgttacctt ttttagcttt 1440
caaactaatc aggctggatc tatagctaat gcaggaagtg tacctactta tgtttggacc 1500
cgtcgtgatg tggaccttaa taatacgatt accccaaata gaattacaca attaccattg 1560
gtaaaggcat ctgcacctgt ttcgggtact acggtcttaa aaggtccagg atttacagga 1620
gggggtatac tccgaagaac aactaatggc acatttggaa cgttaagagt aacggttaat 1680
tcaccattaa cacaacaata tcgcctaaga gttcgttttg cctcaacagg aaatttcagt 1740
ataaggttac tccgtggagg ggtttctatc ggtgatgtta gattagggag cacaatgaac 1800
agagggcagg aactaactta cgaatccttt ttcacaagag agtttactac tactggtccg 1860
ttcaatccgc cttttacatt tacacaagct caagagattc taacagtgaa tgcagaaggt 1920
gttagcaccg gtggtgaata ttatatagat agaattgaaa ttgtccctgt gaatccggca 1980
cgagaagcgg aagaggattt agaagcggcg aagaaagcgg tggcgagctt gtttacacgt 2040
acaagagatg gattacaggt aaatgtgaca gattaccaag tggatcgagc ggcaaattta 2100
gtgtcatgct tatcagatga acaatattcg catgataaaa agatgttatt ggaagccgta 2160
cgcgcagcaa aacgcctcag ccgcgaacgc aacttacttc aagatccaga ttttaataca 2220
atcaatagta cagaagaaaa tggctggaag gcaagtaacg gtgttactat tagcgagggc 2280
ggtccattct ttaaaggtcg tgcacttcag ttagcaagcg caagagaaaa ttatccaaca 2340
tacatttatc aaaaagtaga tgcatcggtg ttaaagcctt atacacgcta tagactagat 2400
ggatttgtga agagtagtca agatttagaa attgatctca tccaccatca taaagtccat 2460
cttgtaaaaa atgtaccaga taatttagta tctgatactt actcagatgg ttcttgcagc 2520
ggaatcaacc gttgtgatga acagcagcag gtagatatgc agctagatgc ggagcatcat 2580
ccaatggatt gctgtgaagc ggctcaaaca catgagtttt cttcctatat taatacaggg 2640
gatctaaatg caagtgtaga tcagggcatt tgggttgtat taaaagttcg aacaacagat 2700
gggtatgcga cgttaggaaa tcttgaattg gtagaggttg ggccattatc gggtgaatct 2760
ctagaacgcg aacaaagaga taatgcgaaa tggaatgcag agctaggaag aaagcgtgca 2820
gaaacagatc gcgtgtatct agctgcgaaa caagcaatta atcatctatt tgtagactat 2880
caagatcaac aattaaatcc agaaattggg ctagcggaaa taaatgaagc ttcaaatctt 2940
gtgaagtcaa tttcgggtgt atatagtgat acactattac agattcctgg aattaactac 3000
gaaatttaca cagagttatc cgatcgatta caacaagcat cgtatctgta tacgtctcga 3060
aatgccgtgc aaaatggaga ctttaacagt ggtctagata gttggaatgc aacaacagat 3120
gcatcggttc agcaagatgg cagtacacat ttcttagttc tttcgcattg ggatgcacaa 3180
gtttcccaac aaatgagagt aaatttgaat tgtaagtatg ttttacgtgt aacagcaaaa 3240
aaagtaggag gcggagatgg atacgtcaca atccgagatg gcgctcatca ccaagaaact 3300
cttacattta atgcatgtga ctacgatgta aatggtacgt atgtcaatga caattcgtac 3360
ataacaaaag aagtggtatt ctacccagag acaaaacata tgtgggtaga ggtgagtgaa 3420
tccgaaggtt cattctatat agacagtatt gagtttattg aaacacaaga gtag 3474
<210> 3
<211> 3522
<212> DNA
<213> 苏云金芽孢杆菌
<400> 3
atgaatcgaa ataatcaagg tgaatatgaa attattgacg cttccacttg tggttgttcg 60
tcagatgatg ttgttcaata tcctttggca agagatccga atgctgcatt ccaaaatatg 120
aattataaag attatttgaa aatgtctgac ggagactacg tcgattctta tataaaccca 180
ggcttatcta ttggtcgtag agatgtgacc ctaactggag ttggtattgt tgcgctaata 240
gtagggactt taggtggtcc agttgggggt atagtaactg gcttgatttc ctctctttta 300
ggattattgt ggccaagtaa tgataatgat gtatgggaag cttttatggc acaaatagaa 360
gagctaattg aacaaaggat agcagatcaa gtagtaagga atgcactcga taacttaact 420
ggattgcgcg attattataa tcaataccta ttagcattgg aggagtggca ggaaaggccg 480
aacgctgtaa gatctacctt agtttttaat agatttgaaa ccctgcattc tcactttgta 540
actagtatgc caagctttgg tagtggccct ggaagtgaaa ggtatgcggt acaattgctg 600
acagtttatg cacaagcggc aaatctgcat ttgttattat taagagatgc tgacatttat 660
ggggcaaggt ggggacttcg tgaatctcag attgatttat attttaatga gctacaaaat 720
cgtactcgag attataccaa tcattgtgta actgcgtaca ataatgggtt agaggagata 780
cgaggaacaa gccctgcaag ttggttgagg taccatcaat tccgtagaga gacaacacta 840
atagcattgg atttagtggc gatattccca tattacaacg tacgagaata tccaattggg 900
gtaaatcctc agcttacacg tgatgtatat acagatccaa taggggttac tttcagaaga 960
gaagattggg aaacaggagt agaatgcaga ccatgggtaa atactcctta catgagcttt 1020
tcggatcttg aaaatgcaat aattcgtcca ccacatctat ttgaaacatt acgtaattta 1080
acaattcata caggtcgata taacctagta ggaggggcga gatttattga aggatgggtc 1140
ggacattctg taacaaatac tcgcttgggt aattcaacag tatttacaag taattatggt 1200
tctttgccac ctcgttttca agtttttaat tttactaatt ttgatgttta ccaaattaat 1260
acgagagcag attctacagg tacctttaga atccctggat ttgcagttac aagggcccaa 1320
ttcattccgg gtgggactta ttcagtagct caccgagatc caggggcatg tcaacaagat 1380
tatgattcaa ttgaagagtt accaagtcta gacccggatg aacctattaa tagaagttat 1440
agtcatagat tatcgcatgt taccctttat aaatatactc tctcagatac agattatgga 1500
gttatcaatt atacagatta tggaagtatg cctgcatatg tctggacaca tcgcgatgtg 1560
gaccttacta acacgattac tgcagataga attacacaac tcccattagt aaaggcatct 1620
acactacctg cgggtactac tgtggtaaaa ggcccaggat ttacaggagg agatatactc 1680
cgaagaacaa ctaatggaac atttgggaca ttacatgtaa gggttaattc accattaaca 1740
caacaatatc gcctaagagt tcgttttgcc tcaacaggaa atttcagtat aagggtactc 1800
cgtggaggga cttctatcgg tgatgctaga tttgggagca caatgaacag aggacaggaa 1860
ctaacttacg aatcctttgt cacaagagag tttactacta ctggtccgtt caatccgcct 1920
tttacattta cacaaactca agaaattcta acagtgaatg cagaaggtgt tagcaccggt 1980
ggtgaatatt atatagatag tattgagatt gttcctgtaa atccgacgcg agaggcggaa 2040
gaggatctag aagcagcgaa gaaagcggtg gcgagcttgt ttacacgtac aagggacgga 2100
ttacaagtaa atgtgacaga ttaccaagtg gatcgagcgg caaatttagt gttatgctta 2160
tcagatgaac aatatgcgca tgataaaaag atgttattgg aagccgtacg cgcagccaaa 2220
cgactcagcc gcgagcgtaa cttgcttcaa gatccagatt tcaatgaaat aaatagtacg 2280
gaagatagtg gttggaagac aagtaacggc attatcatta gtgagggtgg tccattcttt 2340
aaaggtcgtg cccttcagct agcaagcgca cgtgaaaatt acccaacata catctatcaa 2400
aaggtagact catcaatgtt aaaaccttat acacgatata aactagatgg atttgtgcaa 2460
agtagtcaag atttagaaat tgaactcatt caccatcata aagtccacct cgtgaaaaat 2520
gtaccagata atttagtact tgatacttac ccagatggtt cttgcaacgg aattaaccga 2580
tgtgaggaac aacagatggt gaattcgcaa ctagaaacag aacatcatcc aatggattgc 2640
tgtgaagcat cccaaacaca tgagttttct tcctatattc atacaggtga cctaaatgca 2700
agtgtagatc aaggcatttg ggttgtattg aagattcgga caacagatgg ttctgcgacg 2760
ttaggaaatc ttgaattggt agaggttggt ccattatcgg gtgaatctct agaacgtgaa 2820
caaagagata atgcgaaatg gaatgcagag ttaggaagga agcgtgcaga agcagatcgc 2880
gtgtatcaag gtgcgaaaca agcaattaac catctatttg tagactatca agatcaacaa 2940
ttaaatccag aagttgggct agcagaaatt agtgaagctc gaaatcttat cgaatcaatt 3000
tcagatgtat attgcgatgc agtactgcga attcctggaa ttaactacga gatgtataca 3060
gagttatcta atcgtctaca acaagcagcg tatctgtata cgtctcgaaa tgccgtgcaa 3120
aacggggact ttaacagcgg tttagatagt tggaatgcaa caactgatgc gacggttcag 3180
caggatggca atatgtattt cttagttctt tcccattggg atgcacaagt ttctcaacaa 3240
tttagagtac agccgaattg taaatatgtg ttacgtgtga cagcgaagaa agtagggaac 3300
ggagatggat atgttacgat ccaagatggc gctcatcacc gagaaacact tacattcaat 3360
gcatgtgact acgatgtaaa tggtacgcat gtaaatgaca attcgtatat tacaaaagaa 3420
ttggagttct atccaaagac agaacatatg tgggtagagg taagtgaaac agaaggtacc 3480
ttctatatag acagcattga gctaattgaa acacaagagt ag 3522
<210> 4
<211> 3540
<212> DNA
<213> 苏云金芽孢杆菌
<400> 4
atgggaggaa aaagtatgaa tcgaaataat caaggtgaat atgaaattat tgacgcttcc 60
acttgtggtt gttcgtcaga tgatgttgtt caatatcctt tggcaagaga tccgaatgct 120
gcattccaaa atatgaatta taaagattat ttgaaaatgt ctgacggaga ctacgtcgat 180
tcttatataa acccaggctt atctattggt cgtagagatg tgaccctaac tggagttggt 240
attgttgcgc taatagtagg gactttaggt ggtccagttg ggggtatagt aactggcttg 300
atttcctctc ttttaggatt attgtggcca agtaatgata atgatgtatg ggaagctttt 360
atggcacaaa tagaagagct aattgaacaa aggatagcag atcaagtagt aaggaatgca 420
ctcgataact taactggatt gcgcgattat tataatcaat acctattagc attggaggag 480
tggcaggaaa ggccgaacgc tgtaagatct accttagttt ttaatagatt tgaaaccctg 540
cattctcact ttgtaactag tatgccaagc tttggtagtg gccctggaag tgaaaggtat 600
gcggtacaat tgctgacagt ttatgcacaa gcggcaaatc tgcatttgtt attattaaga 660
gatgctgaca tttatggggc aaggtgggga cttcgtgaat ctcagattga tttatatttt 720
aatgagctac aaaatcgtac tcgagattat accaatcatt gtgtaactgc gtacaataat 780
gggttagagg agatacgagg aacaagccct gcaagttggt tgaggtacca tcaattccgt 840
agagagacaa cactaatagc attggattta gtggcgatat tcccatatta caacgtacga 900
gaatatccaa ttggggtaaa tcctcagctt acacgtgatg tatatacaga tccaataggg 960
gttactttca gaagagaaga ttgggaaaca ggagtagaat gcagaccatg ggtaaatact 1020
ccttacatga gcttttcgga tcttgaaaat gcaataattc gtccaccaca tctatttgaa 1080
acattacgta atttaacaat tcatacaggt cgatataacc tagtaggagg ggcgagattt 1140
attgaaggat gggtcggaca ttctgtaaca aatactcgct tgggtaattc aacagtattt 1200
acaagtaatt atggttcttt gccacctcgt tttcaagttt ttaattttac taattttgat 1260
gtttaccaaa ttaatacgag agcagattct acaggtacct ttagaatccc tggatttgca 1320
gttacaaggg cccaattcat tccgggtggg acttattcag tagctcaccg agatccaggg 1380
gcatgtcaac aagattatga ttcaattgaa gagttaccaa gtctagaccc ggatgaacct 1440
attaatagaa gttatagtca tagattatcg catgttaccc tttataaata tactctctca 1500
gatacagatt atggagttat caattataca gattatggaa gtatgcctgc atatgtctgg 1560
acacatcgcg atgtggacct tactaacacg attactgcag atagaattac acaactccca 1620
ttagtaaagg catctacact acctgcgggt actactgtgg taaaaggccc aggatttaca 1680
ggaggagata tactccgaag aacaactaat ggaacatttg ggacattaca tgtaagggtt 1740
aattcaccat taacacaaca atatcgccta agagttcgtt ttgcctcaac aggaaatttc 1800
agtataaggg tactccgtgg agggacttct atcggtgatg ctagatttgg gagcacaatg 1860
aacagaggac aggaactaac ttacgaatcc tttgtcacaa gagagtttac tactactggt 1920
ccgttcaatc cgccttttac atttacacaa actcaagaaa ttctaacagt gaatgcagaa 1980
ggtgttagca ccggtggtga atattatata gatagtattg agattgttcc tgtaaatccg 2040
acgcgagagg cggaagagga tctagaagca gcgaagaaag cggtggcgag cttgtttaca 2100
cgtacaaggg acggattaca agtaaatgtg acagattatc aagtcgatca agcggcaaat 2160
ttagtgtcat gcttatcaga tgaacaatat gggtatgaca aaaagatgtt attggaagcg 2220
gtacgcgcgg caaaacgcct cagccgagaa cgtaacttac ttcaagatcc agattttaat 2280
acaatcaata gtacagaaga aaatggatgg aaagcaagta acggcgttac tattagtgag 2340
ggcggtccat tctataaagg ccgtgcactt cagctagcaa gtgcacgaga aaattatcca 2400
acatacattt atcaaaaagt agatgcatcg gagttaaaac cttatacacg atatagacta 2460
gatgggttcg tgaagagtag tcaagattta gaaattgatc tcattcacca tcataaagtc 2520
catcttgtga aaaatgtacc agataattta gtatctgata cttacccaga tgattcttgt 2580
agtggaatca atcgatgtca ggaacaacag atggtaaatg cgcaactgga aacagagcat 2640
catcatccga tggattgctg tgaagcagct caaacacatg agttttcttc ctatattgat 2700
acaggggatt taaattcgag tgtagaccag ggaatctggg cgatctttaa agttcgaaca 2760
accgatggtt atgcgacgtt aggaaatctt gaattggtag aggtcggacc gttatcgggt 2820
gaatctttag aacgtgaaca aagggataat acaaaatgga gtgcagagct aggaagaaag 2880
cgtgcagaaa cagatcgcgt gtatcaagat gccaaacaat ccatcaatca tttatttgtg 2940
gattatcaag atcaacaatt aaatccagaa atagggatgg cagatattat ggacgctcaa 3000
aatcttgtcg catcaatttc agatgtatat agcgatgccg tactgcaaat ccctggaatt 3060
aactatgaga tttacacaga gctgtccaat cgcttacaac aagcatcgta tctgtatacg 3120
tctcgaaatg cggtgcaaaa tggggacttt aacaacgggc tagatagctg gaatgcaaca 3180
gcgggtgcat cggtacaaca ggatggcaat acgcatttct tagttctttc tcattgggat 3240
gcacaagttt ctcaacaatt tagagtgcag ccgaattgta aatatgtatt acgtgtaaca 3300
gcagagaaag taggcggcgg agacggatac gtgactatcc gggatggtgc tcatcataca 3360
gaaacgctta catttaatgc atgtgattat gatataaatg gcacgtacgt gactgataat 3420
acgtatctaa caaaagaagt gatattctat tcacatacag aacacatgtg ggtagaggta 3480
aatgaaacag aaggtgcatt tcatatagat agtattgaat tcgttgaaac agaaaagtaa 3540
<210> 5
<211> 3444
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 5
atggacctgg atgggaataa gacagagaca gagaccgaga ttgtgaatgg gagcgagagc 60
agcattgacc cgagcagcgt ttcgtacgct gggaacaata gctactccag cgccctgaac 120
ctcaattcgt gccagaatag gggcatcgct cagtgggtta acacgctggg cggggctatt 180
gggcaggccg tgagcatcgg cacatctatc atttcactcc tggccgcgcc gacactcact 240
gggtctattt cactggcctt caatctcatc aggaggatgg ggaccggctc caacggctcg 300
tctatttccg acctgagcat ctgcgatctc ctgagcatca ttaacctgcg ggtttcgcag 360
gctgtgctca acgacgggat cgctgatttc aatggctccg ttgctgtgta cgacctgtac 420
ctccacgccc tgcgcagctg gaacaataac cctaacgctg ctactgctga ggagctgagg 480
acccgcttca ggatcgccga ttcggagttc gagaggattc tgacgagggg ctcgctcaca 540
catggcggct ccctcgcccg ccaggacgct caggtcctcc tgctcccgtc cttcgttaac 600
gcggcttacc tgcacctgct catcctccgc gatgcttcgc gctacggggc ctcttggggc 660
ctcttcaaca ccacgccgca tatcaattac cccgtgaggc tgcagcagct cattggcagc 720
tacacgcact actgcacaca ttggtacaac caggggctga atgagatccg gcagcgcggc 780
aacactgccg tgaattggct cgagttccac cgctaccgcc gcgacatgac gctgatggtc 840
ctcgatgtgg tctcgctgtt ctctgccctc gacacgatcc gctacccgaa cgctacagtt 900
gtgcagctca gccgcactgt ctacaccgat ccgattggct tcgttaaccg cgggtcaggc 960
aataggctgt cctggttcga ctggaggaac caggcgaatt tctctactct cgagtcagag 1020
atgccgaccc cctcatcccc actgagcctc aaccacatgt cgatcttcac tgggcctctg 1080
accctcccag tgtcccctaa cacgcatagg gcccgggtct ggtacggcaa ccagaatatg 1140
ttcacaactg ggtcacagaa ctccggccag accacgaact ctattcagaa tatctcaggc 1200
ctggagattt tccgcatcga ctctcaggcg tgcaatctca ataacaattc atacggcgtg 1260
aacagggcgg agttcttcca cggggctagc cagggctcgc agcggtctgt ctaccaggga 1320
tacatccgcc agagcggcct ggacaaccct gtcgttatga atctgcagtc tttcctccca 1380
ggcgagaact cagccacccc tacggcgcag gattacaccc acattctgtc caacccggtt 1440
aatatcaggg gcgggctcag gcagattgtg gccgacaggc gctcctccgt ggtcgtttac 1500
ggctggacgc acaagtccct gagcaggagg tcactcgtgg ctccagacca gatcacccag 1560
gtcccagccg ttaaggcgtc cccttcttca cattgcacta tcattgccgg cccaggcttc 1620
accggcgggg acctggtgtc gctccagccc aacggccagc tcgtcatccc gttccaggtt 1680
tctgcgcccg agacgaacta ccacattcgc atctgctacg tctcgacgtc tgattgcagc 1740
attaacacaa tctgcaatga cgagacgcat ctgtccacac tcccgagcac aacttccagc 1800
ctggagaacc tccagtgcaa tcacctgcat tacttcaacg tgggcacttt caagccaacc 1860
atcgactcga agctgacgct cgtcaacaca tctcctaacg ctaacatcat tatcgacaag 1920
atcgagttca tcccggtgga taccgcccag cagcagaacg aggacctcga ggccgcgaag 1980
aaggctgtcg cctccctgtt cacacgcact agggacggcc tccaggtcaa tgttaaggac 2040
taccaggtgg atcaggctgc caacctggtc tcatgcctct ccgacgagca gtacggctac 2100
gataagaaga tgctgctcga ggccgtgagg gctgctaaga ggctgagcag ggagaggaac 2160
ctgctccagg accccgattt caacacaatc aactcgaccg aggagaacgg gtggaaggcg 2220
tcaaatggcg tcaccatctc cgagggcggg ccattctaca agggcagggc tattcagctc 2280
gcgtctgctc gggagaacta ccccacatac atctaccaga aggtggatgc ctccgagctg 2340
aagccataca cccgctaccg cctcgacggc ttcgtcaagt cgtctcagga cctggagatt 2400
gatctcatcc accatcacaa ggtgcacctg gtcaagaacg ttccggacaa tctcgtgagc 2460
gatacgtacc ccgacgattc atgctccgga atcaacaggt gccaggagca gcagatggtc 2520
aacgcgcagc tggagaccga gcatcaccat ccgatggact gctgcgaggc tgctcagacg 2580
cacgagttct catcctacat cgacacaggg gatctgaaca gctcggtcga tcagggcatt 2640
tgggccatct tcaaggttag gaccacggac gggtacgcta ccctcggcaa cctggagctg 2700
gtggaggtcg ggccactgag cggcgagtcg ctcgagaggg agcagaggga caacactaag 2760
tggtccgctg agctgggccg caagagggct gagaccgacc gcgtctacca ggatgccaag 2820
cagagcatca atcacctgtt cgttgactac caggatcagc agctcaaccc cgagattggc 2880
atggcggaca tcatggatgc tcagaacctg gtggccagca tctcggacgt gtacagcgat 2940
gcggtcctcc agattccagg aatcaactac gagatctaca cggagctgtc gaacaggctc 3000
cagcaggcct cctacctgta cacaagccgg aacgcggtcc agaatgggga cttcaacaat 3060
ggcctcgatt catggaatgc tacggctggg gcttccgtgc agcaggatgg caacacacac 3120
ttcctggtcc tctcccattg ggacgcgcag gttagccagc agttccgcgt gcagccgaac 3180
tgcaagtatg tgctgagggt cactgctgag aaggttggcg ggggcgacgg ctacgtgacc 3240
atcagggacg atgcgcacca taccgagacg ctgacattca acgcttgcga ctacgacatc 3300
aacggcacct acgtgacaga caacacttac ctaaccaagg aggtggtctt ccacccggag 3360
actcagcata tgtgggttga ggtgaacgag accgagggcg ccttccacat agactccatc 3420
gagttcgtcg agaccgagaa gtga 3444
<210> 6
<211> 3474
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 6
atgaacagga acaaccagaa cgagtacgag attattgacg ccccccattg cggctgcccc 60
tccgacgacg atgtgaagta cccactggct agcgacccca acgctgctct gcagaacatg 120
aattacaagg attacctcca gatgaccgac gaggattaca cggactcgta catcaaccca 180
tccctcagca tttcgggcag ggacgctgtc cagacagccc tgactgtggt cggccgcatc 240
ctcggggcgc tgggcgttcc cttctcaggc cagattgtgt ccttctacca gttcctcctg 300
aataccctct ggccagtgaa cgacacggcg atctgggagg ctttcatgcg ccaggtggag 360
gagctggtca atcagcagat tacggagttc gccaggaacc aggctctcgc gcggctgcag 420
ggcctcgggg actccttcaa tgtctaccag aggagcctgc agaactggct cgccgaccgc 480
aacgatacca ggaatctctc cgttgtgcgc gcccagttca tcgcgctcga cctggatttc 540
gtgaatgcca ttcctctgtt cgctgtgaac ggccagcagg tcccgctcct gtccgtttac 600
gctcaggccg tgaacctgca tctcctgctc ctgaaggatg cttcgctctt cggcgagggg 660
tggggcttca cacagggcga gatctctact tactacgacc gccagctcga gctgacagcg 720
aggtacacta attactgcga gacctggtac aacacggggc tggacaggct caggggaacc 780
aacacggagt cctggctccg ctaccaccag ttccgcaggg agatgactct ggtcgttctc 840
gatgtggtcg ccctgttccc atactacgac gtccgcctct acccaaccgg ctccaaccct 900
cagctgacaa gggaggtgta cactgaccct atcgtcttca acccaccagc taatgtgggg 960
ctctgcaggc gctggggaac caacccgtac aatacgttca gcgagctgga gaacgcgttc 1020
atccggccac ctcatctgtt cgatcgcctc aactctctca ccatttccag caataggttc 1080
cctgtctcgt ctaacttcat ggactactgg tctggccaca cgctgaggcg gagctacctc 1140
aacgattcgg ctgtgcagga ggactcctac ggcctcatca ccacgacacg ggccaccatt 1200
aacccggggg tcgatggcac caaccggatc gagtcgacgg cggtggactt ccgctctgct 1260
ctcatcggga tttacggcgt taacagggct tccttcgtgc caggcgggct gttcaatggc 1320
actaccagcc cagctaacgg cgggtgcagg gacctgtacg ataccaacga cgagctgcca 1380
ccagacgagt ccacaggctc atccactcat cgcctctcgc acgtcacatt cttctctttc 1440
cagactaatc aggccgggtc aatcgcgaac gctggctccg ttcccaccta cgtgtggacg 1500
cgcagggacg tcgatctgaa caacacgatc actccgaacc gcattacgca gctcccactg 1560
gtgaaggctt ctgctccagt ctcaggcacg acagttctga aggggcccgg cttcaccggc 1620
gggggcatcc tccggcgcac taccaatggg accttcggca cgctgagggt gaccgtcaac 1680
agcccactga cgcagcagta caggctccgc gtgaggttcg cttctacggg caatttctca 1740
atcaggctcc tgaggggggg cgtgagcatt ggggacgtca ggctgggctc gacaatgaac 1800
cggggccagg agctgacata cgagagcttc ttcactcgcg agttcacgac aactggccca 1860
ttcaatccac ctttcacctt cacgcaggcc caggagatcc tcacagttaa cgctgagggc 1920
gtgtcgactg ggggcgagta ctacattgat aggatcgaga ttgttccagt gaacccagct 1980
agggaggctg aggaggacct ggaggctgcc aagaaggctg tggccagcct gttcacacgc 2040
actagggacg gcctccaggt caatgttacc gattaccagg tcgacagggc ggctaacctg 2100
gtttcatgcc tctccgatga gcagtactcc cacgacaaga agatgctcct ggaggccgtc 2160
cgggctgcta agcgcctgtc acgggagcgc aacctcctgc aggaccctga tttcaacacg 2220
atcaactcca ctgaggagaa tgggtggaag gccagcaacg gcgtgaccat ttcggagggg 2280
ggcccgttct tcaagggccg cgcgctccag ctggctagcg ctagggagaa ctaccctacg 2340
tacatctacc agaaggtcga tgcgtcggtt ctgaagccgt acacacgcta ccgcctcgac 2400
ggcttcgtga agtcctccca ggatctggag atcgacctca ttcaccatca caaggtccat 2460
ctggttaaga acgtgcccga caatctcgtc tccgatacct acagcgacgg gtcctgcagc 2520
ggaatcaacc gctgcgatga gcagcagcag gtggatatgc agctcgacgc cgagcatcac 2580
ccaatggact gctgcgaggc tgcccagacc cacgagttct cttcctacat caatacgggg 2640
gatctgaacg cctccgttga ccagggcatt tgggttgtgc tcaaagtgag gaccacggac 2700
gggtacgcta ccctgggcaa cctcgagctg gtggaggtcg ggccgctgag cggcgagtcg 2760
ctcgagaggg agcagaggga taacgctaag tggaatgctg agctgggcag gaagagggct 2820
gagaccgaca gggtctacct ggctgctaag caggcgatca atcacctctt cgtggattac 2880
caggaccagc agctgaaccc tgagatcggc ctcgctgaga ttaacgaggc ctctaatctg 2940
gtcaagtcga tctctggggt ttactcagat actctcctgc agatcccggg aattaactac 3000
gagatttaca ccgagctgtc cgaccggctc cagcaggctt cctacctcta cacgagccgc 3060
aacgccgtgc agaatgggga tttcaactcg ggcctggact cttggaacgc gacaactgat 3120
gcttctgtcc agcaggacgg ctcaacccat ttcctcgtgc tgtcacactg ggacgctcag 3180
gtgtcccagc agatgagggt caacctgaat tgcaagtacg tcctcagggt tacggcgaag 3240
aaggtcgggg gcggggatgg ctacgtcaca atcagggacg gcgcgcatca ccaggagacc 3300
ctcacgttca atgcttgcga ctacgatgtc aacggcacat acgttaacga caattcctac 3360
atcactaagg aggtcgtttt ctaccccgag accaagcaca tgtgggttga ggtgtctgag 3420
tcggagggct cgttctacat tgatagcatt gagttcattg agacgcagga gtga 3474
<210> 7
<211> 3522
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 7
atgaaccgga acaaccaggg cgagtacgag attattgatg cctccacttg cggctgctcc 60
tcagatgatg tcgtccagta cccactcgct cgcgacccga acgctgcttt ccagaacatg 120
aattacaagg actacctgaa gatgtctgac ggcgattacg tcgattcata cattaaccca 180
ggcctgtcca tcgggaggag ggacgtcacg ctcacaggcg tcgggatcgt tgctctcatt 240
gtgggcaccc tgggcggccc agttggcggg attgtgacgg gcctgatctc cagcctcctg 300
gggctcctgt ggccaagcaa cgacaatgat gtgtgggagg ccttcatggc gcagatcgag 360
gagctgattg agcagaggat cgctgaccag gtggtccgga acgccctgga caatctcacc 420
ggcctgaggg attactacaa ccagtacctc ctggctctcg aggagtggca ggagaggccc 480
aatgccgtga ggtctacgct ggtcttcaac cggttcgaga cgctccattc acacttcgtg 540
acatcaatgc catccttcgg cagcgggcct ggcagcgagc gctacgcggt tcagctcctg 600
accgtgtacg ctcaggctgc caacctgcac ctcctgctcc tgagggacgc tgatatctac 660
ggcgctcggt gggggctcag ggagtcccag atcgacctct acttcaacga gctgcagaat 720
cggacgcgcg attacacaaa ccattgcgtc acagcctaca acaatggcct ggaggagatc 780
agggggactt cgccagcttc ttggctgcgc taccaccagt tccggcgcga gaccacgctc 840
attgccctcg acctggtggc gatcttccca tactacaatg tcagggagta cccaattggc 900
gttaaccctc agctcacgcg ggacgtgtac acagatccga tcggcgtcac gttcaggcgg 960
gaggactggg agacaggcgt cgagtgcagg ccgtgggtta ataccccata catgtctttc 1020
tcagatctgg agaacgccat cattaggccg ccccatctct tcgagacgct ccggaatctg 1080
acgattcaca caggcaggta caacctggtc ggcggggcga ggttcatcga gggctgggtc 1140
gggcattccg ttactaatac caggctgggc aacagcactg tgttcaccag caattacggg 1200
tcgctcccac ctcggttcca ggtgttcaac ttcacgaatt tcgacgtcta ccagatcaac 1260
acacgggccg attcgacggg cacattccgc attccggggt tcgcggtcac tagggctcag 1320
ttcatccccg gcgggaccta ctccgtggct caccgcgacc caggcgcttg ccagcaggac 1380
tacgattcaa ttgaggagct gccctccctg gacccagatg agcctatcaa ccggtcctac 1440
agccatcgcc tctcacacgt caccctgtac aagtacactc tctccgacac cgattacggc 1500
gtgatcaatt acaccgacta cgggagcatg ccagcttacg tgtggacgca tcgcgacgtc 1560
gatctgacta acaccattac ggcggatagg atcacgcagc tcccgctggt gaaggcttcg 1620
acactccccg ccggcacaac tgttgtgaag gggcccggct tcaccggcgg ggacatcctg 1680
aggaggacca cgaatggcac gttcgggaca ctccacgtga gggtcaacag cccactgacc 1740
cagcagtaca ggctccgggt ccgcttcgct tcgacgggca acttctctat tagggtgctg 1800
aggggcggga catctatcgg cgacgctcgc ttcgggtcaa ctatgaacag gggccaggag 1860
ctgacttacg agtccttcgt gacccgcgag ttcacaacta ccggcccgtt caatccgccc 1920
ttcacattca ctcagaccca ggagatcctg actgtcaacg ctgagggcgt ttcgaccggc 1980
ggggagtact acatcgactc tattgagatc gttccagtga acccaaccag ggaggctgag 2040
gaggatctcg aggctgctaa gaaggccgtc gcgagcctgt tcacgaggac acgggacggc 2100
ctccaggtca atgttacgga ctaccaggtt gatagggctg ctaacctcgt gctgtgcctc 2160
tccgacgagc agtacgccca cgataagaag atgctcctgg aggcggtgag ggctgctaag 2220
aggctgagca gggagaggaa cctcctgcag gaccctgatt tcaacgagat caattctact 2280
gaggactcag gctggaagac cagcaacggg atcattatct cggagggcgg gccgttcttc 2340
aagggccggg ccctgcagct cgcttccgct cgcgagaact accctaccta catctaccag 2400
aaggtggact cgtctatgct gaagccgtac acgaggtaca agctcgacgg cttcgtgcag 2460
tcatcccagg atctcgagat tgagctgatc caccatcaca aggtgcacct cgtcaagaac 2520
gttccagaca atctggtcct cgacacctac cctgatggct cgtgcaacgg aatcaaccgc 2580
tgcgaggagc agcagatggt gaactctcag ctggagacgg agcatcaccc tatggactgc 2640
tgcgaggcct cacagactca tgagttcagc tcgtacatcc acaccggcga cctcaacgcg 2700
tctgtcgatc aggggatttg ggtcgttctg aagatcagga cgacagacgg ctcggctacc 2760
ctcgggaacc tggagctggt ggaggtcggc cccctgtcag gggagtccct cgagagggag 2820
cagagggaca acgccaagtg gaatgctgag ctgggccgga agcgcgctga ggctgatcgc 2880
gtgtaccagg gcgctaagca ggccatcaat cacctcttcg tcgactacca ggatcagcag 2940
ctgaaccctg aggttggcct cgcggagatc agcgaggctc ggaacctgat tgagtcgatc 3000
tctgacgtgt actgcgatgc cgtcctccgc attccgggaa tcaactacga gatgtacacg 3060
gagctgtcca acaggctgca gcaggctgct tacctgtaca caagccgcaa cgcggtgcag 3120
aatggcgact tcaactccgg gctcgatagc tggaatgcta ctaccgacgc caccgttcag 3180
caggatggca acatgtactt cctggtgctc agccactggg acgcccaggt ttcgcagcag 3240
ttccgcgtgc agccaaattg caagtatgtg ctgagggtca cagcgaagaa ggtcgggaac 3300
ggcgacggct acgtgactat ccaggatggc gcgcatcacc gcgagactct gaccttcaat 3360
gcttgcgact acgatgttaa cggcacgcat gtgaacgaca attcctacat tacaaaggag 3420
ctggagttct acccgaagac tgagcacatg tgggttgagg tgagcgagac tgagggcacc 3480
ttctacatag attcgatcga gctgattgag acccaggagt ga 3522
<210> 8
<211> 3540
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 8
atggggggga agtctatgaa caggaacaac cagggcgagt acgagattat tgatgcctcc 60
acatgcgggt gctccagcga cgacgtggtc cagtacccac tcgctcgcga ccctaacgct 120
gctttccaga acatgaatta caaggactac ctgaagatgt ccgacggcga ttacgtggat 180
agctacatta acccaggcct ctcgatcggg aggagggacg tcactctgac cggggttggc 240
atcgtggcgc tgattgttgg cacactcggc gggcctgtgg gcgggattgt cactggcctc 300
atctccagcc tcctggggct cctgtggcca tccaacgaca atgatgtctg ggaggcgttc 360
atggctcaga tcgaggagct gattgagcag cgcatcgcgg accaggtggt caggaacgct 420
ctcgacaatc tgaccggcct cagggattac tacaaccagt acctcctggc tctcgaggag 480
tggcaggaga ggccaaatgc cgtgcgctcc acgctcgttt tcaaccgctt cgagaccctg 540
cacagccatt tcgtgacgag catgccgtcg ttcgggtctg gccccgggtc ggagcgctac 600
gctgtgcagc tcctgaccgt ctacgcccag gctgccaacc tccacctcct gctcctgcgc 660
gacgctgata tctacggcgc caggtggggg ctcagggaga gccagatcga cctgtacttc 720
aacgagctgc agaatcggac acgcgattac actaaccact gcgtcaccgc ctacaacaat 780
ggcctcgagg agatcagggg gacgtcacca gcttcctggc tccgctacca ccagttccgg 840
agggagacca cgctgattgc gctcgacctg gtggctatct tcccctacta caatgtgcgc 900
gagtacccga ttggcgtcaa cccccagctg accagggacg tttacaccga cccgatcggc 960
gtgacattca ggcgggagga ctgggagact ggcgtggagt gcaggccgtg ggtcaatacc 1020
ccatacatgt ctttctcaga cctcgagaac gccatcatta ggccgcccca cctgttcgag 1080
acgctgagga atctcaccat tcatacgggc aggtacaacc tggtcggcgg ggcgcgcttc 1140
atcgagggct gggttgggca ctcagtgacg aacacaaggc tcggcaattc cacagtgttc 1200
acttccaact acggcagcct gccacctcgg ttccaggttt tcaacttcac aaatttcgac 1260
gtgtaccaga tcaacactag ggccgattcg actggcacct tccggattcc agggttcgcc 1320
gttacccgcg cgcagttcat ccctggcggg acgtactccg tggctcaccg cgacccgggc 1380
gcttgccagc aggactacga tagcattgag gagctgccct cgctcgaccc agatgagcct 1440
atcaacaggt cctacagcca ccggctgtct catgtcaccc tctacaagta caccctgtca 1500
gacacggatt acggcgtgat caattacacc gactacgggt ccatgccagc ttacgtttgg 1560
acgcaccggg acgtggatct cacgaacaca attactgccg accgcatcac acagctccca 1620
ctggtgaagg ccagcactct gcctgcgggc acaactgttg tgaagggccc tgggttcacc 1680
ggcggggaca tcctcaggag gaccacgaat ggcaccttcg ggacgctgca tgtccgcgtt 1740
aactccccgc tcacacagca gtacaggctg cgggtgcgct tcgcttcgac tggcaacttc 1800
tctattcgcg tcctcagggg cgggacctcc atcggcgacg ctaggttcgg gagcacgatg 1860
aacaggggcc aggagctgac atacgagtcc ttcgtcacta gggagttcac aactaccggc 1920
ccgttcaatc cgcccttcac cttcacgcag acacaggaga ttctcaccgt taacgctgag 1980
ggcgtgagca cgggcgggga gtactacatc gactcgatcg agattgtgcc agtcaaccca 2040
accagggagg ctgaggagga tctggaggct gctaagaagg ccgtggcgag cctcttcact 2100
aggacccggg acggcctgca ggttaatgtg acggactacc aggtcgatca ggccgcgaac 2160
ctggttagct gcctctcgga cgagcagtac ggctacgata agaagatgct cctggaggcc 2220
gtccgcgctg ctaagaggct ctcgagggag aggaacctcc tgcaggaccc cgatttcaac 2280
acaattaatt ctactgagga gaacggctgg aaggcctcta atggggtgac catctcagag 2340
ggcgggccat tctacaaggg cagggcgctc cagctggctt cagctcggga gaactacccc 2400
acctacatct accagaaggt cgacgcctcc gagctgaagc catacacgcg ctaccgcctg 2460
gatggcttcg tgaagtcgtc tcaggacctg gagatcgatc tcattcacca tcacaaggtc 2520
cacctcgtta agaacgtgcc ggacaatctg gtctccgata cctaccccga cgattcgtgc 2580
tctggaatca acaggtgcca ggagcagcag atggtgaacg cccagctcga gacggagcat 2640
caccatccta tggactgctg cgaggcggct cagacccatg agttctcatc ctacatcgac 2700
acgggcgatc tcaacagctc ggtcgaccag gggatctggg cgattttcaa ggttaggacg 2760
acagatggct acgctaccct ggggaatctc gagctggtcg aggttggccc cctctctggg 2820
gagtcactgg agagggagca gagggacaac acaaagtggt ctgctgagct gggcaggaag 2880
cgggctgaga ctgaccgcgt ctaccaggat gccaagcagt ccatcaatca cctcttcgtg 2940
gactaccagg atcagcagct gaaccctgag attggcatgg ctgacatcat ggatgcccag 3000
aacctcgtcg cgtcaatctc cgacgtctac agcgatgcgg ttctgcagat cccgggcatt 3060
aattacgaga tctacacaga gctgtcgaac aggctccagc aggcgtcata cctctacacg 3120
tcccggaacg ctgtgcagaa tggcgacttc aacaatgggc tggattcgtg gaatgcgaca 3180
gctggcgcct ctgtgcagca ggacgggaac actcacttcc tcgtcctgtc tcattgggat 3240
gcccaggtct cacagcagtt ccgggttcag ccgaactgca agtatgtgct gcgcgttacc 3300
gctgagaaag tgggcggggg cgacggctac gtcacgatcc gcgatggggc tcaccatacg 3360
gagacactca ctttcaacgc ctgcgactac gatatcaatg gcacatacgt tactgacaac 3420
acctacctga cgaaggaggt catcttctac tcccacacag agcatatgtg ggtggaggtc 3480
aacgagactg agggcgcctt ccacatcgac agcattgagt tcgtggagac cgagaagtga 3540
<210> 9
<211> 3444
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 9
atggacctgg atgggaataa gacagagaca gagaccgaga ttgtgaatgg gagcgagagc 60
agcattgacc cgagcagcgt ttcgtacgct gggaacaata gctactccag cgccctgaac 120
ctcaattcgt gccagaatag gggcatcgct cagtgggtta acacgctggg cggggctatt 180
gggcaggccg tgagcatcgg cacatctatc atttcactcc tggccgcgcc gacactcact 240
gggtctattt cactggcctt caatctcatc aggaggatgg ggaccggctc caacggctcg 300
tctatttccg acctgagcat ctgcgatctc ctgagcatca ttaacctgcg ggtttcgcag 360
gctgtgctca acgacgggat cgctgatttc aatggctccg ttgctgtgta cgacctgtac 420
ctccacgccc tgcgcagctg gaacaataac cctaacgctg ctactgctga ggagctgagg 480
acccgcttca ggatcgccga ttcggagttc gagaggattc tgacgagggg ctcgctcaca 540
catggcggct ccctcgcccg ccaggacgct caggtcctcc tgctcccgtc cttcgttaac 600
gcggcttacc tgcacctgct catcctccgc gatgcttcgc gctacggggc ctcttggggc 660
ctcttcaaca ccacgccgca tatcaattac cccgtgaggc tgcagcagct cattggcagc 720
tacacgcact actgcacaca ttggtacaac caggggctga atgagatccg gcagcgcggc 780
aacactgccg tgaattggct cgagttccac cgctaccgcc gcgacatgac gctgatggtc 840
ctcgatgtgg tctcgctgtt ctctgccctc gacacgatcc gctacccgaa cgctacagtt 900
gtgcagctca gccgcactgt ctacaccgat ccgattggct tcgttaaccg cgggtcaggc 960
aataggctgt cctggttcga ctggaggaac caggcgaatt tctctactct cgagtcagag 1020
atgccgaccc cctcatcccc actgagcctc aaccacatgt cgatcttcac tgggcctctg 1080
accctcccag tgtcccctaa cacgcatagg gcccgggtct ggtacggcaa ccagaatatg 1140
ttcacaactg ggtcacagaa ctccggccag accacgaact ctattcagaa tatctcaggc 1200
ctggagattt tccgcatcga ctctcaggcg tgcaatctca ataacaattc atacggcgtg 1260
aacagggcgg agttcttcca cggggctagc cagggctcgc agcggtctgt ctaccaggga 1320
tacatccgcc agagcggcct ggacaaccct gtcgttatga atctgcagtc tttcctccca 1380
ggcgagaact cagccacccc tacggcgcag gattacaccc acattctgtc caacccggtt 1440
aatatcaggg gcgggctcag gcagattgtg gccgacaggc gctcctccgt ggtcgtttac 1500
ggctggacgc acaagtccct gagcaggagg tcactcgtgg ctccagacca gatcacccag 1560
gtcccagccg ttaaggcgtc cccttcttca cattgcacta tcattgccgg cccaggcttc 1620
accggcgggg acctggtgtc gctccagccc aacggccagc tcgtcatccc gttccaggtt 1680
tctgcgcccg agacgaacta ccacattcgc atctgctacg tctcgacgtc tgattgcagc 1740
attaacacaa tctgcaatga cgagacgcat ctgtccacac tcccgagcac aacttccagc 1800
ctggagaacc tccagtgcaa tcacctgcat tacttcaacg tgggcacttt caagccaacc 1860
atcgactcga agctgacgct cgtcaacaca tctcctaacg ctaacatcat tatcgacaag 1920
atcgagttca tcccggtgga taccgcccag cagcagaacg aggacctcga ggccgcgaag 1980
aaggctgtcg cctccctgtt cacacgcact agggacggcc tccaggtcaa tgttaaggac 2040
taccaggtgg atcaggctgc caacctggtc tcatgcctct ccgacgagca gtacggctac 2100
gataagaaga tgctgctcga ggccgtgagg gctgctaaga ggctgagcag ggagaggaac 2160
ctgctccagg accccgattt caacacaatc aactcgaccg aggagaacgg gtggaaggcg 2220
tcaaatggcg tcaccatctc cgagggcggg ccattctaca agggcagggc tattcagctc 2280
gcgtctgctc gggagaacta ccccacatac atctaccaga aggtggatgc ctccgagctg 2340
aagccataca cccgctaccg cctcgacggc ttcgtcaagt cgtctcagga cctggagatt 2400
gatctcatcc accatcacaa ggtgcacctg gtcaagaacg ttccggacaa tctcgtgagc 2460
gatacgtacc ccgacgattc atgctccgga atcaacaggt gccaggagca gcagatggtc 2520
aacgcgcagc tggagaccga gcatcaccat ccgatggact gctgcgaggc tgctcagacg 2580
cacgagttct catcctacat cgacacaggg gatctgaaca gctcggtcga tcagggcatt 2640
tgggccatct tcaaggttag gaccacggac gggtacgcta ccctcggcaa cctggagctg 2700
gtggaggtcg ggccactgag cggcgagtcg ctcgagaggg agcagaggga caacactaag 2760
tggtccgctg agctgggccg caagagggct gagaccgacc gcgtctacca ggatgccaag 2820
cagagcatca atcacctgtt cgttgactac caggatcagc agctcaaccc cgagattggc 2880
atggcggaca tcatggatgc tcagaacctg gtggccagca tctcggacgt gtacagcgat 2940
gcggtcctcc agattccagg aatcaactac gagatctaca cggagctgtc gaacaggctc 3000
cagcaggcct cctacctgta cacaagccgg aacgcggtcc agaatgggga cttcaacaat 3060
ggcctcgatt catggaatgc tacggctggg gcttccgtgc agcaggatgg caacacacac 3120
ttcctggtcc tctcccattg ggacgcgcag gttagccagc agttccgcgt gcagccgaac 3180
tgcaagtatg tgctgagggt cactgctgag aaggttggcg ggggcgacgg ctacgtgacc 3240
atcagggacg atgcgcacca taccgagacg ctgacattca acgcttgcga ctacgacatc 3300
aacggcacct acgtgacaga caacacttac atcaccaagg aggtggtctt ccacccggag 3360
actcagcata tgtgggttga ggtgaacgag accgagggcg ccttccacct tgactccatc 3420
gagttcgtcg agaccgagaa gtga 3444
<210> 10
<211> 3474
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 10
atgaacagga acaaccagaa cgagtacgag attattgacg ccccccattg cggctgcccc 60
tccgacgacg atgtgaagta cccactggct agcgacccca acgctgctct gcagaacatg 120
aattacaagg attacctcca gatgaccgac gaggattaca cggactcgta catcaaccca 180
tccctcagca tttcgggcag ggacgctgtc cagacagccc tgactgtggt cggccgcatc 240
ctcggggcgc tgggcgttcc cttctcaggc cagattgtgt ccttctacca gttcctcctg 300
aataccctct ggccagtgaa cgacacggcg atctgggagg ctttcatgcg ccaggtggag 360
gagctggtca atcagcagat tacggagttc gccaggaacc aggctctcgc gcggctgcag 420
ggcctcgggg actccttcaa tgtctaccag aggagcctgc agaactggct cgccgaccgc 480
aacgatacca ggaatctctc cgttgtgcgc gcccagttca tcgcgctcga cctggatttc 540
gtgaatgcca ttcctctgtt cgctgtgaac ggccagcagg tcccgctcct gtccgtttac 600
gctcaggccg tgaacctgca tctcctgctc ctgaaggatg cttcgctctt cggcgagggg 660
tggggcttca cacagggcga gatctctact cactacgacc gccagctcga gctgacagcg 720
aggtacacta attactgcga gacctggtac aacacggggc tggacaggct caggggaacc 780
aacacggagt cctggctccg ctaccaccag ttccgcaggg agatgactct ggtcgttctc 840
gatgtggtcg ccctgttccc atactacgac gtccgcctct acccaaccgg ctccaaccct 900
cagctgacaa gggaggtgta cactgaccct atcgtcttca acccaccagc taatgtgggg 960
ctctgcaggc gctggggaac caacccgtac aatacgttca gcgagctgga gaacgcgttc 1020
atccggccac ctcatctgtt cgatcgcatc cagtctctct caatttccag caataggttc 1080
cctgtctcgt ctaacttcat ggactactgg tctggccaca cgctgaggcg gagctacctc 1140
aacgattcgg ctgtgcagga ggactcctac ggcctcatca ccacgacacg ggccaccatt 1200
aacccggggg tcgatggcac caaccggatc gagtcgacgg cggtggactt ccgctctgct 1260
ctcatcggga tttacggcgt taacagggct tccttcgtgc caggcgggct gttcaatggc 1320
actaccagcc cagctaacgg cgggtgcagg gacctgtacg ataccaacga cgagctgcca 1380
ccagacgagt ccacaggctc atccactcat cgcctctcgc acgtcacatt cttctctttc 1440
cagactaatc aggccgggtc aatcgcgaac gctggctccg ttcccaccta cgtgtggacg 1500
cgcagggacg tcgatctgaa caacacgatc actccgaacc gcattacgca gctcccactg 1560
gtgaaggctt ctgctccagt ctcaggcacg acagttctga aggggcccgg cttcaccggc 1620
gggggcatcc tccggcgcac taccaatggg accttcggca cgctgagggt gaccgtcaac 1680
agcccactga cgcagcagta caggctccgc gtgaggttcg cttctacggg caatttctca 1740
atcaggctcc tgaggggggg cgtgagcatt ggggacgtca ggctgggctc gacaatgaac 1800
cggggccagg agctgacata cgagagcttc ttcactcgcg agttcacgac aactggccca 1860
ttcaatccac ctttcacctt cacgcaggcc caggagatcc tcacagttaa cgctgagggc 1920
gtgtcgactg ggggcgagta ctacattgat aggatcgaga ttgttccagt gaacccagct 1980
agggaggctg aggaggacct ggaggctgcc aagaaggctg tggccagcct gttcacacgc 2040
actagggacg gcctccaggt caatgttacc gattaccagg tcgacagggc ggctaacctg 2100
gtttcatgcc tctccgatga gcagtactcc cacgacaaga agatgctcct ggaggccgtc 2160
cgggctgcta agcgcctgtc acgggagcgc aacctcctgc aggaccctga tttcaacacg 2220
atcaactcca ctgaggagaa tgggtggaag gccagcaacg gcgtgaccat ttcggagggg 2280
ggcccgttct tcaagggccg cgcgctccag ctggctagcg ctagggagaa ctaccctacg 2340
tacatctacc agaaggtcga tgcgtcggtt ctgaagccgt acacacgcta ccgcctcgac 2400
ggcttcgtga agtcctccca ggatctggag atcgacctca ttcaccatca caaggtccat 2460
ctggttaaga acgtgcccga caatctcgtc tccgatacct acagcgacgg gtcctgcagc 2520
ggaatcaacc gctgcgatga gcagcagcag gtggatatgc agctcgacgc cgagcatcac 2580
ccaatggact gctgcgaggc tgcccagacc cacgagttct cttcctacat caatacgggg 2640
gatctgaacg cctccgttga ccagggcatt tgggttgtgc tcaaagtgag gaccacggac 2700
gggtacgcta ccctgggcaa cctcgagctg gtggaggtcg ggccgctgag cggcgagtcg 2760
ctcgagaggg agcagaggga taacgctaag tggaatgctg agctgggcag gaagagggct 2820
gagaccgaca gggtctacct ggctgctaag caggcgatca atcacctctt cgtggattac 2880
caggaccagc agctgaaccc tgagatcggc ctcgctgaga ttaacgaggc ctctaatctg 2940
gtcaagtcga tctctggggt ttactcagat actctcctgc agatcccggg aattaactac 3000
gagatttaca ccgagctgtc cgaccggctc cagcaggctt cctacctcta cacgagccgc 3060
aacgccgtgc agaatgggga tttcaactcg ggcctggact cttggaacgc gacaactgat 3120
gcttctgtcc agcaggacgg ctcaacccat ttcctcgtgc tgtcacactg ggacgctcag 3180
gtgtcccagc agatgagggt caacctgaat tgcaagtacg tcctcagggt tacggcgaag 3240
aaggtcgggg gcggggatgg ctacgtcaca atcagggacg gcgcgcatca ccaggagacc 3300
ctcacgttca atgcttgcga ctacgatgtc aacggcacat acgttaacga caattcctac 3360
atcactaagg aggtcgtttt ctaccccgag accaagcaca tgtgggttga ggtgtctgag 3420
tcggagggct cgttctacat tgatagcatt gagttcattg agacgcagga gtga 3474
<210> 11
<211> 3522
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 11
atgaaccgga acaaccaggg cgagtacgag attattgatg cctccacttg cggctgctcc 60
tcagatgatg tcgtccagta cccactcgct cgcgacccga acgctgcttt ccagaacatg 120
aattacaagg actacctgaa gatgtctgac ggcgattacg tcgattcata cattaaccca 180
ggcctgtcca tcgggaggag ggacgtcacg ctcacaggcg tcgggatcgt tgctctcatt 240
gtgggcaccc tgggcggccc agttggcggg attgtgacgg gcctgatctc cagcctcctg 300
gggctcctgt ggccaagcaa cgacaatgat gtgtgggagg ccttcatggc gcagatcgag 360
gagctgattg agcagaggat cgctgaccag gtggtccgga acgccctgga caatctcacc 420
ggcctgaggg attactacaa ccagtacctc ctggctctcg aggagtggca ggagaggccc 480
aatgccgtga ggtctacgct ggtcttcaac cggttcgaga cgctccattc acacttcgtg 540
acatcaatgc catccttcgg cagcgggcct ggcagcgagc gctacgcggt tcagctcctg 600
accgtgtacg ctcaggctgc caacctgcac ctcctgctcc tgagggacgc tgatatctac 660
ggcgctcggt gggggctcag ggagtcccag atcgacctct acttcaacga gctgcagaat 720
cggacgcgcg attacacaaa ccattgcgtc acagcctaca acaatggcct ggaggagatc 780
agggggactt cgccagcttc ttggctgcgc taccaccagt tccggcgcga gaccacgctc 840
attgccctcg acctggtggc gatcttccca tactacaatg tcagggagta cccaattggc 900
gttaaccctc agctcacgcg ggacgtgtac acagatccga tcggcgtcac gttcaggcgg 960
gaggactggg agacaggcgt cgagtgcagg ccgtgggtta ataccccata catgtctttc 1020
tcagatctgg agaacgccat cattaggccg ccccatctct tcgagacgct ccggaatctg 1080
acgattcaca caggcaggta caacctggtc ggcggggcga ggttcatcga gggctgggtc 1140
gggcattccg ttactaatac caggctgggc aacagcactg tgttcaccag caattacggg 1200
tcgctcccac ctcggttcca ggtgttcaac ttcacgaatt tcgacgtcta ccagatcaac 1260
acacgggccg attcgacggg cacattccgc attccggggt tcgcggtcac tagggctcag 1320
ttcatccccg gcgggaccta ctccgtggct caccgcgacc caggcgcttg ccagcaggac 1380
tacgattcaa ttgaggagct gccctccctg gacccagatg agcctatcaa ccggtcctac 1440
agccatcgcc tctcacacgt caccctgtac aagtacactc tctccgacac cgattacggc 1500
gtgatcaatt acaccgacta cgggagcatg ccagcttacg tgtggacgca tcgcgacgtc 1560
gatctgacta acaccattac ggcggatagg atcacgcagc tcccgctggt gaaggcttcg 1620
acactccccg ccggcacaac tgttgtgaag gggcccggct tcaccggcgg ggacatcctg 1680
aggaggacca cgaatggcac gttcgggaca ctccacgtga gggtcaacag cccactgacc 1740
cagcagtaca ggctccgggt ccgcttcgct tcgacgggca acttctctat tagggtgctg 1800
aggggcggga catctatcgg cgacgctcgc ttcgggtcaa ctatgaacag gggccaggag 1860
ctgacttacg agtccttcgt gacccgcgag ttcacaacta ccggcccgtt caatccgccc 1920
ttcacattca ctcagaccca ggagatcctg actgtcaacg ctgagggcgt ttcgaccggc 1980
ggggagtact acatcgactc tattgagatc gttccagtga acccaaccag ggaggctgag 2040
gaggatctcg aggctgctaa gaaggccgtc gcgagcctgt tcacgaggac acgggacggc 2100
ctccaggtca atgttacgga ctaccaggtt gatagggctg ctaacctcgt gctgtgcctc 2160
tccgacgagc agtacgccca cgataagaag atgctcctgg aggcggtgag ggctgctaag 2220
aggctgagca gggagaggaa cctcctgcag gaccctgatt tcaacgagat caattctact 2280
gaggactcag gctggaagac cagcaacggg atcattatct cggagggcgg gccgttcttc 2340
aagggccggg ccctgcagct cgcttccgct cgcgagaact accctaccta catctaccag 2400
aaggtggact cgtctatgct gaagccgtac acgaggtaca agctcgacgg cttcgtgcag 2460
tcatcccagg atctcgagat tgagctgatc caccatcaca aggtgcacct cgtcaagaac 2520
gttccagaca atctggtcct cgacacctac cctgatggct cgtgcaacgg aatcaaccgc 2580
tgcgaggagc agcagatggt gaactctcag ctggagacgg agcatcaccc tatggactgc 2640
tgcgaggcct cacagactca tgagttcagc tcgtacatcc acaccggcga cctcaacgcg 2700
tctgtcgatc aggggatttg ggtcgttctg aagatcagga cgacagacgg ctcggctacc 2760
ctcgggaacc tggagctggt ggaggtcggc cccctgtcag gggagtccct cgagagggag 2820
cagagggaca acgccaagtg gaatgctgag ctgggccgga agcgcgctga ggctgatcgc 2880
gtgtaccagg gcgctaagca ggccatcaat cacctcttcg tcgactacca ggatcagcag 2940
ctgaaccctg aggttggcct cgcggagatc agcgaggctc ggaacctgat tgagtcgatc 3000
tctgacgtgt actgcgatgc cgtcctccgc attccgggaa tcaactacga gatgtacacg 3060
gagctgtcca acaggctgca gcaggctgct tacctgtaca caagccgcaa cgcggtgcag 3120
aatggcgact tcaactccgg gctcgatagc tggaatgcta ctaccgacgc caccgttcag 3180
caggatggca acatgtactt cctggtgctc agccactggg acgcccaggt ttcgcagcag 3240
ttccgcgtgc agccaaattg caagtatgtg ctgagggtca cagcgaagaa ggtcgggaac 3300
ggcgacggct acgtgactat ccaggatggc gcgcatcacc gcgagactct gaccttcaat 3360
gcttgcgact acgatgttaa cggcacgcat gtgaacgaca attcctacct cacaaaggag 3420
ctggagttct acccgaagac tgagcacatg tgggttgagg tgagcgagac tgagggcacc 3480
ttctaccttg attcgatcga gctgattgag acccaggagt ga 3522
<210> 12
<211> 3540
<212> DNA
<213> 人工序列
<220>
<223> 合成Cry基因
<400> 12
atggggggga agtctatgaa caggaacaac cagggcgagt acgagattat tgatgcctcc 60
acatgcgggt gctccagcga cgacgtggtc cagtacccac tcgctcgcga ccctaacgct 120
gctttccaga acatgaatta caaggactac ctgaagatgt ccgacggcga ttacgtggat 180
agctacatta acccaggcct ctcgatcggg aggagggacg tcactctgac cggggttggc 240
atcgtggcgc tgattgttgg cacactcggc gggcctgtgg gcgggattgt cactggcctc 300
atctccagcc tcctggggct cctgtggcca tccaacgaca atgatgtctg ggaggcgttc 360
atggctcaga tcgaggagct gattgagcag cgcatcgcgg accaggtggt caggaacgct 420
ctcgacaatc tgaccggcct cagggattac tacaaccagt acctcctggc tctcgaggag 480
tggcaggaga ggccaaatgc cgtgcgctcc acgctcgttt tcaaccgctt cgagaccctg 540
cacagccatt tcgtgacgag catgccgtcg ttcgggtctg gccccgggtc ggagcgctac 600
gctgtgcagc tcctgaccgt ctacgcccag gctgccaacc tccacctcct gctcctgcgc 660
gacgctgata tctacggcgc caggtggggg ctcagggaga gccagatcga cctgtacttc 720
aacgagctgc agaatcggac acgcgattac actaaccact gcgtcaccgc ctacaacaat 780
ggcctcgagg agatcagggg gacgtcacca gcttcctggc tccgctacca ccagttccgg 840
agggagacca cgctgattgc gctcgacctg gtggctatct tcccctacta caatgtgcgc 900
gagtacccga ttggcgtcaa cccccagctg accagggacg tttacaccga cccgatcggc 960
gtgacattca ggcgggagga ctgggagact ggcgtggagt gcaggccgtg ggtcaatacc 1020
ccatacatgt ctttctcaga cctcgagaac gccatcatta ggccgcccca cctgttcgag 1080
acgctgagga atctcaccat tcatacgggc aggtacaacc tggtcggcgg ggcgcgcttc 1140
atcgagggct gggttgggca ctcagtgacg aacacaaggc tcggcaattc cacagtgttc 1200
acttccaact acggcagcct gccacctcgg ttccaggttt tcaacttcac aaatttcgac 1260
gtgtaccaga tcaacactag ggccgattcg actggcacct tccggattcc agggttcgcc 1320
gttacccgcg cgcagttcat ccctggcggg acgtactccg tggctcaccg cgacccgggc 1380
gcttgccagc aggactacga tagcattgag gagctgccct cgctcgaccc agatgagcct 1440
atcaacaggt cctacagcca ccggctgtct catgtcaccc tctacaagta caccctgtca 1500
gacacggatt acggcgtgat caattacacc gactacgggt ccatgccagc ttacgtttgg 1560
acgcaccggg acgtggatct cacgaacaca attactgccg accgcatcac acagctccca 1620
ctggtgaagg ccagcactct gcctgcgggc acaactgttg tgaagggccc tgggttcacc 1680
ggcggggaca tcctcaggag gaccacgaat ggcaccttcg ggacgctgca tgtccgcgtt 1740
aactccccgc tcacacagca gtacaggctg cgggtgcgct tcgcttcgac tggcaacttc 1800
tctattcgcg tcctcagggg cgggacctcc atcggcgacg ctaggttcgg gagcacgatg 1860
aacaggggcc aggagctgac atacgagtcc ttcgtcacta gggagttcac aactaccggc 1920
ccgttcaatc cgcccttcac cttcacgcag acacaggaga ttctcaccgt taacgctgag 1980
ggcgtgagca cgggcgggga gtactacatc gactcgatcg agattgtgcc agtcaaccca 2040
accagggagg ctgaggagga tctggaggct gctaagaagg ccgtggcgag cctcttcact 2100
aggacccggg acggcctgca ggttaatgtg acggactacc aggtcgatca ggccgcgaac 2160
ctggttagct gcctctcgga cgagcagtac ggctacgata agaagatgct cctggaggcc 2220
gtccgcgctg ctaagaggct ctcgagggag aggaacctcc tgcaggaccc cgatttcaac 2280
acaattaatt ctactgagga gaacggctgg aaggcctcta atggggtgac catctcagag 2340
ggcgggccat tctacaaggg cagggcgctc cagctggctt cagctcggga gaactacccc 2400
acctacatct accagaaggt cgacgcctcc gagctgaagc catacacgcg ctaccgcctg 2460
gatggcttcg tgaagtcgtc tcaggacctg gagatcgatc tcattcacca tcacaaggtc 2520
cacctcgtta agaacgtgcc ggacaatctg gtctccgata cctaccccga cgattcgtgc 2580
tctggaatca acaggtgcca ggagcagcag atggtgaacg cccagctcga gacggagcat 2640
caccatccta tggactgctg cgaggcggct cagacccatg agttctcatc ctacatcgac 2700
acgggcgatc tcaacagctc ggtcgaccag gggatctggg cgattttcaa ggttaggacg 2760
acagatggct acgctaccct ggggaatctc gagctggtcg aggttggccc cctctctggg 2820
gagtcactgg agagggagca gagggacaac acaaagtggt ctgctgagct gggcaggaag 2880
cgggctgaga ctgaccgcgt ctaccaggat gccaagcagt ccatcaatca cctcttcgtg 2940
gactaccagg atcagcagct gaaccctgag attggcatgg ctgacatcat ggatgcccag 3000
aacctcgtcg cgtcaatctc cgacgtctac agcgatgcgg ttctgcagat cccgggcatt 3060
aattacgaga tctacacaga gctgtcgaac aggctccagc aggcgtcata cctctacacg 3120
tcccggaacg ctgtgcagaa tggcgacttc aacaatgggc tggattcgtg gaatgcgaca 3180
gctggcgcct ctgtgcagca ggacgggaac actcacttcc tcgtcctgtc tcattgggat 3240
gcccaggtct cacagcagtt ccgggttcag ccgaactgca agtatgtgct gcgcgttacc 3300
gctgagaaag tgggcggggg cgacggctac gtcacgatcc gcgatggggc tcaccatacg 3360
gagacactca ctttcaacgc ctgcgactac gatatcaatg gcacatacgt tactgacaac 3420
acctacctga cgaaggaggt catcttctac tcccacacag agcatatgtg ggtggaggtc 3480
aacgagactg agggcgcctt ccacctcgac agccttgagt tcgtggagac cgagaagtga 3540
<210> 13
<211> 1147
<212> PRT
<213> 苏云金芽孢杆菌
<400> 13
Met Asp Leu Asp Gly Asn Lys Thr Glu Thr Glu Thr Glu Ile Val Asn
1 5 10 15
Gly Ser Glu Ser Ser Ile Asp Pro Ser Ser Val Ser Tyr Ala Gly Asn
20 25 30
Asn Ser Tyr Ser Ser Ala Leu Asn Leu Asn Ser Cys Gln Asn Arg Gly
35 40 45
Ile Ala Gln Trp Val Asn Thr Leu Gly Gly Ala Ile Gly Gln Ala Val
50 55 60
Ser Ile Gly Thr Ser Ile Ile Ser Leu Leu Ala Ala Pro Thr Leu Thr
65 70 75 80
Gly Ser Ile Ser Leu Ala Phe Asn Leu Ile Arg Arg Met Gly Thr Gly
85 90 95
Ser Asn Gly Ser Ser Ile Ser Asp Leu Ser Ile Cys Asp Leu Leu Ser
100 105 110
Ile Ile Asn Leu Arg Val Ser Gln Ala Val Leu Asn Asp Gly Ile Ala
115 120 125
Asp Phe Asn Gly Ser Val Ala Val Tyr Asp Leu Tyr Leu His Ala Leu
130 135 140
Arg Ser Trp Asn Asn Asn Pro Asn Ala Ala Thr Ala Glu Glu Leu Arg
145 150 155 160
Thr Arg Phe Arg Ile Ala Asp Ser Glu Phe Glu Arg Ile Leu Thr Arg
165 170 175
Gly Ser Leu Thr His Gly Gly Ser Leu Ala Arg Gln Asp Ala Gln Val
180 185 190
Leu Leu Leu Pro Ser Phe Val Asn Ala Ala Tyr Leu His Leu Leu Ile
195 200 205
Leu Arg Asp Ala Ser Arg Tyr Gly Ala Ser Trp Gly Leu Phe Asn Thr
210 215 220
Thr Pro His Ile Asn Tyr Pro Val Arg Leu Gln Gln Leu Ile Gly Ser
225 230 235 240
Tyr Thr His Tyr Cys Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Ile
245 250 255
Arg Gln Arg Gly Asn Thr Ala Val Asn Trp Leu Glu Phe His Arg Tyr
260 265 270
Arg Arg Asp Met Thr Leu Met Val Leu Asp Val Val Ser Leu Phe Ser
275 280 285
Ala Leu Asp Thr Ile Arg Tyr Pro Asn Ala Thr Val Val Gln Leu Ser
290 295 300
Arg Thr Val Tyr Thr Asp Pro Ile Gly Phe Val Asn Arg Gly Ser Gly
305 310 315 320
Asn Arg Leu Ser Trp Phe Asp Trp Arg Asn Gln Ala Asn Phe Ser Thr
325 330 335
Leu Glu Ser Glu Met Pro Thr Pro Ser Ser Pro Leu Ser Leu Asn His
340 345 350
Met Ser Ile Phe Thr Gly Pro Leu Thr Leu Pro Val Ser Pro Asn Thr
355 360 365
His Arg Ala Arg Val Trp Tyr Gly Asn Gln Asn Met Phe Thr Thr Gly
370 375 380
Ser Gln Asn Ser Gly Gln Thr Thr Asn Ser Ile Gln Asn Ile Ser Gly
385 390 395 400
Leu Glu Ile Phe Arg Ile Asp Ser Gln Ala Cys Asn Leu Asn Asn Asn
405 410 415
Ser Tyr Gly Val Asn Arg Ala Glu Phe Phe His Gly Ala Ser Gln Gly
420 425 430
Ser Gln Arg Ser Val Tyr Gln Gly Tyr Ile Arg Gln Ser Gly Leu Asp
435 440 445
Asn Pro Val Val Met Asn Leu Gln Ser Phe Leu Pro Gly Glu Asn Ser
450 455 460
Ala Thr Pro Thr Ala Gln Asp Tyr Thr His Ile Leu Ser Asn Pro Val
465 470 475 480
Asn Ile Arg Gly Gly Leu Arg Gln Ile Val Ala Asp Arg Arg Ser Ser
485 490 495
Val Val Val Tyr Gly Trp Thr His Lys Ser Leu Ser Arg Arg Ser Leu
500 505 510
Val Ala Pro Asp Gln Ile Thr Gln Val Pro Ala Val Lys Ala Ser Pro
515 520 525
Ser Ser His Cys Thr Ile Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp
530 535 540
Leu Val Ser Leu Gln Pro Asn Gly Gln Leu Val Ile Pro Phe Gln Val
545 550 555 560
Ser Ala Pro Glu Thr Asn Tyr His Ile Arg Ile Cys Tyr Val Ser Thr
565 570 575
Ser Asp Cys Ser Ile Asn Thr Ile Cys Asn Asp Glu Thr His Leu Ser
580 585 590
Thr Leu Pro Ser Thr Thr Ser Ser Leu Glu Asn Leu Gln Cys Asn His
595 600 605
Leu His Tyr Phe Asn Val Gly Thr Phe Lys Pro Thr Ile Asp Ser Lys
610 615 620
Leu Thr Leu Val Asn Thr Ser Pro Asn Ala Asn Ile Ile Ile Asp Lys
625 630 635 640
Ile Glu Phe Ile Pro Val Asp Thr Ala Gln Gln Gln Asn Glu Asp Leu
645 650 655
Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp
660 665 670
Gly Leu Gln Val Asn Val Lys Asp Tyr Gln Val Asp Gln Ala Ala Asn
675 680 685
Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly Tyr Asp Lys Lys Met
690 695 700
Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn
705 710 715 720
Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn
725 730 735
Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe
740 745 750
Tyr Lys Gly Arg Ala Ile Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro
755 760 765
Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Glu Leu Lys Pro Tyr Thr
770 775 780
Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile
785 790 795 800
Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp
805 810 815
Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser Cys Ser Gly Ile Asn
820 825 830
Arg Cys Gln Glu Gln Gln Met Val Asn Ala Gln Leu Glu Thr Glu His
835 840 845
His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser
850 855 860
Ser Tyr Ile Asp Thr Gly Asp Leu Asn Ser Ser Val Asp Gln Gly Ile
865 870 875 880
Trp Ala Ile Phe Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly
885 890 895
Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu
900 905 910
Arg Glu Gln Arg Asp Asn Thr Lys Trp Ser Ala Glu Leu Gly Arg Lys
915 920 925
Arg Ala Glu Thr Asp Arg Val Tyr Gln Asp Ala Lys Gln Ser Ile Asn
930 935 940
His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly
945 950 955 960
Met Ala Asp Ile Met Asp Ala Gln Asn Leu Val Ala Ser Ile Ser Asp
965 970 975
Val Tyr Ser Asp Ala Val Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile
980 985 990
Tyr Thr Glu Leu Ser Asn Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr
995 1000 1005
Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Asn Gly Leu Asp
1010 1015 1020
Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gln Gln Asp Gly Asn
1025 1030 1035
Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val Ser Gln
1040 1045 1050
Gln Phe Arg Val Gln Pro Asn Cys Lys Tyr Val Leu Arg Val Thr
1055 1060 1065
Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp
1070 1075 1080
Asp Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr
1085 1090 1095
Asp Ile Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu Thr Lys
1100 1105 1110
Glu Val Val Phe His Pro Glu Thr Gln His Met Trp Val Glu Val
1115 1120 1125
Asn Glu Thr Glu Gly Ala Phe His Ile Asp Ser Ile Glu Phe Val
1130 1135 1140
Glu Thr Glu Lys
1145
<210> 14
<211> 1157
<212> PRT
<213> 苏云金芽孢杆菌
<400> 14
Met Asn Arg Asn Asn Gln Asn Glu Tyr Glu Ile Ile Asp Ala Pro His
1 5 10 15
Cys Gly Cys Pro Ser Asp Asp Asp Val Lys Tyr Pro Leu Ala Ser Asp
20 25 30
Pro Asn Ala Ala Leu Gln Asn Met Asn Tyr Lys Asp Tyr Leu Gln Met
35 40 45
Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile
50 55 60
Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile
65 70 75 80
Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr
85 90 95
Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp
100 105 110
Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr
115 120 125
Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp
130 135 140
Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg
145 150 155 160
Asn Asp Thr Arg Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu
165 170 175
Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln
180 185 190
Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu
195 200 205
Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr
210 215 220
Gln Gly Glu Ile Ser Thr Tyr Tyr Asp Arg Gln Leu Glu Leu Thr Ala
225 230 235 240
Arg Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg
245 250 255
Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg
260 265 270
Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr
275 280 285
Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg
290 295 300
Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly
305 310 315 320
Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu
325 330 335
Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser
340 345 350
Leu Thr Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp
355 360 365
Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala
370 375 380
Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile
385 390 395 400
Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp
405 410 415
Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe
420 425 430
Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly
435 440 445
Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser
450 455 460
Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe
465 470 475 480
Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr
485 490 495
Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro
500 505 510
Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser
515 520 525
Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu
530 535 540
Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn
545 550 555 560
Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr
565 570 575
Gly Asn Phe Ser Ile Arg Leu Leu Arg Gly Gly Val Ser Ile Gly Asp
580 585 590
Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu
595 600 605
Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro
610 615 620
Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly
625 630 635 640
Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro
645 650 655
Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys
660 665 670
Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn
675 680 685
Val Thr Asp Tyr Gln Val Asp Arg Ala Ala Asn Leu Val Ser Cys Leu
690 695 700
Ser Asp Glu Gln Tyr Ser His Asp Lys Lys Met Leu Leu Glu Ala Val
705 710 715 720
Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro
725 730 735
Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser
740 745 750
Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala
755 760 765
Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln
770 775 780
Lys Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp
785 790 795 800
Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His His
805 810 815
His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp
820 825 830
Thr Tyr Ser Asp Gly Ser Cys Ser Gly Ile Asn Arg Cys Asp Glu Gln
835 840 845
Gln Gln Val Asp Met Gln Leu Asp Ala Glu His His Pro Met Asp Cys
850 855 860
Cys Glu Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly
865 870 875 880
Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Val
885 890 895
Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu
900 905 910
Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn
915 920 925
Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg
930 935 940
Val Tyr Leu Ala Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr
945 950 955 960
Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly Leu Ala Glu Ile Asn Glu
965 970 975
Ala Ser Asn Leu Val Lys Ser Ile Ser Gly Val Tyr Ser Asp Thr Leu
980 985 990
Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asp
995 1000 1005
Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val
1010 1015 1020
Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala Thr
1025 1030 1035
Thr Asp Ala Ser Val Gln Gln Asp Gly Ser Thr His Phe Leu Val
1040 1045 1050
Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Met Arg Val Asn
1055 1060 1065
Leu Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Lys Lys Val Gly
1070 1075 1080
Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His Gln
1085 1090 1095
Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly Thr
1100 1105 1110
Tyr Val Asn Asp Asn Ser Tyr Ile Thr Lys Glu Val Val Phe Tyr
1115 1120 1125
Pro Glu Thr Lys His Met Trp Val Glu Val Ser Glu Ser Glu Gly
1130 1135 1140
Ser Phe Tyr Ile Asp Ser Ile Glu Phe Ile Glu Thr Gln Glu
1145 1150 1155
<210> 15
<211> 1173
<212> PRT
<213> 苏云金芽孢杆菌
<400> 15
Met Asn Arg Asn Asn Gln Gly Glu Tyr Glu Ile Ile Asp Ala Ser Thr
1 5 10 15
Cys Gly Cys Ser Ser Asp Asp Val Val Gln Tyr Pro Leu Ala Arg Asp
20 25 30
Pro Asn Ala Ala Phe Gln Asn Met Asn Tyr Lys Asp Tyr Leu Lys Met
35 40 45
Ser Asp Gly Asp Tyr Val Asp Ser Tyr Ile Asn Pro Gly Leu Ser Ile
50 55 60
Gly Arg Arg Asp Val Thr Leu Thr Gly Val Gly Ile Val Ala Leu Ile
65 70 75 80
Val Gly Thr Leu Gly Gly Pro Val Gly Gly Ile Val Thr Gly Leu Ile
85 90 95
Ser Ser Leu Leu Gly Leu Leu Trp Pro Ser Asn Asp Asn Asp Val Trp
100 105 110
Glu Ala Phe Met Ala Gln Ile Glu Glu Leu Ile Glu Gln Arg Ile Ala
115 120 125
Asp Gln Val Val Arg Asn Ala Leu Asp Asn Leu Thr Gly Leu Arg Asp
130 135 140
Tyr Tyr Asn Gln Tyr Leu Leu Ala Leu Glu Glu Trp Gln Glu Arg Pro
145 150 155 160
Asn Ala Val Arg Ser Thr Leu Val Phe Asn Arg Phe Glu Thr Leu His
165 170 175
Ser His Phe Val Thr Ser Met Pro Ser Phe Gly Ser Gly Pro Gly Ser
180 185 190
Glu Arg Tyr Ala Val Gln Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn
195 200 205
Leu His Leu Leu Leu Leu Arg Asp Ala Asp Ile Tyr Gly Ala Arg Trp
210 215 220
Gly Leu Arg Glu Ser Gln Ile Asp Leu Tyr Phe Asn Glu Leu Gln Asn
225 230 235 240
Arg Thr Arg Asp Tyr Thr Asn His Cys Val Thr Ala Tyr Asn Asn Gly
245 250 255
Leu Glu Glu Ile Arg Gly Thr Ser Pro Ala Ser Trp Leu Arg Tyr His
260 265 270
Gln Phe Arg Arg Glu Thr Thr Leu Ile Ala Leu Asp Leu Val Ala Ile
275 280 285
Phe Pro Tyr Tyr Asn Val Arg Glu Tyr Pro Ile Gly Val Asn Pro Gln
290 295 300
Leu Thr Arg Asp Val Tyr Thr Asp Pro Ile Gly Val Thr Phe Arg Arg
305 310 315 320
Glu Asp Trp Glu Thr Gly Val Glu Cys Arg Pro Trp Val Asn Thr Pro
325 330 335
Tyr Met Ser Phe Ser Asp Leu Glu Asn Ala Ile Ile Arg Pro Pro His
340 345 350
Leu Phe Glu Thr Leu Arg Asn Leu Thr Ile His Thr Gly Arg Tyr Asn
355 360 365
Leu Val Gly Gly Ala Arg Phe Ile Glu Gly Trp Val Gly His Ser Val
370 375 380
Thr Asn Thr Arg Leu Gly Asn Ser Thr Val Phe Thr Ser Asn Tyr Gly
385 390 395 400
Ser Leu Pro Pro Arg Phe Gln Val Phe Asn Phe Thr Asn Phe Asp Val
405 410 415
Tyr Gln Ile Asn Thr Arg Ala Asp Ser Thr Gly Thr Phe Arg Ile Pro
420 425 430
Gly Phe Ala Val Thr Arg Ala Gln Phe Ile Pro Gly Gly Thr Tyr Ser
435 440 445
Val Ala His Arg Asp Pro Gly Ala Cys Gln Gln Asp Tyr Asp Ser Ile
450 455 460
Glu Glu Leu Pro Ser Leu Asp Pro Asp Glu Pro Ile Asn Arg Ser Tyr
465 470 475 480
Ser His Arg Leu Ser His Val Thr Leu Tyr Lys Tyr Thr Leu Ser Asp
485 490 495
Thr Asp Tyr Gly Val Ile Asn Tyr Thr Asp Tyr Gly Ser Met Pro Ala
500 505 510
Tyr Val Trp Thr His Arg Asp Val Asp Leu Thr Asn Thr Ile Thr Ala
515 520 525
Asp Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Thr Leu Pro Ala
530 535 540
Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu
545 550 555 560
Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu His Val Arg Val Asn
565 570 575
Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr
580 585 590
Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly Thr Ser Ile Gly Asp
595 600 605
Ala Arg Phe Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu
610 615 620
Ser Phe Val Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro
625 630 635 640
Phe Thr Phe Thr Gln Thr Gln Glu Ile Leu Thr Val Asn Ala Glu Gly
645 650 655
Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Ser Ile Glu Ile Val Pro
660 665 670
Val Asn Pro Thr Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys
675 680 685
Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn
690 695 700
Val Thr Asp Tyr Gln Val Asp Arg Ala Ala Asn Leu Val Leu Cys Leu
705 710 715 720
Ser Asp Glu Gln Tyr Ala His Asp Lys Lys Met Leu Leu Glu Ala Val
725 730 735
Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro
740 745 750
Asp Phe Asn Glu Ile Asn Ser Thr Glu Asp Ser Gly Trp Lys Thr Ser
755 760 765
Asn Gly Ile Ile Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala
770 775 780
Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln
785 790 795 800
Lys Val Asp Ser Ser Met Leu Lys Pro Tyr Thr Arg Tyr Lys Leu Asp
805 810 815
Gly Phe Val Gln Ser Ser Gln Asp Leu Glu Ile Glu Leu Ile His His
820 825 830
His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Leu Asp
835 840 845
Thr Tyr Pro Asp Gly Ser Cys Asn Gly Ile Asn Arg Cys Glu Glu Gln
850 855 860
Gln Met Val Asn Ser Gln Leu Glu Thr Glu His His Pro Met Asp Cys
865 870 875 880
Cys Glu Ala Ser Gln Thr His Glu Phe Ser Ser Tyr Ile His Thr Gly
885 890 895
Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Ile
900 905 910
Arg Thr Thr Asp Gly Ser Ala Thr Leu Gly Asn Leu Glu Leu Val Glu
915 920 925
Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn
930 935 940
Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Ala Asp Arg
945 950 955 960
Val Tyr Gln Gly Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr
965 970 975
Gln Asp Gln Gln Leu Asn Pro Glu Val Gly Leu Ala Glu Ile Ser Glu
980 985 990
Ala Arg Asn Leu Ile Glu Ser Ile Ser Asp Val Tyr Cys Asp Ala Val
995 1000 1005
Leu Arg Ile Pro Gly Ile Asn Tyr Glu Met Tyr Thr Glu Leu Ser
1010 1015 1020
Asn Arg Leu Gln Gln Ala Ala Tyr Leu Tyr Thr Ser Arg Asn Ala
1025 1030 1035
Val Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala
1040 1045 1050
Thr Thr Asp Ala Thr Val Gln Gln Asp Gly Asn Met Tyr Phe Leu
1055 1060 1065
Val Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Phe Arg Val
1070 1075 1080
Gln Pro Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Lys Lys Val
1085 1090 1095
Gly Asn Gly Asp Gly Tyr Val Thr Ile Gln Asp Gly Ala His His
1100 1105 1110
Arg Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly
1115 1120 1125
Thr His Val Asn Asp Asn Ser Tyr Ile Thr Lys Glu Leu Glu Phe
1130 1135 1140
Tyr Pro Lys Thr Glu His Met Trp Val Glu Val Ser Glu Thr Glu
1145 1150 1155
Gly Thr Phe Tyr Ile Asp Ser Ile Glu Leu Ile Glu Thr Gln Glu
1160 1165 1170
<210> 16
<211> 1179
<212> PRT
<213> 苏云金芽孢杆菌
<400> 16
Met Gly Gly Lys Ser Met Asn Arg Asn Asn Gln Gly Glu Tyr Glu Ile
1 5 10 15
Ile Asp Ala Ser Thr Cys Gly Cys Ser Ser Asp Asp Val Val Gln Tyr
20 25 30
Pro Leu Ala Arg Asp Pro Asn Ala Ala Phe Gln Asn Met Asn Tyr Lys
35 40 45
Asp Tyr Leu Lys Met Ser Asp Gly Asp Tyr Val Asp Ser Tyr Ile Asn
50 55 60
Pro Gly Leu Ser Ile Gly Arg Arg Asp Val Thr Leu Thr Gly Val Gly
65 70 75 80
Ile Val Ala Leu Ile Val Gly Thr Leu Gly Gly Pro Val Gly Gly Ile
85 90 95
Val Thr Gly Leu Ile Ser Ser Leu Leu Gly Leu Leu Trp Pro Ser Asn
100 105 110
Asp Asn Asp Val Trp Glu Ala Phe Met Ala Gln Ile Glu Glu Leu Ile
115 120 125
Glu Gln Arg Ile Ala Asp Gln Val Val Arg Asn Ala Leu Asp Asn Leu
130 135 140
Thr Gly Leu Arg Asp Tyr Tyr Asn Gln Tyr Leu Leu Ala Leu Glu Glu
145 150 155 160
Trp Gln Glu Arg Pro Asn Ala Val Arg Ser Thr Leu Val Phe Asn Arg
165 170 175
Phe Glu Thr Leu His Ser His Phe Val Thr Ser Met Pro Ser Phe Gly
180 185 190
Ser Gly Pro Gly Ser Glu Arg Tyr Ala Val Gln Leu Leu Thr Val Tyr
195 200 205
Ala Gln Ala Ala Asn Leu His Leu Leu Leu Leu Arg Asp Ala Asp Ile
210 215 220
Tyr Gly Ala Arg Trp Gly Leu Arg Glu Ser Gln Ile Asp Leu Tyr Phe
225 230 235 240
Asn Glu Leu Gln Asn Arg Thr Arg Asp Tyr Thr Asn His Cys Val Thr
245 250 255
Ala Tyr Asn Asn Gly Leu Glu Glu Ile Arg Gly Thr Ser Pro Ala Ser
260 265 270
Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Thr Thr Leu Ile Ala Leu
275 280 285
Asp Leu Val Ala Ile Phe Pro Tyr Tyr Asn Val Arg Glu Tyr Pro Ile
290 295 300
Gly Val Asn Pro Gln Leu Thr Arg Asp Val Tyr Thr Asp Pro Ile Gly
305 310 315 320
Val Thr Phe Arg Arg Glu Asp Trp Glu Thr Gly Val Glu Cys Arg Pro
325 330 335
Trp Val Asn Thr Pro Tyr Met Ser Phe Ser Asp Leu Glu Asn Ala Ile
340 345 350
Ile Arg Pro Pro His Leu Phe Glu Thr Leu Arg Asn Leu Thr Ile His
355 360 365
Thr Gly Arg Tyr Asn Leu Val Gly Gly Ala Arg Phe Ile Glu Gly Trp
370 375 380
Val Gly His Ser Val Thr Asn Thr Arg Leu Gly Asn Ser Thr Val Phe
385 390 395 400
Thr Ser Asn Tyr Gly Ser Leu Pro Pro Arg Phe Gln Val Phe Asn Phe
405 410 415
Thr Asn Phe Asp Val Tyr Gln Ile Asn Thr Arg Ala Asp Ser Thr Gly
420 425 430
Thr Phe Arg Ile Pro Gly Phe Ala Val Thr Arg Ala Gln Phe Ile Pro
435 440 445
Gly Gly Thr Tyr Ser Val Ala His Arg Asp Pro Gly Ala Cys Gln Gln
450 455 460
Asp Tyr Asp Ser Ile Glu Glu Leu Pro Ser Leu Asp Pro Asp Glu Pro
465 470 475 480
Ile Asn Arg Ser Tyr Ser His Arg Leu Ser His Val Thr Leu Tyr Lys
485 490 495
Tyr Thr Leu Ser Asp Thr Asp Tyr Gly Val Ile Asn Tyr Thr Asp Tyr
500 505 510
Gly Ser Met Pro Ala Tyr Val Trp Thr His Arg Asp Val Asp Leu Thr
515 520 525
Asn Thr Ile Thr Ala Asp Arg Ile Thr Gln Leu Pro Leu Val Lys Ala
530 535 540
Ser Thr Leu Pro Ala Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr
545 550 555 560
Gly Gly Asp Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu
565 570 575
His Val Arg Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val
580 585 590
Arg Phe Ala Ser Thr Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly
595 600 605
Thr Ser Ile Gly Asp Ala Arg Phe Gly Ser Thr Met Asn Arg Gly Gln
610 615 620
Glu Leu Thr Tyr Glu Ser Phe Val Thr Arg Glu Phe Thr Thr Thr Gly
625 630 635 640
Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Thr Gln Glu Ile Leu Thr
645 650 655
Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Ser
660 665 670
Ile Glu Ile Val Pro Val Asn Pro Thr Arg Glu Ala Glu Glu Asp Leu
675 680 685
Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp
690 695 700
Gly Leu Gln Val Asn Val Thr Asp Tyr Gln Val Asp Gln Ala Ala Asn
705 710 715 720
Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly Tyr Asp Lys Lys Met
725 730 735
Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn
740 745 750
Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn
755 760 765
Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe
770 775 780
Tyr Lys Gly Arg Ala Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro
785 790 795 800
Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Glu Leu Lys Pro Tyr Thr
805 810 815
Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile
820 825 830
Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp
835 840 845
Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser Cys Ser Gly Ile Asn
850 855 860
Arg Cys Gln Glu Gln Gln Met Val Asn Ala Gln Leu Glu Thr Glu His
865 870 875 880
His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser
885 890 895
Ser Tyr Ile Asp Thr Gly Asp Leu Asn Ser Ser Val Asp Gln Gly Ile
900 905 910
Trp Ala Ile Phe Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly
915 920 925
Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu
930 935 940
Arg Glu Gln Arg Asp Asn Thr Lys Trp Ser Ala Glu Leu Gly Arg Lys
945 950 955 960
Arg Ala Glu Thr Asp Arg Val Tyr Gln Asp Ala Lys Gln Ser Ile Asn
965 970 975
His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly
980 985 990
Met Ala Asp Ile Met Asp Ala Gln Asn Leu Val Ala Ser Ile Ser Asp
995 1000 1005
Val Tyr Ser Asp Ala Val Leu Gln Ile Pro Gly Ile Asn Tyr Glu
1010 1015 1020
Ile Tyr Thr Glu Leu Ser Asn Arg Leu Gln Gln Ala Ser Tyr Leu
1025 1030 1035
Tyr Thr Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Asn Gly
1040 1045 1050
Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gln Gln Asp
1055 1060 1065
Gly Asn Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val
1070 1075 1080
Ser Gln Gln Phe Arg Val Gln Pro Asn Cys Lys Tyr Val Leu Arg
1085 1090 1095
Val Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile
1100 1105 1110
Arg Asp Gly Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys
1115 1120 1125
Asp Tyr Asp Ile Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu
1130 1135 1140
Thr Lys Glu Val Ile Phe Tyr Ser His Thr Glu His Met Trp Val
1145 1150 1155
Glu Val Asn Glu Thr Glu Gly Ala Phe His Ile Asp Ser Ile Glu
1160 1165 1170
Phe Val Glu Thr Glu Lys
1175
<210> 17
<211> 1147
<212> PRT
<213> 人工序列
<220>
<223> 突变型BT-0044
<400> 17
Met Asp Leu Asp Gly Asn Lys Thr Glu Thr Glu Thr Glu Ile Val Asn
1 5 10 15
Gly Ser Glu Ser Ser Ile Asp Pro Ser Ser Val Ser Tyr Ala Gly Asn
20 25 30
Asn Ser Tyr Ser Ser Ala Leu Asn Leu Asn Ser Cys Gln Asn Arg Gly
35 40 45
Ile Ala Gln Trp Val Asn Thr Leu Gly Gly Ala Ile Gly Gln Ala Val
50 55 60
Ser Ile Gly Thr Ser Ile Ile Ser Leu Leu Ala Ala Pro Thr Leu Thr
65 70 75 80
Gly Ser Ile Ser Leu Ala Phe Asn Leu Ile Arg Arg Met Gly Thr Gly
85 90 95
Ser Asn Gly Ser Ser Ile Ser Asp Leu Ser Ile Cys Asp Leu Leu Ser
100 105 110
Ile Ile Asn Leu Arg Val Ser Gln Ala Val Leu Asn Asp Gly Ile Ala
115 120 125
Asp Phe Asn Gly Ser Val Ala Val Tyr Asp Leu Tyr Leu His Ala Leu
130 135 140
Arg Ser Trp Asn Asn Asn Pro Asn Ala Ala Thr Ala Glu Glu Leu Arg
145 150 155 160
Thr Arg Phe Arg Ile Ala Asp Ser Glu Phe Glu Arg Ile Leu Thr Arg
165 170 175
Gly Ser Leu Thr His Gly Gly Ser Leu Ala Arg Gln Asp Ala Gln Val
180 185 190
Leu Leu Leu Pro Ser Phe Val Asn Ala Ala Tyr Leu His Leu Leu Ile
195 200 205
Leu Arg Asp Ala Ser Arg Tyr Gly Ala Ser Trp Gly Leu Phe Asn Thr
210 215 220
Thr Pro His Ile Asn Tyr Pro Val Arg Leu Gln Gln Leu Ile Gly Ser
225 230 235 240
Tyr Thr His Tyr Cys Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Ile
245 250 255
Arg Gln Arg Gly Asn Thr Ala Val Asn Trp Leu Glu Phe His Arg Tyr
260 265 270
Arg Arg Asp Met Thr Leu Met Val Leu Asp Val Val Ser Leu Phe Ser
275 280 285
Ala Leu Asp Thr Ile Arg Tyr Pro Asn Ala Thr Val Val Gln Leu Ser
290 295 300
Arg Thr Val Tyr Thr Asp Pro Ile Gly Phe Val Asn Arg Gly Ser Gly
305 310 315 320
Asn Arg Leu Ser Trp Phe Asp Trp Arg Asn Gln Ala Asn Phe Ser Thr
325 330 335
Leu Glu Ser Glu Met Pro Thr Pro Ser Ser Pro Leu Ser Leu Asn His
340 345 350
Met Ser Ile Phe Thr Gly Pro Leu Thr Leu Pro Val Ser Pro Asn Thr
355 360 365
His Arg Ala Arg Val Trp Tyr Gly Asn Gln Asn Met Phe Thr Thr Gly
370 375 380
Ser Gln Asn Ser Gly Gln Thr Thr Asn Ser Ile Gln Asn Ile Ser Gly
385 390 395 400
Leu Glu Ile Phe Arg Ile Asp Ser Gln Ala Cys Asn Leu Asn Asn Asn
405 410 415
Ser Tyr Gly Val Asn Arg Ala Glu Phe Phe His Gly Ala Ser Gln Gly
420 425 430
Ser Gln Arg Ser Val Tyr Gln Gly Tyr Ile Arg Gln Ser Gly Leu Asp
435 440 445
Asn Pro Val Val Met Asn Leu Gln Ser Phe Leu Pro Gly Glu Asn Ser
450 455 460
Ala Thr Pro Thr Ala Gln Asp Tyr Thr His Ile Leu Ser Asn Pro Val
465 470 475 480
Asn Ile Arg Gly Gly Leu Arg Gln Ile Val Ala Asp Arg Arg Ser Ser
485 490 495
Val Val Val Tyr Gly Trp Thr His Lys Ser Leu Ser Arg Arg Ser Leu
500 505 510
Val Ala Pro Asp Gln Ile Thr Gln Val Pro Ala Val Lys Ala Ser Pro
515 520 525
Ser Ser His Cys Thr Ile Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp
530 535 540
Leu Val Ser Leu Gln Pro Asn Gly Gln Leu Val Ile Pro Phe Gln Val
545 550 555 560
Ser Ala Pro Glu Thr Asn Tyr His Ile Arg Ile Cys Tyr Val Ser Thr
565 570 575
Ser Asp Cys Ser Ile Asn Thr Ile Cys Asn Asp Glu Thr His Leu Ser
580 585 590
Thr Leu Pro Ser Thr Thr Ser Ser Leu Glu Asn Leu Gln Cys Asn His
595 600 605
Leu His Tyr Phe Asn Val Gly Thr Phe Lys Pro Thr Ile Asp Ser Lys
610 615 620
Leu Thr Leu Val Asn Thr Ser Pro Asn Ala Asn Ile Ile Ile Asp Lys
625 630 635 640
Ile Glu Phe Ile Pro Val Asp Thr Ala Gln Gln Gln Asn Glu Asp Leu
645 650 655
Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp
660 665 670
Gly Leu Gln Val Asn Val Lys Asp Tyr Gln Val Asp Gln Ala Ala Asn
675 680 685
Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly Tyr Asp Lys Lys Met
690 695 700
Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn
705 710 715 720
Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn
725 730 735
Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe
740 745 750
Tyr Lys Gly Arg Ala Ile Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro
755 760 765
Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Glu Leu Lys Pro Tyr Thr
770 775 780
Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile
785 790 795 800
Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp
805 810 815
Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser Cys Ser Gly Ile Asn
820 825 830
Arg Cys Gln Glu Gln Gln Met Val Asn Ala Gln Leu Glu Thr Glu His
835 840 845
His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser
850 855 860
Ser Tyr Ile Asp Thr Gly Asp Leu Asn Ser Ser Val Asp Gln Gly Ile
865 870 875 880
Trp Ala Ile Phe Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly
885 890 895
Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu
900 905 910
Arg Glu Gln Arg Asp Asn Thr Lys Trp Ser Ala Glu Leu Gly Arg Lys
915 920 925
Arg Ala Glu Thr Asp Arg Val Tyr Gln Asp Ala Lys Gln Ser Ile Asn
930 935 940
His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly
945 950 955 960
Met Ala Asp Ile Met Asp Ala Gln Asn Leu Val Ala Ser Ile Ser Asp
965 970 975
Val Tyr Ser Asp Ala Val Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile
980 985 990
Tyr Thr Glu Leu Ser Asn Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr
995 1000 1005
Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Asn Gly Leu Asp
1010 1015 1020
Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gln Gln Asp Gly Asn
1025 1030 1035
Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val Ser Gln
1040 1045 1050
Gln Phe Arg Val Gln Pro Asn Cys Lys Tyr Val Leu Arg Val Thr
1055 1060 1065
Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp
1070 1075 1080
Asp Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr
1085 1090 1095
Asp Ile Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Ile Thr Lys
1100 1105 1110
Glu Val Val Phe His Pro Glu Thr Gln His Met Trp Val Glu Val
1115 1120 1125
Asn Glu Thr Glu Gly Ala Phe His Leu Asp Ser Ile Glu Phe Val
1130 1135 1140
Glu Thr Glu Lys
1145
<210> 18
<211> 1157
<212> PRT
<213> 人工序列
<220>
<223> 突变型BT-0051
<400> 18
Met Asn Arg Asn Asn Gln Asn Glu Tyr Glu Ile Ile Asp Ala Pro His
1 5 10 15
Cys Gly Cys Pro Ser Asp Asp Asp Val Lys Tyr Pro Leu Ala Ser Asp
20 25 30
Pro Asn Ala Ala Leu Gln Asn Met Asn Tyr Lys Asp Tyr Leu Gln Met
35 40 45
Thr Asp Glu Asp Tyr Thr Asp Ser Tyr Ile Asn Pro Ser Leu Ser Ile
50 55 60
Ser Gly Arg Asp Ala Val Gln Thr Ala Leu Thr Val Val Gly Arg Ile
65 70 75 80
Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gln Ile Val Ser Phe Tyr
85 90 95
Gln Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala Ile Trp
100 105 110
Glu Ala Phe Met Arg Gln Val Glu Glu Leu Val Asn Gln Gln Ile Thr
115 120 125
Glu Phe Ala Arg Asn Gln Ala Leu Ala Arg Leu Gln Gly Leu Gly Asp
130 135 140
Ser Phe Asn Val Tyr Gln Arg Ser Leu Gln Asn Trp Leu Ala Asp Arg
145 150 155 160
Asn Asp Thr Arg Asn Leu Ser Val Val Arg Ala Gln Phe Ile Ala Leu
165 170 175
Asp Leu Asp Phe Val Asn Ala Ile Pro Leu Phe Ala Val Asn Gly Gln
180 185 190
Gln Val Pro Leu Leu Ser Val Tyr Ala Gln Ala Val Asn Leu His Leu
195 200 205
Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr
210 215 220
Gln Gly Glu Ile Ser Thr His Tyr Asp Arg Gln Leu Glu Leu Thr Ala
225 230 235 240
Arg Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg
245 250 255
Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gln Phe Arg
260 265 270
Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr
275 280 285
Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gln Leu Thr Arg
290 295 300
Glu Val Tyr Thr Asp Pro Ile Val Phe Asn Pro Pro Ala Asn Val Gly
305 310 315 320
Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu
325 330 335
Glu Asn Ala Phe Ile Arg Pro Pro His Leu Phe Asp Arg Ile Gln Ser
340 345 350
Leu Ser Ile Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp
355 360 365
Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala
370 375 380
Val Gln Glu Asp Ser Tyr Gly Leu Ile Thr Thr Thr Arg Ala Thr Ile
385 390 395 400
Asn Pro Gly Val Asp Gly Thr Asn Arg Ile Glu Ser Thr Ala Val Asp
405 410 415
Phe Arg Ser Ala Leu Ile Gly Ile Tyr Gly Val Asn Arg Ala Ser Phe
420 425 430
Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly
435 440 445
Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser
450 455 460
Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe
465 470 475 480
Gln Thr Asn Gln Ala Gly Ser Ile Ala Asn Ala Gly Ser Val Pro Thr
485 490 495
Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr Ile Thr Pro
500 505 510
Asn Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser
515 520 525
Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly Ile Leu
530 535 540
Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn
545 550 555 560
Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr
565 570 575
Gly Asn Phe Ser Ile Arg Leu Leu Arg Gly Gly Val Ser Ile Gly Asp
580 585 590
Val Arg Leu Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu
595 600 605
Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro
610 615 620
Phe Thr Phe Thr Gln Ala Gln Glu Ile Leu Thr Val Asn Ala Glu Gly
625 630 635 640
Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Arg Ile Glu Ile Val Pro
645 650 655
Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys
660 665 670
Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn
675 680 685
Val Thr Asp Tyr Gln Val Asp Arg Ala Ala Asn Leu Val Ser Cys Leu
690 695 700
Ser Asp Glu Gln Tyr Ser His Asp Lys Lys Met Leu Leu Glu Ala Val
705 710 715 720
Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro
725 730 735
Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser
740 745 750
Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala
755 760 765
Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln
770 775 780
Lys Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp
785 790 795 800
Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His His
805 810 815
His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp
820 825 830
Thr Tyr Ser Asp Gly Ser Cys Ser Gly Ile Asn Arg Cys Asp Glu Gln
835 840 845
Gln Gln Val Asp Met Gln Leu Asp Ala Glu His His Pro Met Asp Cys
850 855 860
Cys Glu Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly
865 870 875 880
Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Val
885 890 895
Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu
900 905 910
Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn
915 920 925
Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg
930 935 940
Val Tyr Leu Ala Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr
945 950 955 960
Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly Leu Ala Glu Ile Asn Glu
965 970 975
Ala Ser Asn Leu Val Lys Ser Ile Ser Gly Val Tyr Ser Asp Thr Leu
980 985 990
Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asp
995 1000 1005
Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val
1010 1015 1020
Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala Thr
1025 1030 1035
Thr Asp Ala Ser Val Gln Gln Asp Gly Ser Thr His Phe Leu Val
1040 1045 1050
Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Met Arg Val Asn
1055 1060 1065
Leu Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Lys Lys Val Gly
1070 1075 1080
Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His Gln
1085 1090 1095
Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly Thr
1100 1105 1110
Tyr Val Asn Asp Asn Ser Tyr Ile Thr Lys Glu Val Val Phe Tyr
1115 1120 1125
Pro Glu Thr Lys His Met Trp Val Glu Val Ser Glu Ser Glu Gly
1130 1135 1140
Ser Phe Tyr Ile Asp Ser Ile Glu Phe Ile Glu Thr Gln Glu
1145 1150 1155
<210> 19
<211> 1173
<212> PRT
<213> 人工序列
<220>
<223> 突变型BT-0068
<400> 19
Met Asn Arg Asn Asn Gln Gly Glu Tyr Glu Ile Ile Asp Ala Ser Thr
1 5 10 15
Cys Gly Cys Ser Ser Asp Asp Val Val Gln Tyr Pro Leu Ala Arg Asp
20 25 30
Pro Asn Ala Ala Phe Gln Asn Met Asn Tyr Lys Asp Tyr Leu Lys Met
35 40 45
Ser Asp Gly Asp Tyr Val Asp Ser Tyr Ile Asn Pro Gly Leu Ser Ile
50 55 60
Gly Arg Arg Asp Val Thr Leu Thr Gly Val Gly Ile Val Ala Leu Ile
65 70 75 80
Val Gly Thr Leu Gly Gly Pro Val Gly Gly Ile Val Thr Gly Leu Ile
85 90 95
Ser Ser Leu Leu Gly Leu Leu Trp Pro Ser Asn Asp Asn Asp Val Trp
100 105 110
Glu Ala Phe Met Ala Gln Ile Glu Glu Leu Ile Glu Gln Arg Ile Ala
115 120 125
Asp Gln Val Val Arg Asn Ala Leu Asp Asn Leu Thr Gly Leu Arg Asp
130 135 140
Tyr Tyr Asn Gln Tyr Leu Leu Ala Leu Glu Glu Trp Gln Glu Arg Pro
145 150 155 160
Asn Ala Val Arg Ser Thr Leu Val Phe Asn Arg Phe Glu Thr Leu His
165 170 175
Ser His Phe Val Thr Ser Met Pro Ser Phe Gly Ser Gly Pro Gly Ser
180 185 190
Glu Arg Tyr Ala Val Gln Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn
195 200 205
Leu His Leu Leu Leu Leu Arg Asp Ala Asp Ile Tyr Gly Ala Arg Trp
210 215 220
Gly Leu Arg Glu Ser Gln Ile Asp Leu Tyr Phe Asn Glu Leu Gln Asn
225 230 235 240
Arg Thr Arg Asp Tyr Thr Asn His Cys Val Thr Ala Tyr Asn Asn Gly
245 250 255
Leu Glu Glu Ile Arg Gly Thr Ser Pro Ala Ser Trp Leu Arg Tyr His
260 265 270
Gln Phe Arg Arg Glu Thr Thr Leu Ile Ala Leu Asp Leu Val Ala Ile
275 280 285
Phe Pro Tyr Tyr Asn Val Arg Glu Tyr Pro Ile Gly Val Asn Pro Gln
290 295 300
Leu Thr Arg Asp Val Tyr Thr Asp Pro Ile Gly Val Thr Phe Arg Arg
305 310 315 320
Glu Asp Trp Glu Thr Gly Val Glu Cys Arg Pro Trp Val Asn Thr Pro
325 330 335
Tyr Met Ser Phe Ser Asp Leu Glu Asn Ala Ile Ile Arg Pro Pro His
340 345 350
Leu Phe Glu Thr Leu Arg Asn Leu Thr Ile His Thr Gly Arg Tyr Asn
355 360 365
Leu Val Gly Gly Ala Arg Phe Ile Glu Gly Trp Val Gly His Ser Val
370 375 380
Thr Asn Thr Arg Leu Gly Asn Ser Thr Val Phe Thr Ser Asn Tyr Gly
385 390 395 400
Ser Leu Pro Pro Arg Phe Gln Val Phe Asn Phe Thr Asn Phe Asp Val
405 410 415
Tyr Gln Ile Asn Thr Arg Ala Asp Ser Thr Gly Thr Phe Arg Ile Pro
420 425 430
Gly Phe Ala Val Thr Arg Ala Gln Phe Ile Pro Gly Gly Thr Tyr Ser
435 440 445
Val Ala His Arg Asp Pro Gly Ala Cys Gln Gln Asp Tyr Asp Ser Ile
450 455 460
Glu Glu Leu Pro Ser Leu Asp Pro Asp Glu Pro Ile Asn Arg Ser Tyr
465 470 475 480
Ser His Arg Leu Ser His Val Thr Leu Tyr Lys Tyr Thr Leu Ser Asp
485 490 495
Thr Asp Tyr Gly Val Ile Asn Tyr Thr Asp Tyr Gly Ser Met Pro Ala
500 505 510
Tyr Val Trp Thr His Arg Asp Val Asp Leu Thr Asn Thr Ile Thr Ala
515 520 525
Asp Arg Ile Thr Gln Leu Pro Leu Val Lys Ala Ser Thr Leu Pro Ala
530 535 540
Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu
545 550 555 560
Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu His Val Arg Val Asn
565 570 575
Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val Arg Phe Ala Ser Thr
580 585 590
Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly Thr Ser Ile Gly Asp
595 600 605
Ala Arg Phe Gly Ser Thr Met Asn Arg Gly Gln Glu Leu Thr Tyr Glu
610 615 620
Ser Phe Val Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn Pro Pro
625 630 635 640
Phe Thr Phe Thr Gln Thr Gln Glu Ile Leu Thr Val Asn Ala Glu Gly
645 650 655
Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Ser Ile Glu Ile Val Pro
660 665 670
Val Asn Pro Thr Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys
675 680 685
Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn
690 695 700
Val Thr Asp Tyr Gln Val Asp Arg Ala Ala Asn Leu Val Leu Cys Leu
705 710 715 720
Ser Asp Glu Gln Tyr Ala His Asp Lys Lys Met Leu Leu Glu Ala Val
725 730 735
Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro
740 745 750
Asp Phe Asn Glu Ile Asn Ser Thr Glu Asp Ser Gly Trp Lys Thr Ser
755 760 765
Asn Gly Ile Ile Ile Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala
770 775 780
Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln
785 790 795 800
Lys Val Asp Ser Ser Met Leu Lys Pro Tyr Thr Arg Tyr Lys Leu Asp
805 810 815
Gly Phe Val Gln Ser Ser Gln Asp Leu Glu Ile Glu Leu Ile His His
820 825 830
His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Leu Asp
835 840 845
Thr Tyr Pro Asp Gly Ser Cys Asn Gly Ile Asn Arg Cys Glu Glu Gln
850 855 860
Gln Met Val Asn Ser Gln Leu Glu Thr Glu His His Pro Met Asp Cys
865 870 875 880
Cys Glu Ala Ser Gln Thr His Glu Phe Ser Ser Tyr Ile His Thr Gly
885 890 895
Asp Leu Asn Ala Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Ile
900 905 910
Arg Thr Thr Asp Gly Ser Ala Thr Leu Gly Asn Leu Glu Leu Val Glu
915 920 925
Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn
930 935 940
Ala Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu Ala Asp Arg
945 950 955 960
Val Tyr Gln Gly Ala Lys Gln Ala Ile Asn His Leu Phe Val Asp Tyr
965 970 975
Gln Asp Gln Gln Leu Asn Pro Glu Val Gly Leu Ala Glu Ile Ser Glu
980 985 990
Ala Arg Asn Leu Ile Glu Ser Ile Ser Asp Val Tyr Cys Asp Ala Val
995 1000 1005
Leu Arg Ile Pro Gly Ile Asn Tyr Glu Met Tyr Thr Glu Leu Ser
1010 1015 1020
Asn Arg Leu Gln Gln Ala Ala Tyr Leu Tyr Thr Ser Arg Asn Ala
1025 1030 1035
Val Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala
1040 1045 1050
Thr Thr Asp Ala Thr Val Gln Gln Asp Gly Asn Met Tyr Phe Leu
1055 1060 1065
Val Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Phe Arg Val
1070 1075 1080
Gln Pro Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Lys Lys Val
1085 1090 1095
Gly Asn Gly Asp Gly Tyr Val Thr Ile Gln Asp Gly Ala His His
1100 1105 1110
Arg Glu Thr Leu Thr Phe Asn Ala Cys Asp Tyr Asp Val Asn Gly
1115 1120 1125
Thr His Val Asn Asp Asn Ser Tyr Ile Thr Lys Glu Leu Glu Phe
1130 1135 1140
Tyr Pro Lys Thr Glu His Met Trp Val Glu Val Ser Glu Thr Glu
1145 1150 1155
Gly Thr Phe Tyr Ile Asp Ser Ile Glu Leu Ile Glu Thr Gln Glu
1160 1165 1170
<210> 20
<211> 1179
<212> PRT
<213> 人工序列
<220>
<223> 突变型BT-0128
<400> 20
Met Gly Gly Lys Ser Met Asn Arg Asn Asn Gln Gly Glu Tyr Glu Ile
1 5 10 15
Ile Asp Ala Ser Thr Cys Gly Cys Ser Ser Asp Asp Val Val Gln Tyr
20 25 30
Pro Leu Ala Arg Asp Pro Asn Ala Ala Phe Gln Asn Met Asn Tyr Lys
35 40 45
Asp Tyr Leu Lys Met Ser Asp Gly Asp Tyr Val Asp Ser Tyr Ile Asn
50 55 60
Pro Gly Leu Ser Ile Gly Arg Arg Asp Val Thr Leu Thr Gly Val Gly
65 70 75 80
Ile Val Ala Leu Ile Val Gly Thr Leu Gly Gly Pro Val Gly Gly Ile
85 90 95
Val Thr Gly Leu Ile Ser Ser Leu Leu Gly Leu Leu Trp Pro Ser Asn
100 105 110
Asp Asn Asp Val Trp Glu Ala Phe Met Ala Gln Ile Glu Glu Leu Ile
115 120 125
Glu Gln Arg Ile Ala Asp Gln Val Val Arg Asn Ala Leu Asp Asn Leu
130 135 140
Thr Gly Leu Arg Asp Tyr Tyr Asn Gln Tyr Leu Leu Ala Leu Glu Glu
145 150 155 160
Trp Gln Glu Arg Pro Asn Ala Val Arg Ser Thr Leu Val Phe Asn Arg
165 170 175
Phe Glu Thr Leu His Ser His Phe Val Thr Ser Met Pro Ser Phe Gly
180 185 190
Ser Gly Pro Gly Ser Glu Arg Tyr Ala Val Gln Leu Leu Thr Val Tyr
195 200 205
Ala Gln Ala Ala Asn Leu His Leu Leu Leu Leu Arg Asp Ala Asp Ile
210 215 220
Tyr Gly Ala Arg Trp Gly Leu Arg Glu Ser Gln Ile Asp Leu Tyr Phe
225 230 235 240
Asn Glu Leu Gln Asn Arg Thr Arg Asp Tyr Thr Asn His Cys Val Thr
245 250 255
Ala Tyr Asn Asn Gly Leu Glu Glu Ile Arg Gly Thr Ser Pro Ala Ser
260 265 270
Trp Leu Arg Tyr His Gln Phe Arg Arg Glu Thr Thr Leu Ile Ala Leu
275 280 285
Asp Leu Val Ala Ile Phe Pro Tyr Tyr Asn Val Arg Glu Tyr Pro Ile
290 295 300
Gly Val Asn Pro Gln Leu Thr Arg Asp Val Tyr Thr Asp Pro Ile Gly
305 310 315 320
Val Thr Phe Arg Arg Glu Asp Trp Glu Thr Gly Val Glu Cys Arg Pro
325 330 335
Trp Val Asn Thr Pro Tyr Met Ser Phe Ser Asp Leu Glu Asn Ala Ile
340 345 350
Ile Arg Pro Pro His Leu Phe Glu Thr Leu Arg Asn Leu Thr Ile His
355 360 365
Thr Gly Arg Tyr Asn Leu Val Gly Gly Ala Arg Phe Ile Glu Gly Trp
370 375 380
Val Gly His Ser Val Thr Asn Thr Arg Leu Gly Asn Ser Thr Val Phe
385 390 395 400
Thr Ser Asn Tyr Gly Ser Leu Pro Pro Arg Phe Gln Val Phe Asn Phe
405 410 415
Thr Asn Phe Asp Val Tyr Gln Ile Asn Thr Arg Ala Asp Ser Thr Gly
420 425 430
Thr Phe Arg Ile Pro Gly Phe Ala Val Thr Arg Ala Gln Phe Ile Pro
435 440 445
Gly Gly Thr Tyr Ser Val Ala His Arg Asp Pro Gly Ala Cys Gln Gln
450 455 460
Asp Tyr Asp Ser Ile Glu Glu Leu Pro Ser Leu Asp Pro Asp Glu Pro
465 470 475 480
Ile Asn Arg Ser Tyr Ser His Arg Leu Ser His Val Thr Leu Tyr Lys
485 490 495
Tyr Thr Leu Ser Asp Thr Asp Tyr Gly Val Ile Asn Tyr Thr Asp Tyr
500 505 510
Gly Ser Met Pro Ala Tyr Val Trp Thr His Arg Asp Val Asp Leu Thr
515 520 525
Asn Thr Ile Thr Ala Asp Arg Ile Thr Gln Leu Pro Leu Val Lys Ala
530 535 540
Ser Thr Leu Pro Ala Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr
545 550 555 560
Gly Gly Asp Ile Leu Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu
565 570 575
His Val Arg Val Asn Ser Pro Leu Thr Gln Gln Tyr Arg Leu Arg Val
580 585 590
Arg Phe Ala Ser Thr Gly Asn Phe Ser Ile Arg Val Leu Arg Gly Gly
595 600 605
Thr Ser Ile Gly Asp Ala Arg Phe Gly Ser Thr Met Asn Arg Gly Gln
610 615 620
Glu Leu Thr Tyr Glu Ser Phe Val Thr Arg Glu Phe Thr Thr Thr Gly
625 630 635 640
Pro Phe Asn Pro Pro Phe Thr Phe Thr Gln Thr Gln Glu Ile Leu Thr
645 650 655
Val Asn Ala Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr Ile Asp Ser
660 665 670
Ile Glu Ile Val Pro Val Asn Pro Thr Arg Glu Ala Glu Glu Asp Leu
675 680 685
Glu Ala Ala Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp
690 695 700
Gly Leu Gln Val Asn Val Thr Asp Tyr Gln Val Asp Gln Ala Ala Asn
705 710 715 720
Leu Val Ser Cys Leu Ser Asp Glu Gln Tyr Gly Tyr Asp Lys Lys Met
725 730 735
Leu Leu Glu Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn
740 745 750
Leu Leu Gln Asp Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn
755 760 765
Gly Trp Lys Ala Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe
770 775 780
Tyr Lys Gly Arg Ala Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro
785 790 795 800
Thr Tyr Ile Tyr Gln Lys Val Asp Ala Ser Glu Leu Lys Pro Tyr Thr
805 810 815
Arg Tyr Arg Leu Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile
820 825 830
Asp Leu Ile His His His Lys Val His Leu Val Lys Asn Val Pro Asp
835 840 845
Asn Leu Val Ser Asp Thr Tyr Pro Asp Asp Ser Cys Ser Gly Ile Asn
850 855 860
Arg Cys Gln Glu Gln Gln Met Val Asn Ala Gln Leu Glu Thr Glu His
865 870 875 880
His His Pro Met Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser
885 890 895
Ser Tyr Ile Asp Thr Gly Asp Leu Asn Ser Ser Val Asp Gln Gly Ile
900 905 910
Trp Ala Ile Phe Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly
915 920 925
Asn Leu Glu Leu Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu
930 935 940
Arg Glu Gln Arg Asp Asn Thr Lys Trp Ser Ala Glu Leu Gly Arg Lys
945 950 955 960
Arg Ala Glu Thr Asp Arg Val Tyr Gln Asp Ala Lys Gln Ser Ile Asn
965 970 975
His Leu Phe Val Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly
980 985 990
Met Ala Asp Ile Met Asp Ala Gln Asn Leu Val Ala Ser Ile Ser Asp
995 1000 1005
Val Tyr Ser Asp Ala Val Leu Gln Ile Pro Gly Ile Asn Tyr Glu
1010 1015 1020
Ile Tyr Thr Glu Leu Ser Asn Arg Leu Gln Gln Ala Ser Tyr Leu
1025 1030 1035
Tyr Thr Ser Arg Asn Ala Val Gln Asn Gly Asp Phe Asn Asn Gly
1040 1045 1050
Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala Ser Val Gln Gln Asp
1055 1060 1065
Gly Asn Thr His Phe Leu Val Leu Ser His Trp Asp Ala Gln Val
1070 1075 1080
Ser Gln Gln Phe Arg Val Gln Pro Asn Cys Lys Tyr Val Leu Arg
1085 1090 1095
Val Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val Thr Ile
1100 1105 1110
Arg Asp Gly Ala His His Thr Glu Thr Leu Thr Phe Asn Ala Cys
1115 1120 1125
Asp Tyr Asp Ile Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu
1130 1135 1140
Thr Lys Glu Val Ile Phe Tyr Ser His Thr Glu His Met Trp Val
1145 1150 1155
Glu Val Asn Glu Thr Glu Gly Ala Phe His Leu Asp Ser Leu Glu
1160 1165 1170
Phe Val Glu Thr Glu Lys
1175
<210> 21
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> OAR2613a正向引物
<400> 21
aaacatgaac cgaaataatc aaaatg 26
<210> 22
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> OAR2615a反向引物
<400> 22
atccgtccct tgtgcgtgta aa 22
<210> 23
<211> 30
<212> DNA
<213> 人工序列
<220>
<223> OAR2611a-F正向引物
<400> 23
gtttaaacat gaatcgaaat aatcaaaatg 30
<210> 24
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> OAR2612a-R反向引物
<400> 24
ggcgcgccct actcttgtgt ttcaataaa 29
<210> 25
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> OAR2768-F正向引物
<400> 25
gtttaaacat gaatcaaaat aaacacgga 29
<210> 26
<211> 31
<212> DNA
<213> 人工序列
<220>
<223> OAR2769-R反向引物
<400> 26
ggcgcgcctt actgttgggt ttccatgaac t 31
Claims (6)
1.一种嵌合基因,该嵌合基因包含可操作地连接至核酸分子的异源启动子,该核酸分子由编码对至少小地老虎有毒的蛋白质的核苷酸序列组成,其中该核苷酸序列是合成序列,该合成序列已经进行密码子优化用于在转基因生物中表达,并且编码SEQ ID NO:18或SEQ ID NO:14的蛋白质。
2.如权利要求1所述的嵌合基因,其中该核苷酸序列编码SEQ ID NO:18的蛋白质。
3.如权利要求1所述的嵌合基因,其中该转基因生物是细菌或植物。
4.一种重组载体,该重组载体包含如权利要求1所述的嵌合基因。
5.一种转基因细菌细胞,该转基因细菌细胞包含如权利要求1所述的嵌合基因或如权利要求4所述的重组载体。
6.一种产生抗昆虫转基因植物的方法,该方法包括:向植物中引入如权利要求1所述的嵌合基因,由此赋予该植物对至少小地老虎的抗性,并且产生抗昆虫转基因植物。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110747274.7A CN113736809B (zh) | 2014-12-12 | 2015-12-03 | 用于控制植物有害生物的组合物和方法 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462090899P | 2014-12-12 | 2014-12-12 | |
US62/090,899 | 2014-12-12 | ||
PCT/US2015/063610 WO2016094159A1 (en) | 2014-12-12 | 2015-12-03 | Compositions and methods for controlling plant pests |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110747274.7A Division CN113736809B (zh) | 2014-12-12 | 2015-12-03 | 用于控制植物有害生物的组合物和方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107109418A CN107109418A (zh) | 2017-08-29 |
CN107109418B true CN107109418B (zh) | 2021-07-16 |
Family
ID=56107966
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110747274.7A Active CN113736809B (zh) | 2014-12-12 | 2015-12-03 | 用于控制植物有害生物的组合物和方法 |
CN201580067474.0A Active CN107109418B (zh) | 2014-12-12 | 2015-12-03 | 用于控制植物有害生物的组合物和方法 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110747274.7A Active CN113736809B (zh) | 2014-12-12 | 2015-12-03 | 用于控制植物有害生物的组合物和方法 |
Country Status (12)
Country | Link |
---|---|
US (4) | US10407693B2 (zh) |
EP (1) | EP3230456B1 (zh) |
CN (2) | CN113736809B (zh) |
AR (1) | AR102882A1 (zh) |
BR (2) | BR112017012495A2 (zh) |
CA (1) | CA2969667A1 (zh) |
MX (2) | MX2017007602A (zh) |
PH (1) | PH12017501018A1 (zh) |
PT (1) | PT3230456T (zh) |
RU (1) | RU2745322C2 (zh) |
UA (1) | UA124758C2 (zh) |
WO (1) | WO2016094159A1 (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016094165A1 (en) | 2014-12-12 | 2016-06-16 | Syngenta Participations Ag | Compositions and methods for controlling plant pests |
CA3043493A1 (en) * | 2016-11-23 | 2018-05-31 | BASF Agricultural Solutions Seed US LLC | Axmi669 and axmi991 toxin genes and methods for their use |
RU2663347C1 (ru) * | 2017-07-12 | 2018-08-03 | Общество с ограниченной ответственностью "Дока-Генные Технологии" | Способ доставки биологически активных макромолекул в клетки растений |
AU2020226376A1 (en) * | 2019-02-20 | 2021-08-05 | Syngenta Crop Protection Ag | Engineered pesticidal proteins and methods of controlling plant pests |
CN109776659B (zh) * | 2019-03-14 | 2021-01-29 | 中国农业科学院生物技术研究所 | cry2Ah-vp基因在抗黏虫中的应用 |
EP4107171A4 (en) * | 2020-02-21 | 2024-03-13 | Basf Agricultural Solutions Seed Us Llc | TOXIN GENE AND METHODS OF USE |
WO2022155876A1 (en) * | 2021-01-22 | 2022-07-28 | Syngenta Biotechnology China Co., Ltd. | Control of noctuid, crambid, and pyralid pests |
CN116768990B (zh) * | 2023-08-16 | 2023-11-07 | 莱肯生物科技(海南)有限公司 | 一种人工智能辅助生成的杀虫蛋白 |
CN117304266B (zh) * | 2023-11-29 | 2024-01-30 | 深圳市维琪科技股份有限公司 | 一种皮肤处理多肽、组合物及其应用 |
CN118325926B (zh) * | 2024-06-14 | 2024-10-11 | 江苏省农业科学院 | 二化螟唾液蛋白二硫键异构酶在提高烟草对草地贪夜蛾和蚜虫抗性中的应用及方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1994005771A2 (en) * | 1992-08-27 | 1994-03-17 | Plant Genetic Systems N.V. | New bacillus thuringiensis strains and their insecticidal proteins |
WO1998000546A2 (en) * | 1996-07-01 | 1998-01-08 | Mycogen Corporation | Bacillus thuringiensis toxins active against noctuidae pests |
WO1999033991A3 (en) * | 1997-12-31 | 1999-09-10 | Mycogen Corp | TOXINS ACTIVE AGAINST $i(OSTRINIA NUBILALIS) |
CN1390259A (zh) * | 1999-09-15 | 2003-01-08 | 孟山都技术有限公司 | 对鳞翅目昆虫有活性的苏云金芽孢杆菌δ内毒素组合物及其使用方法 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US788471A (en) | 1904-07-06 | 1905-04-25 | W H Flowers | Bale-tie buckle. |
US5187091A (en) * | 1990-03-20 | 1993-02-16 | Ecogen Inc. | Bacillus thuringiensis cryiiic gene encoding toxic to coleopteran insects |
WO1994024264A1 (en) * | 1993-04-09 | 1994-10-27 | Plant Genetic Systems N.V. | New bacillus thuringiensis strains and their insecticidal proteins |
AU7518100A (en) * | 1999-09-17 | 2001-04-24 | Aventis Cropscience N.V. | Insect-resistant rice plants |
WO2001087940A2 (en) * | 2000-05-15 | 2001-11-22 | Monsanto Technology Llc | Polypeptide composionns toxic to anthonomus insects, and use thereof |
US7629504B2 (en) * | 2003-12-22 | 2009-12-08 | Pioneer Hi-Bred International, Inc. | Bacillus thuringiensis cry9 nucleic acids |
KR101156893B1 (ko) * | 2005-08-31 | 2012-06-21 | 몬산토 테크놀로지 엘엘씨 | 살충 단백질을 암호화하는 뉴클레오티드 서열들 |
AU2009262153B2 (en) * | 2008-06-25 | 2014-02-27 | BASF Agricultural Solutions Seed US LLC | Toxin genes and methods for their use |
WO2012038480A2 (en) | 2010-09-22 | 2012-03-29 | Bayer Cropscience Ag | Use of biological or chemical control agents for controlling insects and nematodes in resistant crops |
WO2013134734A2 (en) | 2012-03-09 | 2013-09-12 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
KR20160094985A (ko) * | 2013-12-09 | 2016-08-10 | 애쓰닉스 코포레이션 | 바실루스 투린기엔시스 유래의 axmi477, axmi482, axmi486 및 axmi525 독소 유전자 및 그의 사용 방법 |
-
2015
- 2015-12-03 EP EP15867122.2A patent/EP3230456B1/en active Active
- 2015-12-03 WO PCT/US2015/063610 patent/WO2016094159A1/en active Application Filing
- 2015-12-03 AR ARP150103944A patent/AR102882A1/es unknown
- 2015-12-03 CN CN202110747274.7A patent/CN113736809B/zh active Active
- 2015-12-03 BR BR112017012495-5A patent/BR112017012495A2/pt not_active Application Discontinuation
- 2015-12-03 US US15/534,074 patent/US10407693B2/en active Active
- 2015-12-03 RU RU2017124614A patent/RU2745322C2/ru active
- 2015-12-03 CA CA2969667A patent/CA2969667A1/en active Pending
- 2015-12-03 MX MX2017007602A patent/MX2017007602A/es unknown
- 2015-12-03 BR BR122018075074-0A patent/BR122018075074B1/pt not_active IP Right Cessation
- 2015-12-03 CN CN201580067474.0A patent/CN107109418B/zh active Active
- 2015-12-03 PT PT158671222T patent/PT3230456T/pt unknown
- 2015-12-03 UA UAA201707199A patent/UA124758C2/uk unknown
-
2017
- 2017-06-01 PH PH12017501018A patent/PH12017501018A1/en unknown
- 2017-06-09 MX MX2023000046A patent/MX2023000046A/es unknown
-
2019
- 2019-07-29 US US16/524,873 patent/US10612039B2/en active Active
-
2020
- 2020-02-25 US US16/799,912 patent/US11261459B2/en active Active
-
2022
- 2022-01-19 US US17/578,620 patent/US11680272B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1994005771A2 (en) * | 1992-08-27 | 1994-03-17 | Plant Genetic Systems N.V. | New bacillus thuringiensis strains and their insecticidal proteins |
WO1998000546A2 (en) * | 1996-07-01 | 1998-01-08 | Mycogen Corporation | Bacillus thuringiensis toxins active against noctuidae pests |
WO1999033991A3 (en) * | 1997-12-31 | 1999-09-10 | Mycogen Corp | TOXINS ACTIVE AGAINST $i(OSTRINIA NUBILALIS) |
CN1390259A (zh) * | 1999-09-15 | 2003-01-08 | 孟山都技术有限公司 | 对鳞翅目昆虫有活性的苏云金芽孢杆菌δ内毒素组合物及其使用方法 |
Non-Patent Citations (2)
Title |
---|
苏云金芽胞杆菌新型cry9基因克隆、表达及活性分析;苏慧琴;《中国优秀硕士学位论文全文数据库》;20120315(第03期);D046-92 * |
苏云金芽胞杆菌杀虫基因的鉴定与克隆;谢月霞;《中国优秀硕士学位论文全文数据库》;20091015(第10期);D046-121 * |
Also Published As
Publication number | Publication date |
---|---|
WO2016094159A1 (en) | 2016-06-16 |
US20200181641A1 (en) | 2020-06-11 |
US20190345514A1 (en) | 2019-11-14 |
US20220135999A1 (en) | 2022-05-05 |
RU2017124614A3 (zh) | 2019-06-25 |
US20170335340A1 (en) | 2017-11-23 |
RU2017124614A (ru) | 2019-01-15 |
US11680272B2 (en) | 2023-06-20 |
PT3230456T (pt) | 2024-08-02 |
US10612039B2 (en) | 2020-04-07 |
CN107109418A (zh) | 2017-08-29 |
BR122018075074B1 (pt) | 2022-08-09 |
US11261459B2 (en) | 2022-03-01 |
PH12017501018A1 (en) | 2017-12-11 |
BR112017012495A2 (pt) | 2018-04-10 |
CN113736809B (zh) | 2024-07-19 |
CN113736809A (zh) | 2021-12-03 |
RU2745322C2 (ru) | 2021-03-23 |
MX2017007602A (es) | 2017-10-19 |
EP3230456A4 (en) | 2018-10-17 |
UA124758C2 (uk) | 2021-11-17 |
EP3230456B1 (en) | 2024-05-22 |
US10407693B2 (en) | 2019-09-10 |
MX2023000046A (es) | 2023-02-01 |
AR102882A1 (es) | 2017-03-29 |
EP3230456A1 (en) | 2017-10-18 |
CA2969667A1 (en) | 2016-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11578105B2 (en) | Compositions and methods for controlling plant pests | |
CN107109418B (zh) | 用于控制植物有害生物的组合物和方法 | |
CN107849098B (zh) | 用于控制植物有害生物的组合物和方法 | |
WO2017146899A1 (en) | Compositions and methods for controlling plant pests | |
US11060105B2 (en) | Compositions and methods for controlling plant pests | |
CN113302199A (zh) | 用于控制昆虫有害生物的组合物和方法 | |
CN117024535A (zh) | 用于控制植物有害生物的组合物和方法 | |
CN111148837A (zh) | 用于控制植物有害生物的组合物和方法 | |
US20220322680A1 (en) | Compositions and methods for controlling plant pests |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |