CN112175093B - 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 - Google Patents
对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 Download PDFInfo
- Publication number
- CN112175093B CN112175093B CN202011054128.8A CN202011054128A CN112175093B CN 112175093 B CN112175093 B CN 112175093B CN 202011054128 A CN202011054128 A CN 202011054128A CN 112175093 B CN112175093 B CN 112175093B
- Authority
- CN
- China
- Prior art keywords
- leu
- thr
- glu
- ser
- asn
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 290
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 282
- 230000000749 insecticidal effect Effects 0.000 title claims abstract description 245
- 230000002401 inhibitory effect Effects 0.000 title claims abstract description 50
- 241000607479 Yersinia pestis Species 0.000 title claims description 81
- 231100000331 toxic Toxicity 0.000 title description 8
- 230000002588 toxic effect Effects 0.000 title description 8
- 239000000203 mixture Substances 0.000 claims abstract description 28
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 17
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 13
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 13
- 241000196324 Embryophyta Species 0.000 claims description 204
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 122
- 241000238631 Hexapoda Species 0.000 claims description 101
- 230000000694 effects Effects 0.000 claims description 58
- 102000040430 polynucleotide Human genes 0.000 claims description 40
- 108091033319 polynucleotide Proteins 0.000 claims description 40
- 239000002157 polynucleotide Substances 0.000 claims description 40
- 230000009261 transgenic effect Effects 0.000 claims description 40
- 230000001580 bacterial effect Effects 0.000 claims description 36
- 238000000034 method Methods 0.000 claims description 20
- 239000012634 fragment Substances 0.000 claims description 17
- 241000258937 Hemiptera Species 0.000 claims description 11
- 239000003112 inhibitor Substances 0.000 claims description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 8
- 241000589158 Agrobacterium Species 0.000 claims description 7
- 239000000126 substance Substances 0.000 claims description 6
- 241000588722 Escherichia Species 0.000 claims description 4
- 241000255777 Lepidoptera Species 0.000 claims description 3
- 241001465754 Metazoa Species 0.000 claims description 3
- 241000589516 Pseudomonas Species 0.000 claims description 3
- 241000589180 Rhizobium Species 0.000 claims description 3
- 235000013312 flour Nutrition 0.000 claims description 3
- 235000012054 meals Nutrition 0.000 claims description 3
- 241000193764 Brevibacillus brevis Species 0.000 claims description 2
- 241000254173 Coleoptera Species 0.000 claims description 2
- 241000588698 Erwinia Species 0.000 claims description 2
- 108010060231 Insect Proteins Proteins 0.000 claims description 2
- 241000588748 Klebsiella Species 0.000 claims description 2
- 241000209510 Liliopsida Species 0.000 claims description 2
- 241001414989 Thysanoptera Species 0.000 claims description 2
- 241001233957 eudicotyledons Species 0.000 claims description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 79
- 210000004027 cell Anatomy 0.000 description 133
- 230000014509 gene expression Effects 0.000 description 118
- 108010077245 asparaginyl-proline Proteins 0.000 description 41
- 241000255967 Helicoverpa zea Species 0.000 description 40
- 239000002773 nucleotide Substances 0.000 description 38
- 125000003729 nucleotide group Chemical group 0.000 description 38
- 241000880493 Leptailurus serval Species 0.000 description 35
- 108020004414 DNA Proteins 0.000 description 32
- 108700026244 Open Reading Frames Proteins 0.000 description 31
- 108010093581 aspartyl-proline Proteins 0.000 description 30
- 108010038633 aspartylglutamate Proteins 0.000 description 30
- 244000068988 Glycine max Species 0.000 description 29
- 108010050848 glycylleucine Proteins 0.000 description 28
- 108010051242 phenylalanylserine Proteins 0.000 description 27
- 108700012359 toxins Proteins 0.000 description 27
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 26
- 241000256247 Spodoptera exigua Species 0.000 description 25
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 24
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 23
- 238000006467 substitution reaction Methods 0.000 description 23
- 108010061238 threonyl-glycine Proteins 0.000 description 23
- 241001477931 Mythimna unipuncta Species 0.000 description 22
- 108010044940 alanylglutamine Proteins 0.000 description 22
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 22
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 21
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 21
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 21
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 20
- 240000008042 Zea mays Species 0.000 description 20
- 108010005233 alanylglutamic acid Proteins 0.000 description 20
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 19
- 229920000742 Cotton Polymers 0.000 description 19
- 244000299507 Gossypium hirsutum Species 0.000 description 19
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 19
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 19
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 19
- 108020001507 fusion proteins Proteins 0.000 description 19
- 102000037865 fusion proteins Human genes 0.000 description 19
- 108090000765 processed proteins & peptides Proteins 0.000 description 19
- 239000003053 toxin Substances 0.000 description 19
- 231100000765 toxin Toxicity 0.000 description 19
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 18
- 108091026890 Coding region Proteins 0.000 description 18
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 18
- 235000010469 Glycine max Nutrition 0.000 description 18
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 18
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 18
- 235000005822 corn Nutrition 0.000 description 18
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 18
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 17
- 108020004511 Recombinant DNA Proteins 0.000 description 17
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 17
- 108010047495 alanylglycine Proteins 0.000 description 17
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 17
- 108010049041 glutamylalanine Proteins 0.000 description 17
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 17
- 108010009298 lysylglutamic acid Proteins 0.000 description 17
- 108010029020 prolylglycine Proteins 0.000 description 17
- 210000001519 tissue Anatomy 0.000 description 17
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 16
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 16
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 16
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 16
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 16
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 16
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 16
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 16
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 16
- 238000004166 bioassay Methods 0.000 description 16
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 16
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 15
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 15
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 15
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 15
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 15
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 15
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 15
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 15
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 15
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 15
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 15
- 241000255901 Tortricidae Species 0.000 description 15
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 14
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 14
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 14
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 14
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 14
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 14
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 14
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 14
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 14
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 14
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 14
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 14
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 14
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 14
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 14
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 14
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 14
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 14
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 14
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 14
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 14
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 14
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 14
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 14
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 14
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 14
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 14
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 14
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 14
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 14
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 14
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 14
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 14
- 108010079005 RDV peptide Proteins 0.000 description 14
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 14
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 14
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 14
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 14
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 14
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 14
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 14
- 241000256251 Spodoptera frugiperda Species 0.000 description 14
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 14
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 14
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 14
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 14
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 14
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 14
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 14
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 14
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 14
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 14
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 14
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 14
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 14
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 14
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 14
- 150000001413 amino acids Chemical class 0.000 description 14
- 108010013835 arginine glutamate Proteins 0.000 description 14
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 14
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 14
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 14
- 108010025306 histidylleucine Proteins 0.000 description 14
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 14
- 108010018625 phenylalanylarginine Proteins 0.000 description 14
- 231100000167 toxic agent Toxicity 0.000 description 14
- 239000003440 toxic substance Substances 0.000 description 14
- 230000009466 transformation Effects 0.000 description 14
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 13
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 13
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 13
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 13
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 13
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 13
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 13
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 13
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 13
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 13
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 13
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 13
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 13
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 13
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 13
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 13
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 13
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 13
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 13
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 13
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 13
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 13
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 13
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 13
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 13
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 13
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 13
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 13
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 13
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 13
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 13
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 13
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 13
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 13
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 13
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 13
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 13
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 13
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 13
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 13
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 13
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 13
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 13
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 13
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 13
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 13
- UHXOYRWHIQZAKV-SZMVWBNQSA-N Trp-Pro-Arg Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UHXOYRWHIQZAKV-SZMVWBNQSA-N 0.000 description 13
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 13
- XOVDRAVPGHTYLP-JYJNAYRXSA-N Tyr-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O XOVDRAVPGHTYLP-JYJNAYRXSA-N 0.000 description 13
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 13
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 13
- 108010078144 glutaminyl-glycine Proteins 0.000 description 13
- 108010024607 phenylalanylalanine Proteins 0.000 description 13
- 108010070643 prolylglutamic acid Proteins 0.000 description 13
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 13
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 13
- 108010073969 valyllysine Proteins 0.000 description 13
- 239000013598 vector Substances 0.000 description 13
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 12
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 12
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 12
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 12
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 12
- 230000001276 controlling effect Effects 0.000 description 12
- 108010087823 glycyltyrosine Proteins 0.000 description 12
- 108010012581 phenylalanylglutamate Proteins 0.000 description 12
- 108010076441 Ala-His-His Proteins 0.000 description 11
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 11
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 11
- 241000346285 Ostrinia furnacalis Species 0.000 description 11
- 241000500437 Plutella xylostella Species 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 102000004196 processed proteins & peptides Human genes 0.000 description 11
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 10
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 10
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 10
- 241000193830 Bacillus <bacterium> Species 0.000 description 10
- 241000894006 Bacteria Species 0.000 description 10
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 10
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 10
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 10
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 10
- 210000003763 chloroplast Anatomy 0.000 description 10
- 241000894007 species Species 0.000 description 10
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 9
- 101150102464 Cry1 gene Proteins 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 9
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 9
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 9
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 9
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 9
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 9
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 9
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 9
- 108010081551 glycylphenylalanine Proteins 0.000 description 9
- 230000008685 targeting Effects 0.000 description 9
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 8
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 8
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 8
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 8
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 8
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 8
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 8
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 8
- 241000426497 Chilo suppressalis Species 0.000 description 8
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 8
- 244000020551 Helianthus annuus Species 0.000 description 8
- 235000003222 Helianthus annuus Nutrition 0.000 description 8
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 8
- 240000007594 Oryza sativa Species 0.000 description 8
- 235000007164 Oryza sativa Nutrition 0.000 description 8
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 8
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 8
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 8
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 8
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 8
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 8
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 8
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 8
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 8
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 8
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 8
- 238000009472 formulation Methods 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 8
- 229920001184 polypeptide Polymers 0.000 description 8
- 235000009566 rice Nutrition 0.000 description 8
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 8
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 7
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 7
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 7
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 7
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 7
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 7
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 7
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 7
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 7
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 7
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 7
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 7
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 7
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 7
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 7
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 7
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 7
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 7
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 7
- 241000879145 Diatraea grandiosella Species 0.000 description 7
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 7
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 7
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 7
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 7
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 7
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 7
- 241001147381 Helicoverpa armigera Species 0.000 description 7
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 7
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 7
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 7
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 7
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 7
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 7
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 7
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 7
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 7
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 7
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 7
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 7
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 7
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 7
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 7
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 7
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 7
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 7
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 7
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 7
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 7
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 7
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 7
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 7
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 7
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 7
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 7
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 7
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 7
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 7
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 7
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 7
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 7
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 7
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 7
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 7
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 7
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 7
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 7
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 7
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 7
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 7
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 239000003795 chemical substances by application Substances 0.000 description 7
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 7
- 108010020688 glycylhistidine Proteins 0.000 description 7
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 6
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 6
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 6
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 6
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 6
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 6
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 6
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 6
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 6
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 6
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 6
- 241001147398 Ostrinia nubilalis Species 0.000 description 6
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 6
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 6
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 6
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 6
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 6
- 241001363650 Thysanoplusia orichalcea Species 0.000 description 6
- 108700019146 Transgenes Proteins 0.000 description 6
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 6
- 240000006365 Vitis vinifera Species 0.000 description 6
- 235000014787 Vitis vinifera Nutrition 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 235000013305 food Nutrition 0.000 description 6
- 108010084760 glycyl-tyrosyl-glycyl-aspartate Proteins 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010084932 tryptophyl-proline Proteins 0.000 description 6
- 108010078580 tyrosylleucine Proteins 0.000 description 6
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 5
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 5
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 5
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 5
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 5
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 5
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 5
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 5
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 5
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 5
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 5
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 5
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 5
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 5
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 5
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 5
- DXSBGVKEPHDOTD-UBHSHLNASA-N Cys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N DXSBGVKEPHDOTD-UBHSHLNASA-N 0.000 description 5
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 5
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 5
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 5
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 5
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 5
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 5
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 5
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 5
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 5
- 241000730161 Haritalodes derogata Species 0.000 description 5
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 5
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 5
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 5
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 5
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 5
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 5
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 5
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 5
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 5
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 5
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 5
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 5
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 5
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 5
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 5
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 5
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 5
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 5
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 5
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 5
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 5
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 5
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 5
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 5
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 5
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 5
- 230000009471 action Effects 0.000 description 5
- 108010036533 arginylvaline Proteins 0.000 description 5
- 239000002775 capsule Substances 0.000 description 5
- 244000038559 crop plants Species 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 4
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 4
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 4
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 4
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 4
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 4
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 4
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 4
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 4
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 4
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 4
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 4
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 4
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 4
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 4
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 4
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 4
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 4
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 4
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 4
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 4
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 4
- 241000193388 Bacillus thuringiensis Species 0.000 description 4
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 4
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 4
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 4
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 4
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 4
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 4
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 4
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 4
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 4
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 4
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 4
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 4
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 4
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 4
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 4
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 4
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 4
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 4
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 4
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 4
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 4
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 4
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 4
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 4
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 4
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 4
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 4
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 4
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 4
- 241000193386 Lysinibacillus sphaericus Species 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 241000721451 Pectinophora gossypiella Species 0.000 description 4
- 241000255969 Pieris brassicae Species 0.000 description 4
- 241000672509 Polyocha depressella Species 0.000 description 4
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 4
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 4
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 4
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 4
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 4
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 4
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 4
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 4
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 4
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 4
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 4
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 4
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 4
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 4
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 4
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 4
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 4
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 4
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 4
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 4
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 4
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 4
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 229940097012 bacillus thuringiensis Drugs 0.000 description 4
- 108010031100 chloroplast transit peptides Proteins 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 239000013078 crystal Substances 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 239000010432 diamond Substances 0.000 description 4
- 229910003460 diamond Inorganic materials 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000035611 feeding Effects 0.000 description 4
- 230000035558 fertility Effects 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 3
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 3
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 3
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 3
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 3
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 3
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 3
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 3
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 3
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 3
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 3
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 3
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 3
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 3
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 3
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 3
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 3
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 3
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 3
- 241000193417 Brevibacillus laterosporus Species 0.000 description 3
- 241000579895 Chlorostilbon Species 0.000 description 3
- 101710151559 Crystal protein Proteins 0.000 description 3
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 3
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 241000289763 Dasygaster padockina Species 0.000 description 3
- 241000353522 Earias insulana Species 0.000 description 3
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 3
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 3
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 3
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 3
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 3
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 3
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 3
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 3
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 3
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 3
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 3
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 3
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 3
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 3
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 3
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 3
- 235000004341 Gossypium herbaceum Nutrition 0.000 description 3
- 240000002024 Gossypium herbaceum Species 0.000 description 3
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 3
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 3
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 3
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 3
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 3
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 3
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 3
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 3
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 3
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 3
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 3
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 3
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 3
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 3
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 3
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 3
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 3
- 241000256250 Spodoptera littoralis Species 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 3
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 3
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 3
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- 241001439624 Trichina Species 0.000 description 3
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 3
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 3
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 3
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 3
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 3
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 3
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 3
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 3
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 3
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 3
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 3
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 3
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 3
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 3
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 3
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 3
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 3
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 3
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 3
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 3
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 3
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 3
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 3
- 235000009754 Vitis X bourquina Nutrition 0.000 description 3
- 235000012333 Vitis X labruscana Nutrition 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- -1 cell homogenates Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 239000010976 emerald Substances 0.000 description 3
- 229910052876 emerald Inorganic materials 0.000 description 3
- 230000000967 entomopathogenic effect Effects 0.000 description 3
- 230000001747 exhibiting effect Effects 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 231100000419 toxicity Toxicity 0.000 description 3
- 230000001988 toxicity Effects 0.000 description 3
- 230000035899 viability Effects 0.000 description 3
- ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylpentanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)CC)C(O)=O ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- 241001259789 Amyelois transitella Species 0.000 description 2
- 241000203069 Archaea Species 0.000 description 2
- 241001002470 Archips argyrospila Species 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 2
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 2
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 2
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- AAIUGNSRQDGCDC-ZLUOBGJFSA-N Asp-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O AAIUGNSRQDGCDC-ZLUOBGJFSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 2
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 2
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 2
- 235000000832 Ayote Nutrition 0.000 description 2
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 2
- 101000878902 Bacillus thuringiensis Pesticidal crystal protein Cry6Aa Proteins 0.000 description 2
- 101000878906 Bacillus thuringiensis Pesticidal crystal protein Cry6Ba Proteins 0.000 description 2
- 101100497219 Bacillus thuringiensis subsp. kurstaki cry1Ac gene Proteins 0.000 description 2
- 101100007621 Bacillus thuringiensis subsp. morrisoni cry1Ka gene Proteins 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 240000007124 Brassica oleracea Species 0.000 description 2
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 2
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 2
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 2
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 2
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 2
- 108010049994 Chloroplast Proteins Proteins 0.000 description 2
- 235000005976 Citrus sinensis Nutrition 0.000 description 2
- 240000002319 Citrus sinensis Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 240000004244 Cucurbita moschata Species 0.000 description 2
- 235000009854 Cucurbita moschata Nutrition 0.000 description 2
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 2
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 2
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 2
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 2
- JUUMIGUJJRFQQR-KKUMJFAQSA-N Cys-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O JUUMIGUJJRFQQR-KKUMJFAQSA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 2
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 2
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 2
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- RIUZKUJUPVFAGY-HOTGVXAUSA-N Gly-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)CN RIUZKUJUPVFAGY-HOTGVXAUSA-N 0.000 description 2
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241001201676 Hedya nubiferana Species 0.000 description 2
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 2
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 2
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 2
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 2
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 2
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 2
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 2
- MKWFGXSFLYNTKC-XIRDDKMYSA-N His-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N MKWFGXSFLYNTKC-XIRDDKMYSA-N 0.000 description 2
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 2
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- 206010061217 Infestation Diseases 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 241001261104 Lobesia botrana Species 0.000 description 2
- 241000193981 Loxostege sticticalis Species 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 2
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 2
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 2
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 2
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 2
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 2
- UXJHNUBJSQQIOC-SZMVWBNQSA-N Met-Trp-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O UXJHNUBJSQQIOC-SZMVWBNQSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 241000208125 Nicotiana Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 241000256259 Noctuidae Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 2
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 2
- 241001525654 Phyllocnistis citrella Species 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- 241000209051 Saccharum Species 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 2
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 2
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 244000062793 Sorghum vulgare Species 0.000 description 2
- 241001521235 Spodoptera eridania Species 0.000 description 2
- 241000985245 Spodoptera litura Species 0.000 description 2
- 244000152045 Themeda triandra Species 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- 241000255993 Trichoplusia ni Species 0.000 description 2
- 244000098338 Triticum aestivum Species 0.000 description 2
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 2
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 2
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 2
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 2
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 2
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000013043 chemical agent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000013065 commercial product Substances 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 210000000172 cytosol Anatomy 0.000 description 2
- 235000005911 diet Nutrition 0.000 description 2
- 230000037213 diet Effects 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 238000009313 farming Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 239000002917 insecticide Substances 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000006384 oligomerization reaction Methods 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 230000003234 polygenic effect Effects 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 231100000654 protein toxin Toxicity 0.000 description 2
- 235000015136 pumpkin Nutrition 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 210000004215 spore Anatomy 0.000 description 2
- 239000007921 spray Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 108010036387 trimethionine Proteins 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- NOOLISFMXDJSKH-UTLUCORTSA-N (+)-Neomenthol Chemical compound CC(C)[C@@H]1CC[C@@H](C)C[C@@H]1O NOOLISFMXDJSKH-UTLUCORTSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- BAAVRTJSLCSMNM-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 BAAVRTJSLCSMNM-CMOCDZPBSA-N 0.000 description 1
- ZHVOBYWXERUHMN-KVJKMEBSSA-N 3-[(3s,5r,8r,9s,10s,13s,14s,17s)-10,13-dimethyl-3-[(2r,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]-2h-furan-5-one Chemical compound O([C@@H]1C[C@H]2CC[C@@H]3[C@@H]([C@]2(CC1)C)CC[C@]1([C@H]3CC[C@@H]1C=1COC(=O)C=1)C)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O ZHVOBYWXERUHMN-KVJKMEBSSA-N 0.000 description 1
- 241000208140 Acer Species 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- 241000234282 Allium Species 0.000 description 1
- 235000005254 Allium ampeloprasum Nutrition 0.000 description 1
- 240000006108 Allium ampeloprasum Species 0.000 description 1
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 1
- 240000002234 Allium sativum Species 0.000 description 1
- 241000483758 Amphipoea oculea Species 0.000 description 1
- 240000007087 Apium graveolens Species 0.000 description 1
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 description 1
- 235000010591 Appio Nutrition 0.000 description 1
- 101000768857 Arabidopsis thaliana 3-phosphoshikimate 1-carboxyvinyltransferase, chloroplastic Proteins 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- 241000103017 Archips occidentalis Species 0.000 description 1
- 241001423656 Archips rosana Species 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- GBAWQWASNGUNQF-ZLUOBGJFSA-N Asp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N GBAWQWASNGUNQF-ZLUOBGJFSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 235000007558 Avena sp Nutrition 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 101100497229 Bacillus thuringiensis cry1Be gene Proteins 0.000 description 1
- 101100114760 Bacillus thuringiensis cry3Bb gene Proteins 0.000 description 1
- 101100497221 Bacillus thuringiensis subsp. alesti cry1Ae gene Proteins 0.000 description 1
- 101100275683 Bacillus thuringiensis subsp. kurstaki cry2Ab gene Proteins 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 1
- 244000178937 Brassica oleracea var. capitata Species 0.000 description 1
- 235000010149 Brassica rapa subsp chinensis Nutrition 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000000536 Brassica rapa subsp pekinensis Nutrition 0.000 description 1
- 241000499436 Brassica rapa subsp. pekinensis Species 0.000 description 1
- 244000188595 Brassica sinapistrum Species 0.000 description 1
- 241000186146 Brevibacterium Species 0.000 description 1
- 235000010773 Cajanus indicus Nutrition 0.000 description 1
- 244000105627 Cajanus indicus Species 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 101710167800 Capsid assembly scaffolding protein Proteins 0.000 description 1
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 1
- 244000020518 Carthamus tinctorius Species 0.000 description 1
- 235000009024 Ceanothus sanguineus Nutrition 0.000 description 1
- 235000010523 Cicer arietinum Nutrition 0.000 description 1
- 244000045195 Cicer arietinum Species 0.000 description 1
- 244000241235 Citrullus lanatus Species 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- 241000008892 Cnaphalocrocis patnalis Species 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 240000007154 Coffea arabica Species 0.000 description 1
- 241000219112 Cucumis Species 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 241001635274 Cydia pomonella Species 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- IOLWXFWVYYCVTJ-NRPADANISA-N Cys-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N IOLWXFWVYYCVTJ-NRPADANISA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 description 1
- NOOLISFMXDJSKH-UHFFFAOYSA-N DL-menthol Natural products CC(C)C1CCC(C)CC1O NOOLISFMXDJSKH-UHFFFAOYSA-N 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 241000522190 Desmodium Species 0.000 description 1
- 206010012559 Developmental delay Diseases 0.000 description 1
- 235000014466 Douglas bleu Nutrition 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 244000004281 Eucalyptus maculata Species 0.000 description 1
- 241000221017 Euphorbiaceae Species 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 101710160621 Fusion glycoprotein F0 Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- OHWJUIXZHVIXJJ-GUBZILKMSA-N Glu-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N OHWJUIXZHVIXJJ-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- WVWZIPOJECFDAG-AVGNSLFASA-N Glu-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N WVWZIPOJECFDAG-AVGNSLFASA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 241001441330 Grapholita molesta Species 0.000 description 1
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- BILZDIPAKWZFSG-PYJNHQTQSA-N His-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BILZDIPAKWZFSG-PYJNHQTQSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- SLFSYFJKSIVSON-SRVKXCTJSA-N His-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SLFSYFJKSIVSON-SRVKXCTJSA-N 0.000 description 1
- ABCCKUZDWMERKT-AVGNSLFASA-N His-Pro-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O ABCCKUZDWMERKT-AVGNSLFASA-N 0.000 description 1
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- 244000025221 Humulus lupulus Species 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- IMRKCLXPYOIHIF-ZPFDUUQYSA-N Ile-Met-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IMRKCLXPYOIHIF-ZPFDUUQYSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 235000002678 Ipomoea batatas Nutrition 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 240000003553 Leptospermum scoparium Species 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 241000208682 Liquidambar Species 0.000 description 1
- 235000006552 Liquidambar styraciflua Nutrition 0.000 description 1
- 241000594033 Liriomyza bryoniae Species 0.000 description 1
- 235000015459 Lycium barbarum Nutrition 0.000 description 1
- 241000721703 Lymantria dispar Species 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 235000014435 Mentha Nutrition 0.000 description 1
- 241001072983 Mentha Species 0.000 description 1
- 244000246386 Mentha pulegium Species 0.000 description 1
- 235000016257 Mentha pulegium Nutrition 0.000 description 1
- 235000004357 Mentha x piperita Nutrition 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- 241000819714 Monema flavescens Species 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 108010034522 NNQQ peptide Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 240000007817 Olea europaea Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241001012098 Omiodes indicata Species 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 241001310339 Paenibacillus popilliae Species 0.000 description 1
- 241001520808 Panicum virgatum Species 0.000 description 1
- 239000006002 Pepper Substances 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- ZZVUXQCQPXSUFH-JBACZVJFSA-N Phe-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ZZVUXQCQPXSUFH-JBACZVJFSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241001236219 Pinus echinata Species 0.000 description 1
- 235000005018 Pinus echinata Nutrition 0.000 description 1
- 235000017339 Pinus palustris Nutrition 0.000 description 1
- 241000218621 Pinus radiata Species 0.000 description 1
- 235000008577 Pinus radiata Nutrition 0.000 description 1
- 235000008566 Pinus taeda Nutrition 0.000 description 1
- 241000218679 Pinus taeda Species 0.000 description 1
- 235000016761 Piper aduncum Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 235000017804 Piper guineense Nutrition 0.000 description 1
- 235000008184 Piper nigrum Nutrition 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 244000292697 Polygonum aviculare Species 0.000 description 1
- 235000006386 Polygonum aviculare Nutrition 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 101710130420 Probable capsid assembly scaffolding protein Proteins 0.000 description 1
- 108010078762 Protein Precursors Proteins 0.000 description 1
- 102000014961 Protein Precursors Human genes 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101100457857 Pseudomonas entomophila (strain L48) mnl gene Proteins 0.000 description 1
- 240000001416 Pseudotsuga menziesii Species 0.000 description 1
- 235000005386 Pseudotsuga menziesii var menziesii Nutrition 0.000 description 1
- 235000014443 Pyrus communis Nutrition 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 244000088415 Raphanus sativus Species 0.000 description 1
- 235000006140 Raphanus sativus var sativus Nutrition 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 101710204410 Scaffold protein Proteins 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 235000002597 Solanum melongena Nutrition 0.000 description 1
- 244000061458 Solanum melongena Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 241000488874 Sonchus Species 0.000 description 1
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 235000021536 Sugar beet Nutrition 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 241000243774 Trichinella Species 0.000 description 1
- 241000219793 Trifolium Species 0.000 description 1
- 235000019714 Triticale Nutrition 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 1
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 1
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- WNGMGTMSUBARLB-RXVVDRJESA-N Trp-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(=O)NCC(O)=O)=CNC2=C1 WNGMGTMSUBARLB-RXVVDRJESA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 241001002356 Valeriana edulis Species 0.000 description 1
- 241000482268 Zea mays subsp. mays Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 230000011681 asexual reproduction Effects 0.000 description 1
- 238000013465 asexual reproduction Methods 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 210000004666 bacterial spore Anatomy 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 230000000853 biopesticidal effect Effects 0.000 description 1
- VEMKTZHHVJILDY-UXHICEINSA-N bioresmethrin Chemical compound CC1(C)[C@H](C=C(C)C)[C@H]1C(=O)OCC1=COC(CC=2C=CC=CC=2)=C1 VEMKTZHHVJILDY-UXHICEINSA-N 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 150000001805 chlorine compounds Chemical class 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 235000016213 coffee Nutrition 0.000 description 1
- 235000013353 coffee beverage Nutrition 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 101150065438 cry1Ab gene Proteins 0.000 description 1
- 101150049404 cry1Ca gene Proteins 0.000 description 1
- 101150102059 cry3Aa gene Proteins 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000004634 feeding behavior Effects 0.000 description 1
- 230000003031 feeding effect Effects 0.000 description 1
- 231100000502 fertility decrease Toxicity 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000004611 garlic Nutrition 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 231100000086 high toxicity Toxicity 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 235000001050 hortel pimenta Nutrition 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 238000011901 isothermal amplification Methods 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 230000007758 mating behavior Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229940041616 menthol Drugs 0.000 description 1
- 239000013264 metal-organic assembly Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 238000003801 milling Methods 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 208000009091 myxoma Diseases 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000014571 nuts Nutrition 0.000 description 1
- 206010073131 oligoastrocytoma Diseases 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 210000002824 peroxisome Anatomy 0.000 description 1
- 230000000361 pesticidal effect Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000014639 sexual reproduction Effects 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- 230000028070 sporulation Effects 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 241000228158 x Triticosecale Species 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N47/00—Biocides, pest repellants or attractants, or plant growth regulators containing organic compounds containing a carbon atom not being member of a ring and having no bond to a carbon or hydrogen atom, e.g. derivatives of carbonic acid
- A01N47/08—Biocides, pest repellants or attractants, or plant growth regulators containing organic compounds containing a carbon atom not being member of a ring and having no bond to a carbon or hydrogen atom, e.g. derivatives of carbonic acid the carbon atom having one or more single bonds to nitrogen atoms
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N63/00—Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
- A01N63/50—Isolated enzymes; Isolated proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Pest Control & Pesticides (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Environmental Sciences (AREA)
- Agronomy & Crop Science (AREA)
- Dentistry (AREA)
- Cell Biology (AREA)
- Insects & Arthropods (AREA)
- Crystallography & Structural Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Catching Or Destruction (AREA)
- Pretreatment Of Seeds And Plants (AREA)
Abstract
公开了编码新型嵌合杀昆虫蛋白质的核苷酸序列,所述嵌合杀昆虫蛋白质表现出鳞翅目抑制活性。具体实施方案提供了含有编码一种或多种所述嵌合杀昆虫蛋白质的重组核酸分子的组合物以及经过转化的植物、植物部分和种子。
Description
本申请是申请日为2015年10月15日、申请号为201580055840.0、发明名称为“对鳞翅目害虫具有毒性或抑制性的新型嵌合杀昆虫蛋白质”的发明专利申请的分案申请。
对相关申请的引用
本申请要求2014年10月16日提交的美国临时申请No.62/064,989的权益,该申请以全文引用的方式并入本文中。
序列表的并入
序列表的计算机可读形式与本申请一起通过电子提交方式提交并且以全文引用的方式并入本申请。序列表包含在2016年8月23日创建的文件中,具有文件名P34230WO00_Seq_PCT_Art34.txt,并且大小是898,229字节(如在操作系统中所测量)。
发明领域
本发明总体上涉及昆虫抑制性蛋白质领域。在本申请中公开了对农作物和种子的农业相关害虫表现出昆虫抑制性活性的一类新型的嵌合杀昆虫蛋白质。具体来说,所公开的类别的蛋白质对鳞翅目昆虫害虫表现出了杀昆虫活性。提供了含有编码所公开的毒素蛋白质中的一种或多种的重组核酸分子的植物、植物部分和种子。
发明背景
提高重要农业植物,尤其包括玉米、大豆、甘蔗、水稻、小麦、蔬菜和棉花的作物产量已经变得越来越重要。除了日益增长的人口对用于提供食物、衣物和能量的农产品的需要日益增长以外,预计气候相关的影响和来自于日益增长的人口将土地用于除农业耕作以外的用途的压力也会减少可耕种耕地的量。这些因素已经导致了关于粮食保障的严峻预测,特别是在植物生物技术和农业技术不存在重大改进的情况下。鉴于这些压力,技术、农业技术和害虫治理的环境可持续改进是在有限量的可耕种耕地上使作物增产的重要工具。
昆虫,特别是鳞翅目内的昆虫,被认为是破坏农田作物,从而降低受侵扰区域中的作物产量的主要原因。会对农业造成负面影响的鳞翅目害虫物种包括但不限于草地粘虫(草地贪夜蛾)、甜菜粘虫(甜菜夜蛾)、披肩粘虫(蓓带夜蛾)、黑切根虫(球菜夜蛾)、甘蓝尺蠖(粉纹夜蛾)、大豆尺蠖(大豆夜蛾)、黎豆毛虫(黎豆夜蛾)、绿叶虫(苜蓿绿夜蛾)、烟草芽虫(烟芽夜蛾)、颗粒切根虫(黄地老虎)、粘虫(一星粘虫)、西方切根虫(灰地老虎)、欧洲玉米钻心虫(欧洲玉米螟)、脐橙虫(脐橙螟)、玉米根结网虫(玉米根网螟)、草地结网虫(水稻切叶野螟)、向日葵蛾(向日葵螟)、小玉米茎钻心虫(南美玉米苗斑螟)、苹果蛾(苹果蠹蛾)、葡萄果蛾(葡萄小食心虫)、东方果蛾(梨小食心虫)、向日葵芽小卷蛾(向日葵芽卷叶蛾)、钻背蛾(小菜蛾)、粉红棉铃虫(棉红铃虫)、粉红钻茎虫(大螟)、吉普赛蛾(舞毒蛾)、棉叶虫(棉叶波纹夜蛾)、果树卷叶虫(果树黄卷蛾)、欧洲卷叶虫(玫瑰黄卷蛾)、亚洲水稻钻心虫或水稻钻茎虫(二化螟)、水稻卷叶虫(稻纵卷叶螟)、玉米根结网虫(玉米根网螟)、蓝草结网虫(早熟禾草螟)、西南玉米钻心虫(西南玉米螟)、蔗螟(小蔗螟)、多刺棉铃虫(埃及金刚钻)、斑点棉铃虫(翠纹金刚钻)、古棉铃虫(棉铃实夜蛾)、玉米穗虫、大豆荚虫或棉铃虫(玉米穗夜蛾)、草地结网虫(水稻切叶野螟)、欧洲葡萄蛾(欧洲葡萄缀穗蛾)、柑桔潜叶虫(柑桔潜叶蛾)、大白蝴蝶(大菜粉蛾)、菜青虫或小白蝴蝶(小菜粉蛾)、烟草切根虫或茶蚕(斜纹夜蛾)和番茄潜叶虫(番茄潜叶蛾)。
在历史上,农业依赖于密集施用作为害虫防治剂的合成化学杀昆虫剂。除了出现抗性问题以外,对环境和人类健康的忧虑也刺激了生物杀虫剂的研究和开发。这种研究工作导致逐渐发现和使用多种昆虫病原微生物物种,包括细菌。
当发现了昆虫病原细菌,尤其是属于芽孢杆菌属的细菌,并且将其开发为生物害虫防治剂时,生物防治模式发生了变化。细菌苏云金芽孢杆菌(Bt)的菌株已经被用作杀昆虫蛋白质的来源,因为人们发现Bt菌株会对特定的昆虫显示出高毒性。已知Bt菌株会在孢子形成开始时和在稳定生长期期间产生位于伴孢结晶包涵体内的δ内毒素(例如Cry蛋白),而且还已知其能产生分泌型杀昆虫蛋白质。在被易感昆虫摄取后,δ内毒素以及分泌型毒素在中肠皮膜细胞表面发挥其作用,从而破坏细胞膜,导致细胞破环和死亡。除Bt以外的细菌物种中也鉴别出了编码杀昆虫蛋白质的基因,所述细菌物种包括其他芽孢杆菌和多种其他细菌物种,诸如侧孢短芽孢杆菌(Brevibacillus laterosporus)、球形赖氨酸芽孢杆菌(Lysinibacillus sphaericus)(“Ls”先前被称为球形芽孢杆菌(Bacillus sphaericus))和日本甲虫芽孢杆菌(Paenibacillus popilliae)。
结晶型和分泌型可溶性杀昆虫蛋白质毒素对其宿主具有高度特异性,并且已经得以在世界范围内被接受作为化学杀昆虫剂的替代物。举例来说,杀昆虫毒素蛋白质已经被用于多种农业应用中,以防止重要农业植物遭到昆虫侵扰,减少对化学杀虫剂施用的需要和增加产量。通过机械方法诸如将含有多种细菌菌株的微生物制剂分散至植物表面上的喷雾和通过使用旨在产生表达杀昆虫毒素蛋白质的转基因植物和种子的基因转化技术将杀昆虫毒素蛋白质用来防治作物植物的农业相关害虫。
表达杀昆虫蛋白质的转基因植物的使用已经在全世界得到了采用。举例来说,在2012年,有2610万公顷种植了表达Bt毒素的转基因作物(James,C.,Global Status ofCommercialized Biotech/GM Crops:2012.ISAAA Brief第44期)。转基因昆虫防护作物的全球使用和被用于这些作物中的杀昆虫蛋白质的有限数目已经对能赋予针对目前所利用的杀昆虫蛋白质的抗性的现有昆虫等位基因产生了选择压力。
目标害虫中发展对杀昆虫蛋白质的抗性引起持续需要发现和开发可用于控制昆虫对表达杀昆虫蛋白质的转基因作物的抗性增加的杀昆虫蛋白质新形式。具有提高的效力并且对更宽范围的易感昆虫物种表现出控制的新杀昆虫蛋白质将减少可能发展抗性等位基因的存活昆虫的数目。另外,在一种植物中使用两种或更多种对同一昆虫害虫有毒并且呈现出不同的作用模式的转基因杀昆虫蛋白质能降低任何单一目标昆虫物种中出现抗性的概率。
因此,极度需要鉴别出具有改善的杀昆虫性质,诸如与农业耕作中目前使用的毒素相比对更宽范围的目标昆虫害虫物种和不同的作用模式的功效有所增加的其他杀昆虫蛋白质。为了满足这个需要,本发明公开了对主要目标鳞翅目害虫物种表现出活性的新型Cry1嵌合杀昆虫蛋白质。
本领域中已知Cry1晶体蛋白质家族成员会对鳞翅目害虫表现出生物活性。Cry1晶体蛋白质的前体形式由两个近似相等尺寸的片段组成。所述前体蛋白质的羧基末端部分,称为原毒素片段,能使晶体形成稳定并且不表现杀昆虫活性。所述前体蛋白质的胺基末端部分包含Cry1蛋白的毒素片段,并且基于Cry1家族成员内的保守或基本保守序列的比对,可以进一步再分成三个结构域,即结构域I、结构域II和结构域III。结构域I包含活性毒素片段的约前三分之一,而且已经被证明对通道形成是不可或缺的。结构域II和结构域III都牵涉受体结合和昆虫物种特异性,这取决于所研究的昆虫和杀昆虫蛋白质。
由对本领域中已知的众多天然杀昆虫蛋白质的结构域结构进行拣选来任意地产生具有增强的嵌合蛋白质的可能性微乎其微。这是蛋白质结构、寡聚和释放杀昆虫蛋白质片段所需的激活(包括对嵌合前体进行正确蛋白水解处理,如果以此种形式表达的话)的复杂特性的结果。只有通过小心地选择各亲本蛋白质内的原毒素和具体靶标来产生嵌合结构才可能构建出与得到嵌合体的亲本蛋白质相比表现出有所改善的杀昆虫活性的功能嵌合杀昆虫毒素。本领域中已知,重新组装原毒素与任何两种或更多种彼此不同的毒素的毒素结构域I、结构域II和结构域III往往会构建出表现出错误晶体形成或完全缺乏针对优选目标昆虫害虫物种的任何可检测杀昆虫活性的蛋白质。只有通过试错法才能设计出有效的杀昆虫嵌合体,而且即使那样,熟练的技术人员最后也不一定能得到与可能得到嵌合体组成原毒素或毒素结构域的任何单个亲本毒素蛋白质相比表现出等效或改善的杀昆虫活性的嵌合体。举例来说,文献报告了由两种或更多种晶体蛋白质前体构建或组装嵌合蛋白质的众多实例。参见例如Jacqueline S.Knight等人,“A Strategy for Shuffling NumerousBacillus thuringiensis Crystal Protein Domains.”J.Economic Entomology,97(6)(2004):1805-1813;Bosch等人(美国专利No.6,204,246);Malvar和Gilmer(美国专利No.6,017,534)。在这些实例中的每一个中,所得嵌合体中有许多与得到嵌合体组分的前体蛋白质相比不能表现出等效或有所改善的杀昆虫或晶体形成性质。
发明概述
提供了编码对鳞翅类植物害虫具有毒性的嵌合杀昆虫蛋白质的重组核酸分子。每一种嵌合杀昆虫蛋白质都可以单独使用或与彼此以及与其他杀昆虫蛋白质和昆虫抑制剂组合于制剂和活体中;从而提供农业系统中目前使用的杀昆虫蛋白质和杀昆虫化学物质的替代物。
在某些实施方案中,本文中公开了一种嵌合杀昆虫蛋白质,其包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的氨基酸序列。这种嵌合杀昆虫蛋白质对鳞翅目昆虫物种表现出抑制活性,所述鳞翅目昆虫物种诸如但不限于黎豆夜蛾、小蔗螟、南美玉米苗斑螟、玉米穗夜蛾、烟芽夜蛾、大豆夜蛾、考斯夜蛾、亚热带粘虫、草地贪夜蛾、甜菜夜蛾、棉铃实夜蛾、斜纹夜蛾、棉红铃虫、西南玉米螟、翠纹金刚钻、南美棉铃虫和薄荷灰夜蛾。
在另一个实施方案中,公开了一种编码嵌合杀昆虫蛋白质的多核苷酸,其中所述多核苷酸可操作地连接至异源启动子,并且所述嵌合杀昆虫蛋白质包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的氨基酸序列。还设想了一种编码嵌合杀昆虫蛋白质的多核苷酸,其中所述多核苷酸包含的核苷酸序列任选地:在严格条件下与如SEQ ID NO:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129或130中的任意者中所示出的多核苷酸序列的反向互补序列杂交;或编码包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的氨基酸序列的嵌合杀昆虫蛋白质。
在其他实施方案中,本文中公开了一种宿主细胞,其包含SEQ ID NO:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129或130中的任意者中所示出的多核苷酸,其中所述宿主细胞是选自由细菌宿主细胞或植物宿主细胞组成的群组。所设想的细菌宿主包括土壤杆菌、根瘤菌、芽孢杆菌、短芽孢杆菌、埃希氏杆菌、假单胞菌、克雷伯氏杆菌和欧文氏菌;并且其中所述芽孢杆菌属是蜡样芽胞杆菌或苏云金芽孢杆菌,所述短芽孢杆菌是侧孢短芽孢杆菌,且所述埃希氏杆菌是大肠埃希氏杆菌。所设想的植物细胞包括单子叶植物和双子叶植物。
本文中所公开的其他实施方案包括昆虫抑制性组合物,其包含有包含如SEQ IDNO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的氨基酸序列的嵌合杀昆虫蛋白质。在某些实施方案中,所述昆虫抑制性组合物还包含与所述嵌合杀昆虫蛋白质不同的至少一种昆虫抑制剂。所设想的与所述嵌合杀昆虫蛋白质不同的昆虫抑制剂包括昆虫抑制性蛋白质、昆虫抑制性dsRNA分子和昆虫抑制性化学物质。这些与所述嵌合杀昆虫蛋白质不同的昆虫抑制剂可以对鳞翅目、鞘翅目、半翅目、同翅目或缨翅目的一种或多种害虫物种表现出活性。
在又另一个实施方案中,本文中公开了一种种子,其包含昆虫抑制有效量的以下各物:包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的氨基酸序列的嵌合杀昆虫蛋白质;或SEQ IDNO:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129或130中的任意者中所示出的多核苷酸。
还设想了一种防治鳞翅目害虫的方法,所述方法包括使所述鳞翅目害虫与抑制量的本发明嵌合杀昆虫蛋白质接触。
在另一个实施方案中,本文中公开了一种转基因植物细胞、植物或植物部分,其包含嵌合杀昆虫蛋白质,其中:所述嵌合杀昆虫蛋白质包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的任意氨基酸序列;或所述嵌合杀昆虫蛋白质包含如下蛋白质:与SEQ ID NO:21、10具有至少94%同一性;与SEQ ID NO:28具有至少93%同一性;与SEQ ID NO:7具有至少87%同一性;与SEQ ID NO:4具有至少90%同一性;与SEQ ID NO:13具有至少91%同一性;与SEQ IDNO:16具有至少64%同一性;与SEQ ID NO:19具有至少66%同一性;与SEQ ID NO:23具有至少86%同一性;与SEQ ID NO:25具有至少91%同一性;与SEQ ID NO:30具有至少94%同一性;与SEQ ID NO:33具有至少91%同一性;与SEQ ID NO:36具有至少64%同一性;与SEQ IDNO:39具有至少66%同一性;与SEQ ID NO:41具有至少94%同一性;与SEQ ID NO:43具有至少84%同一性;与SEQ ID NO:45具有至少93%同一性;与SEQ ID NO:47具有至少94%同一性;与SEQ ID NO:50具有至少91%同一性;或与SEQ ID NO:53具有至少93%同一性;或与SEQ ID NO:85、93、105具有至少87%同一性;或与SEQ ID NO:55、57、59、61、63、65、67、69、71、73、75、77、79具有至少85%同一性;或与SEQ ID NO:91、87、89具有至少88%同一性;或与SEQ ID NO:107、111具有至少89%同一性;或与SEQ ID NO:97具有至少90%同一性;与SEQ ID NO:109具有至少91%同一性;或与SEQ ID NO:83具有至少93%同一性;或与SEQ IDNO:91或103具有至少94%同一性;或与SEQ ID NO:95、101具有至少95%同一性;或与SEQID NO:99具有至少98%同一性。还设想了防治鳞翅目害虫的方法,其包括使所述害虫暴露于这种转基因植物细胞、植物或植物部分,其中所述植物细胞、植物或植物部分表达鳞翅目抑制量的所述嵌合杀昆虫蛋白质。
在本文中的其他实施方案中,提供了来源于所述植物细胞、植物或植物部分的商品产品,其中所述产品包含可检测量的所述嵌合杀昆虫蛋白质。所设想的商品产品包括植物生物质、油、膳食、动物饲料、面粉、薄片、糠、棉绒、外壳和经过处理的种子。
本文中所公开的又另一种方法是一种生产包含嵌合杀昆虫蛋白质的种子的方法,所述方法包括:种植至少一个包含嵌合杀昆虫蛋白质的种子;从所述种子长出植物;和从所述植物收获种子,其中所述收获的种子包含所述嵌合杀昆虫蛋白质。
本文中还设想了编码嵌合杀昆虫蛋白质的重组多核苷酸分子,其包含选自由以下各项组成的群组的核苷酸序列:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129或130;和任选地,编码与所述嵌合杀昆虫蛋白质不同的昆虫抑制剂的多核苷酸序列。
本文中所设想的另一种重组核酸分子包含可操作地连接至编码嵌合杀昆虫蛋白质的多核苷酸片段的异源启动子,其中:所述嵌合杀昆虫蛋白质包含如SEQ ID NO:21、10、28、7、4、13、16、19、23、25、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109或111中的任意者中所示出的任意氨基酸序列;或所述嵌合杀昆虫蛋白质包含如下蛋白质:与SEQ ID NO:21、10具有至少94%同一性;与SEQ ID NO:28具有至少93%同一性;与SEQ ID NO:7具有至少87%同一性;与SEQ ID NO:4具有至少90%同一性;与SEQ ID NO:13具有至少91%同一性;与SEQ ID NO:16具有至少64%同一性;与SEQ IDNO:19具有至少66%同一性;与SEQ IDNO:23具有至少86%同一性;与SEQ ID NO:25具有至少91%同一性;与SEQ ID NO:30具有至少94%同一性;与SEQ ID NO:33具有至少91%同一性;与SEQ ID NO:36具有至少64%同一性;与SEQ ID NO:39具有至少66%同一性;与SEQ ID NO:41具有至少94%同一性;与SEQ IDNO:43具有至少84%同一性;与SEQ ID NO:45具有至少93%同一性;与SEQ ID NO:47具有至少94%同一性;与SEQ ID NO:50具有至少91%同一性;或与SEQ ID NO:53具有至少93%同一性;或与SEQ ID NO:85、93、105具有87%同一性;或与SEQ ID NO:55、57、59、61、63、65、67、69、71、73、75、77、79具有至少85%同一性;或与SEQ ID NO:91、87、89具有至少88%同一性;或与SEQ ID NO:107、111具有至少89%同一性;或与SEQ ID NO:97具有至少90%同一性;或与SEQ ID NO:109具有至少91%同一性;或与SEQ ID NO:83具有至少93%同一性;或与SEQ ID NO:91、103具有至少94%同一性;或与SEQ ID NO:95、101具有至少95%同一性;或与SEQ ID NO:99具有至少98%同一性;或者所述多核苷酸片段与具有如SEQ ID NO:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129或130中的任意者中所示出的核苷酸序列的多核苷酸杂交。
本发明的其他实施方案、特征和优点将从以下详细描述、实施例和权利要求书中显而易见。
序列简述
SEQ ID NO:1是用于在细菌细胞中表达的编码TIC1100的重组DNA序列。
SEQ ID NO:2是用于在植物细胞中表达的编码TIC1100的合成DNA序列。
SEQ ID NO:3是用于在植物细胞中表达的编码TIC1100的合成DNA序列。
SEQ ID NO:4是TIC1100的氨基酸序列。
SEQ ID NO:5是用于在细菌细胞中表达的编码TIC860的重组DNA序列。
SEQ ID NO:6是用于在植物细胞中表达的编码TIC860的合成DNA序列。
SEQ ID NO:7是TIC860的氨基酸序列。
SEQ ID NO:8是用于在细菌细胞中表达的编码TIC867的重组DNA序列。
SEQ ID NO:9是用于在植物细胞中表达的编码TIC867的合成DNA序列。
SEQ ID NO:10是TIC867的氨基酸序列。
SEQ ID NO:11是用于在细菌细胞中表达的编码TIC867_20的重组DNA序列。
SEQ ID NO:12是用于在植物细胞中表达的编码TIC867_20的合成DNA序列。
SEQ ID NO:13是TIC867_20的氨基酸序列。
SEQ ID NO:14是用于在细菌细胞中表达的编码TIC867_21的重组DNA序列。
SEQ ID NO:15是用于在植物细胞中表达的编码TIC867_21的合成DNA序列。
SEQ ID NO:16是TIC867_21的氨基酸序列。
SEQ ID NO:17是用于在细菌细胞中表达的编码TIC867_22的重组DNA序列。
SEQ ID NO:18是用于在植物细胞中表达的编码TIC867_22的合成DNA序列。
SEQ ID NO:19是TIC867_22的氨基酸序列。
SEQ ID NO:20是用于在植物细胞中表达的编码TIC867_23的合成DNA序列。
SEQ ID NO:21是TIC867_23的氨基酸序列。
SEQ ID NO:22是用于在植物细胞中表达的编码TIC867_24的合成DNA序列。
SEQ ID NO:23是TIC867_24的氨基酸序列。
SEQ ID NO:24是用于在植物细胞中表达的编码TIC867_24的合成DNA序列。
SEQ ID NO:25是TIC867_25的氨基酸序列。
SEQ ID NO:26是用于在细菌细胞中表达的编码TIC868的重组DNA序列。
SEQ ID NO:27是用于在植物细胞中表达的编码TIC868的合成DNA序列。
SEQ ID NO:28是TIC868的氨基酸序列。
SEQ ID NO:29是用于在植物细胞中表达的编码TIC868_9的合成DNA序列。
SEQ ID NO:30是TIC868_9的氨基酸序列。
SEQ ID NO:31是用于在细菌细胞中表达的编码TIC868_10的重组DNA序列。
SEQ ID NO:32是用于在植物细胞中表达的编码TIC868变异体TIC868_10的合成DNA序列。
SEQ ID NO:33是TIC868_10的氨基酸序列。
SEQ ID NO:34是用于在细菌细胞中表达的编码TIC868_11的重组DNA序列。
SEQ ID NO:35是用于在植物细胞中表达的编码TIC868_11的合成DNA序列。
SEQ ID NO:36是TIC868_11的氨基酸序列。
SEQ ID NO:37是用于在细菌细胞中表达的编码TIC868_12的重组DNA序列。
SEQ ID NO:38是用于在植物细胞中表达的编码TIC868_12的合成DNA序列。
SEQ ID NO:39是TIC868_12的氨基酸序列。
SEQ ID NO:40是用于在植物细胞中表达的编码TIC868_13的合成DNA序列。
SEQ ID NO:41是TIC868_13的氨基酸序列。
SEQ ID NO:42是用于在植物细胞中表达的编码TIC868_14的合成DNA序列。
SEQ ID NO:43是TIC868_14的氨基酸序列。
SEQ ID NO:44是用于在植物细胞中表达的编码TIC868_15的合成DNA序列。
SEQ ID NO:45是TIC868_15的氨基酸序列。
SEQ ID NO:46是用于在植物细胞中表达的编码TIC868_29的合成DNA序列。
SEQ ID NO:47是TIC868_29的氨基酸序列。
SEQ ID NO:48是用于在细菌细胞中表达的编码TIC869的重组DNA序列。
SEQ ID NO:49是用于在植物细胞中表达的编码TIC869的合成DNA序列。
SEQ ID NO:50是TIC869的氨基酸序列。
SEQ ID NO:51是用于在细菌细胞中表达的编码TIC836的重组DNA序列。
SEQ ID NO:52是用于在植物细胞中表达的编码TIC836的合成DNA序列。
SEQ ID NO:53是TIC836的氨基酸序列。
SEQ ID NO:54是编码嵌合TIC713氨基酸序列的DNA序列。
SEQ ID NO:55是从SEQ ID NO:54中所示出的开放阅读框翻译而来的TIC713氨基酸序列。
SEQ ID NO:56是编码嵌合TIC843氨基酸序列的DNA序列。
SEQ ID NO:57是从SEQ ID NO:56中所示出的开放阅读框翻译而来的TIC843氨基酸序列。
SEQ ID NO:58是编码嵌合TIC862氨基酸序列的DNA序列。
SEQ ID NO:59是从SEQ ID NO:58中所示出的开放阅读框翻译而来的TIC862氨基酸序列。
SEQ ID NO:60是编码嵌合TIC1099氨基酸序列的DNA序列。
SEQ ID NO:61是从SEQ ID NO:60中所示出的开放阅读框翻译而来的TIC1099氨基酸序列。
SEQ ID NO:62是编码嵌合TIC1099-T507E氨基酸序列的DNA序列。
SEQ ID NO:63是从SEQ ID NO:62中所示出的开放阅读框翻译而来的TIC1099-T507E氨基酸序列。
SEQ ID NO:64是编码嵌合TIC1099-R522K氨基酸序列的DNA序列。
SEQ ID NO:65是从SEQ ID NO:64中所示出的开放阅读框翻译而来的TIC1099-R522K氨基酸序列。
SEQ ID NO:66是编码嵌合TIC1099-K490S氨基酸序列的DNA序列。
SEQ ID NO:67是从SEQ ID NO:66中所示出的开放阅读框翻译而来的TIC1099-K490S氨基酸序列。
SEQ ID NO:68是编码嵌合TIC1099-T562R氨基酸序列的DNA序列。
SEQ ID NO:69是从SEQ ID NO:68中所示出的开放阅读框翻译而来的TIC1099-T562R氨基酸序列。
SEQ ID NO:70是编码嵌合TIC1099-S553R氨基酸序列的DNA序列。
SEQ ID NO:71是从SEQ ID NO:70中所示出的开放阅读框翻译而来的TIC1099-S553R氨基酸序列。
SEQ ID NO:72是编码嵌合TIC1099-G498D氨基酸序列的DNA序列。
SEQ ID NO:73是从SEQ ID NO:72中所示出的开放阅读框翻译而来的TIC1099-G498D氨基酸序列。
SEQ ID NO:74是编码嵌合TIC1099-K490A氨基酸序列的DNA序列。
SEQ ID NO:75是从SEQ ID NO:74中所示出的开放阅读框翻译而来的TIC1099-K490A氨基酸序列。
SEQ ID NO:76是编码嵌合TIC1099-E564A氨基酸序列的DNA序列。
SEQ ID NO:77是从SEQ ID NO:76中所示出的开放阅读框翻译而来的TIC1099-E564A氨基酸序列。
SEQ ID NO:78是编码嵌合TIC1103氨基酸序列的DNA序列。
SEQ ID NO:79是从SEQ ID NO:78中所示出的开放阅读框翻译而来的TIC1103氨基酸序列。
SEQ ID NO:80是编码嵌合TIC1101氨基酸序列的DNA序列。
SEQ ID NO:81是从SEQ ID NO:80中所示出的开放阅读框翻译而来的TIC1101氨基酸序列。
SEQ ID NO:82是编码嵌合TIC845氨基酸序列的DNA序列。
SEQ ID NO:83是从SEQ ID NO:82中所示出的开放阅读框翻译而来的TIC845氨基酸序列。
SEQ ID NO:84是编码嵌合TIC846氨基酸序列的DNA序列。
SEQ ID NO:85是从SEQ ID NO:84中所示出的开放阅读框翻译而来的TIC846氨基酸序列。
SEQ ID NO:86是编码嵌合TIC858氨基酸序列的DNA序列。
SEQ ID NO:87是从SEQ ID NO:86中所示出的开放阅读框翻译而来的TIC858氨基酸序列。
SEQ ID NO:88是编码嵌合TIC865氨基酸序列的DNA序列。
SEQ ID NO:89是从SEQ ID NO:88中所示出的开放阅读框翻译而来的TIC865氨基酸序列。
SEQ ID NO:90是编码嵌合TIC866氨基酸序列的DNA序列。
SEQ ID NO:91是从SEQ ID NO:90中所示出的开放阅读框翻译而来的TIC866氨基酸序列。
SEQ ID NO:92是编码嵌合TIC838氨基酸序列的DNA序列。
SEQ ID NO:93是从SEQ ID NO:92中所示出的开放阅读框翻译而来的TIC838氨基酸序列。
SEQ ID NO:94是编码嵌合TIC839氨基酸序列的DNA序列。
SEQ ID NO:95是从SEQ ID NO:94中所示出的开放阅读框翻译而来的TIC839氨基酸序列。
SEQ ID NO:96是编码嵌合TIC841氨基酸序列的DNA序列。
SEQ ID NO:97是从SEQ ID NO:96中所示出的开放阅读框翻译而来的TIC841氨基酸序列。
SEQ ID NO:98是编码嵌合TIC842氨基酸序列的DNA序列。
SEQ ID NO:99是从SEQ ID NO:98中所示出的开放阅读框翻译而来的TIC842氨基酸序列。
SEQ ID NO:100是编码嵌合TIC850氨基酸序列的DNA序列。
SEQ ID NO:101是从SEQ ID NO:100中所示出的开放阅读框翻译而来的TIC850氨基酸序列。
SEQ ID NO:102是编码嵌合TIC859氨基酸序列的DNA序列。
SEQ ID NO:103是从SEQ ID NO:102中所示出的开放阅读框翻译而来的TIC859氨基酸序列。
SEQ ID NO:104是编码嵌合TIC861氨基酸序列的DNA序列。
SEQ ID NO:105是从SEQ ID NO:104中所示出的开放阅读框翻译而来的TIC861氨基酸序列。
SEQ ID NO:106是编码嵌合TIC848氨基酸序列的DNA序列。
SEQ ID NO:107是从SEQ ID NO:106中所示出的开放阅读框翻译而来的TIC848氨基酸序列。
SEQ ID NO:108是编码嵌合TIC849氨基酸序列的DNA序列。
SEQ ID NO:109是从SEQ ID NO:108中所示出的开放阅读框翻译而来的TIC849氨基酸序列。
SEQ ID NO:110是编码嵌合TIC847氨基酸序列的DNA序列。
SEQ ID NO:111是从SEQ ID NO:110中所示出的开放阅读框翻译而来的TIC847氨基酸序列。
SEQ ID NO:112是用于在植物细胞中表达的编码TIC713的合成DNA序列。
SEQ ID NO:113是用于在植物细胞中表达的编码TIC713的合成DNA序列。
SEQ ID NO:114是用于在植物细胞中表达的编码TIC843的合成DNA序列。
SEQ ID NO:115是用于在植物细胞中表达的编码TIC862的合成DNA序列。
SEQ ID NO:116是用于在植物细胞中表达的编码TIC1099的合成DNA序列。
SEQ ID NO:117是用于在植物细胞中表达的编码TIC1103的合成DNA序列。
SEQ ID NO:118是用于在植物细胞中表达的编码TIC845的合成DNA序列。
SEQ ID NO:119是用于在植物细胞中表达的编码TIC846的合成DNA序列。
SEQ ID NO:120是用于在植物细胞中表达的编码TIC858的合成DNA序列。
SEQ ID NO:121是用于在植物细胞中表达的编码TIC866的合成DNA序列。
SEQ ID NO:122是用于在植物细胞中表达的编码TIC838的合成DNA序列。
SEQ ID NO:123是用于在植物细胞中表达的编码TIC841的合成DNA序列。
SEQ ID NO:124是用于在植物细胞中表达的编码TIC842的合成DNA序列。
SEQ ID NO:125是用于在植物细胞中表达的编码TIC850的合成DNA序列。
SEQ ID NO:126是用于在植物细胞中表达的编码TIC859的合成DNA序列。
SEQ ID NO:127是用于在植物细胞中表达的编码TIC861的合成DNA序列。
SEQ ID NO:128是用于在植物细胞中表达的编码TIC848的合成DNA序列。
SEQ ID NO:129是用于在植物细胞中表达的编码TIC849的合成DNA序列。
SEQ ID NO:130是用于在植物细胞中表达的编码TIC847的合成DNA序列。
发明详述
农业害虫防治领域中的问题的特征是需要对目标害虫有效果、对目标害虫物种表现出广谱毒性、能够在植物中表达而不会引起不希望的农艺学问题并且与市面上用于植物的当前毒素相比可提供替代作用模式的新杀昆虫蛋白质。本文中公开了新型嵌合杀昆虫蛋白质,并且解决了这些需要中的每一个,特别是对抗较宽范围的鳞翅目昆虫害虫。
为了避免发展或规避昆虫对目前使用的杀昆虫蛋白质的抗性,需要具有不同的作用模式(MOA)以及广谱性和功效的新杀昆虫蛋白质用于鳞翅目防治。解决这种需要的一种方式是从不同的生物学来源,优选地从细菌、真菌或植物中发现新的杀昆虫蛋白质。另一种方法是在表现出结构相似性的各种Bt蛋白质之间互换片段以产生具有昆虫抑制性质的新嵌合Bt蛋白质。本领域中已知由对本领域中已知的众多天然杀昆虫晶体蛋白质的结构域结构进行再拣选来产生具有增强的嵌合蛋白质的可能性微乎其微。参见例如JacquelineS.Knight等人,“A Strategy for Shuffling Numerous Bacillus thuringiensisCrystal Protein Domains.”J.Economic Entomology,97(6)(2004):1805-1813。
本文中公开了编码新型嵌合杀昆虫蛋白质的重组核酸分子序列。这些杀昆虫蛋白质解决了本领域中持续需要工程改造其他毒性杀昆虫蛋白质以具有改善的杀昆虫性质(诸如对较宽范围的目标昆虫害虫物种的功效有所增加)和不同的作用模式。这组蛋白质的成员,包括本文中所公开的示例性蛋白质,对鳞翅目昆虫害虫物种表现出杀昆虫活性。
术语“片段(segment/fragment)”在本申请中用于描述比描述所公开的嵌合杀昆虫蛋白质的完整氨基酸或核酸序列短的连续氨基酸或核酸序列。本申请中还公开了表现出昆虫抑制活性的片段,如果将所述片段与所述嵌合杀昆虫蛋白质的相应部分比对,那么将得到在所述片段与所述嵌合杀昆虫蛋白质的相应部分之间存在从约65%至约100%的任何分数百分比的氨基酸序列同一性。
在本申请中提及术语“活性的”或“活性”、“杀虫活性”或“杀虫的”,或者“杀昆虫活性”、“昆虫抑制性”或“杀昆虫的”是指毒性剂,诸如杀昆虫蛋白质,在抑制(抑制生长、摄食、繁殖力或存活力)、遏制(遏制生长、摄食、繁殖力或存活力)、控制(控制害虫侵扰、控制含有有效量的杀昆虫蛋白质的特定作物上的害虫摄食活动)或杀死(引起发病、死亡或降低繁殖力)害虫方面的功效。这些术语意图包括对害虫提供杀虫有效量的杀昆虫蛋白质的结果,其中使所述害虫暴露于所述杀昆虫蛋白质将导致发病、死亡、繁殖力降低或发育迟缓。这些术语还包括由于在植物中或在植物上提供杀虫有效量的杀昆虫蛋白质而将害虫从植物、植物组织、植物部分、种子、植物细胞或从植物可能生长的特定地理位置中驱离。一般来说,杀虫活性是指杀昆虫蛋白质在抑制特定目标害虫(包括但不限于鳞翅目昆虫)的生长、发育、存活力、摄食行为、交配行为、繁殖力方面有效或对由以这种蛋白质、蛋白质片段、蛋白质片段或多核苷酸为食的昆虫引起的不利作用产生任何可测量的减少的能力。所述杀昆虫蛋白质可以由植物产生,或可以被施用至所述植物或施用至所述植物所处位置内的环境。术语“生物活性”、“有效的”、“有效果的”或其变化形式也是本申请中可互换用于描述本发明的嵌合杀昆虫蛋白质对目标昆虫害虫的作用的术语。
当提供于目标害虫的食物中时,杀虫有效量的毒性剂在所述毒性剂接触所述害虫时表现出杀虫活性。毒性剂可以是杀昆虫蛋白质或本领域中已知的一种或多种化学试剂。杀昆虫化学试剂和杀昆虫蛋白质试剂可以单独使用或与彼此组合使用。化学试剂包括但不限于靶向目标害虫中的特定基因以便遏制的dsRNA分子、有机氯化物、有机磷酸酯、氨基甲酸酯、拟除虫菊酯、新烟碱和莱恩碱。杀昆虫蛋白质试剂包括本申请中示出的嵌合杀昆虫蛋白质,以及其他蛋白质毒性剂,包括靶向鳞翅目害虫物种的那些,以及用于防治其他植物害虫的蛋白质毒素,诸如本领域中可用于防治鞘翅目、缨翅目、半翅目和同翅目物种的Cry蛋白质。
意图提及害虫、特别是作物植物的害虫意指作物植物的昆虫害虫,特别是通过所公开的嵌合杀昆虫蛋白质加以防治的那些鳞翅目昆虫害虫。然而,当靶向这些害虫的毒性剂与所述嵌合杀昆虫蛋白质或同所述嵌合杀昆虫蛋白质具有65%至约100%同一性的蛋白质共处或共存时,提及害虫还可以包括植物的鞘翅目、半翅目和同翅目昆虫害虫以及线虫和真菌。
本文中所公开的嵌合杀昆虫蛋白质对鳞翅目昆虫物种的昆虫害虫(包括成虫、蛹、幼虫和初孵幼虫)以及半翅目昆虫物种(包括成虫和若虫)表现出杀昆虫活性。鳞翅目昆虫包括但不限于夜蛾科的粘虫、切根虫、尺蠖和夜蛾,例如草地粘虫(草地贪夜蛾)、甜菜粘虫(甜菜夜蛾)、披肩粘虫(蓓带夜蛾)、黑切根虫(球菜夜蛾)、甘蓝尺蠖(粉纹夜蛾)、大豆尺蠖(大豆夜蛾)、黎豆毛虫(黎豆夜蛾)、绿啃叶虫(苜蓿绿夜蛾)、烟草芽虫(烟芽夜蛾)、颗粒切根虫(黄地老虎)、粘虫(一星粘虫)、西方切根虫(灰地老虎);螟蛾科的钻心虫、鞘蛾、结网虫、球果虫、甘蓝虫和雕叶虫,例如,欧洲玉米钻心虫(欧洲玉米螟)、脐橙虫(脐橙螟)、玉米根结网虫(玉米根网螟)、草地结网虫(水稻切叶野螟)、向日葵蛾(向日葵螟)、小玉米茎钻心虫(南美玉米苗斑螟);卷蛾科的卷叶虫、芽虫、种虫和果虫,例如,苹果蛾(苹果蠹蛾)、葡萄果实蛾(葡萄小食心虫)、东方果蛾(梨小食心虫)、向日葵芽小卷蛾(向日葵芽卷叶蛾);和许多其他经济上重要的鳞翅目昆虫,例如钻背蛾(小菜蛾)、粉红棉铃虫(棉红铃虫)和吉普赛蛾(舞毒蛾)。其他鳞翅目昆虫害虫包括例如棉叶波纹夜蛾(棉叶虫)、果树黄卷蛾(果树卷叶虫)、玫瑰黄卷蛾(欧洲卷叶虫)和其他黄卷蛾属、二化螟(亚洲水稻钻心虫或水稻钻茎虫)、稻纵卷叶螟(水稻卷叶虫)、玉米根网螟(玉米根结网虫)、早熟禾草暝(蓝草结网虫)、西南玉米螟(西南玉米钻心虫)、小蔗螟(蔗螟)、埃及金刚钻(多刺棉铃虫)、翠纹金刚钻(斑点棉铃虫)、棉铃实夜蛾(美洲棉铃虫)、玉米穗夜蛾(玉米穗虫或棉铃虫)、烟芽夜蛾(烟草芽虫)、水稻切叶野螟(草地结网虫)、欧洲葡萄缀穗蛾(欧洲葡萄蛾)、柑桔潜叶蛾(柑桔潜叶虫)、大菜粉蛾(大白蝴蝶)、小菜粉蛾(菜青虫或小白蝴蝶)、小菜蛾(钻背蛾)、甜菜夜蛾(甜菜粘虫)、斜纹夜蛾(烟草切根虫、茶蚕)和番茄潜叶蛾(番茄潜叶虫)。
本申请中提及“分离的DNA分子”或者等效术语或短语意图意指所述DNA分子是单独或与其他组合物组合存在但不在其天然环境内的DNA分子。举例来说,只要生物体基因组DNA内天然存在的核酸元件,诸如编码序列、内含子序列、非翻译前导序列、启动子序列、转录终止序列等在所述生物体基因组内且在天然存在其的基因组内位置上,所述元件就不被认为是“分离的”。然而,只要这些元件中的每一个和这些元件的子部分不是在所述生物体基因组内且在天然存在其的基因组内位置上,所述元件在本公开的范围内就将是“分离的”。类似地,只要编码杀昆虫蛋白质或所述蛋白质的任何天然存在的杀昆虫变体的核苷酸序列不在从中天然存在编码所述蛋白质的序列的细菌DNA内,所述核苷酸序列就将是分离的核苷酸序列。出于本公开的目的,编码天然存在的杀昆虫蛋白质的氨基酸序列的合成核苷酸序列将被认为是分离的。出于本公开的目的,任何转基因核苷酸序列,即插入植物或细菌细胞基因组中或存在于染色体外载体中的DNA核苷酸序列将被认为是分离的核苷酸序列,无论其存在于质粒内或用于转化所述细胞的类似结构内、所述植物或细菌的基因组内,还是以可检测的量存在于来源于所述植物或细菌的组织、子代、生物样品或商品产品中。
如实施例中进一步描述,通过嵌合工作,由已知杀昆虫毒素(在本文中被称为“亲本蛋白质”的原毒素和毒素结构域构建了约八百四十四(844)个编码嵌合杀昆虫蛋白质的核苷酸序列,并且加以表达且在生物测定中测试鳞翅目活性。与从中得到其毒素组分的亲本蛋白质相比,所构建的嵌合杀昆虫蛋白质中少数表现出提高的鳞翅目活性或增大的鳞翅目谱。
这些具有提高的鳞翅目活性或增大的鳞翅目谱的新型嵌合杀昆虫蛋白质是由以下杀昆虫亲本蛋白质原毒素和毒素结构域构建:Cry1Ah(结构域I)、Cry1Bb1(结构域I和II)、Cry1Be2(结构域I和II)、Cry1Ja1(结构域I和II)、Cry1Fa1(结构域I和II)、Cry1Ac(结构域II和结构域原毒素)、Cry1Ca(结构域III和结构域原毒素)、Cry1Ka(结构域III和结构域原毒素)、Cry1Jx(结构域III)、Cry1Ab(结构域III)、Cry1Ab3(原毒素)、Cry1Da1(原毒素)、Cry4(原毒素)、Cry9(原毒素)、Cry1Be(原毒素),和结构域Cry1Ka(原毒素)。
具体来说,本发明的具有提高的鳞翅目活性或增大的鳞翅目谱的新型嵌合杀昆虫蛋白质包含以下原毒素与结构域组合:TIC1100/SEQ ID NO:4(结构域I-Cry1Ah、结构域II-Cry1Ac、结构域III-Cry1Ca、原毒素-Cry1Ac)、TIC860/SEQ ID NO:7(结构域I-Cry1Bb1、结构域II-Cry1BB1、结构域III-Cry1Ca、原毒素-Cry1Ac)、TIC867/SEQ ID NO:10(结构域I-Cry1Be2、结构域II-Cry1Be2、结构域III-Cry1Ka、原毒素-Cry1Ab3)、TIC868/SEQ ID NO:28(结构域I-Cry1Be2、结构域II-Cry1Be2和结构域III-Cry1Ca、原毒素-Cry1Ab3)、TIC869/SEQ ID NO:50(结构域I-Cry1Ja1、结构域II-Cry1Ja1、结构域III-Cry1Jx、原毒素-Cry1Ab3)和TIC836/SEQ ID NO:53(结构域I-Cry1Fa1、结构域II-Cry1Fa1、结构域III-Cry1Ab、原毒素-Cry1Ac)。
对于嵌合杀昆虫蛋白质TIC867和TIC868,还构建了引入氨基酸取代或替代原毒素结构域的变体。具体来说,TIC867和TIC868的这些变体包含以下氨基酸取代或替代原毒素结构域:TIC867_20/SEQ ID NO:13(替代原毒素结构域Cry1Da1)、TIC867_21/SEQ ID NO:16(替代原毒素结构域Cry4)、TIC867_22/SEQ ID NO:19(替代原毒素结构域Cry9)、TIC867_23/SEQ ID NO:21(替代原毒素结构域Cry1Be)、TIC867_24/SEQ ID NO:23(替代原毒素结构域Cry1Ka)、TIC867_25/SEQ ID NO:25(替代原毒素结构域Cry1Ka)、TIC868_9/SEQ ID NO:30(氨基酸修饰N240S_Y343Q_N349T)、TIC868_10/SEQ ID NO:33(替代原毒素结构域Cry1Da1)、TIC868_11/SEQ ID NO:36(替代原毒素结构域Cry4)、TIC868_12/SEQ ID NO:39(替代原毒素结构域Cry 9),TIC868_13/SEQ ID NO:41(替代原毒素结构域Cry1Be)、TIC868_14/SEQ ID NO:43(替代原毒素结构域Cry1Ka)、TIC868_15/SEQ ID NO:45(替代原毒素结构域Cry1Ca)和TIC868_29/SEQ ID NO:47(氨基酸修饰Q136Y_Y343Q_N349T)。
如实施例中所显示,这些TIC867和TIC868变体中的每一种都能改变亲本嵌合杀昆虫蛋白质的鳞翅目活性和/或减小鳞翅目活性谱,因此表明替代原毒素结构域和氨基酸取代对嵌合杀昆虫蛋白质TIC867和TIC868的杀昆虫活性和谱具有直接后果。
所述嵌合杀昆虫蛋白质中有许多种对多个鳞翅目昆虫害虫物种显示了杀昆虫活性。具体来说,本申请中所公开的新型嵌合杀昆虫蛋白质对一种或多种以下鳞翅目昆虫害虫表现出活性:黎豆毛虫(VBC,黎豆夜蛾)、蔗螟(SCB,小蔗螟)、小玉米茎钻心虫(LSCB,南美玉米苗斑螟)、玉米穗虫(CEW,玉米穗夜蛾)、大豆荚虫(SPW,玉米穗夜蛾)、棉铃虫(CBW,玉米穗夜蛾)、烟草芽虫(TBW,烟芽夜蛾)、大豆尺蠖(SBL,大豆夜蛾)、黑粘虫(BLAW,考斯夜蛾)、南方粘虫(SAW,亚热带粘虫)、草地粘虫(FAW,草地贪夜蛾)、甜菜粘虫(BAW,甜菜夜蛾)、古棉铃虫(OBW,棉铃实夜蛾)、东方叶虫(OLW,斜纹夜蛾)、粉红棉铃虫(PBW,棉红铃虫)、西南玉米钻心虫(SWCB,西南玉米螟)、斑点棉铃虫(SBW,翠纹金刚钻)、美洲棉铃虫(SABW,南美棉铃虫)和向日葵尺蠖(SFL,薄荷灰夜蛾)。因此,本申请中所描述的示例性蛋白质因常见功能而相关并且对鳞翅目昆虫物种的昆虫害虫,包括成虫、幼虫和蛹,表现出杀昆虫活性。
与所述嵌合杀昆虫蛋白质类似的蛋白质可以通过使用本领域中已知的基于计算机的算法与彼此比较来加以鉴别。举例来说,可以使用Clustal W比对,使用以下这些默认参数来分析蛋白质相对于嵌合杀昆虫蛋白质的氨基酸序列同一性:权重矩阵:blosum,空位开放罚分:10.0,空位延伸罚分:0.05,亲水性空位:On,亲水性残基:GPSNDQERK,残基特异性空位罚分:On(Thompson等人,(1994)Nucleic Acids Research,22:4673-4680)。通过100%×(氨基酸同一性/标的蛋白质的长度)的乘积来进一步计算氨基酸同一性百分比。还可利用本领域中的其他比对算法,得到与使用Clustal W比对获得的结果类似的结果,且涵盖在本申请中。
意图本申请中公开一种表现出昆虫抑制活性的查询蛋白质,在比对此类查询蛋白质与SEQ ID NO:4、7、10、13、16、19、21、23、25、28、30、33、36、39、41、43、45、47、50、53、55、57、59、61、63、65、67、69、71、73、75、77、79、81、83、85、87、89、91、93、95、97、99、101、103、105、107、109和111中所示出的标的嵌合杀昆虫蛋白质时,在所述查询蛋白质与所述标的蛋白质之间获得至少约64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或约100%氨基酸序列同一性(或此范围中的任何分数百分比)。
如本申请的实施例中进一步描述,设计编码所述嵌合杀昆虫蛋白质的合成序列或人工序列以供用于植物中。设计用于植物中的示例性合成核苷酸序列示出于SEQ ID NO:2和3(TIC1100)、SEQ ID NO:6(TIC860)、SEQ ID NO:9(TIC867)、SEQ ID NO:12(TIC867_20)、SEQ ID NO:15(TIC867_21)、SEQ ID NO:18(TIC867_22)、SEQ ID NO:20(TIC867_23)、SEQID NO:22(TIC867_24)、SEQ ID NO:24(TIC867_25)、SEQ ID NO:27(TIC868)、SEQ ID NO:29(TIC868_9)、SEQ ID NO:32(TIC868_10)、SEQ ID NO:35(TIC868_11)、SEQ ID NO:38(TIC868_12)、SEQ ID NO:40(TIC868_13)、SEQ ID NO:42(TIC868_14)、SEQ ID NO:44(TIC868_15)、SEQ ID NO:46(TIC868_29)、SEQ ID NO:49(TIC869)和SEQ ID NO:52(TIC836)、SEQ ID NO:112和113(TIC713)、SEQ ID NO:114(TIC843)、SEQ ID NO:115(TIC862)、SEQ ID NO:116(TIC1099)、SEQ ID NO:117(TIC1103)、SEQ ID NO:118(TIC845)、SEQ ID NO:119(TIC846)、SEQ ID NO:120(TIC858)、SEQ ID NO:121(TIC866)、SEQ ID NO:122(TIC838)、SEQ ID NO:123(TIC841)、SEQ ID NO:124(TIC842)、SEQ ID NO:125(TIC850)、SEQ ID NO:126(TIC859)、SEQ ID NO:127(TIC861)、SEQ ID NO:128(TIC848)、SEQ ID NO:129(TIC849)和SEQ ID NO:130(TIC847)中。
为了在植物细胞中表达,所述嵌合杀昆虫蛋白质可以被表达以存在于细胞溶质中或靶向植物细胞的各种细胞器。举例来说,使蛋白质靶向叶绿体可能导致转基因植物中的表达蛋白质的水平增加,同时防止出现脱靶表型。靶向还可能导致转基因事件中的害虫抗性功效增加。靶标肽或转运肽是指导蛋白质转运至细胞中特定区域,包括细胞核、线粒体、内质网(ER)、叶绿体、质外体、过氧化物酶体和胞质膜的短(3至70个氨基酸长)肽链。在蛋白质被转运之后通过信号肽酶使一些靶标肽从所述蛋白质上裂解。为了靶向叶绿体,蛋白质含有大约40至50个氨基酸的转运肽。关于叶绿体转运肽的使用的描述,参见美国专利No.5,188,642和5,728,925。许多位于叶绿体的蛋白质是从细胞核基因表达为前体,并且通过叶绿体转运肽(CTP)靶向叶绿体。此类分离的叶绿体蛋白质的实例包括但不限于与核酮糖-1,5-双磷酸羧化酶、铁氧化还原蛋白、铁氧化还原蛋白氧化还原酶、捕光复合体蛋白I和蛋白II、硫氧化还原蛋白F、烯醇丙酮莽草酸磷酸合酶(EPSPS)和美国专利No.7,193,133中所描述的转运肽的小亚单元(SSU)缔合的那些。已经在活体内和活体外证明,可以通过使用与异源CTP的蛋白质融合物使非叶绿体蛋白质靶向叶绿体,而且CTP足以使蛋白质靶向叶绿体。并入合适的叶绿体转运肽,诸如阿拉伯芥EPSPS CTP(CTP2)(参见Klee等人,Mol.Gen.Genet.210:437-442,1987)或矮牵牛EPSPS CTP(CTP4)(参见della-Cioppa等人,Proc.Natl.Acad.Sci.USA 83:6873-6877,1986)已经证明异源EPSPS蛋白质序列靶向转基因植物中的叶绿体(参见美国专利No.5,627,061、5,633,435和5,312,910;以及EP0218571、EP 189707、EP 508909和EP 924299)。为了使嵌合杀昆虫蛋白质靶向叶绿体,相对于已经设计的用于在植物细胞中进行最佳表达的编码嵌合杀昆虫蛋白质的合成编码序列,将编码叶绿体转运肽的序列的放在5'处的可操作键联中和框架中。
根据本领域中已知的转化方法和技术来构建含有这些合成或人工核苷酸序列的表达盒和载体并且将其引入玉米、棉花和大豆植物细胞中。使转化细胞再生出转化植物,观察到所述转化植物表达所述嵌合杀昆虫蛋白质。为了测试杀虫活性,在存在鳞翅目害虫幼虫的情况下使用获自转化植物的植物叶盘进行生物测定。设想了编码所述嵌合杀昆虫蛋白质的重组核酸分子组合物。举例来说,可以用重组DNA构建体表达所述嵌合杀昆虫蛋白质,其中编码嵌合杀昆虫蛋白质的具有ORF的多核苷酸分子可操作地连接至诸如启动子之类的基因表达元件和在所述构建体所意图的系统中表达所必需的任何其他调控元件。非限制性实例包括与合成嵌合杀昆虫蛋白质编码序列可操作地连接以便在植物中表达所述嵌合杀昆虫蛋白质的植物功能启动子或与嵌合杀昆虫蛋白质编码序列可操作地连接以便在Bt细菌或其他芽孢杆菌属中表达所述蛋白质的Bt功能启动子。其他元件可以可操作地连接至所述嵌合杀昆虫蛋白质编码序列,包括但不限于增强子、内含子、未翻译前导序列、编码蛋白质固定标签(HIS标签)、转位肽(即,质粒转运肽、信号肽)、针对翻译后修饰酶的多肽序列、核糖体结合位点和RNAi标靶位点。
本文中所提供的示例性重组多核苷酸分子包括但不限于与编码具有如SEQ IDNO:4(TIC1100)、7(TIC860)、10(TIC867)、13(TIC867_20)、16(TIC867_21)、19(TIC867_22)、21(TIC867_23)、23(TIC867_24)、25(TIC867_25)、28(TIC868)、30(TIC868_9)、33(TIC868_10)、36(TIC868_11)、39(TIC867_12)、41(TIC867_13)、43(TIC867_14)、45(TIC867_15)、47(TIC867_29)、50(TIC869)、53(TIC836)、55(TIC713)、57(TIC843)、59(TIC862)、61(TIC1099)、63(TIC1099-T507E)、65(TIC1099-R522K)、67(TIC1099-K490S)、69(TIC1099-T562R)、71(TIC1099-S533R)、73(TIC1099-G498D)、75(TIC1099-K490A)、77(TIC1099-E564A)、79(TIC1103)、81(TIC1101)、83(TIC845)、85(TIC846)、87(TIC858)、89(TIC865)、91(TIC866)、93(TIC838)、95(TIC839)、97(TIC841)、99(TIC842)、101(TIC850)、103(TIC859)、105(TIC861)、107(TIC848)、109(TIC849)和111(TIC847)中所示出的氨基酸序列的多肽或蛋白质的诸如SEQ ID NO:1、2、3、5、6、8、9、11、12、14、15、17、18、20、22、24、26、27、29、31、32、34、35、37、38、40、42、44、46、48、49、51、52、54、56、58、60、62、64、66、68、70、72、74、76、78、80、82、84、86、88、90、92、94、96、98、100、102、104、106、108、110、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129和130之类的多核苷酸可操作地连接的异源启动子。异源启动子还可以可操作地连接至编码质粒靶向的嵌合杀昆虫蛋白质和未靶向的嵌合杀昆虫蛋白质的合成DNA编码序列。设想编码本文中所公开的嵌合杀昆虫蛋白质的重组核酸分子的密码子可以被同义密码子取代(在本领域中被称为沉默取代)。
包含嵌合杀昆虫蛋白质编码序列的重组DNA分子或构建体可以进一步包含编码一种或多种毒性剂的DNA区域,所述DNA区域可以经配置以便与编码嵌合杀昆虫蛋白质、不同于嵌合杀昆虫蛋白质的蛋白质、昆虫抑制性dsRNA分子或辅助蛋白质的DNA序列伴随表达或共同表达。辅助蛋白质包括但不限于辅因子、酶、结合伴侣或功能在于辅助昆虫抑制剂的效果,例如通过辅助其表达、影响其在植物中的稳定性、优化寡聚自由能、加强其毒性和增加其活性谱的其他试剂。辅助蛋白质可以例如促进一种或多种昆虫抑制剂的吸收或增强毒性剂的毒性效果。
可以组装重组DNA分子或构建体,使得所有蛋白质或dsRNA分子都可以从一种启动子表达,或者每一种蛋白质或dsRNA分子都在单独启动子控制或其一些组合下。本发明的蛋白质可以由多基因表达系统表达,其中嵌合杀昆虫蛋白质由常见核苷酸片段表达,所述核苷酸片段还含有其他开放阅读框和启动子,取决于所选表达系统的类型。举例来说,细菌多基因表达系统可以利用单个启动子来驱动多个连接/串联开放阅读框从单个操纵子内表达(即,多顺反子表达)。在另一个实例中,植物多基因表达系统可以利用多个未连接的表达盒,每一个表达盒表达不同的蛋白质或其他毒性剂,诸如一或多个dsRNA分子。
可以通过载体,例如质粒、杆状病毒、合成染色体、病毒体、粘粒、噬菌粒、噬菌体或病毒载体,将包含嵌合杀昆虫蛋白质编码序列的重组核酸分子或重组DNA构建体递送至宿主细胞。此类载体可用于实现嵌合杀昆虫蛋白质编码序列在宿主细胞中的稳定或暂时表达或者所编码的多肽的后续表达。包含嵌合杀昆虫蛋白质序列编码序列并且被引入宿主细胞中的外源重组多核苷酸或重组DNA构建体在本文中被称为“转基因”。
本文中提供了含有编码所述嵌合杀昆虫蛋白质中的任何一种或多种的多核苷酸的转基因细菌、转基因植物细胞、转基因植物和转基因植物部分。术语“细菌细胞”或“细菌”可以包括但不限于土壤杆菌、芽孢杆菌、埃希氏杆菌、沙门氏菌、假单胞菌或根瘤菌细胞。术语“植物细胞”或“植物”可以包括但不限于双子叶细胞或单子叶细胞。所设想的植物和植物细胞包括但不限于苜蓿、香蕉、大麦、豆、茎椰菜、甘蓝、芸苔、胡萝卜、木薯、蓖麻、花椰菜、芹菜、鹰嘴豆、大白菜、柑桔、椰子、咖啡、玉米、三叶草、棉花、葫芦、黄瓜、花旗松、茄子、桉树、亚麻、大蒜、葡萄、啤酒花、韭菜、莴苣、火炬松、小米、甜瓜、坚果、燕麦、橄榄树、洋葱、观赏植物、棕榈、牧草、豌豆、花生、胡椒、木豆、松树、马铃薯、白杨树、南瓜、辐射松、萝卜、油菜籽、水稻、根茎、黑麦、红花、灌木、高粱、南方松、大豆、菠菜、南瓜、草莓、糖用甜菜、甘蔗、向日葵、甜玉米、香枫、甜薯、柳枝稷、茶树、烟草、番茄、黑小麦、草坪草、西瓜和小麦植物细胞或植物。在某些实施方案中,提供了由转基因植物细胞再生的转基因植物和转基因植物部分。在某些实施方案中,所述转基因植物可以获自转基因种子、通过插枝、折断、研磨或其他方式使部分从植物脱离。在某些实施方案中,所述植物部分可以是种子、蒴、叶、花、茎、根或其任何部分,或者转基因植物部分的不可再生部分。如这种情况下所使用,转基因植物部分的“不可再生”部分是不能受到诱导而形成完整植物或不能受到诱导而形成能够进行有性繁殖和/或无性繁殖的完整植物的部分。在某些实施方案中,植物部分的不可再生部分是转基因种子、蒴、叶、花、茎或根的一部分。
提供了制造包含鳞翅目抑制量的嵌合杀昆虫蛋白质的转基因植物的方法。此类植物可以通过将编码本申请中所提供的嵌合杀昆虫蛋白质的多核苷酸引入植物细胞中和选择表达昆虫或鳞翅目抑制量的嵌合杀昆虫蛋白质的来源于所述植物细胞的植物来制造。植物可以通过再生、种子、花粉或分生组织转化技术而来源于植物细胞。用于转化植物的方法在本领域中是已知的。举例来说,土壤杆菌介导的转化描述于美国专利申请公布2009/0138985A1(大豆)、2008/0280361A1(大豆)、2009/0142837A1(玉米)、2008/0282432(棉花)和2008/0256667(棉花)中。
表达所述嵌合杀昆虫蛋白质的植物可以通过利用表达其他杀昆虫蛋白质和/或表达其他转基因特质(诸如其他昆虫防治特质)的转基因事件、除草剂耐受基因、赋予产量或应力耐受特质的基因等等进行育种来进行杂交,或可以将此类特质组合在单个载体中,从而将所述特质全部联系起来。
本申请中还公开了经过加工的植物产品,其中所述经过加工的产品包含可检测量的嵌合杀昆虫蛋白质、其昆虫抑制性片段或其任何区别部分。在某些实施方案中,所述经过加工的产品是选自由以下各项组成的群组:植物部分、植物生物质、油、膳食、糖、动物饲料、面粉、薄片、糠、棉绒、外壳、经过加工的种子和种子。在某些实施方案中,所述经过加工的产品是不可再生的。所述植物产品可以包含来源于转基因植物或转基因植物部分的商品或其他商业产品,其中所述商品或其他产品可以通过检测编码或包含嵌合杀昆虫蛋白质的特色部分的核苷酸片段或者表达的RNA或蛋白质来通过商业进行追踪。
本申请中还公开了用所述嵌合杀昆虫蛋白质来防治昆虫、特别是鳞翅目昆虫侵扰作物植物的方法。此类方法可以包括种植包含昆虫或鳞翅目抑制量的所述嵌合杀昆虫蛋白质的植物。在某些实施方案中,此类方法可以进一步包括以下操作中的任何一个或多个:(i)向植物或能产生植物的种子施用包含或编码嵌合杀昆虫蛋白质的任何组合物;和(ii)用编码嵌合杀昆虫蛋白质的多核苷酸对植物或能产生植物的植物细胞进行转化。一般来说,设想嵌合杀昆虫蛋白质可以提供于组合物中,提供于微生物中,或提供于转基因植物中,以赋予针对鳞翅目昆虫的昆虫抑制活性。
在某些实施方案中,所述嵌合杀昆虫蛋白质是通过培养经过转化以便在适合表达的条件下表达嵌合杀昆虫蛋白质的重组芽孢杆菌或任何其他重组细菌细胞而制备的昆虫抑制性组合物的杀昆虫活性成分。此类组合物可以通过对表达/产生所述嵌合杀昆虫蛋白质的此类重组细胞的培养物进行干燥、冻干、均质化、提取、过滤、离心、沉降或浓缩来制备。此类方法可以获得芽孢杆菌或其他昆虫病原细菌细胞提取物、细胞悬浮液、细胞匀浆、细胞溶解产物、细胞上清液、细胞滤液或细胞球粒。通过获得如此产生的嵌合杀昆虫蛋白质,包括所述嵌合杀昆虫蛋白质的组合物可以包括细菌细胞、细菌孢子和伴孢包涵体,并且可以经过配制以用于各种用途,包括作为农业昆虫抑制性喷雾产品或作为食物生物测定中的昆虫抑制性制剂。
上述化合物或制剂可以进一步包含农业上可接受的载体,诸如诱饵、粉末、粉尘、球粒、颗粒、喷雾、乳液、胶态悬浮液、水溶液、牙胞杆菌孢子或晶体制剂或种子处理。所述化合物或制剂还可以进一步包含经转化以表达一种或多种所述蛋白质的重组植物细胞、植物组织、种子或植物;或经转化以表达一种或多种所述蛋白质的细菌。取决于所述重组多肽固有的昆虫抑制或杀昆虫抑制水平和应用于植物或食物测定的化合物或制剂水平,所述化合物或制剂可以包括各种以重量计量的所述重组多肽,例如以重量计0.0001%至0.001%至0.01%至1%至99%的所述重组多肽。
在一个实施方案中,为了降低抗性发展的可能性,包含嵌合杀昆虫蛋白质的昆虫抑制组合物或转基因植物可以进一步包含至少一种其他毒性剂,所述毒性剂对相同鳞翅目昆虫物种表现出昆虫抑制活性但不同于所述嵌合杀昆虫蛋白质。用于此类组合物的其他可能毒性剂包括昆虫抑制性蛋白质和昆虫抑制性dsRNA分子。使用此类核糖核苷酸序列来防治昆虫害虫的一个实例描述于Baum等人(美国专利公布2006/0021087A1)中。用于防治鳞翅目害虫的此类其他多肽可以选自由昆虫抑制性蛋白质组成的群组,诸如但不限于Cry1A(美国专利No.5,880,275)、Cry1Ab、Cry1Ac、Cry1A.105、Cry1Ae、Cry1B(美国专利公布No.10/525,318)、Cry1C(美国专利No.6,033,874)、Cry1D、Cry1E、Cry1F和Cry1A/F嵌合体(美国专利No.7,070,982、6,962,705和6,713,063)、Cry1G、Cry1H、Cry1I、Cry1J、Cry1K、Cry1L、Cry2A、Cry2Ab(美国专利No.7,064,249)、Cry2Ae、Cry4B、Cry6、Cry7、Cry8、Cry9、Cry15、Cry43A、Cry43B、Cry51Aa1、ET66、TIC400、TIC800、TIC834、TIC1415、Vip3A、VIP3Ab、VIP3B、AXMI-001、AXMI-002、AXMI-030、AXMI-035和AXMI-045(美国专利公布2013-0117884A1)、AXMI-52、AXMI-58、AXMI-88、AXMI-97、AXMI-102、AXMI-112、AXMI-117、AXMI-100(美国专利公布2013-0310543A1)、AXMI-115、AXMI-113、AXMI-005(美国专利公布2013-0104259A1)、AXMI-134(美国专利公布2013-0167264A1)、AXMI-150(美国专利公布2010-0160231A1)、AXMI-184(美国专利公布2010-0004176A1)、AXMI-196、AXMI-204、AXMI-207、AXMI-209(美国专利公布2011-0030096A1)、AXMI-218、AXMI-220(美国专利公布2014-0245491A1)、AXMI-221z、AXMI-222z、AXMI-223z、AXMI-224z、AXMI-225z(美国专利公布2014-0196175A1)、AXMI-238(美国专利公布2014-0033363A1)、AXMI-270(美国专利公布2014-0223598A1)、AXMI-345(美国专利公布2014-0373195A1)、DIG-3(美国专利公布2013-0219570A1)、DIG-5(美国专利公布2010-0317569A1)、DIG-11(美国专利公布2010-0319093A1)、AfIP-1A及其衍生物(美国专利公布2014-0033361A1)、AfIP-1B及其衍生物(美国专利公布2014-0033361A1)、PIP-1APIP-1B(美国专利公布2014-0007292A1)、PSEEN3174(美国专利公布2014-0007292A1)、AECFG-592740(美国专利公布2014-0007292A1)、Pput_1063(美国专利公布2014-0007292A1)、Pput_1064(美国专利公布2014-0007292A1)、GS-135及其衍生物(美国专利公布2012-0233726A1)、GS153及其衍生物(美国专利公布2012-0192310A1)、GS154及其衍生物(美国专利公布2012-0192310A1)、GS155及其衍生物(美国专利公布2012-0192310A1)、如美国专利公布2012-0167259A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2012-0047606A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2011-0154536A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2011-0112013A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2010-0192256A1中所描述的SEQ ID NO:2和4及其衍生物、如美国专利公布2010-0077507A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2010-0077508A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2009-0313721A1中所描述的SEQ ID NO:2及其衍生物、如美国专利公布2010-0269221A1中所描述的SEQ ID NO:2或4及其衍生物、如美国专利No.7,772,465(B2)中所描述的SEQ ID NO:2及其衍生物、如WO2014/008054A2中所描述的CF161_0085及其衍生物、如美国专利公布US2008-0172762A1、US2011-0055968A1和US2012-0117690A1中所描述的鳞翅目毒性蛋白质及其衍生物、如US7510878(B2)中所描述的SEQ ID NO:2及其衍生物、如美国专利No.7812129(B1)中所描述的SEQ ID NO:2及其衍生物;等等。
在其他实施方案中,昆虫抑制组合物或转基因植物可以进一步包含对不受本发明的嵌合杀昆虫蛋白质抑制的昆虫害虫(诸如鞘翅目、半翅目和同翅目害虫)表现出昆虫抑制活性的至少一种其他毒性剂,以便扩大所获得的昆虫抑制的范围。
用于防治鞘翅目害虫的此类其他毒性剂可以选自由昆虫抑制性蛋白质组成的群组,诸如但不限于Cry3Bb(美国专利No.6,501,009)、Cry1C变体、Cry3A变体、Cry3、Cry3B、Cry34/35、5307、AXMI134(美国专利公布2013-0167264A1)、AXMI-184(美国专利公布2010-0004176A1)、AXMI-205(美国专利公布2014-0298538A1)、axmi207(美国专利公布2013-0303440A1)、AXMI-218、AXMI-220(美国专利公布20140245491A1)、AXMI-221z、AXMI-223z(美国专利公布2014-0196175A1)、AXMI-279(美国专利公布2014-0223599A1)、AXMI-R1和其变体(美国专利公布2010-0197592A1)、TIC407、TIC417、TIC431、TIC807、TIC853、TIC901、TIC1201、TIC3131、DIG-10(美国专利公布2010-0319092A1)、eHIP(美国专利申请公布No.2010/0017914)、IP3和其变体(美国专利公布2012-0210462A1)和ω-Hexatoxin-Hv1a(美国专利申请公布US2014-0366227A1)。
用于防治半翅目害虫的此类其他毒性剂可以选自由半翅目活性蛋白质组成的群组,诸如但不限于TIC1415(美国专利公布2013-0097735A1)、TIC807(美国专利No.8609936)、TIC834(美国专利公布2013-0269060A1)、AXMI-036(美国专利公布2010-0137216A1)和AXMI-171(美国专利公布2013-0055469A1)。用于防治鞘翅目、鳞翅目和半翅目昆虫害虫的其他多肽可以在由Neil Crickmore维护的苏云金芽孢杆菌毒素命名网站(www.btnomenclature.info)上找到。
可以使用本领域技术人员已知的方法,诸如聚合酶链反应(PCR)、热扩增和杂交来鉴别嵌合杀昆虫蛋白质编码序列和与所述嵌合杀昆虫蛋白质具有相当大同一性百分比的序列。举例来说,所述嵌合杀昆虫蛋白质可用于产生特异性结合相关蛋白质的抗体,并且可用于筛选和发现密切相关的其他蛋白质。
此外,编码所述嵌合杀昆虫蛋白质的核苷酸序列可以用作筛选用探针和引物,以便使用热循环或等温扩增和杂交方法来鉴别所述类别的其他成员。举例来说,来源于如SEQID NO:2中所示出的序列的寡核苷酸可用于确定来源于商品产品的脱氧核糖核酸样品中存在或不存在嵌合杀昆虫转基因。鉴于使用寡核苷酸的某些核酸检测方法的敏感性,预期来源于如SEQ ID NO:2中的任意者中所示出的序列的寡核苷酸可用于在来源于汇集来源的商品产品中检测相应嵌合杀昆虫蛋白质,其中仅一部分商品产品来源于含有SEQ ID NO:2中的任意者的转基因植物。
实施例
鉴于上文,本领域技术人员应了解,以下公开实施方案仅代表本发明,本发明可以用多种形式体现。因此,本文中所公开的具体结构和功能细节不应理解为具有限制性。
实施例1
鳞翅目活性新型嵌合杀昆虫蛋白质编码序列的生成和克隆
本实施例说明新型嵌合杀昆虫蛋白质的生成以及所述嵌合杀昆虫蛋白质的克隆和表达。
由用于产生编码新型嵌合杀昆虫蛋白质的多核苷酸序列的已知Cry蛋白质基因构建了重组核酸序列。将所得多核苷酸序列克隆至苏云金芽孢杆菌(Bt)表达质粒载体中。在证实多核苷酸序列之后,将所述表达质粒转化至Bt中并且表达。测定所表达的新型嵌合蛋白质的制剂对各种鳞翅目害虫的活性。
产生了许多编码嵌合杀昆虫蛋白质的多核苷酸序列并且在生物测定中加以测试。不是所有嵌合杀昆虫蛋白质都显示出活性。仅基于其在生物测定中显示的对特定鳞翅目的活性来选择一些嵌合杀昆虫蛋白质。还基于原始嵌合杀昆虫蛋白质TIC867和TIC868产生了引入氨基酸取代或替代原毒素结构域的氨基酸变体。本发明的嵌合杀昆虫蛋白质的组分(结构域I、II和III,以及原毒素)提供于表1中。还提供了TIC868变体中相对于原始TIC868蛋白质序列的氨基酸取代。
表1.新型嵌合杀虫蛋白质和其组分。
*使用标准IUPAC氨基酸代码来标识氨基酸突变。参见IUPAC-IUB JointCommission on Biochemical Nomenclature.Nomenclature and Symbolism for AminoAcids and Peptides.Eur.J.Biochem.138:9-37(1984)。第一个氨基酸序列缩写表示给定支架蛋白质中的原始氨基酸,数字表示所述氨基酸的位置,而第二个氨基酸序列缩写表示改进的变体蛋白质中放在该位置上的氨基酸。
实施例2
新型嵌合杀昆虫蛋白质对鳞翅目害虫显示出活性
本实施例说明了对实施例1中所描述的嵌合杀昆虫蛋白质的测试和针对所述嵌合杀昆虫蛋白质观测到的鳞翅目活性。
在Bt中表达编码嵌合杀昆虫蛋白质的多核苷酸序列。然后针对已知是玉米、甘蔗、大豆和棉花以及其他作物植物的害虫的多种鳞翅目昆虫来测定所表达的嵌合杀昆虫蛋白质。具体来说,测定杀昆虫蛋白质对黎豆毛虫(VBC,黎豆夜蛾)、蔗螟(SCB,小蔗螟)、小玉米茎钻心虫(LSCB,南美玉米苗斑螟)、玉米穗虫(CEW,玉米穗夜蛾)、烟草芽虫(TBW,烟芽夜蛾)、大豆尺蠖(SBL,大豆夜蛾)、黑粘虫(BLAW,考斯夜蛾)、南方粘虫(SAW,亚热带粘虫)、草地粘虫(FAW,草地贪夜蛾)、甜菜粘虫(BAW,甜菜夜蛾)、古棉铃虫(OBW,棉铃实夜蛾)、东方叶虫(OLW,斜纹夜蛾)、粉红棉铃虫(PBW,棉红铃虫)、黑切根虫(BCW,球菜夜蛾)、西南玉米钻心虫(SWCB,西南玉米螟)、斑点棉铃虫(SBW,翠纹金刚钻)和欧洲玉米钻心虫(ECB,欧洲玉米螟)的活性。玉米穗虫(CEW,玉米穗夜蛾)也称为大豆荚虫(SPW)和棉铃虫(CBW)。通过死亡率与发育迟缓评分以及MIC50评分的组合来测定活性。MIC50是指蜕变抑制浓度,其中将死幼虫和L1幼虫(未能蜕变至第二龄的幼虫)都考虑至所述评分中。表2示出了每一种嵌合杀昆虫蛋白质的活性。‘+’符号表示对特定昆虫害虫观测到的活性。
如以上表2中可见,大部分嵌合杀昆虫蛋白质对一种或多种鳞翅目害虫物种表现出活性。
实施例3
合成编码嵌合杀昆虫蛋白质的基因并且用于在植物中表达
本实施例说明合成编码嵌合杀昆虫蛋白质的多核苷酸以便在植物中表达。
构建合成编码序列以用于在植物中表达嵌合杀昆虫蛋白质。根据美国专利5,500,365中大体描述的方法来设计并合成所述合成序列,从而避免某些有害的问题序列,诸如ATTTA和富A/T植物聚腺苷酸化序列,同时保留嵌合杀昆虫蛋白质的氨基酸序列。表3中列出了用于在植物中表达的编码所述嵌合杀昆虫蛋白质的这些基因的核苷酸序列。
表3.设计用于在植物中使用的编码嵌合杀昆虫蛋白质的多核苷酸序列。
实施例4
用于在植物中表达嵌合杀昆虫蛋白质的表达盒
本实施例说明包含设计用于在植物中使用的编码嵌合杀昆虫蛋白质的多核苷酸序列的表达盒的构建。
用表3中提供的设计用于植物表达的编码嵌合杀昆虫蛋白质的多核苷酸序列构建了多种植物表达盒。此类表达盒可用于在植物原生质体中暂时表达或转化植物细胞。根据蛋白质在细胞内的最终落点来设计典型表达盒。用允许蛋白质被翻译并且保留在细胞溶质中的方式设计一组表达盒。设计具有与毒素蛋白质连续的转运肽以允许靶向细胞的细胞器(诸如叶绿体或质粒)的另一组表达盒。以5′端开始用启动子设计所有表达盒,其可以由可操作地连接以加强转基因的表达的多个启动子元件、增强子元件或本领域技术人员已知的其他表达元件构成。启动子序列通常连续跟随有相对于启动子处于3′的一或多个前导序列。相对于前导序列,内含子序列通常提供在3′以改进转基因的表达。相对于可操作地连接的启动子、前导序列和内含子构形,毒素或转运肽的编码序列和毒素的编码序列通常位于3′。3′UTR序列通常提供在编码序列的3′,以促进转录终止并且提供对所得转录产物的聚腺苷酸化非常重要的序列。以上所描述的所有元件都可操作地连接并且依次布置,通常有为了构建表达盒而设的其他序列。
实施例5
嵌合杀昆虫蛋白质在经稳定转化的玉米中的鳞翅目活性
本实施例说明嵌合杀昆虫蛋白质当表达在玉米植物中并且作为食物提供给相应玉米昆虫害虫时对鳞翅目害虫表现出的抑制活性。
使用土壤杆菌介导的转化方法,用实施例4中所描述的二元转化载体转化玉米品种LH244。通过本领域中已知的方法诱导经转化的细胞以形成植物。以类似于美国专利No.8,344,207中所描述的那些生物测定的方式,使用植物叶盘进行生物测定。使用未转化的LH244植物来获得用作阴性对照的组织。针对玉米穗虫(CEW,玉米穗夜蛾)、草地粘虫(FAW,草地贪夜蛾)、黑切根虫(BCW,球菜夜蛾)和西南玉米钻心虫(SWCB,西南玉米螟)评价来自每一个二元载体的多个转化事件。
对R0和F1代转基因植物进行叶盘生物测定。另外,对受鳞翅目昆虫害虫侵扰的表达某些嵌合杀昆虫蛋白质的完整转基因F1植物,评价叶损伤评级。还评价了表达TIC860和TIC868的F1转基因事件在农田中对FAW、CEW和SWCB的活性。表4中示出了测定结果。‘+’符号表示对特定昆虫害虫观测到的活性。如表4中可见,大部分嵌合杀昆虫蛋白质和许多嵌合杀昆虫蛋白质变体对一种或多种鳞翅目害虫物种显示出活性。
实施例6
所述嵌合杀昆虫蛋白质在经稳定转化的大豆中的鳞翅目活性
本实施例说明嵌合杀昆虫蛋白质当表达在大豆植物中并且作为食物提供给相应昆虫害虫时对鳞翅目害虫表现出的抑制活性。
重新设计所选嵌合杀昆虫蛋白质的编码序列以用于植物表达,克隆至二元植物转化载体中,并且用于转化大豆植物细胞。所述植物转化载体包含用于如实施例4中所描述来表达嵌合杀昆虫蛋白质的第一转基因盒和用于使用壮观霉素选择来选择经转化的植物细胞的第二转基因盒。在一些情况下,诸如在TIC1100、TIC860和TIC836的情况下,将叶绿体转运肽编码序列可操作地连接至嵌合杀昆虫编码序列。用靶向和非靶向TIC1100、TIC860和TIC836的质粒进行测定。以下表5示出了嵌合杀昆虫蛋白质和TIC867变异嵌合杀昆虫蛋白质以及用于在经稳定转化的大豆中表达的相关编码序列。
使用以上所描述的二元转化载体通过土壤杆菌介导的转化来转化大豆植物细胞。诱导所得经转化的植物细胞以形成完整大豆植物。收集叶组织并且用于如实施例5中所描述的生物测定中,或替代地,将冻干组织用于昆虫食物中以用于生物测定。对FAW、南方粘虫(SAW,亚热带粘虫)、大豆尺蠖(SBL,大豆夜蛾)、大豆荚虫(SPW,玉米穗夜蛾)、黎豆毛虫(VBC,黎豆夜蛾)、烟草芽虫(TBW,烟芽夜蛾)、黑粘虫(BLAW,考斯夜蛾)、小玉米茎钻心虫(LSCB,南美玉米苗斑螟)和古棉铃虫(OBW,棉铃实夜蛾)进行生物测定。
表5示出了每一种杀昆虫蛋白质在R0代植物中对所选鳞翅目物种的活性,其中‘+’表示活性。如表5中可见,经稳定转化的大豆中所表达的每一种嵌合杀昆虫蛋白质对多个鳞翅目物种显示出活性。特别值得注意的是,TIC867变体TIC867_23显示了对SPW的活性。
表5.来自经稳定转化的R0大豆叶组织的嵌合杀昆虫蛋白质的生物测定活性。
允许所选转化事件自花授粉,并且种植所得种子。从R1代植物收获叶组织并且用于饲喂生物测定。测定表达TIC1100、TIC860、TIC867、TIC868、TIC869和TIC836的R1植物对SAW、SBL、SPW和VBC的活性。表6示出了在这些测试中所观测到的活性。‘+’符号表示对特定昆虫害虫观测到的活性。如表6中所显示,来自R1代植物的大部分表达的嵌合杀昆虫蛋白质对一种或多种鳞翅目物种显示了活性。
表6.来自经稳定转化的R1大豆叶组织的嵌合杀昆虫蛋白质的生物测定活性。
毒素 | SAW | SBL | SPW | VBC |
TIC1100 | + | + | + | |
TIC860 | + | + | + | |
TIC867 | + | |||
TIC868 | + | + | + | |
TIC869 | + | + | + | |
TIC836 | + | + | + |
表7显示在网室中用表达TIC1100、TIC860和TIC836的经稳定转化的R1代大豆植物进行农田测试的结果。在网室中用于侵扰植物的物种包括SAW、SBL和SPW。抗性定义为小于或等于百分之十五的大豆植物发生脱叶。在这些网笼试验中观测到的抗性与表6中提供的在R1代大豆叶组织测定中观测到的抗性一致。‘+’符号表示对特定昆虫害虫观测到的活性。
表7.网室农田测试中所测试的R1代大豆中表达的TIC1100、TIC860和TIC836的活性谱。
毒素 | SAW | SBL | SPW |
TIC1100 | + | + | |
TIC860 | + | + | |
TIC836 | + | + |
还在阿根廷的两个不同的地方阿塞维多和冯特左拉(Fontezuela)在网室中用表达TIC867和TIC869的经稳定转化的R1代大豆植物进行农田测试。用于在网室中侵扰植物的物种包括南美棉铃虫(SABW,南美棉铃虫)、VBC、BLAW和向日葵尺蠖(SFL,薄荷灰夜蛾)。抗性定义为小于或等于百分之十五的大豆植物发生脱叶。以下表8示出了观测到的抗性。‘+’符号表示对特定昆虫害虫观测到的活性。如表8中所显示,表达TIC867的转基因大豆植物对BLAW和VBC显示了抗性。表达TIC869的转基因大豆植物对SABW、SFL、BLAW和VBC显示了抗性。
表8.网室农田测试中所测试的R1代大豆中表达的TIC867和TIC869的活性谱。
实施例7
嵌合杀昆虫蛋白质在经稳定转化的棉花中的鳞翅目活性
本实施例说明嵌合杀昆虫蛋白质当表达在棉花植物中并且作为食物提供给相应昆虫害虫时对鳞翅目害虫表现出的抑制活性。
重新设计所选嵌合杀昆虫蛋白质的编码序列以用于植物表达,克隆至二元植物转化载体中,并且用于转化棉花植物细胞。所得二元载体类似于实施例4中所描述的那些,并且被用于表达靶向和非靶向TIC860(编码序列:SEQ ID NO:6;蛋白质序列:SEQ ID NO:7)、TIC867(编码序列:SEQ ID NO:9;蛋白质序列:SEQ ID NO:10)、TIC868(编码序列:SEQ IDNO:27;蛋白质序列:SEQ ID NO:28)和TIC867_23(编码序列:SEQ ID NO:20;蛋白质序列:SEQ ID NO:23)的质粒。
通过土壤杆菌介导的转化方法来转化棉花植物细胞。诱导经转化的棉花细胞以形成完整植物。如实施例5中所描述将棉花叶组织用于对棉铃虫(CBW,玉米穗夜蛾)、FAW、TBW和SBL进行生物测定。表9示出了TIC860、TIC867和TIC868在经稳定转化的R0代棉花中对这些鳞翅目物种的观测活性,其中‘+’表示活性。如表9中可见,TIC860、TIC867和TIC868在经稳定转化的R0代棉花中对两种或更多种鳞翅目害虫物种显示了活性。
表9.来自经稳定转化的R0棉花叶组织的TIC860、TIC867和TIC868的生物测定活性。
毒素 | CBW | FAW | TBW | SBL |
TIC860 | + | + | ||
TIC867 | + | + | + | NT |
TIC868 | + | + |
所选转化事件被用于产生R1种子。测定表达TIC860、TIC867和TIC868的R1植物对CBW、FAW、TBW和SBL的抗性。将叶、蕾和蒴组织用于测定。表10示出了在这些测试中观测到的活性。‘+’符号表示对特定昆虫害虫观测到的活性。如表10中所显示,TIC860在叶组织中对FAW显示了活性。此外,嵌合杀昆虫蛋白质TIC867在叶、蕾和蒴组织中对CBW和FAW以及在叶中对TBW和SBL显示了活性。嵌合杀昆虫蛋白质TIC868在叶、蕾和蒴组织中对FAW以及在叶中对TBW和SBL显示了活性。
表10.来自经稳定转化的R1棉花叶组织的嵌合杀昆虫蛋白质的生物测定活性。
本文中所公开和要求保护的所有组合物都可以在不过度实验的情况下根据本公开内容来制造和执行。尽管已经依据上述说明性实施方案描述了本发明的组合物,但本领域技术人员将显而易见,可以在不背离本发明的真实原理、精神和范围的情况下对本文中所描述的组合物加以变化、改变、修改和变更。更具体来说,将显而易见,在化学上和生理学上相关的某些药剂可以取代本文中所描述的药剂,同时将实现相同或类似的结果。对本领域技术人员显而易见的所有此类取代和修改都被认为在如所附权利要求书所定义的本发明精神、范围和原理内。
本说明书中所引用的所有出版物和公开的专利文件都以引用的方式并入本文中,达到与明确地且个别地指示每一个别出版物或专利申请以引用的方式并入相同的程度。
序列表
<110> Monsanto Technology LLC
Baum, James A
Cerruti, Thomas A
Dart, Crystal L
English, Leigh H
Fu, Xiaoran
Guzov, Victor M
Howe, Arlene R
Morgenstern, Jay P
Roberts, James K
Salvador, Sara A
Wang, Jinling
<120> 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质
<130> P34230WO00/0022270.00098
<150> US 62/064989
<151> 2014-10-16
<160> 53
<170> PatentIn version 3.5
<210> 1
<211> 3570
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC1100的重组核苷酸序列。
<400> 1
atggagatag tgaataatca gaatcaatgc gtgccttata attgtttgaa taatcccgaa 60
atcgaaatat tagaaggcgg aagaatatca gttggtaata ccccaattga tatttctctt 120
tcgcttactc agtttctttt gagtgaattt gtcccaggtg cggggtttgt attaggatta 180
attgatttaa tatggggatt tgtaggtcct tcccaatggg acgcatttct tgctcaagtg 240
gaacagttaa ttaaccaaag aatagcagaa gctgtaagaa atacagcaat tcaggaatta 300
gagggaatgg cacgggttta tagaacctat gctactgctt ttgctgagtg ggaaaaagct 360
cctgatgacc cagagctaag agaagcacta cgtacacaat ttacagcaac tgagacttat 420
ataagtggaa gaatatccgt tttaaaaatt caaacttttg aagtacagct gttatcagtg 480
tttgcccaag ctgcaaattt acatttatct ttattaagag acgttgtgtt ttttgggcaa 540
agatggggtt tttcaacgac aaccgtaaat aattactaca atgatttaac agaagggatt 600
agtacctata cagattatgc tgtacgctgg tacaatacgg gattagaacg tgtatgggga 660
ccggattcta gagattgggt aaggtataat caatttagaa gagaattaac actaactgta 720
ttagatatcg ttgctctgtt cccgaattat gatagtagaa gatatccaat tcgaacagtt 780
tcccaattaa caagagaaat ttatacaaac ccagtattag aaaattttga tggtagtttt 840
cgaggctcgg ctcagggcat agaaagaagt attaggagtc cacatttgat ggatatactt 900
aacagtataa ccatctatac ggatgctcat aggggttatt attattggtc agggcatcaa 960
ataatggctt ctcctgtcgg tttttcgggg ccagaattca cgtttccgct atatggaacc 1020
atgggaaatg cagctccaca acaacgtatt gttgctcaac taggtcaggg cgtgtataga 1080
acattatcgt ccactttata tagaagacct tttaatatag ggataaataa tcaacaacta 1140
tctgttcttg acgggacaga atttgcttat ggaacctcct caaatttgcc atccgctgta 1200
tacagaaaaa gcggaacggt agattcgctg gatgaaatac cgccacagaa taacaacgtg 1260
ccacctaggc aaggatttag tcatcgatta agccatgttt caatgtttcg ttcaggcttt 1320
agtaatagta gtgtaagtat aataagagct cctatgttct cttggataca tcgtagtgct 1380
gaatttaata atataattgc atcggatagt attaatcaaa tacctttagt gaaaggattt 1440
agagtttggg ggggcacctc tgtcattaca ggaccaggat ttacaggagg ggatatcctt 1500
cgaagaaata cctttggtga ttttgtatct ctacaagtca atattaattc accaattacc 1560
caaagatacc gtttaagatt tcgttacgct tccagtaggg atgcacgagt tatagtatta 1620
acaggagcgg catccacagg agtgggaggc caagttagtg taaatatgcc tcttcagaaa 1680
actatggaaa taggggagaa cttaacatct agaacattta gatataccga ttttagtaat 1740
cctttttcat ttagagctaa tccagatata attgggataa gtgaacaacc tctatttggt 1800
gcaggttcta ttagtagcgg tgaactttat atagataaaa ttgaaattat tctagcagat 1860
gcaacatttg aagcagaatc tgatttagaa agagcgcaga aggcggtgaa tgcgctgttt 1920
acgtctacaa accaactagg gctaaaaaca aatgtaacgg attatcatat tgatcaagtg 1980
tccaatttag ttacgtattt atcggatgaa ttttgtctgg atgaaaagcg agaattgtcc 2040
gagaaagtca aacatgcgaa gcgactcagt gatgaacgca atttactcca agattcaaat 2100
ttcaaagaca ttaataggca accagaacgt gggtggggcg gaagtacagg gattaccatc 2160
caaggagggg atgacgtatt taaagaaaat tacgtcacac tatcaggtac ctttgatgag 2220
tgctatccaa catatttgta tcaaaaaatc gatgaatcaa aattaaaagc ctttacccgt 2280
tatcaattaa gagggtatat cgaagatagt caagacttag aaatctattt aattcgctac 2340
aatgcaaaac atgaaacagt aaatgtgcca ggtacgggtt ccttatggcc gctttcagcc 2400
caaagtccaa tcggaaagtg tggagagccg aatcgatgcg cgccacacct tgaatggaat 2460
cctgacttag attgttcgtg tagggatgga gaaaagtgtg cccatcattc gcatcatttc 2520
tccttagaca ttgatgtagg atgtacagac ttaaatgagg acctaggtgt atgggtgatc 2580
tttaagatta agacgcaaga tgggcacgca agactaggga atctagagtt tctcgaagag 2640
aaaccattag taggagaagc gctagctcgt gtgaaaagag cggagaaaaa atggagagac 2700
aaacgtgaaa aattggaatg ggaaacaaat atcgtttata aagaggcaaa agaatctgta 2760
gatgctttat ttgtaaactc tcaatatgat caattacaag cggatacgaa tattgccatg 2820
attcatgcgg cagataaacg tgttcatagc attcgagaag cttatctgcc tgagctgtct 2880
gtgattccgg gtgtcaatgc ggctattttt gaagaattag aagggcgtat tttcactgca 2940
ttctccctat atgatgcgag aaatgtcatt aaaaatggtg attttaataa tggcttatcc 3000
tgctggaacg tgaaagggca tgtagatgta gaagaacaaa acaaccaacg ttcggtcctt 3060
gttgttccgg aatgggaagc agaagtgtca caagaagttc gtgtctgtcc gggtcgtggc 3120
tatatccttc gtgtcacagc gtacaaggag ggatatggag aaggttgcgt aaccattcat 3180
gagatcgaga acaatacaga cgaactgaag tttagcaact gcgtagaaga ggaaatctat 3240
ccaaataaca cggtaacgtg taatgattat actgtaaatc aagaagaata cggaggtgcg 3300
tacacttctc gtaatcgagg atataacgaa gctccttccg taccagctga ttatgcgtca 3360
gtctatgaag aaaaatcgta tacagatgga cgaagagaga atccttgtga atttaacaga 3420
gggtataggg attacacgcc actaccagtt ggttatgtga caaaagaatt agaatacttc 3480
ccagaaaccg ataaggtatg gattgagatt ggagaaacgg aaggaacatt tatcgtggac 3540
agcgtggaat tactccttat ggaggaatga 3570
<210> 2
<211> 3570
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC1100的合成核苷酸序列。
<400> 2
atggagattg tgaacaacca gaaccagtgc gttccttaca actgcttgaa caaccctgag 60
attgagattc ttgagggtgg tagaatttct gttggcaaca ctcctattga catctctttg 120
agtttgactc aattcttgtt gagtgagttc gttcctggtg ctggtttcgt cttgggtttg 180
attgatttga tttggggttt cgttggtcct agtcaatggg atgctttctt ggctcaagtt 240
gagcaattga ttaaccagag gatcgctgag gctgtgagga acactgctat tcaagagttg 300
gagggtatgg ctagagttta cagaacttac gctactgctt tcgctgagtg ggagaaggct 360
cctgatgacc ctgagttgag ggaggctttg agaactcaat tcactgctac tgagacttac 420
atcagtggta gaatcagtgt cttgaagatt caaactttcg aggttcaatt gctttctgtg 480
ttcgctcaag ctgcaaactt gcacttgtct ttgcttagag atgttgtgtt ctttggtcaa 540
agatggggtt tctccactac taccgtgaac aattactaca acgatttgac tgagggtatt 600
tctacttaca ctgattacgc tgttagatgg tacaacactg gtttggagag agtttggggt 660
ccagattcca gagattgggt cagatacaac cagttcagaa gggagttgac tttgactgtc 720
ttggacattg ttgctctctt ccctaactac gatagtcgtc gttaccctat tagaactgtt 780
tctcaactta ctagggaaat ctacactaac cctgttcttg agaacttcga tggtagtttc 840
cgtggtagtg ctcaagggat tgagcgttct attcgttctc ctcatcttat ggacattctt 900
aactctatta ctatctacac tgatgctcat cgtggttact attactggtc tggtcatcaa 960
attatggcta gtcctgttgg tttcagtggt cctgagttca ctttccctct ttacggtact 1020
atgggcaacg ctgcacctca acagaggatc gttgctcaac ttggtcaagg tgtttacagg 1080
actctttctt caacccttta caggcgtcct ttcaacattg ggatcaacaa ccagcagctt 1140
tctgttcttg atggaaccga gttcgcttac ggaacctctt caaaccttcc tagtgctgtt 1200
tacaggaagt ctggaaccgt tgacagtctt gatgagattc caccgcagaa caataacgtt 1260
ccacccaggc aaggcttcag tcataggctt tctcatgttt ctatgttccg ctctggattc 1320
agcaactctt cagtttctat tatcagggct ccaatgttct cgtggattca taggtctgcc 1380
gagttcaaca acattatcgc ttccgatagc attaaccaga ttccacttgt taagggattc 1440
cgtgtttggg gaggcacctc tgttattacc ggaccaggct tcaccggagg cgacattctt 1500
cgtcgtaaca ccttcggaga tttcgtttca cttcaagtga acattaactc accaatcacc 1560
cagcgctaca ggcttcgctt ccgctacgca tcatccaggg atgcaagggt gatcgtgctt 1620
accggagcag cctcaaccgg agtgggaggc caagtgagcg tgaacatgcc acttcagaag 1680
acgatggaga tcggcgagaa ccttacctca agaacctttc gttacaccga tttcagcaac 1740
ccattcagct ttcgtgcaaa cccagacatc atagggatct cagagcagcc actgtttgga 1800
gctggatcaa tctcatccgg agagctttac atcgacaaga tcgagatcat actcgcagat 1860
gcaaccttcg aggctgagag cgatctggag cgtgcacaga aggcagtgaa cgcactcttt 1920
acctctacca accagctcgg actcaagacc aacgtgaccg attaccacat cgaccaagtg 1980
agcaacctcg tgacctacct ctcagatgag ttctgcttgg atgagaaacg cgaactcagc 2040
gagaaggtga agcacgcaaa gcgtctctca gatgagcgta acctcctcca ggatagcaat 2100
ttcaaggaca tcaatcgtca gccagagcgt ggatggggag gctcaaccgg aatcaccatc 2160
cagggaggcg atgatgtgtt taaggagaat tacgtgacac tctccggaac attcgatgag 2220
tgctacccaa catacctcta tcagaagatc gacgagtcca agctcaaggc gttcacccgt 2280
tatcagctcc gtggctacat cgaggatagt caagacctgg aaatctacct catccgctac 2340
aatgcaaagc acgagacagt gaatgtgcca ggaacaggct ccctctggcc actctccgca 2400
cagtctccaa tcggcaagtg cggcgagcca aatcgctgcg cgccacacct ggagtggaat 2460
cccgacctgg actgctcctg ccgcgacggc gagaagtgcg cccaccactc ccaccacttt 2520
agcctggaca tcgacgtggg ctgtacagac ctgaatgagg atctgggcgt gtgggtgatc 2580
tttaagatca agacacagga cggccacgcc cgcctgggca atctggagtt tctggaggag 2640
aagcctctgg tgggcgaagc cctggcccgc gtgaagcgcg ccgagaagaa atggcgcgac 2700
aaacgcgaga aactggaatg ggaaacaaac atcgtgtaca aagaagccaa agaatccgtg 2760
gacgccctat ttgtgaactc ccagtatgac cagctacagg ccgacacaaa catcgcgatg 2820
atccacgctg cggacaagcg cgtgcactcc atacgcgaag cctatctacc cgaactatcc 2880
gtgatacccg gcgtcaatgc cgcgatcttt gaagaattgg aaggccgcat cttcacagcc 2940
tttagcctct atgacgcccg aaatgtcatc aagaatggcg actttaacaa tgggctatcc 3000
tgttggaatg tcaaagggca cgtggacgtc gaagagcaga acaatcagcg atccgtctta 3060
gtcgtacccg aatgggaagc cgaagtctcc caggaagtcc gagtctgtcc tggtagaggt 3120
tacatcttga gagtgactgc ttacaaggag ggttacggtg agggatgcgt gactattcac 3180
gagattgaga acaacactga tgagttgaag ttcagtaact gcgtggagga ggaaatctac 3240
cccaacaaca ctgtgacttg taacgattac accgtgaacc aggaggaata cggaggcgct 3300
tacacctcca gaaaccgtgg atacaatgag gctccctcgg tccccgctga ttatgcctcc 3360
gtctatgagg agaagtccta caccgatgga aggcgcgaga atccctgcga gttcaatcgc 3420
ggctatcgag actacactcc gctacccgtt ggctatgtca caaaggaact ggaatacttc 3480
ccggaaacag acaaagtctg gatcgaaatc ggcgaaacag aagggacgtt catagtcgat 3540
agcgtagaac ttctccttat ggaagaatga 3570
<210> 3
<211> 3570
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC1100的合成核苷酸序列。
<400> 3
atggagattg tgaacaacca gaaccagtgc gttccttaca actgcttgaa caaccctgag 60
attgagattc ttgagggtgg tagaatttct gttggcaaca ctcctattga catctctttg 120
agtttgactc aattcttgtt gagtgagttc gttcctggtg ctggtttcgt cttgggtttg 180
attgatttga tttggggttt cgttggtcct agtcaatggg atgctttctt ggctcaagtt 240
gagcaattga ttaaccagag gatcgctgag gctgtgagga acactgctat tcaagagttg 300
gagggtatgg ctagagttta cagaacttac gctactgctt tcgctgagtg ggagaaggct 360
cctgatgacc ctgagttgag ggaggctttg agaactcaat tcactgctac tgagacttac 420
atcagtggta gaatcagtgt cttgaagatt caaactttcg aggttcaatt gctttctgtg 480
ttcgctcaag ctgcaaactt gcacttgtct ttgcttagag atgttgtgtt ctttggtcaa 540
agatggggtt tctccactac taccgtgaac aattactaca acgatttgac tgagggtatt 600
tctacttaca ctgattacgc tgttagatgg tacaacactg gtttggagag agtttggggt 660
ccagattcca gagattgggt cagatacaac cagttcagaa gggagttgac tttgactgtc 720
ttggacattg ttgctctctt ccctaactac gatagtcgtc gttaccctat tagaactgtt 780
tctcaactta ctagggaaat ctacactaac cctgttcttg agaacttcga tggtagtttc 840
cgtggtagtg ctcaagggat tgagcgttct attcgttctc ctcatcttat ggacattctt 900
aactctatta ctatctacac tgatgctcat cgtggttact attactggtc tggtcatcaa 960
attatggcta gtcctgttgg tttcagtggt cctgagttca ctttccctct ttacggtact 1020
atgggcaacg ctgcacctca acagaggatc gttgctcaac ttggtcaagg tgtttacagg 1080
actctttctt caacccttta caggcgtcct ttcaacattg ggatcaacaa ccagcagctt 1140
tctgttcttg atggaaccga gttcgcttac ggaacctctt caaaccttcc tagtgctgtt 1200
tacaggaagt ctggaaccgt tgacagtctt gatgagattc caccgcagaa caataacgtt 1260
ccacccaggc aaggcttcag tcataggctt tctcatgttt ctatgttccg ctctggattc 1320
agcaactctt cagtttctat tatcagggct ccaatgttct cgtggattca taggtctgcc 1380
gagttcaaca acattatcgc ttccgatagc attaaccaga ttccacttgt taagggattc 1440
cgtgtttggg gaggcacctc tgttattacc ggaccaggct tcaccggagg cgacattctt 1500
cgtcgtaaca ccttcggaga tttcgtttca cttcaagtga acattaactc accaatcacc 1560
cagcgctaca ggcttcgctt ccgctacgca tcatccaggg atgcaagggt gatcgtgctt 1620
accggagcag cctcaaccgg agtgggaggc caagtgagcg tgaacatgcc acttcagaag 1680
acgatggaga tcggcgagaa ccttacctca agaacctttc gttacaccga tttcagcaac 1740
ccattcagct ttcgtgcaaa cccagacatc atagggatct cagagcagcc actgtttgga 1800
gctggatcaa tctcatccgg agagctttac atcgacaaga tcgagatcat actcgcagat 1860
gcaaccttcg aggctgagag cgatctggag cgtgcacaga aggcagtgaa cgcactcttt 1920
acctctacca accagctcgg actcaagacc aacgtgaccg attaccacat cgaccaagtg 1980
agcaacctcg tgacctacct ctcagatgag ttctgcttgg atgagaaacg cgaactcagc 2040
gagaaggtga agcacgcaaa gcgtctctca gatgagcgta acctcctcca ggatagcaat 2100
ttcaaggaca tcaatcgtca gccagagcgt ggatggggag gctcaaccgg aatcaccatc 2160
cagggaggcg atgatgtgtt taaggagaat tacgtgacac tctccggaac attcgatgag 2220
tgctacccaa catacctcta tcagaagatc gacgagtcca agctcaaggc gttcacccgt 2280
tatcagctcc gtggctacat cgaggatagt caagacctgg aaatctacct catccgctac 2340
aatgcaaagc acgagacagt gaatgtacca ggaacaggct ccctctggcc actctccgca 2400
cagtctccaa tcggcaagtg cggcgagcca aatcgctgcg cgccacacct ggagtggaat 2460
cccgacctgg actgctcctg ccgcgacggc gagaagtgcg cccaccactc ccaccacttt 2520
agcctggaca tcgacgtggg ctgtacagac ctgaatgagg atctgggcgt gtgggtgatc 2580
tttaagatca agacacagga cggccacgcc cgcctgggca atctggagtt tctggaggag 2640
aagcctctgg tgggcgaagc cctggcccgc gtgaagcgcg ccgagaagaa atggcgcgac 2700
aaacgcgaga aactggaatg ggaaacaaac atcgtgtaca aagaagccaa agaatccgtg 2760
gacgccctat ttgtgaactc ccagtatgac cagctacagg ccgacacaaa catcgcgatg 2820
atccacgctg cggacaagcg cgtgcactcc atacgcgaag cctatctacc cgaactatcc 2880
gtgatacccg gcgtcaatgc cgcgatcttt gaagaattgg aaggccgcat cttcacagcc 2940
tttagcctct atgacgcccg aaatgtcatc aagaatggcg actttaacaa tgggctatcc 3000
tgttggaatg tcaaagggca cgtggacgtc gaagagcaga acaatcagcg atccgtctta 3060
gtcgtacccg aatgggaagc cgaagtctcc caggaagtcc gagtctgtcc tggtagaggt 3120
tacatcttga gagtgactgc ttacaaggag ggttacggtg agggatgcgt gactattcac 3180
gagattgaga acaacactga tgagttgaag ttcagtaact gcgtggagga ggaaatctac 3240
cccaacaaca ctgtgacttg taacgattac accgtgaacc aggaggaata cggaggcgct 3300
tacacctcca gaaaccgtgg atacaatgag gctccctcgg tccccgctga ttatgcctcc 3360
gtctatgagg agaagtccta caccgatgga aggcgcgaga atccctgcga gttcaatcgc 3420
ggctatcgag actacactcc gctacccgtt ggctatgtca caaaggaact ggaatacttc 3480
ccggaaacag acaaagtctg gatcgaaatc ggcgaaacag aagggacgtt catagtcgat 3540
agcgtagaac ttctccttat ggaagaatga 3570
<210> 4
<211> 1189
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC1100的氨基酸序列。
<400> 4
Met Glu Ile Val Asn Asn Gln Asn Gln Cys Val Pro Tyr Asn Cys Leu
1 5 10 15
Asn Asn Pro Glu Ile Glu Ile Leu Glu Gly Gly Arg Ile Ser Val Gly
20 25 30
Asn Thr Pro Ile Asp Ile Ser Leu Ser Leu Thr Gln Phe Leu Leu Ser
35 40 45
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Ile Asp Leu Ile
50 55 60
Trp Gly Phe Val Gly Pro Ser Gln Trp Asp Ala Phe Leu Ala Gln Val
65 70 75 80
Glu Gln Leu Ile Asn Gln Arg Ile Ala Glu Ala Val Arg Asn Thr Ala
85 90 95
Ile Gln Glu Leu Glu Gly Met Ala Arg Val Tyr Arg Thr Tyr Ala Thr
100 105 110
Ala Phe Ala Glu Trp Glu Lys Ala Pro Asp Asp Pro Glu Leu Arg Glu
115 120 125
Ala Leu Arg Thr Gln Phe Thr Ala Thr Glu Thr Tyr Ile Ser Gly Arg
130 135 140
Ile Ser Val Leu Lys Ile Gln Thr Phe Glu Val Gln Leu Leu Ser Val
145 150 155 160
Phe Ala Gln Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Val Val
165 170 175
Phe Phe Gly Gln Arg Trp Gly Phe Ser Thr Thr Thr Val Asn Asn Tyr
180 185 190
Tyr Asn Asp Leu Thr Glu Gly Ile Ser Thr Tyr Thr Asp Tyr Ala Val
195 200 205
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg
210 215 220
Asp Trp Val Arg Tyr Asn Gln Phe Arg Arg Glu Leu Thr Leu Thr Val
225 230 235 240
Leu Asp Ile Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Arg Tyr Pro
245 250 255
Ile Arg Thr Val Ser Gln Leu Thr Arg Glu Ile Tyr Thr Asn Pro Val
260 265 270
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gln Gly Ile Glu
275 280 285
Arg Ser Ile Arg Ser Pro His Leu Met Asp Ile Leu Asn Ser Ile Thr
290 295 300
Ile Tyr Thr Asp Ala His Arg Gly Tyr Tyr Tyr Trp Ser Gly His Gln
305 310 315 320
Ile Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro
325 330 335
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gln Gln Arg Ile Val Ala
340 345 350
Gln Leu Gly Gln Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg
355 360 365
Arg Pro Phe Asn Ile Gly Ile Asn Asn Gln Gln Leu Ser Val Leu Asp
370 375 380
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val
385 390 395 400
Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu Ile Pro Pro Gln
405 410 415
Asn Asn Asn Val Pro Pro Arg Gln Gly Phe Ser His Arg Leu Ser His
420 425 430
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser Ile Ile
435 440 445
Arg Ala Pro Met Phe Ser Trp Ile His Arg Ser Ala Glu Phe Asn Asn
450 455 460
Ile Ile Ala Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
465 470 475 480
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
485 490 495
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
500 505 510
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
515 520 525
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
530 535 540
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
545 550 555 560
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
565 570 575
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
580 585 590
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
595 600 605
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Phe Glu
610 615 620
Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Ala Leu Phe
625 630 635 640
Thr Ser Thr Asn Gln Leu Gly Leu Lys Thr Asn Val Thr Asp Tyr His
645 650 655
Ile Asp Gln Val Ser Asn Leu Val Thr Tyr Leu Ser Asp Glu Phe Cys
660 665 670
Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg
675 680 685
Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Ser Asn Phe Lys Asp Ile
690 695 700
Asn Arg Gln Pro Glu Arg Gly Trp Gly Gly Ser Thr Gly Ile Thr Ile
705 710 715 720
Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Ser Gly
725 730 735
Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu
740 745 750
Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu
755 760 765
Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His
770 775 780
Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala
785 790 795 800
Gln Ser Pro Ile Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His
805 810 815
Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys
820 825 830
Cys Ala His His Ser His His Phe Ser Leu Asp Ile Asp Val Gly Cys
835 840 845
Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe Lys Ile Lys
850 855 860
Thr Gln Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu
865 870 875 880
Lys Pro Leu Val Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys
885 890 895
Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu Thr Asn Ile Val
900 905 910
Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gln
915 920 925
Tyr Asp Gln Leu Gln Ala Asp Thr Asn Ile Ala Met Ile His Ala Ala
930 935 940
Asp Lys Arg Val His Ser Ile Arg Glu Ala Tyr Leu Pro Glu Leu Ser
945 950 955 960
Val Ile Pro Gly Val Asn Ala Ala Ile Phe Glu Glu Leu Glu Gly Arg
965 970 975
Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val Ile Lys Asn
980 985 990
Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val Lys Gly His Val
995 1000 1005
Asp Val Glu Glu Gln Asn Asn Gln Arg Ser Val Leu Val Val Pro
1010 1015 1020
Glu Trp Glu Ala Glu Val Ser Gln Glu Val Arg Val Cys Pro Gly
1025 1030 1035
Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly
1040 1045 1050
Glu Gly Cys Val Thr Ile His Glu Ile Glu Asn Asn Thr Asp Glu
1055 1060 1065
Leu Lys Phe Ser Asn Cys Val Glu Glu Glu Ile Tyr Pro Asn Asn
1070 1075 1080
Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gln Glu Glu Tyr Gly
1085 1090 1095
Gly Ala Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser
1100 1105 1110
Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr
1115 1120 1125
Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg
1130 1135 1140
Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu
1145 1150 1155
Tyr Phe Pro Glu Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr
1160 1165 1170
Glu Gly Thr Phe Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu
1175 1180 1185
Glu
<210> 5
<211> 3672
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC860的重组核苷酸序列。
<400> 5
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccaacggta 60
tcgaatcctt ccacgcaaat gaatctatca ccagatgctc gtattgaaga tagcttgtgt 120
gtagccgagg tgaacaatat tgatccattt gttagcgcat caacagtcca aacgggtata 180
aacatagctg gtagaatatt gggcgtatta ggtgtgccgt ttgctggaca actagctagt 240
ttttatagtt ttcttgttgg ggaattatgg cctagtggca gagatccatg ggaaattttc 300
ctggaacatg tagaacaact tataagacaa caagtaacag aaaatactag gaatacggct 360
attgctcgat tagaaggtct aggaagaggc tatagatctt accagcaggc tcttgaaact 420
tggttagata accgaaatga tgcaagatca agaagcatta ttcttgagcg ctatgttgct 480
ttagaacttg acattactac tgctataccg cttttcagaa tacgaaatga agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agacgcatcc 600
ctttttggta gtgaatgggg gatggcatct tccgatgtta accaatatta ccaagaacaa 660
atcagatata cagaggaata ttctaaccat tgcgtacaat ggtataatac agggctaaat 720
aacttaagag ggacaaatgc tgaaagttgg ttgcggtata atcaattccg tagagaccta 780
acgttagggg tattagattt agtagcccta ttcccaagct atgatactcg cacttatcca 840
atcaatacga gtgctcagtt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccattttcag gcctccgcat ctacttgatt ttccagaaca acttacaatt 1020
tacagtgcat caagccgttg gagtagcact caacatatga attattgggt gggacatagg 1080
cttaacttcc gcccaatagg agggacatta aatacctcaa cacaaggact tactaataat 1140
acttcaatta atcctgtaac attacagttt acgtctcgag acgtttatag aacagaatca 1200
aatgcaggga caaatatact atttactact cctgtgaatg gagtaccttg ggctagattt 1260
aattttataa accctcagaa tatttatgaa agaggcgcca ctacctacag tcaaccgtat 1320
cagggagttg ggattcaatt atttgattca gaaactgaat taccaccaga aacaacagaa 1380
cgaccaaatt atgaatcata tagtcataga ttatctcata taggactaat cataggaaac 1440
actttgagag caccagtcta ttcttggacg catcgtagtg cagatcgtac gaatacgatt 1500
ggaccaaata gaattaatca aataccttta gtgaaaggat ttagagtttg ggggggcacc 1560
tctgtcatta caggaccagg atttacagga ggggatatcc ttcgaagaaa tacctttggt 1620
gattttgtat ctctacaagt caatattaat tcaccaatta cccaaagata ccgtttaaga 1680
tttcgttacg cttccagtag ggatgcacga gttatagtat taacaggagc ggcatccaca 1740
ggagtgggag gccaagttag tgtaaatatg cctcttcaga aaactatgga aataggggag 1800
aacttaacat ctagaacatt tagatatacc gattttagta atcctttttc atttagagct 1860
aatccagata taattgggat aagtgaacaa cctctatttg gtgcaggttc tattagtagc 1920
ggtgaacttt atatagataa aattgaaatt attctagcag atgcaacatt tgaagcagaa 1980
tctgatttag aaagagcgca gaaggcggtg aatgcgctgt ttacgtctac aaaccaacta 2040
gggctaaaaa caaatgtaac ggattatcat attgatcaag tgtccaattt agttacgtat 2100
ttatcggatg aattttgtct ggatgaaaag cgagaattgt ccgagaaagt caaacatgcg 2160
aagcgactca gtgatgaacg caatttactc caagattcaa atttcaaaga cattaatagg 2220
caaccagaac gtgggtgggg cggaagtaca gggattacca tccaaggagg ggatgacgta 2280
tttaaagaaa attacgtcac actatcaggt acctttgatg agtgctatcc aacatatttg 2340
tatcaaaaaa tcgatgaatc aaaattaaaa gcctttaccc gttatcaatt aagagggtat 2400
atcgaagata gtcaagactt agaaatctat ttaattcgct acaatgcaaa acatgaaaca 2460
gtaaatgtgc caggtacggg ttccttatgg ccgctttcag cccaaagtcc aatcggaaag 2520
tgtggagagc cgaatcgatg cgcgccacac cttgaatgga atcctgactt agattgttcg 2580
tgtagggatg gagaaaagtg tgcccatcat tcgcatcatt tctccttaga cattgatgta 2640
ggatgtacag acttaaatga ggacctaggt gtatgggtga tctttaagat taagacgcaa 2700
gatgggcacg caagactagg gaatctagag tttctcgaag agaaaccatt agtaggagaa 2760
gcgctagctc gtgtgaaaag agcggagaaa aaatggagag acaaacgtga aaaattggaa 2820
tgggaaacaa atatcgttta taaagaggca aaagaatctg tagatgcttt atttgtaaac 2880
tctcaatatg atcaattaca agcggatacg aatattgcca tgattcatgc ggcagataaa 2940
cgtgttcata gcattcgaga agcttatctg cctgagctgt ctgtgattcc gggtgtcaat 3000
gcggctattt ttgaagaatt agaagggcgt attttcactg cattctccct atatgatgcg 3060
agaaatgtca ttaaaaatgg tgattttaat aatggcttat cctgctggaa cgtgaaaggg 3120
catgtagatg tagaagaaca aaacaaccaa cgttcggtcc ttgttgttcc ggaatgggaa 3180
gcagaagtgt cacaagaagt tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca 3240
gcgtacaagg agggatatgg agaaggttgc gtaaccattc atgagatcga gaacaataca 3300
gacgaactga agtttagcaa ctgcgtagaa gaggaaatct atccaaataa cacggtaacg 3360
tgtaatgatt atactgtaaa tcaagaagaa tacggaggtg cgtacacttc tcgtaatcga 3420
ggatataacg aagctccttc cgtaccagct gattatgcgt cagtctatga agaaaaatcg 3480
tatacagatg gacgaagaga gaatccttgt gaatttaaca gagggtatag ggattacacg 3540
ccactaccag ttggttatgt gacaaaagaa ttagaatact tcccagaaac cgataaggta 3600
tggattgaga ttggagaaac ggaaggaaca tttatcgtgg acagcgtgga attactcctt 3660
atggaggaat ag 3672
<210> 6
<211> 3672
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC860的合成核苷酸序列。
<400> 6
atgaccagca accggaagaa cgagaacgag atcatcaacg ccctgagcat cccgaccgtg 60
agcaacccta gcacccagat gaacctgagc cctgacgctc gcatcgagga ctccctctgc 120
gtggctgagg tgaacaacat cgacccgttc gtgtccgcct ccaccgtgca gaccggcatc 180
aacatcgcgg gccgcatcct cggcgtgctc ggcgtgccct ttgcgggcca gctcgcctcc 240
ttctactcct tcctcgtggg agagctgtgg ccctccggcc gcgacccgtg ggagatcttc 300
ctggagcacg tggagcagct catccgccag caagtcaccg agaacacccg caacaccgcc 360
atcgcccgcc tggagggcct gggccgtggc taccgctcct accagcaagc cctggagacc 420
tggctcgaca accgcaacga cgcccgctcc cgctccatca tcctggagcg ctacgtcgcc 480
ctggaactgg acatcaccac tgccatccca ctcttccgca tcaggaacga ggaggtgcct 540
ctgctgatgg tgtacgccca ggctgcgaac ctgcacctgc tgctgctgcg cgacgcaagc 600
ctgtttggct ccgagtgggg tatggcaagc tccgacgtca accagtacta ccaggagcag 660
atccgctaca ccgaggagta cagcaaccac tgcgtccagt ggtacaacac cggtctgaac 720
aatctcagag ggaccaacgc tgagagctgg ctgcgctaca accagttccg gcgggatctg 780
accctaggtg tcctggatct ggtcgctctg ttcccgagct acgataccag gacgtaccct 840
atcaacacct ctgctcagct taccagggag atctacactg atcctatcgg taggactaac 900
gctcctagtg gtttcgccag cactaactgg ttcaacaaca acgcgcctag tttctctgcc 960
atcgaggcgg cgatcttccg gcctcctcac ctcctcgact tcccggagca gcttactatc 1020
tactctgcgt cttcgcggtg gtcttcgact cagcacatga actactgggt tggtcaccgg 1080
cttaacttcc gcccgattgg aggaactctt aacaccagta cgcaaggtct tacgaacaac 1140
acttccatca acccggttac gttgcagttc acgtctcggg acgtttaccg gacggagtcg 1200
aatgctggga cgaacatcct gttcacgaca ccggtgaatg gtgttccgtg ggcacgtttc 1260
aacttcatca acccgcagaa catctacgag cgtggagcaa cgacatactc gcaaccatac 1320
caaggcgttg gcatccaact gtttgactcg gagacggaac tgccaccaga gacgacagaa 1380
cgtccgaatt acgagtcata ctcacacaga ctatcacaca ttggactcat tatcggaaac 1440
acactgagag caccagtgta ctcatggaca catcggtcag cagatcgtac gaacaccatc 1500
ggacccaatc ggatcaacca gatcccgctc gtgaagggct tccgcgtgtg gggcggcacc 1560
tccgtcatca ccggtccggg cttcaccggc ggcgacatcc tccgccgcaa caccttcggc 1620
gacttcgtgt cactccaagt gaacatcaac agcccgatca cccagcgcta tcgcctccgc 1680
ttccgctacg cctcctcccg cgacgctaga gtgatcgtgc tcaccggagc ggcgtccaca 1740
ggcgtaggcg gccaagtgtc tgtgaacatg ccgctccaga agactatgga gattggtgag 1800
aacctcacct ctcgcacctt ccgctacacc gacttctcca atccgttctc cttcagagcc 1860
aacccagaca tcatcggcat ctccgagcag cctctctttg gcgctggctc catctcctcc 1920
ggcgagctgt acatcgacaa gattgagatc atccttgccg acgccacctt cgaagctgag 1980
tccgatctcg agcgcgccca gaaggccgtg aacgccctct tcactagcac taaccagctc 2040
ggcctcaaga ctaacgtgac cgactaccac attgaccaag tgagcaacct agtgacctac 2100
cttagcgacg agttctgcct tgacgagaag cgtgagctga gcgagaaggt gaagcacgcc 2160
aagcgcctct ccgacgagcg caacctcctc caggactcca acttcaagga catcaaccgc 2220
cagcccgagc gcggctgggg cggtagcacc ggcatcacca tccagggcgg tgacgatgtg 2280
ttcaaggaga actacgtgac cctctccggc accttcgacg agtgctaccc gacctacctc 2340
taccagaaga tcgacgagtc caagctcaag gcgttcaccc gctaccagct tcgcggctac 2400
atcgaggact cccaggatct ggagatctac ctcatccgct acaacgccaa gcacgagacc 2460
gtgaacgtgc ccggcaccgg ctccctctgg ccgctctccg cccagagccc tatcggcaag 2520
tgcggcgagc ccaaccgctg cgcgcctcac ctggagtgga accctgacct cgactgctcc 2580
tgccgcgacg gcgagaagtg cgcccaccat agccaccact tctctctcga catcgacgtg 2640
ggctgcaccg acctcaacga ggatctgggc gtgtgggtga tcttcaagat caagacccag 2700
gacggccacg ccaggctggg caacctggag ttcctggagg agaagcctct ggtgggtgag 2760
gccctggcca gggtcaagag ggctgagaag aaatggaggg acaagaggga gaagctggag 2820
tgggagacca acatcgtgta caaggaggct aaggagtccg tggacgctct gttcgtcaac 2880
tctcagtacg atcagctcca ggctgacacc aacatcgcta tgatccacgc tgcggataag 2940
agggtccact ctatcaggga ggcttacctg cctgagcttt ctgtcatccc tggtgtcaac 3000
gcggcaatct tcgaggaact tgagggccgc atcttcactg cgttctcgct ttacgatgcg 3060
cggaacgtca ttaagaacgg tgacttcaac aatggtcttt cgtgctggaa cgtcaagggt 3120
catgtcgatg tcgaggaaca gaacaaccag cggtcggtcc ttgtcgttcc cgagtgggag 3180
gccgaggtct cgcaagaggt ccgggtctgc cctgggcgcg ggtacattct tcgtgtcact 3240
gcgtacaagg agggctacgg cgagggctgc gttactattc atgagattga gaacaatacg 3300
gatgagctta agtttagtaa ctgtgttgag gaggagatct acccgaacaa tacggttacg 3360
tgcaatgatt acacggtgaa ccaggaggaa tacggcggag catacacctc acgtaataga 3420
gggtacaatg aggcaccgtc agttccggca gattatgcct cagtttatga ggagaagtcc 3480
tacacggatg gaagacgcga gaatccatgt gagtttaata gaggataccg agactacaca 3540
ccactcccag ttggatacgt tacaaaggag ttggaatact tcccagaaac agataaagtt 3600
tggatagaga tcggagaaac agaaggaacc ttcatcgtgg acagtgtaga actgctgctg 3660
atggaagaat ga 3672
<210> 7
<211> 1223
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC860的氨基酸序列。
<400> 7
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Thr Val Ser Asn Pro Ser Thr Gln Met Asn Leu Ser Pro Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Val Ala Glu Val Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Leu Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Ser Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asn Thr Ala Ile Ala Arg Leu Glu Gly Leu Gly
115 120 125
Arg Gly Tyr Arg Ser Tyr Gln Gln Ala Leu Glu Thr Trp Leu Asp Asn
130 135 140
Arg Asn Asp Ala Arg Ser Arg Ser Ile Ile Leu Glu Arg Tyr Val Ala
145 150 155 160
Leu Glu Leu Asp Ile Thr Thr Ala Ile Pro Leu Phe Arg Ile Arg Asn
165 170 175
Glu Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Trp Gly Met
195 200 205
Ala Ser Ser Asp Val Asn Gln Tyr Tyr Gln Glu Gln Ile Arg Tyr Thr
210 215 220
Glu Glu Tyr Ser Asn His Cys Val Gln Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Thr Tyr Pro Ile Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Ile Phe Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Tyr Ser Ala Ser Ser Arg Trp Ser Ser Thr Gln His
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Asn Phe Arg Pro Ile Gly Gly
355 360 365
Thr Leu Asn Thr Ser Thr Gln Gly Leu Thr Asn Asn Thr Ser Ile Asn
370 375 380
Pro Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser
385 390 395 400
Asn Ala Gly Thr Asn Ile Leu Phe Thr Thr Pro Val Asn Gly Val Pro
405 410 415
Trp Ala Arg Phe Asn Phe Ile Asn Pro Gln Asn Ile Tyr Glu Arg Gly
420 425 430
Ala Thr Thr Tyr Ser Gln Pro Tyr Gln Gly Val Gly Ile Gln Leu Phe
435 440 445
Asp Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr
450 455 460
Glu Ser Tyr Ser His Arg Leu Ser His Ile Gly Leu Ile Ile Gly Asn
465 470 475 480
Thr Leu Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg
485 490 495
Thr Asn Thr Ile Gly Pro Asn Arg Ile Asn Gln Ile Pro Leu Val Lys
500 505 510
Gly Phe Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe
515 520 525
Thr Gly Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser
530 535 540
Leu Gln Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg
545 550 555 560
Phe Arg Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly
565 570 575
Ala Ala Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu
580 585 590
Gln Lys Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg
595 600 605
Tyr Thr Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile
610 615 620
Ile Gly Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser
625 630 635 640
Gly Glu Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr
645 650 655
Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Ala
660 665 670
Leu Phe Thr Ser Thr Asn Gln Leu Gly Leu Lys Thr Asn Val Thr Asp
675 680 685
Tyr His Ile Asp Gln Val Ser Asn Leu Val Thr Tyr Leu Ser Asp Glu
690 695 700
Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala
705 710 715 720
Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Ser Asn Phe Lys
725 730 735
Asp Ile Asn Arg Gln Pro Glu Arg Gly Trp Gly Gly Ser Thr Gly Ile
740 745 750
Thr Ile Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu
755 760 765
Ser Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile
770 775 780
Asp Glu Ser Lys Leu Lys Ala Phe Thr Arg Tyr Gln Leu Arg Gly Tyr
785 790 795 800
Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala
805 810 815
Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu
820 825 830
Ser Ala Gln Ser Pro Ile Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala
835 840 845
Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly
850 855 860
Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp Ile Asp Val
865 870 875 880
Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe Lys
885 890 895
Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu
900 905 910
Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg Val Lys Arg Ala
915 920 925
Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu Thr Asn
930 935 940
Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn
945 950 955 960
Ser Gln Tyr Asp Gln Leu Gln Ala Asp Thr Asn Ile Ala Met Ile His
965 970 975
Ala Ala Asp Lys Arg Val His Ser Ile Arg Glu Ala Tyr Leu Pro Glu
980 985 990
Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe Glu Glu Leu Glu
995 1000 1005
Gly Arg Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val
1010 1015 1020
Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn Val
1025 1030 1035
Lys Gly His Val Asp Val Glu Glu Gln Asn Asn Gln Arg Ser Val
1040 1045 1050
Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gln Glu Val Arg
1055 1060 1065
Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr Lys
1070 1075 1080
Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile Glu Asn
1085 1090 1095
Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu Ile
1100 1105 1110
Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gln
1115 1120 1125
Glu Glu Tyr Gly Gly Ala Tyr Thr Ser Arg Asn Arg Gly Tyr Asn
1130 1135 1140
Glu Ala Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu
1145 1150 1155
Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn
1160 1165 1170
Arg Gly Tyr Arg Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr
1175 1180 1185
Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp Ile Glu
1190 1195 1200
Ile Gly Glu Thr Glu Gly Thr Phe Ile Val Asp Ser Val Glu Leu
1205 1210 1215
Leu Leu Met Glu Glu
1220
<210> 8
<211> 3564
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC867的重组核苷酸序列。
<400> 8
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta cacaaatacc attggtaaag gcgcataccc tccaatcggg taccactgta 1560
gtaaaagggc cagggtttac aggaggggat atcctccgtc gaacaagtgg aggaccattt 1620
gcttttagta atgttaatct agattttaac ttgtcacaaa ggtatcgtgc tagaattcgt 1680
tatgcctcta ctactaacct aagaatttac gtaacggttg caggtgaacg aatttttgct 1740
ggtcaatttg acaaaactat ggatgctggt gccccattaa cattccaatc ttttagttac 1800
gcaactatta atacagcttt tacattccca gaaagatcga gcagcttgac tgtaggtgcc 1860
gatacgttta gttcaggtaa tgaagtttat gtagatagat ttgaattaat cccagttact 1920
gcaaccttcg aggcagaatc tgatttagaa agagcacaaa aggcggtgaa tgagctgttt 1980
acttcttcca atcaaatcgg gttaaaaaca gatgtgacgg attatcatat tgatcaagta 2040
tccaatttag ttgagtgttt atctgatgaa ttttgtctgg atgaaaaaaa agaattgtcc 2100
gagaaagtca aacatgcgaa gcgacttagt gatgagcgga atttacttca agatccaaac 2160
tttagaggga tcaatagaca actagaccgt ggctggagag gaagtacgga tattaccatc 2220
caaggaggcg atgacgtatt caaagagaat tacgttacgc tattgggtac ctttgatgag 2280
tgctatccaa cgtatttata tcaaaaaata gatgagtcga aattaaaagc ctatacccgt 2340
taccaattaa gagggtatat cgaagatagt caagacttag aaatctattt aattcgctac 2400
aatgccaaac acgaaacagt aaatgtgcca ggtacgggtt ccttatggcc gctttcagcc 2460
ccaagtccaa tcggaaaatg tgcccatcat tcccatcatt tctccttgga cattgatgtt 2520
ggatgtacag acttaaatga ggacttaggt gtatgggtga tattcaagat taagacgcaa 2580
gatggccatg caagactagg aaatctagaa tttctcgaag agaaaccatt agtaggagaa 2640
gcactagctc gtgtgaaaag agcggagaaa aaatggagag acaaacgtga aaaattggaa 2700
tgggaaacaa atattgttta taaagaggca aaagaatctg tagatgcttt atttgtaaac 2760
tctcaatatg atagattaca agcggatacc aacatcgcga tgattcatgc ggcagataaa 2820
cgcgttcata gcattcgaga agcttatctg cctgagctgt ctgtgattcc gggtgtcaat 2880
gcggctattt ttgaagaatt agaagggcgt attttcactg cattctccct atatgatgcg 2940
agaaatgtca ttaaaaatgg tgattttaat aatggcttat cctgctggaa cgtgaaaggg 3000
catgtagatg tagaagaaca aaacaaccac cgttcggtcc ttgttgttcc ggaatgggaa 3060
gcagaagtgt cacaagaagt tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca 3120
gcgtacaagg agggatatgg agaaggttgc gtaaccattc atgagatcga gaacaataca 3180
gacgaactga agtttagcaa ctgtgtagaa gaggaagtat atccaaacaa cacggtaacg 3240
tgtaatgatt atactgcgac tcaagaagaa tatgagggta cgtacacttc tcgtaatcga 3300
ggatatgacg gagcctatga aagcaattct tctgtaccag ctgattatgc atcagcctat 3360
gaagaaaaag catatacaga tggacgaaga gacaatcctt gtgaatctaa cagaggatat 3420
ggggattaca caccactacc agctggctat gtgacaaaag aattagagta cttcccagaa 3480
accgataagg tatggattga gatcggagaa acggaaggaa cattcatcgt ggacagcgtg 3540
gaattacttc ttatggagga atag 3564
<210> 9
<211> 3564
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867的合成核苷酸序列。
<400> 9
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccttcg aagctgagtc ggacctggag cgtgcacaga aggcagtcaa cgagctgttc 1980
acctctagca accagatcgg cctcaagacc gacgtcacag actaccacat cgaccaagtg 2040
tccaacctgg tcgagtgcct tagcgacgag ttctgcctag acgagaagaa ggagctgtcg 2100
gagaaggtca aacacgccaa gcgtctgagc gatgagcgca acctgctcca agaccctaac 2160
ttccgtggca tcaacaggca gcttgaccgt ggctggcgcg gctcgacgga catcacgatc 2220
cagggtggcg acgacgtatt caaggagaat tacgtgacct tgcttgggac gtttgacgag 2280
tgctatccca cctacctcta ccagaagatt gatgaatcga aattgaaggc gtacacgaga 2340
taccagctcc gtggctacat cgaggacagc caggacttgg agatctacct catacgctac 2400
aacgctaaac atgagaccgt gaacgtccct gggacgggca gtctgtggcc actctctgct 2460
cctagcccta tcggcaagtg cgctcaccac tcgcaccact tcagccttga catcgacgtg 2520
ggatgtactg acctcaacga agacctgggc gtctgggtta tcttcaagat caagacccag 2580
gacggccacg cccgactcgg caacctggag ttcctggagg agaaaccact ggtgggcgag 2640
gcgctcgccc gcgtgaagcg tgccgagaag aagtggcggg acaagaggga gaagctagaa 2700
tgggagacga acatcgtgta caaggaggcc aaggaaagcg tcgatgccct gttcgtgaac 2760
tcacagtacg accgtctcca ggcggacacg aacatcgcca tgatccacgc ggctgacaag 2820
cgcgtccact ccatccgcga ggcgtactta ccggagctgt cggtgatccc aggcgtaaac 2880
gcggcgatct tcgaggagct agagggacgc atcttcacag cgttcagcct gtacgacgca 2940
cgcaacgtca tcaagaacgg cgatttcaac aacggactgt cctgctggaa cgtgaagggc 3000
cacgtcgatg tcgaggaaca gaacaaccac cgctctgtcc tggtggtccc agagtgggag 3060
gccgaggtct cccaggaggt ccgcgtgtgc cctgggcgtg gctacatcct ccgtgtgaca 3120
gcctacaagg agggctacgg tgagggctgc gtcaccattc acgagatcga gaacaacact 3180
gacgaactca agttctcgaa ttgcgtggag gaggaggtgt acccgaacaa tacggtgacg 3240
tgcaacgact acacggcaac ccaagaggag tacgagggca cctacaccag taggaaccgt 3300
ggctacgacg gtgcctacga gtcgaactcc agcgtccctg cggactacgc cagcgcgtac 3360
gaggagaagg cttacaccga cggacgccgg gacaacccat gcgagagcaa ccgtggctac 3420
ggcgactaca ctcctctccc ggccggatac gtcacaaagg agctggagta tttcccagag 3480
acggacaagg tgtggatcga aatcggagag acagagggaa ccttcatcgt ggacagcgtg 3540
gagctgctcc tcatggagga gtga 3564
<210> 10
<211> 1187
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC867的氨基酸序列。
<400> 10
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val
645 650 655
Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu Lys Thr Asp Val
660 665 670
Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Glu Cys Leu Ser
675 680 685
Asp Glu Phe Cys Leu Asp Glu Lys Lys Glu Leu Ser Glu Lys Val Lys
690 695 700
His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn
705 710 715 720
Phe Arg Gly Ile Asn Arg Gln Leu Asp Arg Gly Trp Arg Gly Ser Thr
725 730 735
Asp Ile Thr Ile Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val
740 745 750
Thr Leu Leu Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln
755 760 765
Lys Ile Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg
770 775 780
Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr
785 790 795 800
Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp
805 810 815
Pro Leu Ser Ala Pro Ser Pro Ile Gly Lys Cys Ala His His Ser His
820 825 830
His Phe Ser Leu Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp
835 840 845
Leu Gly Val Trp Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala
850 855 860
Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu
865 870 875 880
Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg
885 890 895
Glu Lys Leu Glu Trp Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu
900 905 910
Ser Val Asp Ala Leu Phe Val Asn Ser Gln Tyr Asp Arg Leu Gln Ala
915 920 925
Asp Thr Asn Ile Ala Met Ile His Ala Ala Asp Lys Arg Val His Ser
930 935 940
Ile Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn
945 950 955 960
Ala Ala Ile Phe Glu Glu Leu Glu Gly Arg Ile Phe Thr Ala Phe Ser
965 970 975
Leu Tyr Asp Ala Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly
980 985 990
Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gln Asn
995 1000 1005
Asn His Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val
1010 1015 1020
Ser Gln Glu Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg
1025 1030 1035
Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile
1040 1045 1050
His Glu Ile Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys
1055 1060 1065
Val Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp
1070 1075 1080
Tyr Thr Ala Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg
1085 1090 1095
Asn Arg Gly Tyr Asp Gly Ala Tyr Glu Ser Asn Ser Ser Val Pro
1100 1105 1110
Ala Asp Tyr Ala Ser Ala Tyr Glu Glu Lys Ala Tyr Thr Asp Gly
1115 1120 1125
Arg Arg Asp Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr
1130 1135 1140
Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe
1145 1150 1155
Pro Glu Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly
1160 1165 1170
Thr Phe Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1175 1180 1185
<210> 11
<211> 3642
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC867_20的重组核苷酸序列。
<400> 11
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta cacaaatacc attggtaaag gcgcataccc tccaatcggg taccactgta 1560
gtaaaagggc cagggtttac aggaggggat atcctccgtc gaacaagtgg aggaccattt 1620
gcttttagta atgttaatct agattttaac ttgtcacaaa ggtatcgtgc tagaattcgt 1680
tatgcctcta ctactaacct aagaatttac gtaacggttg caggtgaacg aatttttgct 1740
ggtcaatttg acaaaactat ggatgctggt gccccattaa cattccaatc ttttagttac 1800
gcaactatta atacagcttt tacattccca gaaagatcga gcagcttgac tgtaggtgcc 1860
gatacgttta gttcaggtaa tgaagtttat gtagatagat ttgaattaat cccagttact 1920
gcaacctttg aggcagaata tgatttagaa agagcgcaaa aggtggtgaa tgccctgttt 1980
acgtctacaa accaactagg gctaaaaaca gatgtgacgg attatcatat tgatcaggta 2040
tccaatctag ttgcgtgttt atcggatgaa ttttgtctgg atgaaaagag agaattgtcc 2100
gagaaagtta aacatgcaaa gcgactcagt gatgagcgga atttacttca agatccaaac 2160
ttcagaggga tcaataggca accagaccgt ggctggagag gaagtacgga tattactatc 2220
caaggaggag atgacgtatt caaagagaat tacgttacgc taccgggtac ctttgatgag 2280
tgctatccaa cgtatttata tcaaaaaata gatgagtcga aattaaaagc ctatacccgt 2340
tatcaattaa gagggtatat cgaagatagt caagacttag aaatctattt aattcgttac 2400
aatgcaaaac acgaaatagt aaatgtacca ggtacaggaa gtttatggcc tctttctgta 2460
gaaaatcaaa ttggaccttg tggagaaccg aatcgatgcg cgccacacct tgaatggaat 2520
cctgatttac actgttcctg cagagacggg gaaaaatgtg cacatcattc tcatcatttc 2580
tctttggaca ttgatgttgg atgtacagac ttaaatgagg acttaggtgt atgggtgata 2640
ttcaagatta agacgcaaga tggccacgca cgactaggga atctagagtt tctcgaagag 2700
aaaccattat taggagaagc actagctcgt gtgaaaagag cggagaaaaa atggagagac 2760
aaacgcgaaa cattacaatt ggaaacaact atcgtttata aagaggcaaa agaatctgta 2820
gatgctttat ttgtaaactc tcaatatgat agattacaag cggatacgaa catcgcgatg 2880
attcatgcgg cagataaacg cgttcataga attcgagaag cgtatctgcc ggagctgtct 2940
gtgattccgg gtgtcaatgc ggctattttt gaagaattag aagagcgtat tttcactgca 3000
ttttccctat atgatgcgag aaatattatt aaaaatggcg atttcaataa tggcttatta 3060
tgctggaacg tgaaagggca tgtagaggta gaagaacaaa acaatcaccg ttcagtcctg 3120
gttatcccag aatgggaggc agaagtgtca caagaggttc gtgtctgtcc aggtcgtggc 3180
tatatccttc gtgttacagc gtacaaagag ggatatggag aaggttgcgt aacgatccat 3240
gagatcgaga acaatacaga cgaactgaaa ttcaacaact gtgtagaaga ggaagtatat 3300
ccaaacaaca cggtaacgtg tattaattat actgcgactc aagaagaata tgagggtacg 3360
tacacttctc gtaatcgagg atatgacgaa gcctatggta ataacccttc cgtaccagct 3420
gattatgcgt cagtctatga agaaaaatcg tatacagata gacgaagaga gaatccttgt 3480
gaatctaaca gaggatatgg agattacaca ccactaccag ctggttatgt aacaaaggaa 3540
ttagagtact tcccagagac cgataaggta tggattgaga ttggagaaac agaaggaaca 3600
ttcatcgtgg acagcgtgga attactcctt atggaggaat ag 3642
<210> 12
<211> 3642
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_20的合成核苷酸序列。
<400> 12
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccttcg aggccgagta cgaccttgag cgcgcccaga aggtggtgaa cgccctcttc 1980
actagcacta accagctagg cctgaagact gacgtgaccg actaccacat cgaccaagtg 2040
agcaacctag tggcctgcct ctccgacgag ttctgcctcg acgagaagcg cgagctgtcc 2100
gagaaggtga agcacgccaa gcgcctctcc gacgagcgca acctgctcca ggaccccaac 2160
ttcaggggca tcaacaggca gcccgaccgc ggctggcgcg gctccaccga catcaccatc 2220
cagggcggtg acgacgtatt caaggagaac tacgttaccc tccccggcac cttcgacgag 2280
tgttacccca cctacctcta ccagaagatc gacgagtcca agctgaaggc ctacacccgc 2340
taccagctcc gcggctacat cgaggactcc caggacctgg aaatctacct catccgctac 2400
aacgccaagc acgagatcgt gaacgtgcct ggcaccggca gcctctggcc tctcagcgtg 2460
gagaaccaga tcggcccttg cggcgagcct aaccgctgcg cccctcacct cgagtggaac 2520
cctgacctcc actgctcgtg cagggacggc gagaagtgcg cccaccatag ccaccacttc 2580
tctctggaca tcgacgtggg ctgcaccgac ctgaacgagg acctgggcgt gtgggttatc 2640
ttcaagatca agacccagga cggtcacgcc aggctgggta acctggagtt ccttgaggaa 2700
aagcctctgc tgggtgaggc cctggccagg gtcaagaggg ctgagaagaa atggagggat 2760
aagagggaga ccctgcagct ggagaccact atcgtctaca aggaggctaa ggagtctgtc 2820
gatgctctgt tcgtcaactc tcagtacgat agactgcaag ctgataccaa catcgctatg 2880
atccacgctg cggataagcg ggtccaccgg atccgggagg cttaccttcc ggagctttct 2940
gtcatcccgg gtgtcaacgc tgcgatcttc gaggaacttg aggaacggat cttcactgcg 3000
tttagtcttt acgatgcgcg gaacatcatc aagaacgggg acttcaacaa tggtctgctg 3060
tgctggaacg tcaagggtca tgtcgaggtc gaggaacaaa acaatcatcg tagtgtcctt 3120
gtcattcctg agtgggaggc ggaggtctct caagaggtcc gtgtttgccc ggggcgtggg 3180
tacattcttc gtgttactgc gtacaaggag gggtacgggg aggggtgcgt tactattcat 3240
gagattgaga acaatactga tgagcttaag ttcaacaatt gtgttgagga ggaggtttac 3300
ccgaacaata ctgttacgtg catcaactac acggcaacgc aagaggaata cgaggggacg 3360
tacacctcgc gtaatagagg gtatgatgag gcgtacggaa acaacccgtc ggttccagca 3420
gattatgcct cggtttatga ggagaagtcg tacacggata gacgacgcga gaatccatgt 3480
gagtcaaatc gaggatacgg agattacaca ccattaccag caggatacgt tacaaaggag 3540
ttggaatact tcccggaaac agataaagtt tggattgaaa tcggagaaac agaaggaaca 3600
ttcatcgtcg actcagtaga attgttgttg atggaagaat ga 3642
<210> 13
<211> 1213
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_20的氨基酸序列。
<400> 13
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gln Lys Val Val
645 650 655
Asn Ala Leu Phe Thr Ser Thr Asn Gln Leu Gly Leu Lys Thr Asp Val
660 665 670
Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Ala Cys Leu Ser
675 680 685
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys
690 695 700
His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn
705 710 715 720
Phe Arg Gly Ile Asn Arg Gln Pro Asp Arg Gly Trp Arg Gly Ser Thr
725 730 735
Asp Ile Thr Ile Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val
740 745 750
Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln
755 760 765
Lys Ile Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg
770 775 780
Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr
785 790 795 800
Asn Ala Lys His Glu Ile Val Asn Val Pro Gly Thr Gly Ser Leu Trp
805 810 815
Pro Leu Ser Val Glu Asn Gln Ile Gly Pro Cys Gly Glu Pro Asn Arg
820 825 830
Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu His Cys Ser Cys Arg
835 840 845
Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp Ile
850 855 860
Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile
865 870 875 880
Phe Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn Leu Glu
885 890 895
Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys
900 905 910
Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Thr Leu Gln Leu Glu
915 920 925
Thr Thr Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe
930 935 940
Val Asn Ser Gln Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile Ala Met
945 950 955 960
Ile His Ala Ala Asp Lys Arg Val His Arg Ile Arg Glu Ala Tyr Leu
965 970 975
Pro Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe Glu Glu
980 985 990
Leu Glu Glu Arg Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn
995 1000 1005
Ile Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn
1010 1015 1020
Val Lys Gly His Val Glu Val Glu Glu Gln Asn Asn His Arg Ser
1025 1030 1035
Val Leu Val Ile Pro Glu Trp Glu Ala Glu Val Ser Gln Glu Val
1040 1045 1050
Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr
1055 1060 1065
Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile Glu
1070 1075 1080
Asn Asn Thr Asp Glu Leu Lys Phe Asn Asn Cys Val Glu Glu Glu
1085 1090 1095
Val Tyr Pro Asn Asn Thr Val Thr Cys Ile Asn Tyr Thr Ala Thr
1100 1105 1110
Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly Tyr
1115 1120 1125
Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala
1130 1135 1140
Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Arg Arg Arg Glu Asn
1145 1150 1155
Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro
1160 1165 1170
Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp
1175 1180 1185
Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile Val
1190 1195 1200
Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1205 1210
<210> 14
<211> 3690
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC867_21的重组核苷酸序列。
<400> 14
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta cacaaatacc attggtaaag gcgcataccc tccaatcggg taccactgta 1560
gtaaaagggc cagggtttac aggaggggat atcctccgtc gaacaagtgg aggaccattt 1620
gcttttagta atgttaatct agattttaac ttgtcacaaa ggtatcgtgc tagaattcgt 1680
tatgcctcta ctactaacct aagaatttac gtaacggttg caggtgaacg aatttttgct 1740
ggtcaatttg acaaaactat ggatgctggt gccccattaa cattccaatc ttttagttac 1800
gcaactatta atacagcttt tacattccca gaaagatcga gcagcttgac tgtaggtgcc 1860
gatacgttta gttcaggtaa tgaagtttat gtagatagat ttgaattaat cccagttact 1920
gcaaccggaa cgacaaccta tgagtatgaa gagaagcaga atctagaaaa agcgcagaaa 1980
gcgttgaacg ctttgtttac ggatggcacg aatggctatc tacaaatgga tgccactgat 2040
tatgatatca atcaaactgc aaacttaata gaatgtgtat cagatgaatt gtatgcaaaa 2100
gaaaagatag ttttattaga tgaagtcaaa tatgcgaagc ggcttagcat atcacgtaac 2160
ctacttttga acgatgattt agaattttca gatggatttg gagaaaacgg atggacgaca 2220
agtgataata tttcaatcca ggcggataat ccccttttta aggggaatta tttaaaaatg 2280
tttggggcaa gagatattga tggaacccta tttccaactt atctctatca aaaaatagat 2340
gagtccaggt taaaaccata tacacgttat cgagtaagag ggtttgtggg aagtagtaaa 2400
aatctaaaat tagtggtaac acgctatgag aaagaaattg atgccattat gaatgttcca 2460
aatgatttgg cacatatgca gcttaaccct tcatgtggag attatcgctg tgaatcatcg 2520
tcccagtttt tggtgaacca agtgcatcct acaccaacag ctggatatgc tcttgatatg 2580
tatgcatgcc cgtcaagttc agataaaaaa catattatgt gtcacgatcg tcatccattt 2640
gattttcata ttgacaccgg agaattaaat ccaaacacaa acctgggtat tgatgtcttg 2700
tttaaaattt ctaatccaaa tggatacgct acattaggga atctagaagt cattgaagaa 2760
ggaccactaa cagatgaagc attggtacat gtaaaacaaa aggaaaagaa atggcgtcag 2820
cacatggaga aaaaacgaat ggaaacacaa caagcctatg atccagcaaa acaagctgta 2880
gatgcattat ttacaaatga acaagagtta gactatcata ctactttaga tcatattcag 2940
aacgccgatc agctggtaca ggcgattccc tatgtacacc atgcttggtt accggatgct 3000
ccaggtatga actatgatgt atatcaaggg ttaaacgcac gtatcatgca ggcgtacaat 3060
ttatatgatg cacgaaatgt cataataaat ggtgacttta cacaaggact acaaggatgg 3120
cacgcaacag gaaaagcagc ggtacaacaa atagatggag cttcagtatt agttctatca 3180
aactggagtg ccgaggtatc tcagaatctg catgcccaag atcatcatgg atatatgtta 3240
cgtgtgattg ccaaaaaaga aggtcctgga aaagggtatg taatgatgat ggattttaat 3300
ggaaagcagg aaacacttac gttcacttct tgtgaagaag gatatataac aaaaacaata 3360
gaggtattcc cggaaagtga tcgaatacga attgaaatgg gagaaacaga gggtacgttt 3420
tatgtagata gcatcgagtt gctttgtatg caaggatatg ctagcgataa taacccgcac 3480
acgggtaata tgtatgagca aagttataat ggaaattata atcaaaatac tagcgatgtg 3540
tatcaccaag gatatataaa caactataac caaaattcta gtagtatgta taatcaaaat 3600
tatattaaca atgatgacct gcattccggt tgcacatgta accaagggca taactctggc 3660
tgtacatgta atcaaggata taaccgttag 3690
<210> 15
<211> 3690
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_21的合成核苷酸序列。
<400> 15
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccggga ctaccaccta cgagtacgag gagaagcaga atctcgagaa ggctcagaag 1980
gctctgaacg ctctgttcac tgacgggacc aacggctacc tccagatgga cgccactgac 2040
tacgacatca accagacagc taacctgatt gagtgtgtga gtgacgaact gtacgctaag 2100
gagaagatcg tactcctgga cgaggtgaag tacgctaagc gcctgagcat tagccgtaac 2160
ctgctgctga acgacgatct ggagttcagc gacggctttg gcgagaacgg ctggaccacc 2220
agcgacaaca tctccatcca ggccgacaat ccactcttca aaggcaacta cctcaagatg 2280
ttcggagcca gggacatcga cggcaccctc tttccgacct acctctacca gaagatcgac 2340
gagtcccgcc tcaaacccta cacccgctac agggtgcgcg gcttcgtggg cagcagcaag 2400
aacctcaagc tcgtggtcac acggtatgag aaggagatcg acgccatcat gaacgtgccc 2460
aacgatctcg cccacatgca gctcaatcca tcctgcggcg actaccggtg cgagtccagc 2520
tcccagttcc tcgtgaacca ggtgcaccct actccgaccg ctggctatgc cctggacatg 2580
tacgcctgcc ctagttcctc cgacaagaag cacatcatgt gccacgaccg tcatccgttc 2640
gacttccaca tcgacaccgg cgaactgaac ccgaacacca acctgggcat cgacgtactg 2700
ttcaagattt ccaacccgaa cgggtacgcc accttgggca acctggaggt catcgaagaa 2760
ggcccgctga ccgacgaggc cctggtccac gtcaaacaga aggagaagaa gtggcggcag 2820
cacatggaga agaagcggat ggagactcaa caagcctacg acccggccaa gcaagctgtg 2880
gacgctctgt tcaccaacga gcaagagctt gactaccaca ctactcttga ccacatccag 2940
aatgctgacc agcttgtcca ggctattccg tacgtccacc acgcttggct accggacgct 3000
ccagggatga actacgatgt gtaccagggt ctgaacgcgc ggatcatgca agcgtacaac 3060
ctgtacgacg cgcgtaacgt catcatcaac ggtgacttca ctcagggtct tcaaggttgg 3120
cacgcgactg gcaaagcggc agtccagcag attgatggtg cgtctgttct tgtgttgagc 3180
aactggtctg cggaggtttc tcagaacctg cacgcacagg atcaccacgg ctacatgctg 3240
agggtgattg ctaagaagga gggccctggc aaaggctacg tcatgatgat ggacttcaac 3300
ggaaagcaag aaaccctgac cttcactagc tgtgaggagg gctacatcac taagaccatt 3360
gaggtctttc cggagtctga ccgcatccgg atcgagatgg gcgagaccga aggcacgttc 3420
tacgtggact ccatcgaact cctctgcatg caaggctacg cctccgacaa caacccacac 3480
acgggcaaca tgtacgagca gtcctacaac gggaactaca accagaacac ctccgatgtg 3540
taccatcagg gctacatcaa caactacaac cagaacagca gcagcatgta caaccagaac 3600
tacatcaaca acgatgactt gcactcgggt tgcacctgca accagggtca caacagtggg 3660
tgcacgtgca accagggata caaccgttga 3690
<210> 16
<211> 1229
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_21的氨基酸序列。
<400> 16
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Gly Thr Thr Thr Tyr Glu Tyr Glu Glu Lys Gln Asn Leu Glu
645 650 655
Lys Ala Gln Lys Ala Leu Asn Ala Leu Phe Thr Asp Gly Thr Asn Gly
660 665 670
Tyr Leu Gln Met Asp Ala Thr Asp Tyr Asp Ile Asn Gln Thr Ala Asn
675 680 685
Leu Ile Glu Cys Val Ser Asp Glu Leu Tyr Ala Lys Glu Lys Ile Val
690 695 700
Leu Leu Asp Glu Val Lys Tyr Ala Lys Arg Leu Ser Ile Ser Arg Asn
705 710 715 720
Leu Leu Leu Asn Asp Asp Leu Glu Phe Ser Asp Gly Phe Gly Glu Asn
725 730 735
Gly Trp Thr Thr Ser Asp Asn Ile Ser Ile Gln Ala Asp Asn Pro Leu
740 745 750
Phe Lys Gly Asn Tyr Leu Lys Met Phe Gly Ala Arg Asp Ile Asp Gly
755 760 765
Thr Leu Phe Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu Ser Arg Leu
770 775 780
Lys Pro Tyr Thr Arg Tyr Arg Val Arg Gly Phe Val Gly Ser Ser Lys
785 790 795 800
Asn Leu Lys Leu Val Val Thr Arg Tyr Glu Lys Glu Ile Asp Ala Ile
805 810 815
Met Asn Val Pro Asn Asp Leu Ala His Met Gln Leu Asn Pro Ser Cys
820 825 830
Gly Asp Tyr Arg Cys Glu Ser Ser Ser Gln Phe Leu Val Asn Gln Val
835 840 845
His Pro Thr Pro Thr Ala Gly Tyr Ala Leu Asp Met Tyr Ala Cys Pro
850 855 860
Ser Ser Ser Asp Lys Lys His Ile Met Cys His Asp Arg His Pro Phe
865 870 875 880
Asp Phe His Ile Asp Thr Gly Glu Leu Asn Pro Asn Thr Asn Leu Gly
885 890 895
Ile Asp Val Leu Phe Lys Ile Ser Asn Pro Asn Gly Tyr Ala Thr Leu
900 905 910
Gly Asn Leu Glu Val Ile Glu Glu Gly Pro Leu Thr Asp Glu Ala Leu
915 920 925
Val His Val Lys Gln Lys Glu Lys Lys Trp Arg Gln His Met Glu Lys
930 935 940
Lys Arg Met Glu Thr Gln Gln Ala Tyr Asp Pro Ala Lys Gln Ala Val
945 950 955 960
Asp Ala Leu Phe Thr Asn Glu Gln Glu Leu Asp Tyr His Thr Thr Leu
965 970 975
Asp His Ile Gln Asn Ala Asp Gln Leu Val Gln Ala Ile Pro Tyr Val
980 985 990
His His Ala Trp Leu Pro Asp Ala Pro Gly Met Asn Tyr Asp Val Tyr
995 1000 1005
Gln Gly Leu Asn Ala Arg Ile Met Gln Ala Tyr Asn Leu Tyr Asp
1010 1015 1020
Ala Arg Asn Val Ile Ile Asn Gly Asp Phe Thr Gln Gly Leu Gln
1025 1030 1035
Gly Trp His Ala Thr Gly Lys Ala Ala Val Gln Gln Ile Asp Gly
1040 1045 1050
Ala Ser Val Leu Val Leu Ser Asn Trp Ser Ala Glu Val Ser Gln
1055 1060 1065
Asn Leu His Ala Gln Asp His His Gly Tyr Met Leu Arg Val Ile
1070 1075 1080
Ala Lys Lys Glu Gly Pro Gly Lys Gly Tyr Val Met Met Met Asp
1085 1090 1095
Phe Asn Gly Lys Gln Glu Thr Leu Thr Phe Thr Ser Cys Glu Glu
1100 1105 1110
Gly Tyr Ile Thr Lys Thr Ile Glu Val Phe Pro Glu Ser Asp Arg
1115 1120 1125
Ile Arg Ile Glu Met Gly Glu Thr Glu Gly Thr Phe Tyr Val Asp
1130 1135 1140
Ser Ile Glu Leu Leu Cys Met Gln Gly Tyr Ala Ser Asp Asn Asn
1145 1150 1155
Pro His Thr Gly Asn Met Tyr Glu Gln Ser Tyr Asn Gly Asn Tyr
1160 1165 1170
Asn Gln Asn Thr Ser Asp Val Tyr His Gln Gly Tyr Ile Asn Asn
1175 1180 1185
Tyr Asn Gln Asn Ser Ser Ser Met Tyr Asn Gln Asn Tyr Ile Asn
1190 1195 1200
Asn Asp Asp Leu His Ser Gly Cys Thr Cys Asn Gln Gly His Asn
1205 1210 1215
Ser Gly Cys Thr Cys Asn Gln Gly Tyr Asn Arg
1220 1225
<210> 17
<211> 3432
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC867_22的重组核苷酸序列。
<400> 17
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta cacaaatacc attggtaaag gcgcataccc tccaatcggg taccactgta 1560
gtaaaagggc cagggtttac aggaggggat atcctccgtc gaacaagtgg aggaccattt 1620
gcttttagta atgttaatct agattttaac ttgtcacaaa ggtatcgtgc tagaattcgt 1680
tatgcctcta ctactaacct aagaatttac gtaacggttg caggtgaacg aatttttgct 1740
ggtcaatttg acaaaactat ggatgctggt gccccattaa cattccaatc ttttagttac 1800
gcaactatta atacagcttt tacattccca gaaagatcga gcagcttgac tgtaggtgcc 1860
gatacgttta gttcaggtaa tgaagtttat gtagatagat ttgaattaat cccagttact 1920
gcaaccaatc cgacgcgaga ggcggaagag gatctagaag cagcgaagaa agcggtggcg 1980
agcttgttta cacgtacaag ggacggatta caagtaaatg tgacagatta tcaagtcgat 2040
caagcggcaa atttagtgtc atgcttatca gatgaacaat atgggcatga caaaaagatg 2100
ttattggaag cggtaagagc ggcaaaacgc ctcagccgag aacgcaactt acttcaggat 2160
ccagatttta atacaatcaa tagtacagaa gaaaatggat ggaaagcaag taacggcgtt 2220
actattagcg agggcggtcc attctataaa ggccgtgcgc ttcagctagc aagcgcaaga 2280
gaaaattacc caacatacat ttatcaaaaa gtaaatgcat cagagttaaa gccgtataca 2340
cgttatagac tggatgggtt cgtgaagagt agtcaagatt tagaaattga tctcattcac 2400
catcataaag tccatctcgt gaaaaatgta ccagataatt tagtatccga tacttactcg 2460
gatggttctt gcagtggaat gaatcgatgt gaggaacaac agatggtaaa tgcgcaactg 2520
gaaacagaac atcatcatcc gatggattgc tgtgaagcgg ctcaaacaca tgagttttct 2580
tcctatatta atacaggcga tctaaattca agtgtagatc aaggcatttg ggttgtattg 2640
aaagttcgaa caaccgatgg ttatgcgacg ctaggaaatc ttgaattggt agaggtcgga 2700
ccgttatcgg gtgaatctct agaacgtgaa caaagggata atgcgaaatg gagtgcagag 2760
ctaggaagaa agcgtgcaga aacagatcgc gtgtatcaag atgccaaaca atccatcaat 2820
catttatttg tggattatca agatcaacaa ttaaatccag aaatagggat ggcagatatt 2880
attgacgctc aaaatcttgt cgcatcaatt tcagatgtgt atagcgatgc agtactgcaa 2940
atccctggaa ttaactatga gatttacaca gagctatcca atcgcttaca acaagcatcg 3000
tatctgtata cgtctcgaaa tgcggtgcaa aatggggact ttaacagcgg tctagatagt 3060
tggaatgcaa cagggggggc tacggtacaa caggatggca atacgcattt cttagttctt 3120
tctcattggg atgcacaagt ttctcaacaa tttagagtgc agccgaattg taaatatgta 3180
ttacgtgtaa cagcagagaa agtaggcggc ggagacggat acgtgacaat ccgggatggt 3240
gctcatcata cagaaaagct tacatttaat gcatgtgatt atgatataaa tggcacgtac 3300
gtgactgata atacgtatct aacaaaagaa gtggtattct attcacatac agaacacatg 3360
tgggtagagg taagtgaaac agaaggtgca tttcatatag atagtattga attcgttgaa 3420
acagaaaagt ag 3432
<210> 18
<211> 3432
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_22的合成核苷酸序列。
<400> 18
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccaacc cgacgcggga agctgaggaa gacttggaag ccgccaagaa agcggtcgcc 1980
agcctgttta ctcggacgcg ggacgggctc caagtgaatg tgacggacta tcaagtggat 2040
caggccgcta acctcgtgtc atgcctgagc gacgagcagt acggtcacga caagaaaatg 2100
ctgctggagg ccgtccgggc cgccaagcgg ctgtccaggg agcgtaacct gctacaagat 2160
cccgacttta acacgatcaa cagcacagag gagaatggct ggaaggccag caacggagtt 2220
acgataagcg agggcggtcc gttctacaag ggtcgtgccc tccagctcgc ctctgcaagg 2280
gagaactatc caacctacat ctatcagaag gtgaacgcat ccgagcttaa gccctacaca 2340
cgctaccgcc tggacgggtt cgttaagtcc agtcaagacc tagagataga cctcatccac 2400
caccacaaag tgcatctggt caagaacgtt cccgataatc tcgtgagcga tacctactca 2460
gacggctcat gctctggcat gaacagatgt gaggagcaac agatggttaa tgctcaactc 2520
gaaaccgagc atcatcatcc tatggattgc tgcgaggccg cgcagaccca tgagttcagc 2580
tcttacatca acaccggaga cctcaacagt agcgtggatc agggaatttg ggtggtgctt 2640
aaagtgcgta caaccgacgg ctacgccacc ctcggcaacc ttgagcttgt cgaggtcgga 2700
ccacttagcg gcgagtccct ggaacgtgag cagcgggaca acgccaaatg gagcgcagag 2760
ctagggcgca aacgcgcgga gacggaccgg gtttatcagg acgcgaagca gtccatcaat 2820
cacctcttcg tggattatca ggaccagcag cttaatccag agatcggcat ggccgacatc 2880
atcgacgccc agaacctagt agcgtcgatt tccgatgtct attccgacgc cgtgcttcaa 2940
atacctggca tcaactacga gatctacaca gagttgtcca acaggctcca gcaagcgtca 3000
tacctctaca ccagccgcaa cgccgtccag aatggcgact tcaattccgg actagactcc 3060
tggaacgcca cgggcggagc tacggtgcaa caagacggca acacccactt cctcgtactt 3120
agccactggg acgctcaagt gagtcagcaa ttccgggttc agccgaactg caagtacgtc 3180
ctgcgcgtaa cggccgagaa ggttggaggc ggagacggct acgttaccat ccgcgacggc 3240
gctcaccaca ccgagaaact gacgttcaac gcttgtgact acgacatcaa cggcacttac 3300
gtgacggaca acacctacct gacgaaggag gtggtgttct attctcacac cgagcacatg 3360
tgggttgagg tcagcgagac cgagggagcc ttccacattg acagcatcga gttcgtggag 3420
actgagaagt ga 3432
<210> 19
<211> 1143
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_22的氨基酸序列。
<400> 19
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Asn Pro Thr Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys
645 650 655
Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val
660 665 670
Asn Val Thr Asp Tyr Gln Val Asp Gln Ala Ala Asn Leu Val Ser Cys
675 680 685
Leu Ser Asp Glu Gln Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala
690 695 700
Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp
705 710 715 720
Pro Asp Phe Asn Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala
725 730 735
Ser Asn Gly Val Thr Ile Ser Glu Gly Gly Pro Phe Tyr Lys Gly Arg
740 745 750
Ala Leu Gln Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr
755 760 765
Gln Lys Val Asn Ala Ser Glu Leu Lys Pro Tyr Thr Arg Tyr Arg Leu
770 775 780
Asp Gly Phe Val Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His
785 790 795 800
His His Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser
805 810 815
Asp Thr Tyr Ser Asp Gly Ser Cys Ser Gly Met Asn Arg Cys Glu Glu
820 825 830
Gln Gln Met Val Asn Ala Gln Leu Glu Thr Glu His His His Pro Met
835 840 845
Asp Cys Cys Glu Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn
850 855 860
Thr Gly Asp Leu Asn Ser Ser Val Asp Gln Gly Ile Trp Val Val Leu
865 870 875 880
Lys Val Arg Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu
885 890 895
Val Glu Val Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg
900 905 910
Asp Asn Ala Lys Trp Ser Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr
915 920 925
Asp Arg Val Tyr Gln Asp Ala Lys Gln Ser Ile Asn His Leu Phe Val
930 935 940
Asp Tyr Gln Asp Gln Gln Leu Asn Pro Glu Ile Gly Met Ala Asp Ile
945 950 955 960
Ile Asp Ala Gln Asn Leu Val Ala Ser Ile Ser Asp Val Tyr Ser Asp
965 970 975
Ala Val Leu Gln Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu
980 985 990
Ser Asn Arg Leu Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala
995 1000 1005
Val Gln Asn Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala
1010 1015 1020
Thr Gly Gly Ala Thr Val Gln Gln Asp Gly Asn Thr His Phe Leu
1025 1030 1035
Val Leu Ser His Trp Asp Ala Gln Val Ser Gln Gln Phe Arg Val
1040 1045 1050
Gln Pro Asn Cys Lys Tyr Val Leu Arg Val Thr Ala Glu Lys Val
1055 1060 1065
Gly Gly Gly Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His
1070 1075 1080
Thr Glu Lys Leu Thr Phe Asn Ala Cys Asp Tyr Asp Ile Asn Gly
1085 1090 1095
Thr Tyr Val Thr Asp Asn Thr Tyr Leu Thr Lys Glu Val Val Phe
1100 1105 1110
Tyr Ser His Thr Glu His Met Trp Val Glu Val Ser Glu Thr Glu
1115 1120 1125
Gly Ala Phe His Ile Asp Ser Ile Glu Phe Val Glu Thr Glu Lys
1130 1135 1140
<210> 20
<211> 3696
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_23的合成核苷酸序列。
<400> 20
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccacgg cgaccttcga ggcggagtat gacttggagc gggctcagga ggccgtcaac 1980
gcgctgttca caaacaccaa tcctcgccgc ctcaagacgg gtgtgactga ttaccacatt 2040
gacgaggtct ccaacttggt cgcgtgtctg tccgatgagt tctgcctgga cgagaagcgg 2100
gaactgctgg agaaggtcaa gtacgccaag cgcctctccg acgaaaggaa cctcctccaa 2160
gatcccaact ttacttccat taacaagcag ccggacttca tctccaccaa cgagcagtcc 2220
aacttcacct caatccacga gcagtcggag cacgggtggt ggggcagcga gaacatcacc 2280
atccaagagg gcaacgacgt cttcaaggag aactacgtga tcctgcccgg caccttcaac 2340
gagtgttacc cgacctatct ctaccagaag attggcgaag cggaactcaa ggcttacacc 2400
cgttaccaac tgagtggcta cattgaggac tcacaagacc tggaaatcta cctgatccgc 2460
tacaacgcca agcacgagac cctcgacgtg cctggcacgg agtccgtctg gcccttgagc 2520
gtggagtctc ctatcggtcg ttgcggcgag cccaatcgct gcgctccgca ctttgagtgg 2580
aatcctgatt tggattgctc ctgccgagac ggtgagaaat gcgcccacca ctcgcaccac 2640
ttcagcctag acatcgacgt gggctgcatc gacctgcacg agaacttggg cgtctgggtc 2700
gtgttcaaga tcaagacaca ggagggccat gctcggcttg ggaacctgga gttcatcgag 2760
gagaagccac tgctgggtga agccttgtca cgggtgaaac gcgccgagaa gaagtggcgg 2820
gacaaacggg agaagctcca gttggagaca aagcgtgtgt acacagaggc caaggaggcc 2880
gtggatgcct tgttcgtgga cagtcagtac gacaggctgc aagcggacac caacatcggg 2940
atgatccacg cggctgataa gcttgttcac agaatccgcg aggcgtacct gtcagagctt 3000
agcgtgatcc caggcgtcaa cgccgaaatc ttcgaggaac tggagggccg cattatcacg 3060
gcaatctcac tttatgacgc gaggaatgtg gtcaagaacg gtgacttcaa caacggcttg 3120
gcgtgttgga acgttaaagg gcacgtggat gtacaacagt cacaccacag aagtgtcttg 3180
gtcatcccgg agtgggaggc ggaagtgagc caggccgtcc gggtctgccc tgggcgcggt 3240
tacatcctcc gcgtgacagc gtacaaggag ggctacggtg agggctgcgt gacgatccac 3300
gagattgaga acaacacgga cgagcttaag ttcaagaact gcgaggagga ggaagtgtac 3360
ccgacagaca ccggcacctg caacgactac accgcccacc aagggaccgc cgcctgcaac 3420
agccgcaacg cgggctatga agatgcgtac gaggttgata ccaccgcctc agtgaactac 3480
aaaccgactt atgaggagga gacatacacg gacgtcaggc gcgacaacca ttgtgagtac 3540
gaccgtggct acgtgaacta tccgccggtg ccagcgggct acatgacgaa ggagctagaa 3600
tacttccctg agacggacaa ggtgtggatt gaaatcggcg agaccgaggg caagtttatc 3660
gtggattctg tcgagctgct gctaatggag gagtag 3696
<210> 21
<211> 1231
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_23的氨基酸序列。
<400> 21
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Thr Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gln
645 650 655
Glu Ala Val Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys
660 665 670
Thr Gly Val Thr Asp Tyr His Ile Asp Glu Val Ser Asn Leu Val Ala
675 680 685
Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu
690 695 700
Lys Val Lys Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln
705 710 715 720
Asp Pro Asn Phe Thr Ser Ile Asn Lys Gln Pro Asp Phe Ile Ser Thr
725 730 735
Asn Glu Gln Ser Asn Phe Thr Ser Ile His Glu Gln Ser Glu His Gly
740 745 750
Trp Trp Gly Ser Glu Asn Ile Thr Ile Gln Glu Gly Asn Asp Val Phe
755 760 765
Lys Glu Asn Tyr Val Ile Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro
770 775 780
Thr Tyr Leu Tyr Gln Lys Ile Gly Glu Ala Glu Leu Lys Ala Tyr Thr
785 790 795 800
Arg Tyr Gln Leu Ser Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile
805 810 815
Tyr Leu Ile Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly
820 825 830
Thr Glu Ser Val Trp Pro Leu Ser Val Glu Ser Pro Ile Gly Arg Cys
835 840 845
Gly Glu Pro Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu
850 855 860
Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His
865 870 875 880
Phe Ser Leu Asp Ile Asp Val Gly Cys Ile Asp Leu His Glu Asn Leu
885 890 895
Gly Val Trp Val Val Phe Lys Ile Lys Thr Gln Glu Gly His Ala Arg
900 905 910
Leu Gly Asn Leu Glu Phe Ile Glu Glu Lys Pro Leu Leu Gly Glu Ala
915 920 925
Leu Ser Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu
930 935 940
Lys Leu Gln Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala
945 950 955 960
Val Asp Ala Leu Phe Val Asp Ser Gln Tyr Asp Arg Leu Gln Ala Asp
965 970 975
Thr Asn Ile Gly Met Ile His Ala Ala Asp Lys Leu Val His Arg Ile
980 985 990
Arg Glu Ala Tyr Leu Ser Glu Leu Ser Val Ile Pro Gly Val Asn Ala
995 1000 1005
Glu Ile Phe Glu Glu Leu Glu Gly Arg Ile Ile Thr Ala Ile Ser
1010 1015 1020
Leu Tyr Asp Ala Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn
1025 1030 1035
Gly Leu Ala Cys Trp Asn Val Lys Gly His Val Asp Val Gln Gln
1040 1045 1050
Ser His His Arg Ser Val Leu Val Ile Pro Glu Trp Glu Ala Glu
1055 1060 1065
Val Ser Gln Ala Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu
1070 1075 1080
Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr
1085 1090 1095
Ile His Glu Ile Glu Asn Asn Thr Asp Glu Leu Lys Phe Lys Asn
1100 1105 1110
Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp Thr Gly Thr Cys Asn
1115 1120 1125
Asp Tyr Thr Ala His Gln Gly Thr Ala Ala Cys Asn Ser Arg Asn
1130 1135 1140
Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr Ala Ser Val
1145 1150 1155
Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp Val Arg
1160 1165 1170
Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr Pro
1175 1180 1185
Pro Val Pro Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro
1190 1195 1200
Glu Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Lys
1205 1210 1215
Phe Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1220 1225 1230
<210> 22
<211> 3666
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_24的合成核苷酸序列。
<400> 22
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccaccg cgacgtttga agctgaatcc gacctcgagc gtgcgcgcaa ggcggtgaac 1980
gctctgttca cgagcaccaa ccctcgtggc ttgaagacgg atgtgacgga ctaccacatc 2040
gaccaagtct cgaacctcgt ggagtgcctg agcgacgagt tctgtcttga caagaagcgc 2100
gagctgctgg aggaggtgaa gtacgccaag cgcctctccg atgagcgcaa cctgctccaa 2160
gatcctacct tcacgtcgat ttccggccaa accgaccgtg gatggatcgg ctcgactggc 2220
atctccatcc agggcggcga cgacatcttc aaggagaact atgttcggct gccgggcacg 2280
gtggacgagt gttacccgac gtacctctac cagaagatag acgagagtca actcaagtcc 2340
tacacgcggt atcagttacg tggctacatt gaagactccc aggacttgga aatctatctc 2400
atacggtaca acgccaagca cgagacctta agcgtgccgg gaacggagtc gccctggcca 2460
agctctggcg tgtacccttc cggtaggtgc ggcgagccca accgctgtgc acctcgaatc 2520
gaatggaacc cggaccttga ctgctcttgc cggtacggcg agaagtgcgt ccatcattct 2580
caccacttca gcttggacat tgacgtcggc tgcaccgacc tcaatgaaga cctcggagtg 2640
tgggtcatct tcaagatcaa gacacaggac gggcacgcga aactaggaaa cctggagttc 2700
atcgaggaga agccactcct cggcaaggca ctttccaggg tcaagcgggc cgagaagaaa 2760
tggagggaca agtacgagaa actccagctc gaaacaaagc gggtgtacac ggaggcaaag 2820
gaatccgtgg acgccctgtt cgtggactct cagtacgaca agctccaggc gaacacaaac 2880
attggcatca tccacggtgc ggacaagcaa gtgcacagga tacgggagcc ttacctctcg 2940
gagctgccgg tgattccctc gatcaacgcg gcgatcttcg aggaactgga gggccacatc 3000
ttcaaggcgt attctctgta cgacgcgcgt aacgtcatca agaacggcga cttcaacaat 3060
gggctgtcct gctggaacgt taaaggccac gtcgatgtcc agcagaacca ccataggtca 3120
gtcctggtgc tgagcgagtg ggaggcggag gtgtcccaga aggtgcgcgt gtgcccggat 3180
cgcggctaca tcttgagggt gacagcctac aaggagggct acggcgaggg ctgtgtcacg 3240
atccatgagt tcgaggacaa cacggatgtc ctgaaattcc gtaacttcgt cgaggaggag 3300
gtctatccca acaacaccgt gacctgcaac gactacacga ccaatcagtc ggctgagggc 3360
agtaccgatg cctgcaacag ctacaaccgt ggttacgaag atggatacga gaaccgctac 3420
gagcccaatc cttcggctcc cgtgaattac actcccacgt acgaggaggg catgtacact 3480
gacactcagg gctacaacca ttgcgtcagc gaccgtggct accgcaacca cacgccgctc 3540
ccagcgggct acgtgacgct ggagctggaa tactttcccg agacagaaca agtgtggata 3600
gagatcggcg agaccgaggg cacattcatc gtgggctctg tggaattgct cctcatggag 3660
gagtaa 3666
<210> 23
<211> 1221
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_24的氨基酸序列。
<400> 23
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Thr Ala Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Arg
645 650 655
Lys Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Pro Arg Gly Leu Lys
660 665 670
Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Glu
675 680 685
Cys Leu Ser Asp Glu Phe Cys Leu Asp Lys Lys Arg Glu Leu Leu Glu
690 695 700
Glu Val Lys Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln
705 710 715 720
Asp Pro Thr Phe Thr Ser Ile Ser Gly Gln Thr Asp Arg Gly Trp Ile
725 730 735
Gly Ser Thr Gly Ile Ser Ile Gln Gly Gly Asp Asp Ile Phe Lys Glu
740 745 750
Asn Tyr Val Arg Leu Pro Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr
755 760 765
Leu Tyr Gln Lys Ile Asp Glu Ser Gln Leu Lys Ser Tyr Thr Arg Tyr
770 775 780
Gln Leu Arg Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu
785 790 795 800
Ile Arg Tyr Asn Ala Lys His Glu Thr Leu Ser Val Pro Gly Thr Glu
805 810 815
Ser Pro Trp Pro Ser Ser Gly Val Tyr Pro Ser Gly Arg Cys Gly Glu
820 825 830
Pro Asn Arg Cys Ala Pro Arg Ile Glu Trp Asn Pro Asp Leu Asp Cys
835 840 845
Ser Cys Arg Tyr Gly Glu Lys Cys Val His His Ser His His Phe Ser
850 855 860
Leu Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val
865 870 875 880
Trp Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala Lys Leu Gly
885 890 895
Asn Leu Glu Phe Ile Glu Glu Lys Pro Leu Leu Gly Lys Ala Leu Ser
900 905 910
Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Tyr Glu Lys Leu
915 920 925
Gln Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ser Val Asp
930 935 940
Ala Leu Phe Val Asp Ser Gln Tyr Asp Lys Leu Gln Ala Asn Thr Asn
945 950 955 960
Ile Gly Ile Ile His Gly Ala Asp Lys Gln Val His Arg Ile Arg Glu
965 970 975
Pro Tyr Leu Ser Glu Leu Pro Val Ile Pro Ser Ile Asn Ala Ala Ile
980 985 990
Phe Glu Glu Leu Glu Gly His Ile Phe Lys Ala Tyr Ser Leu Tyr Asp
995 1000 1005
Ala Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser
1010 1015 1020
Cys Trp Asn Val Lys Gly His Val Asp Val Gln Gln Asn His His
1025 1030 1035
Arg Ser Val Leu Val Leu Ser Glu Trp Glu Ala Glu Val Ser Gln
1040 1045 1050
Lys Val Arg Val Cys Pro Asp Arg Gly Tyr Ile Leu Arg Val Thr
1055 1060 1065
Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu
1070 1075 1080
Phe Glu Asp Asn Thr Asp Val Leu Lys Phe Arg Asn Phe Val Glu
1085 1090 1095
Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr
1100 1105 1110
Thr Asn Gln Ser Ala Glu Gly Ser Thr Asp Ala Cys Asn Ser Tyr
1115 1120 1125
Asn Arg Gly Tyr Glu Asp Gly Tyr Glu Asn Arg Tyr Glu Pro Asn
1130 1135 1140
Pro Ser Ala Pro Val Asn Tyr Thr Pro Thr Tyr Glu Glu Gly Met
1145 1150 1155
Tyr Thr Asp Thr Gln Gly Tyr Asn His Cys Val Ser Asp Arg Gly
1160 1165 1170
Tyr Arg Asn His Thr Pro Leu Pro Ala Gly Tyr Val Thr Leu Glu
1175 1180 1185
Leu Glu Tyr Phe Pro Glu Thr Glu Gln Val Trp Ile Glu Ile Gly
1190 1195 1200
Glu Thr Glu Gly Thr Phe Ile Val Gly Ser Val Glu Leu Leu Leu
1205 1210 1215
Met Glu Glu
1220
<210> 24
<211> 3651
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC867_25的合成核苷酸序列。
<400> 24
atgaccagca accgaaagaa cgagaacgag atcatcaacg ccctgtccat accggccgtg 60
tcaaaccact ccgcccagat gaacctctcc accgacgcga ggatcgagga ctccctctgc 120
atcgccgagg gcaacaacat cgacccgttc gtgtctgcaa gcacggtcca gaccggcatc 180
aacatcgcgg gccgcatcct gggcgtgctc ggcgtgccct tcgcgggtca aatcgcctct 240
ttctactcat tcctcgtggg cgagctgtgg ccgcgcggac gtgacccgtg ggaaatcttc 300
ctggagcacg ttgagcagct catccggcag caagtgaccg agaacaccag ggacaccgca 360
ctggcacggc tccagggcct tggcaacagc ttccgcgcct accagcagtc gctggaggac 420
tggctggaga accgagacga cgccagaacc cgctcagttc tgtacacaca gtacatcgcc 480
ctagagctgg acttcctcaa cgctatgccg ctcttcgcca tccgtaacca ggaagtaccg 540
cttctgatgg tgtacgcaca agcagcgaac ctccatctgc tcctgctgcg agacgcatct 600
ctgttcggca gtgagttcgg gctgacgagc caggagatcc agcgctacta cgagcgccaa 660
gtggagaaga ctcgtgagta cagcgactac tgcgcgcgct ggtacaacac gggcttgaac 720
aaccttcgcg ggacaaacgc cgaatcctgg cttcgctaca accagttccg ccgcgacctc 780
acgctgggtg tgctggacct ggtcgcgctc ttcccgtcct acgacacacg ggtgtaccca 840
atgaacacga gcgcacagct cacccgtgag atctacacag atcccatcgg ccgcaccaac 900
gctcccagtg gcttcgcaag cacgaattgg ttcaacaata acgctccttc tttctctgcc 960
atcgaggccg ctgtcatcag accgccgcac ttactcgatt tcccggagca gctcactatc 1020
ttctctgtgt tgtcccggtg gtcgaacacg cagtacatga actactgggt gggccacagg 1080
ctagagagcc ggaccatccg tggcagtctc tcaacctcga cccacggcaa cacgaacacg 1140
agcatcaacc ctgtcactct ccagtttaca tctagggacg tttacaggac agagtcgttc 1200
gctggcatta acattctgtt gaccactccg gtgaacggcg tcccttgggc ccgcttcaac 1260
tggaggaatc ctctgaactc actgcgcggc agccttctct acactatcgg ctacaccggc 1320
gttgggacgc aactcttcga ctcggagacc gagctgccgc ccgagaccac cgagcggcct 1380
aactacgaga gttattcaca caggctctcc aacatccgct tgatttctgg gaacaccttg 1440
cgggctccgg tgtactcctg gacgcaccgc agcgccgaca gaactaatac catcagctcc 1500
gactcgatca cccagatccc gctggtgaag gctcacacgc ttcagtcggg caccacagtc 1560
gtcaagggcc ctggcttcac cggcggcgac atcctgcgtc gcacatctgg cggacccttc 1620
gccttcagca acgtgaactt ggacttcaat ttgtcacagc ggtatcgtgc cagaatccgg 1680
tacgccagca ctacgaacct gcgaatctat gttactgtgg cgggcgagcg gatcttcgcc 1740
gggcaattcg acaagacgat ggacgcggga gcacctctga cattccagtc attctcttac 1800
gccacgatca acacggcatt cacgtttccg gagcgttcca gtagcctgac cgtgggcgct 1860
gataccttca gtagcgggaa cgaggtgtac gttgaccgtt tcgagctgat cccggtcacc 1920
gccaccgatg ctacctttga agcagagtcc gacttggaac gtgcacagaa ggcagtgaac 1980
gcactcttca cctcaagcaa ccagatcgga ttgaagacag atgtgacaga ttaccacatc 2040
gaccaagtga gcaacttggt ggattgcttg tcagatgagt tctgcttgga tgagaagcgt 2100
gaactctccg agaaggtgaa gcacgcaaag cgtctctcag atgaacgtaa tctccttcaa 2160
gaccctaact ttcgtggtat caatcgtcag ccagatcgtg gatggcgtgg atcaacagac 2220
atcaccatcc agggaggcga tgatgtgttc aaggagaact acgtgaccct cccaggaacc 2280
gtggatgaat gctacccaac ctacctctac cagaagatcg acgagtcaaa gctcaaggct 2340
tacacccgtt atgaactccg tggctacatc gaagatagcc aggatctcga aatctatctc 2400
atccgttaca atgctaagca cgaaatcgtg aatgtgccag gaaccggctc actctggcca 2460
ctctcagcac agtcaccaat cggcaagtgc ggcgaaccca atcgctgcgc tcctcatctc 2520
gaatggaatc ccgatctcga ctgctcctgc cgagacggcg agaagtgtgc acatcactca 2580
caccacttca ccctcgacat cgacgtgggc tgcaccgacc tcaatgaaga cctgggcgtg 2640
tgggtgatct tcaagatcaa gacccaggac ggccacgcac gactgggcaa tctggagttt 2700
ctggaggaga agccactgct tggcgaggca ctggcacgag tgaaacgagc cgagaagaaa 2760
tggcgagaca aacgtgagaa gctgcaactg gagaccaaca tcgtgtacaa agaggccaaa 2820
gagtcagttg acgccctgtt tgtcaatagc cagtatgacc gactgcaagt tgacaccaac 2880
atcgccatga tccacgctgc ggacaagcgc gtccaccgca tccgcgaggc ttatctgccc 2940
gagctgagcg tcattcccgg cgtcaatgcc gcgatcttcg aggagttaga gggccgcatc 3000
ttcaccgcct acagcctcta tgacgcccgc aatgtcatta agaatggcga cttcaacaat 3060
ggcttactat gctggaatgt caaagggcac gttgacgtcg aggagcagaa caatcaccgc 3120
agcgtcttag tcatacccga gtgggaggcc gaagtcagcc aggaagtccg cgtctgtcca 3180
gggcgcgggt acatcctgcg ggtcaccgcc tacaaagagg gatacggcga gggttgtgtc 3240
accatacacg agatagagga caataccgac gaactcaagt tcagcaattg tgtcgaggag 3300
gaagtctatc ccaacaatac cgtaacctgc aacaactaca ccggaaccca ggaggagtat 3360
gaagggacgt acacctcgcg gaaccagggc tatgacgaag cctatgggaa caacccgtcg 3420
gtgcctgctg actatgcgtc ggtctatgag gagaaatcgt acacggacgg gcggcgggag 3480
aatccgtgtg agtcgaatcg cgggtatggt gactacacgc cgctaccggc gggctatgta 3540
acgaaagacc tggaatactt cccggagacg gacaaagtat ggatagagat aggcgagacg 3600
gagggaacgt tcatcgtgga ctcggtagag ctgctgctca tggaggagtg a 3651
<210> 25
<211> 1216
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC867_25的氨基酸序列。
<400> 25
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Thr Gln Ile Pro Leu Val Lys Ala His
500 505 510
Thr Leu Gln Ser Gly Thr Thr Val Val Lys Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Thr Ser Gly Gly Pro Phe Ala Phe Ser Asn
530 535 540
Val Asn Leu Asp Phe Asn Leu Ser Gln Arg Tyr Arg Ala Arg Ile Arg
545 550 555 560
Tyr Ala Ser Thr Thr Asn Leu Arg Ile Tyr Val Thr Val Ala Gly Glu
565 570 575
Arg Ile Phe Ala Gly Gln Phe Asp Lys Thr Met Asp Ala Gly Ala Pro
580 585 590
Leu Thr Phe Gln Ser Phe Ser Tyr Ala Thr Ile Asn Thr Ala Phe Thr
595 600 605
Phe Pro Glu Arg Ser Ser Ser Leu Thr Val Gly Ala Asp Thr Phe Ser
610 615 620
Ser Gly Asn Glu Val Tyr Val Asp Arg Phe Glu Leu Ile Pro Val Thr
625 630 635 640
Ala Thr Asp Ala Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Gln
645 650 655
Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu Lys
660 665 670
Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Asp
675 680 685
Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu
690 695 700
Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln
705 710 715 720
Asp Pro Asn Phe Arg Gly Ile Asn Arg Gln Pro Asp Arg Gly Trp Arg
725 730 735
Gly Ser Thr Asp Ile Thr Ile Gln Gly Gly Asp Asp Val Phe Lys Glu
740 745 750
Asn Tyr Val Thr Leu Pro Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr
755 760 765
Leu Tyr Gln Lys Ile Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr
770 775 780
Glu Leu Arg Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu
785 790 795 800
Ile Arg Tyr Asn Ala Lys His Glu Ile Val Asn Val Pro Gly Thr Gly
805 810 815
Ser Leu Trp Pro Leu Ser Ala Gln Ser Pro Ile Gly Lys Cys Gly Glu
820 825 830
Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys
835 840 845
Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Thr
850 855 860
Leu Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val
865 870 875 880
Trp Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly
885 890 895
Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ala
900 905 910
Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu
915 920 925
Gln Leu Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp
930 935 940
Ala Leu Phe Val Asn Ser Gln Tyr Asp Arg Leu Gln Val Asp Thr Asn
945 950 955 960
Ile Ala Met Ile His Ala Ala Asp Lys Arg Val His Arg Ile Arg Glu
965 970 975
Ala Tyr Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile
980 985 990
Phe Glu Glu Leu Glu Gly Arg Ile Phe Thr Ala Tyr Ser Leu Tyr Asp
995 1000 1005
Ala Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Leu
1010 1015 1020
Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gln Asn Asn
1025 1030 1035
His Arg Ser Val Leu Val Ile Pro Glu Trp Glu Ala Glu Val Ser
1040 1045 1050
Gln Glu Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val
1055 1060 1065
Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His
1070 1075 1080
Glu Ile Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val
1085 1090 1095
Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr
1100 1105 1110
Thr Gly Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn
1115 1120 1125
Gln Gly Tyr Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala
1130 1135 1140
Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg
1145 1150 1155
Arg Glu Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr
1160 1165 1170
Pro Leu Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro
1175 1180 1185
Glu Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr
1190 1195 1200
Phe Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1205 1210 1215
<210> 26
<211> 3600
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC868的重组核苷酸序列。
<400> 26
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta atcaaatacc tttagtgaaa ggatttagag tttggggggg cacctctgtc 1560
attacaggac caggatttac aggaggggat atccttcgaa gaaatacctt tggtgatttt 1620
gtatctctac aagtcaatat taattcacca attacccaaa gataccgttt aagatttcgt 1680
tacgcttcca gtagggatgc acgagttata gtattaacag gagcggcatc cacaggagtg 1740
ggaggccaag ttagtgtaaa tatgcctctt cagaaaacta tggaaatagg ggagaactta 1800
acatctagaa catttagata taccgatttt agtaatcctt tttcatttag agctaatcca 1860
gatataattg ggataagtga acaacctcta tttggtgcag gttctattag tagcggtgaa 1920
ctttatatag ataaaattga aattattcta gcagatgcaa catttgaagc agaatctgat 1980
ttagaaagag cacaaaaggc ggtgaatgag ctgtttactt cttccaatca aatcgggtta 2040
aaaacagatg tgacggatta tcatattgat caagtatcca atttagttga gtgtttatct 2100
gatgaatttt gtctggatga aaaaaaagaa ttgtccgaga aagtcaaaca tgcgaagcga 2160
cttagtgatg agcggaattt acttcaagat ccaaacttta gagggatcaa tagacaacta 2220
gaccgtggct ggagaggaag tacggatatt accatccaag gaggcgatga cgtattcaaa 2280
gagaattacg ttacgctatt gggtaccttt gatgagtgct atccaacgta tttatatcaa 2340
aaaatagatg agtcgaaatt aaaagcctat acccgttacc aattaagagg gtatatcgaa 2400
gatagtcaag acttagaaat ctatttaatt cgctacaatg ccaaacacga aacagtaaat 2460
gtgccaggta cgggttcctt atggccgctt tcagccccaa gtccaatcgg aaaatgtgcc 2520
catcattccc atcatttctc cttggacatt gatgttggat gtacagactt aaatgaggac 2580
ttaggtgtat gggtgatatt caagattaag acgcaagatg gccatgcaag actaggaaat 2640
ctagaatttc tcgaagagaa accattagta ggagaagcac tagctcgtgt gaaaagagcg 2700
gagaaaaaat ggagagacaa acgtgaaaaa ttggaatggg aaacaaatat tgtttataaa 2760
gaggcaaaag aatctgtaga tgctttattt gtaaactctc aatatgatag attacaagcg 2820
gataccaaca tcgcgatgat tcatgcggca gataaacgcg ttcatagcat tcgagaagct 2880
tatctgcctg agctgtctgt gattccgggt gtcaatgcgg ctatttttga agaattagaa 2940
gggcgtattt tcactgcatt ctccctatat gatgcgagaa atgtcattaa aaatggtgat 3000
tttaataatg gcttatcctg ctggaacgtg aaagggcatg tagatgtaga agaacaaaac 3060
aaccaccgtt cggtccttgt tgttccggaa tgggaagcag aagtgtcaca agaagttcgt 3120
gtctgtccgg gtcgtggcta tatccttcgt gtcacagcgt acaaggaggg atatggagaa 3180
ggttgcgtaa ccattcatga gatcgagaac aatacagacg aactgaagtt tagcaactgt 3240
gtagaagagg aagtatatcc aaacaacacg gtaacgtgta atgattatac tgcgactcaa 3300
gaagaatatg agggtacgta cacttctcgt aatcgaggat atgacggagc ctatgaaagc 3360
aattcttctg taccagctga ttatgcatca gcctatgaag aaaaagcata tacagatgga 3420
cgaagagaca atccttgtga atctaacaga ggatatgggg attacacacc actaccagct 3480
ggctatgtga caaaagaatt agagtacttc ccagaaaccg ataaggtatg gattgagatc 3540
ggagaaacgg aaggaacatt catcgtggac agcgtggaat tacttcttat ggaggaatag 3600
<210> 27
<211> 3600
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868的合成核苷酸序列。
<400> 27
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgttcgaggc cgagtctgac 1980
ctggagcggg ctcagaaggc tgtcaacgaa ctgttcacca gcagcaacca gattgggctc 2040
aagaccgacg tcacggacta tcacattgac caagtgtcca accttgtgga gtgcctgtcc 2100
gacgagttct gcctcgacga gaagaaggag ctgtccgaga aggtcaaaca cgcgaagcgt 2160
ctgagtgacg agcggaattt gctccaggac ccgaacttcc gtggcatcaa ccgccagctc 2220
gaccgtggtt ggcgcgggag tacagacatc accatccagg gaggcgacga tgtgttcaag 2280
gagaactatg tgacgctgct cgggactttc gacgaatgct acccgacgta tctctaccag 2340
aagatagacg agagtaaatt gaaggcgtac acccgctacc agcttcgcgg gtacatcgag 2400
gatagtcagg acctggaaat ctacctgatc cgatacaacg ccaagcacga gacagtgaac 2460
gtgccaggca cgggctcact ttggccattg agcgctccct ctccaatcgg aaagtgcgct 2520
caccactcgc accacttctc tctggacatc gacgtgggct gcaccgacct caacgaggac 2580
ctgggtgtct gggttatctt caagattaag acccaggacg gacatgcccg cctcggcaac 2640
ctggagttcc ttgaggagaa gcctctcgtg ggcgaggccc tcgctcgtgt gaagcgcgcc 2700
gagaagaaat ggcgagacaa gcgggagaag ctggagtggg agaccaacat cgtgtacaag 2760
gaggccaagg agtcagtgga cgcactcttc gtcaacagcc agtacgaccg cctccaggct 2820
gacaccaaca tcgccatgat ccacgcggct gacaagcggg tccacagcat ccgtgaggcg 2880
tacctgcccg agctgtcagt gatccctggt gtgaacgcgg cgatcttcga ggaactggag 2940
ggccgcatct tcacagcatt cagcctgtac gatgccagga atgttattaa gaacggtgac 3000
ttcaacaacg ggctgagttg ctggaacgtc aagggccatg tggacgtcga ggagcagaac 3060
aaccaccggt ccgtgctggt cgtgccggag tgggaggcag aggtgagcca ggaggtccgc 3120
gtctgccctg gtcgcggcta catcctccgt gtgactgcgt acaaggaagg ctacggtgaa 3180
ggctgcgtga ctatccacga gatcgagaac aacaccgacg agctcaagtt ctcgaactgt 3240
gtggaggagg aggtgtaccc gaacaacacc gttacttgca acgactacac tgccacgcaa 3300
gaggagtacg agggcactta cacttcccgg aatcgcggct atgatggcgc gtacgagtcc 3360
aacagcagcg tgcctgcgga ttatgcgtcc gcttacgagg agaaggcgta caccgacgga 3420
cggagggaca acccttgcga gtccaaccgt ggctacggtg actacactcc gctgcccgcc 3480
gggtacgtca ccaaggagct ggagtacttc ccggagaccg acaaagtctg gatcgagatc 3540
ggcgagacgg agggcacttt catcgtggac tcggtcgagc tgctactgat ggaggagtga 3600
<210> 28
<211> 1199
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC868的氨基酸序列。
<400> 28
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Phe Glu
645 650 655
Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe
660 665 670
Thr Ser Ser Asn Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His
675 680 685
Ile Asp Gln Val Ser Asn Leu Val Glu Cys Leu Ser Asp Glu Phe Cys
690 695 700
Leu Asp Glu Lys Lys Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg
705 710 715 720
Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe Arg Gly Ile
725 730 735
Asn Arg Gln Leu Asp Arg Gly Trp Arg Gly Ser Thr Asp Ile Thr Ile
740 745 750
Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Leu Gly
755 760 765
Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu
770 775 780
Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu
785 790 795 800
Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His
805 810 815
Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala
820 825 830
Pro Ser Pro Ile Gly Lys Cys Ala His His Ser His His Phe Ser Leu
835 840 845
Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp
850 855 860
Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn
865 870 875 880
Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg
885 890 895
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu
900 905 910
Trp Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala
915 920 925
Leu Phe Val Asn Ser Gln Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile
930 935 940
Ala Met Ile His Ala Ala Asp Lys Arg Val His Ser Ile Arg Glu Ala
945 950 955 960
Tyr Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe
965 970 975
Glu Glu Leu Glu Gly Arg Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala
980 985 990
Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp
995 1000 1005
Asn Val Lys Gly His Val Asp Val Glu Glu Gln Asn Asn His Arg
1010 1015 1020
Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gln Glu
1025 1030 1035
Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala
1040 1045 1050
Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile
1055 1060 1065
Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu
1070 1075 1080
Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Ala
1085 1090 1095
Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly
1100 1105 1110
Tyr Asp Gly Ala Tyr Glu Ser Asn Ser Ser Val Pro Ala Asp Tyr
1115 1120 1125
Ala Ser Ala Tyr Glu Glu Lys Ala Tyr Thr Asp Gly Arg Arg Asp
1130 1135 1140
Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu
1145 1150 1155
Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr
1160 1165 1170
Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile
1175 1180 1185
Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1190 1195
<210> 29
<211> 3600
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_9的合成核苷酸序列。
<400> 29
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
agcctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgcag 1200
gcgggcatta acatccttat gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgaagaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgttcgaggc cgagtctgac 1980
ctggagcggg ctcagaaggc tgtcaacgaa ctgttcacca gcagcaacca gattgggctc 2040
aagaccgacg tcacggacta tcacattgac caagtgtcca accttgtgga gtgcctgtcc 2100
gacgagttct gcctcgacga gaagaaggag ctgtccgaga aggtcaaaca cgcgaagcgt 2160
ctgagtgacg agcggaattt gctccaggac ccgaacttcc gtggcatcaa ccgccagctc 2220
gaccgtggtt ggcgcgggag tacagacatc accatccagg gaggcgacga tgtgttcaag 2280
gagaactatg tgacgctgct cgggactttc gacgaatgct acccgacgta tctctaccag 2340
aagatagacg agagtaaatt gaaggcgtac acccgctacc agcttcgcgg gtacatcgag 2400
gatagtcagg acctggaaat ctacctgatc cgatacaacg ccaagcacga gacagtgaac 2460
gtgccaggca cgggctcact ttggccattg agcgctccct ctccaatcgg aaagtgcgct 2520
caccactcgc accacttctc tctggacatc gacgtgggct gcaccgacct caacgaggac 2580
ctgggtgtct gggttatctt caagattaag acccaggacg gacatgcccg cctcggcaac 2640
ctggagttcc ttgaggagaa gcctctcgtg ggcgaggccc tcgctcgtgt gaagcgcgcc 2700
gagaagaaat ggcgagacaa gcgggagaag ctggagtggg agaccaacat cgtgtacaag 2760
gaggccaagg agtcagtgga cgcactcttc gtcaacagcc agtacgaccg cctccaggct 2820
gacaccaaca tcgccatgat ccacgcggct gacaagcggg tccacagcat ccgtgaggcg 2880
tacctgcccg agctgtcagt gatccctggt gtgaacgcgg cgatcttcga ggaactggag 2940
ggccgcatct tcacagcatt cagcctgtac gatgccagga atgttattaa gaacggtgac 3000
ttcaacaacg ggctgagttg ctggaacgtc aagggccatg tggacgtcga ggagcagaac 3060
aaccaccggt ccgtgctggt cgtgccggag tgggaggcag aggtgagcca ggaggtccgc 3120
gtctgccctg gtcgcggcta catcctccgt gtgactgcgt acaaggaagg ctacggtgaa 3180
ggctgcgtga ctatccacga gatcgagaac aacaccgacg agctcaagtt ctcgaactgt 3240
gtggaggagg aggtgtaccc gaacaacacc gttacttgca acgactacac tgccacgcaa 3300
gaggagtacg agggcactta cacttcccgg aatcgcggct atgatggcgc gtacgagtcc 3360
aacagcagcg tgcctgcgga ttatgcgtcc gcttacgagg agaaggcgta caccgacgga 3420
cggagggaca acccttgcga gtccaaccgt ggctacggtg actacactcc gctgcccgcc 3480
gggtacgtca ccaaggagct ggagtacttc ccggagaccg acaaagtctg gatcgagatc 3540
ggcgagacgg agggcacttt catcgtggac tcggtcgagc tgctactgat ggaggagtga 3600
<210> 30
<211> 1199
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_9的氨基酸序列。
<400> 30
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Ser Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Gln
385 390 395 400
Ala Gly Ile Asn Ile Leu Met Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Lys Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Phe Glu
645 650 655
Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe
660 665 670
Thr Ser Ser Asn Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His
675 680 685
Ile Asp Gln Val Ser Asn Leu Val Glu Cys Leu Ser Asp Glu Phe Cys
690 695 700
Leu Asp Glu Lys Lys Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg
705 710 715 720
Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe Arg Gly Ile
725 730 735
Asn Arg Gln Leu Asp Arg Gly Trp Arg Gly Ser Thr Asp Ile Thr Ile
740 745 750
Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Leu Gly
755 760 765
Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu
770 775 780
Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu
785 790 795 800
Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His
805 810 815
Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala
820 825 830
Pro Ser Pro Ile Gly Lys Cys Ala His His Ser His His Phe Ser Leu
835 840 845
Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp
850 855 860
Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn
865 870 875 880
Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg
885 890 895
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu
900 905 910
Trp Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala
915 920 925
Leu Phe Val Asn Ser Gln Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile
930 935 940
Ala Met Ile His Ala Ala Asp Lys Arg Val His Ser Ile Arg Glu Ala
945 950 955 960
Tyr Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe
965 970 975
Glu Glu Leu Glu Gly Arg Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala
980 985 990
Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp
995 1000 1005
Asn Val Lys Gly His Val Asp Val Glu Glu Gln Asn Asn His Arg
1010 1015 1020
Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gln Glu
1025 1030 1035
Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala
1040 1045 1050
Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile
1055 1060 1065
Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu
1070 1075 1080
Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Ala
1085 1090 1095
Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly
1100 1105 1110
Tyr Asp Gly Ala Tyr Glu Ser Asn Ser Ser Val Pro Ala Asp Tyr
1115 1120 1125
Ala Ser Ala Tyr Glu Glu Lys Ala Tyr Thr Asp Gly Arg Arg Asp
1130 1135 1140
Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu
1145 1150 1155
Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr
1160 1165 1170
Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile
1175 1180 1185
Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1190 1195
<210> 31
<211> 3678
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC868_10的重组核苷酸序列。
<400> 31
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta atcaaatacc tttagtgaaa ggatttagag tttggggggg cacctctgtc 1560
attacaggac caggatttac aggaggggat atccttcgaa gaaatacctt tggtgatttt 1620
gtatctctac aagtcaatat taattcacca attacccaaa gataccgttt aagatttcgt 1680
tacgcttcca gtagggatgc acgagttata gtattaacag gagcggcatc cacaggagtg 1740
ggaggccaag ttagtgtaaa tatgcctctt cagaaaacta tggaaatagg ggagaactta 1800
acatctagaa catttagata taccgatttt agtaatcctt tttcatttag agctaatcca 1860
gatataattg ggataagtga acaacctcta tttggtgcag gttctattag tagcggtgaa 1920
ctttatatag ataaaattga aattattcta gcagatgcaa catttgaggc agaatatgat 1980
ttagaaagag cgcaaaaggt ggtgaatgcc ctgtttacgt ctacaaacca actagggcta 2040
aaaacagatg tgacggatta tcatattgat caggtatcca atctagttgc gtgtttatcg 2100
gatgaatttt gtctggatga aaagagagaa ttgtccgaga aagttaaaca tgcaaagcga 2160
ctcagtgatg agcggaattt acttcaagat ccaaacttca gagggatcaa taggcaacca 2220
gaccgtggct ggagaggaag tacggatatt actatccaag gaggagatga cgtattcaaa 2280
gagaattacg ttacgctacc gggtaccttt gatgagtgct atccaacgta tttatatcaa 2340
aaaatagatg agtcgaaatt aaaagcctat acccgttatc aattaagagg gtatatcgaa 2400
gatagtcaag acttagaaat ctatttaatt cgttacaatg caaaacacga aatagtaaat 2460
gtaccaggta caggaagttt atggcctctt tctgtagaaa atcaaattgg accttgtgga 2520
gaaccgaatc gatgcgcgcc acaccttgaa tggaatcctg atttacactg ttcctgcaga 2580
gacggggaaa aatgtgcaca tcattctcat catttctctt tggacattga tgttggatgt 2640
acagacttaa atgaggactt aggtgtatgg gtgatattca agattaagac gcaagatggc 2700
cacgcacgac tagggaatct agagtttctc gaagagaaac cattattagg agaagcacta 2760
gctcgtgtga aaagagcgga gaaaaaatgg agagacaaac gcgaaacatt acaattggaa 2820
acaactatcg tttataaaga ggcaaaagaa tctgtagatg ctttatttgt aaactctcaa 2880
tatgatagat tacaagcgga tacgaacatc gcgatgattc atgcggcaga taaacgcgtt 2940
catagaattc gagaagcgta tctgccggag ctgtctgtga ttccgggtgt caatgcggct 3000
atttttgaag aattagaaga gcgtattttc actgcatttt ccctatatga tgcgagaaat 3060
attattaaaa atggcgattt caataatggc ttattatgct ggaacgtgaa agggcatgta 3120
gaggtagaag aacaaaacaa tcaccgttca gtcctggtta tcccagaatg ggaggcagaa 3180
gtgtcacaag aggttcgtgt ctgtccaggt cgtggctata tccttcgtgt tacagcgtac 3240
aaagagggat atggagaagg ttgcgtaacg atccatgaga tcgagaacaa tacagacgaa 3300
ctgaaattca acaactgtgt agaagaggaa gtatatccaa acaacacggt aacgtgtatt 3360
aattatactg cgactcaaga agaatatgag ggtacgtaca cttctcgtaa tcgaggatat 3420
gacgaagcct atggtaataa cccttccgta ccagctgatt atgcgtcagt ctatgaagaa 3480
aaatcgtata cagatagacg aagagagaat ccttgtgaat ctaacagagg atatggagat 3540
tacacaccac taccagctgg ttatgtaaca aaggaattag agtacttccc agagaccgat 3600
aaggtatgga ttgagattgg agaaacagaa ggaacattca tcgtggacag cgtggaatta 3660
ctccttatgg aggaatag 3678
<210> 32
<211> 3678
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_10的合成核苷酸序列。
<400> 32
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgttcgaggc cgagtacgac 1980
cttgagcgcg cccagaaggt ggtgaacgcc ctcttcacta gcactaacca gctaggcctg 2040
aagactgacg tgaccgacta ccacatcgac caagtgagca acctagtggc ctgcctctcc 2100
gacgagttct gcctcgacga gaagcgcgag ctgtccgaga aggtgaagca cgccaagcgc 2160
ctctccgacg agcgcaacct gctccaggac cccaacttca ggggcatcaa caggcagccc 2220
gaccgcggct ggcgcggctc caccgacatc accatccagg gcggtgacga cgtattcaag 2280
gagaactacg ttaccctccc cggcaccttc gacgagtgtt accccaccta cctctaccag 2340
aagatcgacg agtccaagct gaaggcctac acccgctacc agctccgcgg ctacatcgag 2400
gactcccagg acctggaaat ctacctcatc cgctacaacg ccaagcacga gatcgtgaac 2460
gtgcctggca ccggcagcct ctggcctctc agcgtggaga accagatcgg cccttgcggc 2520
gagcctaacc gctgcgcccc tcacctcgag tggaaccctg acctccactg ctcgtgcagg 2580
gacggcgaga agtgcgccca ccatagccac cacttctctc tggacatcga cgtgggctgc 2640
accgacctga acgaggacct gggcgtgtgg gttatcttca agatcaagac ccaggacggt 2700
cacgccaggc tgggtaacct ggagttcctt gaggaaaagc ctctgctggg tgaggccctg 2760
gccagggtca agagggctga gaagaaatgg agggataaga gggagaccct gcagctggag 2820
accactatcg tctacaagga ggctaaggag tctgtcgatg ctctgttcgt caactctcag 2880
tacgatagac tgcaagctga taccaacatc gctatgatcc acgctgcgga taagcgggtc 2940
caccggatcc gggaggctta ccttccggag ctttctgtca tcccgggtgt caacgctgcg 3000
atcttcgagg aacttgagga acggatcttc actgcgttta gtctttacga tgcgcggaac 3060
atcatcaaga acggggactt caacaatggt ctgctgtgct ggaacgtcaa gggtcatgtc 3120
gaggtcgagg aacaaaacaa tcatcgtagt gtccttgtca ttcctgagtg ggaggcggag 3180
gtctctcaag aggtccgtgt ttgcccgggg cgtgggtaca ttcttcgtgt tactgcgtac 3240
aaggaggggt acggggaggg gtgcgttact attcatgaga ttgagaacaa tactgatgag 3300
cttaagttca acaattgtgt tgaggaggag gtttacccga acaatactgt tacgtgcatc 3360
aactacacgg caacgcaaga ggaatacgag gggacgtaca cctcgcgtaa tagagggtat 3420
gatgaggcgt acggaaacaa cccgtcggtt ccagcagatt atgcctcggt ttatgaggag 3480
aagtcgtaca cggatagacg acgcgagaat ccatgtgagt caaatcgagg atacggagat 3540
tacacaccat taccagcagg atacgttaca aaggagttgg aatacttccc ggaaacagat 3600
aaagtttgga ttgaaatcgg agaaacagaa ggaacattca tcgtcgactc agtagaattg 3660
ttgttgatgg aagaatga 3678
<210> 33
<211> 1225
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_10的氨基酸序列。
<400> 33
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Phe Glu
645 650 655
Ala Glu Tyr Asp Leu Glu Arg Ala Gln Lys Val Val Asn Ala Leu Phe
660 665 670
Thr Ser Thr Asn Gln Leu Gly Leu Lys Thr Asp Val Thr Asp Tyr His
675 680 685
Ile Asp Gln Val Ser Asn Leu Val Ala Cys Leu Ser Asp Glu Phe Cys
690 695 700
Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg
705 710 715 720
Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe Arg Gly Ile
725 730 735
Asn Arg Gln Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp Ile Thr Ile
740 745 750
Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly
755 760 765
Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu
770 775 780
Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu
785 790 795 800
Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His
805 810 815
Glu Ile Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Val
820 825 830
Glu Asn Gln Ile Gly Pro Cys Gly Glu Pro Asn Arg Cys Ala Pro His
835 840 845
Leu Glu Trp Asn Pro Asp Leu His Cys Ser Cys Arg Asp Gly Glu Lys
850 855 860
Cys Ala His His Ser His His Phe Ser Leu Asp Ile Asp Val Gly Cys
865 870 875 880
Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe Lys Ile Lys
885 890 895
Thr Gln Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu
900 905 910
Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys
915 920 925
Lys Trp Arg Asp Lys Arg Glu Thr Leu Gln Leu Glu Thr Thr Ile Val
930 935 940
Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gln
945 950 955 960
Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile Ala Met Ile His Ala Ala
965 970 975
Asp Lys Arg Val His Arg Ile Arg Glu Ala Tyr Leu Pro Glu Leu Ser
980 985 990
Val Ile Pro Gly Val Asn Ala Ala Ile Phe Glu Glu Leu Glu Glu Arg
995 1000 1005
Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn Ile Ile Lys
1010 1015 1020
Asn Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn Val Lys Gly
1025 1030 1035
His Val Glu Val Glu Glu Gln Asn Asn His Arg Ser Val Leu Val
1040 1045 1050
Ile Pro Glu Trp Glu Ala Glu Val Ser Gln Glu Val Arg Val Cys
1055 1060 1065
Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr Lys Glu Gly
1070 1075 1080
Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile Glu Asn Asn Thr
1085 1090 1095
Asp Glu Leu Lys Phe Asn Asn Cys Val Glu Glu Glu Val Tyr Pro
1100 1105 1110
Asn Asn Thr Val Thr Cys Ile Asn Tyr Thr Ala Thr Gln Glu Glu
1115 1120 1125
Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly Tyr Asp Glu Ala
1130 1135 1140
Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr
1145 1150 1155
Glu Glu Lys Ser Tyr Thr Asp Arg Arg Arg Glu Asn Pro Cys Glu
1160 1165 1170
Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr
1175 1180 1185
Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp
1190 1195 1200
Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile Val Asp Ser Val
1205 1210 1215
Glu Leu Leu Leu Met Glu Glu
1220 1225
<210> 34
<211> 3726
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC868_11的重组核苷酸序列。
<400> 34
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta atcaaatacc tttagtgaaa ggatttagag tttggggggg cacctctgtc 1560
attacaggac caggatttac aggaggggat atccttcgaa gaaatacctt tggtgatttt 1620
gtatctctac aagtcaatat taattcacca attacccaaa gataccgttt aagatttcgt 1680
tacgcttcca gtagggatgc acgagttata gtattaacag gagcggcatc cacaggagtg 1740
ggaggccaag ttagtgtaaa tatgcctctt cagaaaacta tggaaatagg ggagaactta 1800
acatctagaa catttagata taccgatttt agtaatcctt tttcatttag agctaatcca 1860
gatataattg ggataagtga acaacctcta tttggtgcag gttctattag tagcggtgaa 1920
ctttatatag ataaaattga aattattcta gcagatgcaa caggaacgac aacctatgag 1980
tatgaagaga agcagaatct agaaaaagcg cagaaagcgt tgaacgcttt gtttacggat 2040
ggcacgaatg gctatctaca aatggatgcc actgattatg atatcaatca aactgcaaac 2100
ttaatagaat gtgtatcaga tgaattgtat gcaaaagaaa agatagtttt attagatgaa 2160
gtcaaatatg cgaagcggct tagcatatca cgtaacctac ttttgaacga tgatttagaa 2220
ttttcagatg gatttggaga aaacggatgg acgacaagtg ataatatttc aatccaggcg 2280
gataatcccc tttttaaggg gaattattta aaaatgtttg gggcaagaga tattgatgga 2340
accctatttc caacttatct ctatcaaaaa atagatgagt ccaggttaaa accatataca 2400
cgttatcgag taagagggtt tgtgggaagt agtaaaaatc taaaattagt ggtaacacgc 2460
tatgagaaag aaattgatgc cattatgaat gttccaaatg atttggcaca tatgcagctt 2520
aacccttcat gtggagatta tcgctgtgaa tcatcgtccc agtttttggt gaaccaagtg 2580
catcctacac caacagctgg atatgctctt gatatgtatg catgcccgtc aagttcagat 2640
aaaaaacata ttatgtgtca cgatcgtcat ccatttgatt ttcatattga caccggagaa 2700
ttaaatccaa acacaaacct gggtattgat gtcttgttta aaatttctaa tccaaatgga 2760
tacgctacat tagggaatct agaagtcatt gaagaaggac cactaacaga tgaagcattg 2820
gtacatgtaa aacaaaagga aaagaaatgg cgtcagcaca tggagaaaaa acgaatggaa 2880
acacaacaag cctatgatcc agcaaaacaa gctgtagatg cattatttac aaatgaacaa 2940
gagttagact atcatactac tttagatcat attcagaacg ccgatcagct ggtacaggcg 3000
attccctatg tacaccatgc ttggttaccg gatgctccag gtatgaacta tgatgtatat 3060
caagggttaa acgcacgtat catgcaggcg tacaatttat atgatgcacg aaatgtcata 3120
ataaatggtg actttacaca aggactacaa ggatggcacg caacaggaaa agcagcggta 3180
caacaaatag atggagcttc agtattagtt ctatcaaact ggagtgccga ggtatctcag 3240
aatctgcatg cccaagatca tcatggatat atgttacgtg tgattgccaa aaaagaaggt 3300
cctggaaaag ggtatgtaat gatgatggat tttaatggaa agcaggaaac acttacgttc 3360
acttcttgtg aagaaggata tataacaaaa acaatagagg tattcccgga aagtgatcga 3420
atacgaattg aaatgggaga aacagagggt acgttttatg tagatagcat cgagttgctt 3480
tgtatgcaag gatatgctag cgataataac ccgcacacgg gtaatatgta tgagcaaagt 3540
tataatggaa attataatca aaatactagc gatgtgtatc accaaggata tataaacaac 3600
tataaccaaa attctagtag tatgtataat caaaattata ttaacaatga tgacctgcat 3660
tccggttgca catgtaacca agggcataac tctggctgta catgtaatca aggatataac 3720
cgttag 3726
<210> 35
<211> 3726
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_11的合成核苷酸序列。
<400> 35
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cggggactac cacctacgag 1980
tacgaggaga agcagaatct cgagaaggct cagaaggctc tgaacgctct gttcactgac 2040
gggaccaacg gctacctcca gatggacgcc actgactacg acatcaacca gacagctaac 2100
ctgattgagt gtgtgagtga cgaactgtac gctaaggaga agatcgtact cctggacgag 2160
gtgaagtacg ctaagcgcct gagcattagc cgtaacctgc tgctgaacga cgatctggag 2220
ttcagcgacg gctttggcga gaacggctgg accaccagcg acaacatctc catccaggcc 2280
gacaatccac tcttcaaagg caactacctc aagatgttcg gagccaggga catcgacggc 2340
accctctttc cgacctacct ctaccagaag atcgacgagt cccgcctcaa accctacacc 2400
cgctacaggg tgcgcggctt cgtgggcagc agcaagaacc tcaagctcgt ggtcacacgg 2460
tatgagaagg agatcgacgc catcatgaac gtgcccaacg atctcgccca catgcagctc 2520
aatccatcct gcggcgacta ccggtgcgag tccagctccc agttcctcgt gaaccaggtg 2580
caccctactc cgaccgctgg ctatgccctg gacatgtacg cctgccctag ttcctccgac 2640
aagaagcaca tcatgtgcca cgaccgtcat ccgttcgact tccacatcga caccggcgaa 2700
ctgaacccga acaccaacct gggcatcgac gtactgttca agatttccaa cccgaacggg 2760
tacgccacct tgggcaacct ggaggtcatc gaagaaggcc cgctgaccga cgaggccctg 2820
gtccacgtca aacagaagga gaagaagtgg cggcagcaca tggagaagaa gcggatggag 2880
actcaacaag cctacgaccc ggccaagcaa gctgtggacg ctctgttcac caacgagcaa 2940
gagcttgact accacactac tcttgaccac atccagaatg ctgaccagct tgtccaggct 3000
attccgtacg tccaccacgc ttggctaccg gacgctccag ggatgaacta cgatgtgtac 3060
cagggtctga acgcgcggat catgcaagcg tacaacctgt acgacgcgcg taacgtcatc 3120
atcaacggtg acttcactca gggtcttcaa ggttggcacg cgactggcaa agcggcagtc 3180
cagcagattg atggtgcgtc tgttcttgtg ttgagcaact ggtctgcgga ggtttctcag 3240
aacctgcacg cacaggatca ccacggctac atgctgaggg tgattgctaa gaaggagggc 3300
cctggcaaag gctacgtcat gatgatggac ttcaacggaa agcaagaaac cctgaccttc 3360
actagctgtg aggagggcta catcactaag accattgagg tctttccgga gtctgaccgc 3420
atccggatcg agatgggcga gaccgaaggc acgttctacg tggactccat cgaactcctc 3480
tgcatgcaag gctacgcctc cgacaacaac ccacacacgg gcaacatgta cgagcagtcc 3540
tacaacggga actacaacca gaacacctcc gatgtgtacc atcagggcta catcaacaac 3600
tacaaccaga acagcagcag catgtacaac cagaactaca tcaacaacga tgacttgcac 3660
tcgggttgca cctgcaacca gggtcacaac agtgggtgca cgtgcaacca gggatacaac 3720
cgttga 3726
<210> 36
<211> 1241
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_11的氨基酸序列。
<400> 36
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Gly Thr
645 650 655
Thr Thr Tyr Glu Tyr Glu Glu Lys Gln Asn Leu Glu Lys Ala Gln Lys
660 665 670
Ala Leu Asn Ala Leu Phe Thr Asp Gly Thr Asn Gly Tyr Leu Gln Met
675 680 685
Asp Ala Thr Asp Tyr Asp Ile Asn Gln Thr Ala Asn Leu Ile Glu Cys
690 695 700
Val Ser Asp Glu Leu Tyr Ala Lys Glu Lys Ile Val Leu Leu Asp Glu
705 710 715 720
Val Lys Tyr Ala Lys Arg Leu Ser Ile Ser Arg Asn Leu Leu Leu Asn
725 730 735
Asp Asp Leu Glu Phe Ser Asp Gly Phe Gly Glu Asn Gly Trp Thr Thr
740 745 750
Ser Asp Asn Ile Ser Ile Gln Ala Asp Asn Pro Leu Phe Lys Gly Asn
755 760 765
Tyr Leu Lys Met Phe Gly Ala Arg Asp Ile Asp Gly Thr Leu Phe Pro
770 775 780
Thr Tyr Leu Tyr Gln Lys Ile Asp Glu Ser Arg Leu Lys Pro Tyr Thr
785 790 795 800
Arg Tyr Arg Val Arg Gly Phe Val Gly Ser Ser Lys Asn Leu Lys Leu
805 810 815
Val Val Thr Arg Tyr Glu Lys Glu Ile Asp Ala Ile Met Asn Val Pro
820 825 830
Asn Asp Leu Ala His Met Gln Leu Asn Pro Ser Cys Gly Asp Tyr Arg
835 840 845
Cys Glu Ser Ser Ser Gln Phe Leu Val Asn Gln Val His Pro Thr Pro
850 855 860
Thr Ala Gly Tyr Ala Leu Asp Met Tyr Ala Cys Pro Ser Ser Ser Asp
865 870 875 880
Lys Lys His Ile Met Cys His Asp Arg His Pro Phe Asp Phe His Ile
885 890 895
Asp Thr Gly Glu Leu Asn Pro Asn Thr Asn Leu Gly Ile Asp Val Leu
900 905 910
Phe Lys Ile Ser Asn Pro Asn Gly Tyr Ala Thr Leu Gly Asn Leu Glu
915 920 925
Val Ile Glu Glu Gly Pro Leu Thr Asp Glu Ala Leu Val His Val Lys
930 935 940
Gln Lys Glu Lys Lys Trp Arg Gln His Met Glu Lys Lys Arg Met Glu
945 950 955 960
Thr Gln Gln Ala Tyr Asp Pro Ala Lys Gln Ala Val Asp Ala Leu Phe
965 970 975
Thr Asn Glu Gln Glu Leu Asp Tyr His Thr Thr Leu Asp His Ile Gln
980 985 990
Asn Ala Asp Gln Leu Val Gln Ala Ile Pro Tyr Val His His Ala Trp
995 1000 1005
Leu Pro Asp Ala Pro Gly Met Asn Tyr Asp Val Tyr Gln Gly Leu
1010 1015 1020
Asn Ala Arg Ile Met Gln Ala Tyr Asn Leu Tyr Asp Ala Arg Asn
1025 1030 1035
Val Ile Ile Asn Gly Asp Phe Thr Gln Gly Leu Gln Gly Trp His
1040 1045 1050
Ala Thr Gly Lys Ala Ala Val Gln Gln Ile Asp Gly Ala Ser Val
1055 1060 1065
Leu Val Leu Ser Asn Trp Ser Ala Glu Val Ser Gln Asn Leu His
1070 1075 1080
Ala Gln Asp His His Gly Tyr Met Leu Arg Val Ile Ala Lys Lys
1085 1090 1095
Glu Gly Pro Gly Lys Gly Tyr Val Met Met Met Asp Phe Asn Gly
1100 1105 1110
Lys Gln Glu Thr Leu Thr Phe Thr Ser Cys Glu Glu Gly Tyr Ile
1115 1120 1125
Thr Lys Thr Ile Glu Val Phe Pro Glu Ser Asp Arg Ile Arg Ile
1130 1135 1140
Glu Met Gly Glu Thr Glu Gly Thr Phe Tyr Val Asp Ser Ile Glu
1145 1150 1155
Leu Leu Cys Met Gln Gly Tyr Ala Ser Asp Asn Asn Pro His Thr
1160 1165 1170
Gly Asn Met Tyr Glu Gln Ser Tyr Asn Gly Asn Tyr Asn Gln Asn
1175 1180 1185
Thr Ser Asp Val Tyr His Gln Gly Tyr Ile Asn Asn Tyr Asn Gln
1190 1195 1200
Asn Ser Ser Ser Met Tyr Asn Gln Asn Tyr Ile Asn Asn Asp Asp
1205 1210 1215
Leu His Ser Gly Cys Thr Cys Asn Gln Gly His Asn Ser Gly Cys
1220 1225 1230
Thr Cys Asn Gln Gly Tyr Asn Arg
1235 1240
<210> 37
<211> 3468
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC868_12的重组核苷酸序列。
<400> 37
atgacttcaa ataggaaaaa tgagaatgaa attataaatg ctttatcgat tccagctgta 60
tcgaatcatt ccgcacaaat gaatctatca accgatgctc gtattgagga tagcttgtgt 120
atagccgagg ggaacaatat cgatccattt gttagcgcat caacagtcca aacgggtatt 180
aacatagctg gtagaatact aggtgtatta ggcgtaccgt ttgctggaca aatagctagt 240
ttttatagtt ttcttgttgg tgaattatgg ccccgcggca gagatccttg ggaaattttc 300
ctagaacatg tcgaacaact tataagacaa caagtaacag aaaatactag ggatacggct 360
cttgctcgat tacaaggttt aggaaattcc tttagagcct atcaacagtc acttgaagat 420
tggctagaaa accgtgatga tgcaagaacg agaagtgttc tttataccca atatatagcc 480
ttagaacttg attttcttaa tgcgatgccg cttttcgcaa ttagaaacca agaagttcca 540
ttattaatgg tatatgctca agctgcaaat ttacacctat tattattgag agatgcctct 600
ctttttggta gtgaatttgg gcttacatcc caagaaattc aacgttatta tgagcgccaa 660
gtggaaaaaa cgagagaata ttctgattat tgcgcaagat ggtataatac gggtttaaat 720
aatttgagag ggacaaatgc tgaaagttgg ttgcgatata atcaattccg tagagactta 780
acgctaggag tattagatct agtggcacta ttcccaagct atgacacgcg tgtttatcca 840
atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080
cttgaatcgc gaacaataag ggggtcatta agtacctcga cacacggaaa taccaatact 1140
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380
aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500
gatagcatta atcaaatacc tttagtgaaa ggatttagag tttggggggg cacctctgtc 1560
attacaggac caggatttac aggaggggat atccttcgaa gaaatacctt tggtgatttt 1620
gtatctctac aagtcaatat taattcacca attacccaaa gataccgttt aagatttcgt 1680
tacgcttcca gtagggatgc acgagttata gtattaacag gagcggcatc cacaggagtg 1740
ggaggccaag ttagtgtaaa tatgcctctt cagaaaacta tggaaatagg ggagaactta 1800
acatctagaa catttagata taccgatttt agtaatcctt tttcatttag agctaatcca 1860
gatataattg ggataagtga acaacctcta tttggtgcag gttctattag tagcggtgaa 1920
ctttatatag ataaaattga aattattcta gcagatgcaa caaatccgac gcgagaggcg 1980
gaagaggatc tagaagcagc gaagaaagcg gtggcgagct tgtttacacg tacaagggac 2040
ggattacaag taaatgtgac agattatcaa gtcgatcaag cggcaaattt agtgtcatgc 2100
ttatcagatg aacaatatgg gcatgacaaa aagatgttat tggaagcggt aagagcggca 2160
aaacgcctca gccgagaacg caacttactt caggatccag attttaatac aatcaatagt 2220
acagaagaaa atggatggaa agcaagtaac ggcgttacta ttagcgaggg cggtccattc 2280
tataaaggcc gtgcgcttca gctagcaagc gcaagagaaa attacccaac atacatttat 2340
caaaaagtaa atgcatcaga gttaaagccg tatacacgtt atagactgga tgggttcgtg 2400
aagagtagtc aagatttaga aattgatctc attcaccatc ataaagtcca tctcgtgaaa 2460
aatgtaccag ataatttagt atccgatact tactcggatg gttcttgcag tggaatgaat 2520
cgatgtgagg aacaacagat ggtaaatgcg caactggaaa cagaacatca tcatccgatg 2580
gattgctgtg aagcggctca aacacatgag ttttcttcct atattaatac aggcgatcta 2640
aattcaagtg tagatcaagg catttgggtt gtattgaaag ttcgaacaac cgatggttat 2700
gcgacgctag gaaatcttga attggtagag gtcggaccgt tatcgggtga atctctagaa 2760
cgtgaacaaa gggataatgc gaaatggagt gcagagctag gaagaaagcg tgcagaaaca 2820
gatcgcgtgt atcaagatgc caaacaatcc atcaatcatt tatttgtgga ttatcaagat 2880
caacaattaa atccagaaat agggatggca gatattattg acgctcaaaa tcttgtcgca 2940
tcaatttcag atgtgtatag cgatgcagta ctgcaaatcc ctggaattaa ctatgagatt 3000
tacacagagc tatccaatcg cttacaacaa gcatcgtatc tgtatacgtc tcgaaatgcg 3060
gtgcaaaatg gggactttaa cagcggtcta gatagttgga atgcaacagg gggggctacg 3120
gtacaacagg atggcaatac gcatttctta gttctttctc attgggatgc acaagtttct 3180
caacaattta gagtgcagcc gaattgtaaa tatgtattac gtgtaacagc agagaaagta 3240
ggcggcggag acggatacgt gacaatccgg gatggtgctc atcatacaga aaagcttaca 3300
tttaatgcat gtgattatga tataaatggc acgtacgtga ctgataatac gtatctaaca 3360
aaagaagtgg tattctattc acatacagaa cacatgtggg tagaggtaag tgaaacagaa 3420
ggtgcatttc atatagatag tattgaattc gttgaaacag aaaagtag 3468
<210> 38
<211> 3468
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_12的合成核苷酸序列。
<400> 38
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgaacccgac gcgggaagct 1980
gaggaagact tggaagccgc caagaaagcg gtcgccagcc tgtttactcg gacgcgggac 2040
gggctccaag tgaatgtgac ggactatcaa gtggatcagg ccgctaacct cgtgtcatgc 2100
ctgagcgacg agcagtacgg tcacgacaag aaaatgctgc tggaggccgt ccgggccgcc 2160
aagcggctgt ccagggagcg taacctgcta caagatcccg actttaacac gatcaacagc 2220
acagaggaga atggctggaa ggccagcaac ggagttacga taagcgaggg cggtccgttc 2280
tacaagggtc gtgccctcca gctcgcctct gcaagggaga actatccaac ctacatctat 2340
cagaaggtga acgcatccga gcttaagccc tacacacgct accgcctgga cgggttcgtt 2400
aagtccagtc aagacctaga gatagacctc atccaccacc acaaagtgca tctggtcaag 2460
aacgttcccg ataatctcgt gagcgatacc tactcagacg gctcatgctc tggcatgaac 2520
agatgtgagg agcaacagat ggttaatgct caactcgaaa ccgagcatca tcatcctatg 2580
gattgctgcg aggccgcgca gacccatgag ttcagctctt acatcaacac cggagacctc 2640
aacagtagcg tggatcaggg aatttgggtg gtgcttaaag tgcgtacaac cgacggctac 2700
gccaccctcg gcaaccttga gcttgtcgag gtcggaccac ttagcggcga gtccctggaa 2760
cgtgagcagc gggacaacgc caaatggagc gcagagctag ggcgcaaacg cgcggagacg 2820
gaccgggttt atcaggacgc gaagcagtcc atcaatcacc tcttcgtgga ttatcaggac 2880
cagcagctta atccagagat cggcatggcc gacatcatcg acgcccagaa cctagtagcg 2940
tcgatttccg atgtctattc cgacgccgtg cttcaaatac ctggcatcaa ctacgagatc 3000
tacacagagt tgtccaacag gctccagcaa gcgtcatacc tctacaccag ccgcaacgcc 3060
gtccagaatg gcgacttcaa ttccggacta gactcctgga acgccacggg cggagctacg 3120
gtgcaacaag acggcaacac ccacttcctc gtacttagcc actgggacgc tcaagtgagt 3180
cagcaattcc gggttcagcc gaactgcaag tacgtcctgc gcgtaacggc cgagaaggtt 3240
ggaggcggag acggctacgt taccatccgc gacggcgctc accacaccga gaaactgacg 3300
ttcaacgctt gtgactacga catcaacggc acttacgtga cggacaacac ctacctgacg 3360
aaggaggtgg tgttctattc tcacaccgag cacatgtggg ttgaggtcag cgagaccgag 3420
ggagccttcc acattgacag catcgagttc gtggagactg agaagtga 3468
<210> 39
<211> 1155
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_12的氨基酸序列。
<400> 39
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Asn Pro
645 650 655
Thr Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys Ala Val Ala
660 665 670
Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gln Val Asn Val Thr Asp
675 680 685
Tyr Gln Val Asp Gln Ala Ala Asn Leu Val Ser Cys Leu Ser Asp Glu
690 695 700
Gln Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala Ala
705 710 715 720
Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gln Asp Pro Asp Phe Asn
725 730 735
Thr Ile Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly Val
740 745 750
Thr Ile Ser Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala Leu Gln Leu
755 760 765
Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr Ile Tyr Gln Lys Val Asn
770 775 780
Ala Ser Glu Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe Val
785 790 795 800
Lys Ser Ser Gln Asp Leu Glu Ile Asp Leu Ile His His His Lys Val
805 810 815
His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr Tyr Ser
820 825 830
Asp Gly Ser Cys Ser Gly Met Asn Arg Cys Glu Glu Gln Gln Met Val
835 840 845
Asn Ala Gln Leu Glu Thr Glu His His His Pro Met Asp Cys Cys Glu
850 855 860
Ala Ala Gln Thr His Glu Phe Ser Ser Tyr Ile Asn Thr Gly Asp Leu
865 870 875 880
Asn Ser Ser Val Asp Gln Gly Ile Trp Val Val Leu Lys Val Arg Thr
885 890 895
Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val Gly
900 905 910
Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gln Arg Asp Asn Ala Lys
915 920 925
Trp Ser Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg Val Tyr
930 935 940
Gln Asp Ala Lys Gln Ser Ile Asn His Leu Phe Val Asp Tyr Gln Asp
945 950 955 960
Gln Gln Leu Asn Pro Glu Ile Gly Met Ala Asp Ile Ile Asp Ala Gln
965 970 975
Asn Leu Val Ala Ser Ile Ser Asp Val Tyr Ser Asp Ala Val Leu Gln
980 985 990
Ile Pro Gly Ile Asn Tyr Glu Ile Tyr Thr Glu Leu Ser Asn Arg Leu
995 1000 1005
Gln Gln Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gln Asn
1010 1015 1020
Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Ala Thr Gly Gly
1025 1030 1035
Ala Thr Val Gln Gln Asp Gly Asn Thr His Phe Leu Val Leu Ser
1040 1045 1050
His Trp Asp Ala Gln Val Ser Gln Gln Phe Arg Val Gln Pro Asn
1055 1060 1065
Cys Lys Tyr Val Leu Arg Val Thr Ala Glu Lys Val Gly Gly Gly
1070 1075 1080
Asp Gly Tyr Val Thr Ile Arg Asp Gly Ala His His Thr Glu Lys
1085 1090 1095
Leu Thr Phe Asn Ala Cys Asp Tyr Asp Ile Asn Gly Thr Tyr Val
1100 1105 1110
Thr Asp Asn Thr Tyr Leu Thr Lys Glu Val Val Phe Tyr Ser His
1115 1120 1125
Thr Glu His Met Trp Val Glu Val Ser Glu Thr Glu Gly Ala Phe
1130 1135 1140
His Ile Asp Ser Ile Glu Phe Val Glu Thr Glu Lys
1145 1150 1155
<210> 40
<211> 3732
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_13的合成核苷酸序列。
<400> 40
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgacggcgac cttcgaggcg 1980
gagtatgact tggagcgggc tcaggaggcc gtcaacgcgc tgttcacaaa caccaatcct 2040
cgccgcctca agacgggtgt gactgattac cacattgacg aggtctccaa cttggtcgcg 2100
tgtctgtccg atgagttctg cctggacgag aagcgggaac tgctggagaa ggtcaagtac 2160
gccaagcgcc tctccgacga aaggaacctc ctccaagatc ccaactttac ttccattaac 2220
aagcagccgg acttcatctc caccaacgag cagtccaact tcacctcaat ccacgagcag 2280
tcggagcacg ggtggtgggg cagcgagaac atcaccatcc aagagggcaa cgacgtcttc 2340
aaggagaact acgtgatcct gcccggcacc ttcaacgagt gttacccgac ctatctctac 2400
cagaagattg gcgaagcgga actcaaggct tacacccgtt accaactgag tggctacatt 2460
gaggactcac aagacctgga aatctacctg atccgctaca acgccaagca cgagaccctc 2520
gacgtgcctg gcacggagtc cgtctggccc ttgagcgtgg agtctcctat cggtcgttgc 2580
ggcgagccca atcgctgcgc tccgcacttt gagtggaatc ctgatttgga ttgctcctgc 2640
cgagacggtg agaaatgcgc ccaccactcg caccacttca gcctagacat cgacgtgggc 2700
tgcatcgacc tgcacgagaa cttgggcgtc tgggtcgtgt tcaagatcaa gacacaggag 2760
ggccatgctc ggcttgggaa cctggagttc atcgaggaga agccactgct gggtgaagcc 2820
ttgtcacggg tgaaacgcgc cgagaagaag tggcgggaca aacgggagaa gctccagttg 2880
gagacaaagc gtgtgtacac agaggccaag gaggccgtgg atgccttgtt cgtggacagt 2940
cagtacgaca ggctgcaagc ggacaccaac atcgggatga tccacgcggc tgataagctt 3000
gttcacagaa tccgcgaggc gtacctgtca gagcttagcg tgatcccagg cgtcaacgcc 3060
gaaatcttcg aggaactgga gggccgcatt atcacggcaa tctcacttta tgacgcgagg 3120
aatgtggtca agaacggtga cttcaacaac ggcttggcgt gttggaacgt taaagggcac 3180
gtggatgtac aacagtcaca ccacagaagt gtcttggtca tcccggagtg ggaggcggaa 3240
gtgagccagg ccgtccgggt ctgccctggg cgcggttaca tcctccgcgt gacagcgtac 3300
aaggagggct acggtgaggg ctgcgtgacg atccacgaga ttgagaacaa cacggacgag 3360
cttaagttca agaactgcga ggaggaggaa gtgtacccga cagacaccgg cacctgcaac 3420
gactacaccg cccaccaagg gaccgccgcc tgcaacagcc gcaacgcggg ctatgaagat 3480
gcgtacgagg ttgataccac cgcctcagtg aactacaaac cgacttatga ggaggagaca 3540
tacacggacg tcaggcgcga caaccattgt gagtacgacc gtggctacgt gaactatccg 3600
ccggtgccag cgggctacat gacgaaggag ctagaatact tccctgagac ggacaaggtg 3660
tggattgaaa tcggcgagac cgagggcaag tttatcgtgg attctgtcga gctgctgcta 3720
atggaggagt ag 3732
<210> 41
<211> 1243
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_13的氨基酸序列。
<400> 41
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Thr Ala
645 650 655
Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gln Glu Ala Val Asn
660 665 670
Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Gly Val Thr
675 680 685
Asp Tyr His Ile Asp Glu Val Ser Asn Leu Val Ala Cys Leu Ser Asp
690 695 700
Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys Tyr
705 710 715 720
Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe
725 730 735
Thr Ser Ile Asn Lys Gln Pro Asp Phe Ile Ser Thr Asn Glu Gln Ser
740 745 750
Asn Phe Thr Ser Ile His Glu Gln Ser Glu His Gly Trp Trp Gly Ser
755 760 765
Glu Asn Ile Thr Ile Gln Glu Gly Asn Asp Val Phe Lys Glu Asn Tyr
770 775 780
Val Ile Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu Tyr
785 790 795 800
Gln Lys Ile Gly Glu Ala Glu Leu Lys Ala Tyr Thr Arg Tyr Gln Leu
805 810 815
Ser Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg
820 825 830
Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser Val
835 840 845
Trp Pro Leu Ser Val Glu Ser Pro Ile Gly Arg Cys Gly Glu Pro Asn
850 855 860
Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys
865 870 875 880
Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp
885 890 895
Ile Asp Val Gly Cys Ile Asp Leu His Glu Asn Leu Gly Val Trp Val
900 905 910
Val Phe Lys Ile Lys Thr Gln Glu Gly His Ala Arg Leu Gly Asn Leu
915 920 925
Glu Phe Ile Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg Val
930 935 940
Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gln Leu
945 950 955 960
Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala Leu
965 970 975
Phe Val Asp Ser Gln Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile Gly
980 985 990
Met Ile His Ala Ala Asp Lys Leu Val His Arg Ile Arg Glu Ala Tyr
995 1000 1005
Leu Ser Glu Leu Ser Val Ile Pro Gly Val Asn Ala Glu Ile Phe
1010 1015 1020
Glu Glu Leu Glu Gly Arg Ile Ile Thr Ala Ile Ser Leu Tyr Asp
1025 1030 1035
Ala Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala
1040 1045 1050
Cys Trp Asn Val Lys Gly His Val Asp Val Gln Gln Ser His His
1055 1060 1065
Arg Ser Val Leu Val Ile Pro Glu Trp Glu Ala Glu Val Ser Gln
1070 1075 1080
Ala Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr
1085 1090 1095
Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu
1100 1105 1110
Ile Glu Asn Asn Thr Asp Glu Leu Lys Phe Lys Asn Cys Glu Glu
1115 1120 1125
Glu Glu Val Tyr Pro Thr Asp Thr Gly Thr Cys Asn Asp Tyr Thr
1130 1135 1140
Ala His Gln Gly Thr Ala Ala Cys Asn Ser Arg Asn Ala Gly Tyr
1145 1150 1155
Glu Asp Ala Tyr Glu Val Asp Thr Thr Ala Ser Val Asn Tyr Lys
1160 1165 1170
Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp Val Arg Arg Asp Asn
1175 1180 1185
His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr Pro Pro Val Pro
1190 1195 1200
Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp
1205 1210 1215
Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Lys Phe Ile Val
1220 1225 1230
Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1235 1240
<210> 42
<211> 3702
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_14的合成核苷酸序列。
<400> 42
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgaccgcgac gtttgaagct 1980
gaatccgacc tcgagcgtgc gcgcaaggcg gtgaacgctc tgttcacgag caccaaccct 2040
cgtggcttga agacggatgt gacggactac cacatcgacc aagtctcgaa cctcgtggag 2100
tgcctgagcg acgagttctg tcttgacaag aagcgcgagc tgctggagga ggtgaagtac 2160
gccaagcgcc tctccgatga gcgcaacctg ctccaagatc ctaccttcac gtcgatttcc 2220
ggccaaaccg accgtggatg gatcggctcg actggcatct ccatccaggg cggcgacgac 2280
atcttcaagg agaactatgt tcggctgccg ggcacggtgg acgagtgtta cccgacgtac 2340
ctctaccaga agatagacga gagtcaactc aagtcctaca cgcggtatca gttacgtggc 2400
tacattgaag actcccagga cttggaaatc tatctcatac ggtacaacgc caagcacgag 2460
accttaagcg tgccgggaac ggagtcgccc tggccaagct ctggcgtgta cccttccggt 2520
aggtgcggcg agcccaaccg ctgtgcacct cgaatcgaat ggaacccgga ccttgactgc 2580
tcttgccggt acggcgagaa gtgcgtccat cattctcacc acttcagctt ggacattgac 2640
gtcggctgca ccgacctcaa tgaagacctc ggagtgtggg tcatcttcaa gatcaagaca 2700
caggacgggc acgcgaaact aggaaacctg gagttcatcg aggagaagcc actcctcggc 2760
aaggcacttt ccagggtcaa gcgggccgag aagaaatgga gggacaagta cgagaaactc 2820
cagctcgaaa caaagcgggt gtacacggag gcaaaggaat ccgtggacgc cctgttcgtg 2880
gactctcagt acgacaagct ccaggcgaac acaaacattg gcatcatcca cggtgcggac 2940
aagcaagtgc acaggatacg ggagccttac ctctcggagc tgccggtgat tccctcgatc 3000
aacgcggcga tcttcgagga actggagggc cacatcttca aggcgtattc tctgtacgac 3060
gcgcgtaacg tcatcaagaa cggcgacttc aacaatgggc tgtcctgctg gaacgttaaa 3120
ggccacgtcg atgtccagca gaaccaccat aggtcagtcc tggtgctgag cgagtgggag 3180
gcggaggtgt cccagaaggt gcgcgtgtgc ccggatcgcg gctacatctt gagggtgaca 3240
gcctacaagg agggctacgg cgagggctgt gtcacgatcc atgagttcga ggacaacacg 3300
gatgtcctga aattccgtaa cttcgtcgag gaggaggtct atcccaacaa caccgtgacc 3360
tgcaacgact acacgaccaa tcagtcggct gagggcagta ccgatgcctg caacagctac 3420
aaccgtggtt acgaagatgg atacgagaac cgctacgagc ccaatccttc ggctcccgtg 3480
aattacactc ccacgtacga ggagggcatg tacactgaca ctcagggcta caaccattgc 3540
gtcagcgacc gtggctaccg caaccacacg ccgctcccag cgggctacgt gacgctggag 3600
ctggaatact ttcccgagac agaacaagtg tggatagaga tcggcgagac cgagggcaca 3660
ttcatcgtgg gctctgtgga attgctcctc atggaggagt aa 3702
<210> 43
<211> 1200
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_14的氨基酸序列。
<400> 43
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Thr Ala
645 650 655
Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Arg Lys Ala Val Asn
660 665 670
Ala Leu Phe Thr Ser Thr Asn Pro Arg Gly Leu Lys Thr Asp Val Thr
675 680 685
Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Glu Cys Leu Ser Asp
690 695 700
Glu Phe Cys Leu Asp Lys Lys Arg Glu Leu Leu Glu Glu Val Lys Tyr
705 710 715 720
Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Thr Phe
725 730 735
Thr Ser Ile Ser Gly Gln Thr Asp Arg Gly Trp Ile Gly Ser Thr Gly
740 745 750
Ile Ser Ile Gln Gly Gly Asp Asp Ile Phe Lys Glu Asn Tyr Val Arg
755 760 765
Leu Pro Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys
770 775 780
Ile Asp Glu Ser Gln Leu Lys Ser Tyr Thr Arg Tyr Gln Leu Arg Gly
785 790 795 800
Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn
805 810 815
Ala Lys His Glu Thr Leu Ser Val Pro Gly Thr Glu Ser Pro Trp Pro
820 825 830
Ser Ser Gly Val Tyr Pro Ser Gly Arg Cys Gly Glu Pro Asn Arg Cys
835 840 845
Ala Pro Arg Ile Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Tyr
850 855 860
Gly Glu Lys Cys Val His His Ser His His Phe Ser Leu Asp Ile Asp
865 870 875 880
Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe
885 890 895
Lys Ile Lys Thr Gln Asp Gly His Ala Lys Leu Gly Asn Leu Glu Phe
900 905 910
Ile Glu Glu Lys Pro Leu Leu Gly Lys Ala Leu Ser Arg Val Lys Arg
915 920 925
Ala Glu Lys Lys Trp Arg Asp Lys Tyr Glu Lys Leu Gln Leu Glu Thr
930 935 940
Lys Arg Val Tyr Thr Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val
945 950 955 960
Asp Ser Gln Tyr Asp Lys Leu Gln Ala Asn Thr Asn Ile Gly Ile Ile
965 970 975
His Gly Ala Asp Lys Gln Val His Arg Ile Arg Glu Pro Tyr Leu Ser
980 985 990
Glu Leu Pro Val Ile Pro Ser Ile Asn Ala Ala Ile Phe Glu Glu Leu
995 1000 1005
Glu Gly His Ile Phe Lys Ala Tyr Ser Leu Tyr Asp Ala Arg Asn
1010 1015 1020
Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp Asn
1025 1030 1035
Val Lys Gly His Val Asp Val Gln Gln Asn His His Arg Ser Val
1040 1045 1050
Leu Val Leu Ser Glu Trp Glu Ala Glu Val Ser Gln Lys Val Arg
1055 1060 1065
Val Cys Pro Asp Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr Lys
1070 1075 1080
Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Phe Glu Asp
1085 1090 1095
Asn Thr Asp Val Leu Lys Phe Arg Asn Phe Val Glu Glu Glu Val
1100 1105 1110
Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Thr Asn Gln
1115 1120 1125
Ser Ala Glu Gly Ser Thr Asp Ala Cys Asn Ser Tyr Asn Arg Gly
1130 1135 1140
Tyr Glu Asp Gly Tyr Glu Asn Arg Tyr Glu Pro Asn Pro Ser Ala
1145 1150 1155
Pro Val Asn Tyr Thr Pro Thr Tyr Glu Glu Gly Met Tyr Thr Asp
1160 1165 1170
Thr Gln Gly Tyr Asn His Cys Val Ser Asp Arg Gly Tyr Arg Asn
1175 1180 1185
His Thr Pro Leu Pro Ala Gly Tyr Val Thr Leu Glu
1190 1195 1200
<210> 44
<211> 3687
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_15的合成核苷酸序列。
<400> 44
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagcagtc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctccgtgc tctcacgctg gtccaacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cggatgctac ctttgaagca 1980
gagtccgact tggaacgtgc acagaaggca gtgaacgcac tcttcacctc aagcaaccag 2040
atcggattga agacagatgt gacagattac cacatcgacc aagtgagcaa cttggtggat 2100
tgcttgtcag atgagttctg cttggatgag aagcgtgaac tctccgagaa ggtgaagcac 2160
gcaaagcgtc tctcagatga acgtaatctc cttcaagacc ctaactttcg tggtatcaat 2220
cgtcagccag atcgtggatg gcgtggatca acagacatca ccatccaggg aggcgatgat 2280
gtgttcaagg agaactacgt gaccctccca ggaaccgtgg atgaatgcta cccaacctac 2340
ctctaccaga agatcgacga gtcaaagctc aaggcttaca cccgttatga actccgtggc 2400
tacatcgaag atagccagga tctcgaaatc tatctcatcc gttacaatgc taagcacgaa 2460
atcgtgaatg tgccaggaac cggctcactc tggccactct cagcacagtc accaatcggc 2520
aagtgcggcg aacccaatcg ctgcgctcct catctcgaat ggaatcccga tctcgactgc 2580
tcctgccgag acggcgagaa gtgtgcacat cactcacacc acttcaccct cgacatcgac 2640
gtgggctgca ccgacctcaa tgaagacctg ggcgtgtggg tgatcttcaa gatcaagacc 2700
caggacggcc acgcacgact gggcaatctg gagtttctgg aggagaagcc actgcttggc 2760
gaggcactgg cacgagtgaa acgagccgag aagaaatggc gagacaaacg tgagaagctg 2820
caactggaga ccaacatcgt gtacaaagag gccaaagagt cagttgacgc cctgtttgtc 2880
aatagccagt atgaccgact gcaagttgac accaacatcg ccatgatcca cgctgcggac 2940
aagcgcgtcc accgcatccg cgaggcttat ctgcccgagc tgagcgtcat tcccggcgtc 3000
aatgccgcga tcttcgagga gttagagggc cgcatcttca ccgcctacag cctctatgac 3060
gcccgcaatg tcattaagaa tggcgacttc aacaatggct tactatgctg gaatgtcaaa 3120
gggcacgttg acgtcgagga gcagaacaat caccgcagcg tcttagtcat acccgagtgg 3180
gaggccgaag tcagccagga agtccgcgtc tgtccagggc gcgggtacat cctgcgggtc 3240
accgcctaca aagagggata cggcgagggt tgtgtcacca tacacgagat agaggacaat 3300
accgacgaac tcaagttcag caattgtgtc gaggaggaag tctatcccaa caataccgta 3360
acctgcaaca actacaccgg aacccaggag gagtatgaag ggacgtacac ctcgcggaac 3420
cagggctatg acgaagccta tgggaacaac ccgtcggtgc ctgctgacta tgcgtcggtc 3480
tatgaggaga aatcgtacac ggacgggcgg cgggagaatc cgtgtgagtc gaatcgcggg 3540
tatggtgact acacgccgct accggcgggc tatgtaacga aagacctgga atacttcccg 3600
gagacggaca aagtatggat agagataggc gagacggagg gaacgttcat cgtggactcg 3660
gtagagctgc tgctcatgga ggagtga 3687
<210> 45
<211> 1228
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_15的氨基酸序列。
<400> 45
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Gln Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Asp Ala
645 650 655
Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn
660 665 670
Ala Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu Lys Thr Asp Val Thr
675 680 685
Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val Asp Cys Leu Ser Asp
690 695 700
Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His
705 710 715 720
Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe
725 730 735
Arg Gly Ile Asn Arg Gln Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp
740 745 750
Ile Thr Ile Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr
755 760 765
Leu Pro Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys
770 775 780
Ile Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly
785 790 795 800
Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn
805 810 815
Ala Lys His Glu Ile Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro
820 825 830
Leu Ser Ala Gln Ser Pro Ile Gly Lys Cys Gly Glu Pro Asn Arg Cys
835 840 845
Ala Pro His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp
850 855 860
Gly Glu Lys Cys Ala His His Ser His His Phe Thr Leu Asp Ile Asp
865 870 875 880
Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe
885 890 895
Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe
900 905 910
Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg
915 920 925
Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gln Leu Glu Thr
930 935 940
Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val
945 950 955 960
Asn Ser Gln Tyr Asp Arg Leu Gln Val Asp Thr Asn Ile Ala Met Ile
965 970 975
His Ala Ala Asp Lys Arg Val His Arg Ile Arg Glu Ala Tyr Leu Pro
980 985 990
Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe Glu Glu Leu
995 1000 1005
Glu Gly Arg Ile Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn
1010 1015 1020
Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn
1025 1030 1035
Val Lys Gly His Val Asp Val Glu Glu Gln Asn Asn His Arg Ser
1040 1045 1050
Val Leu Val Ile Pro Glu Trp Glu Ala Glu Val Ser Gln Glu Val
1055 1060 1065
Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala Tyr
1070 1075 1080
Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile Glu
1085 1090 1095
Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu
1100 1105 1110
Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr Gly Thr
1115 1120 1125
Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Gln Gly Tyr
1130 1135 1140
Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala
1145 1150 1155
Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn
1160 1165 1170
Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro
1175 1180 1185
Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr Asp
1190 1195 1200
Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile Val
1205 1210 1215
Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1220 1225
<210> 46
<211> 3600
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC868_29的合成核苷酸序列。
<400> 46
atgacgagca accggaagaa cgagaacgag atcatcaacg ccctctcgat ccctgctgtt 60
tcaaaccact ccgcgcagat gaacctgtcc accgacgcgc gcatcgagga ctccctctgc 120
atagccgagg gcaacaacat cgacccattc gtgtcggcca gcacggttca gaccggcatc 180
aacatcgcgg gccgtatcct cggcgtcctc ggtgtcccat tcgccggtca gatcgcgtcc 240
ttctactcgt tccttgtggg cgagctgtgg cctcgcggtc gtgacccgtg ggagatcttc 300
ctggagcatg tggagcagtt gatccggcag caagtcacgg agaacacccg cgatactgct 360
ctggccaggc tacagggcct gggaaactcc tttcgggcat accagtactc actggaggac 420
tggttggaga acagggatga cgcgcgaaca cgctcggtac tctacaccca gtacatcgct 480
ctcgaactcg acttcctgaa cgctatgccg ctgttcgcca tcaggaacca ggaagttcca 540
ctccttatgg tgtacgccca ggccgccaac ttacatctgc tcctgctgcg ggacgccagc 600
ctgttcggct ccgagttcgg actcacatct caagaaatcc agcgttacta cgagcgccaa 660
gtggagaaga cccgtgagta cagtgactac tgcgctcgat ggtacaacac agggctcaac 720
aacctgcgcg gcaccaacgc tgagtcatgg ctccgttaca accagttccg ccgcgacttg 780
actttgggtg tcctagacct ggtggcgcta ttcccgtctt acgacacacg ggtgtaccca 840
atgaacacta gcgcgcaact cacgcgggag atctacacag acccaatcgg ccggacgaac 900
gcaccctccg gtttcgcatc cacgaattgg ttcaacaaca acgcaccctc cttctcggca 960
atcgaggccg ccgtcatccg ccctcctcac ctgctcgact ttcccgagca gctcacgatc 1020
ttctcccagc tctcacgctg gtcccacaca cagtacatga actactgggt cgggcaccga 1080
ttggagagta ggacgatccg tggcagcttg agcaccagta cccacggcaa caccaacacc 1140
tccatcaacc cagttacgct acagttcacg agccgcgacg tttaccggac tgagtcgttc 1200
gcgggcatta acatccttct gacaacgccc gtcaacggcg tcccgtgggc ccggttcaac 1260
tggcgtaacc cgttgaactc cctgcgcggg tcattgctct acaccatcgg gtacacgggc 1320
gtcggcaccc agctcttcga cagtgaaact gagctgccgc ccgagaccac ggaacgcccg 1380
aactacgagt cctacagcca ccgcctgtcc aacatccggc tcatctctgg caacacgctg 1440
cgtgcgccgg tgtactcctg gacacaccgc agcgccgacc ggaccaacac gatctcttcc 1500
gactccatta accagatccc gctcgtgaag ggcttccgtg tgtggggtgg cacgagcgtc 1560
atcaccggtc cgggcttcac cggtggagac atactgcggc gcaacacttt cggcgacttc 1620
gtttcgttgc aagtgaacat caactcgccg atcacccagc gttaccgtct gaggttccgc 1680
tacgcttcaa gccgcgacgc gagggtcatt gtcctgaccg gagccgcgtc cacaggcgtg 1740
ggaggccaag tctcagtcaa catgcctctc cagaagacga tggagatagg cgagaacttg 1800
actagccgaa ccttccggta cactgatttc tcgaaccctt tctcattcag agcgaaccct 1860
gacatcattg ggatctccga gcaaccgctg ttcggtgctg gctccatcag ctctggcgaa 1920
ctgtacatcg acaagattga gatcatcctg gcggatgcga cgttcgaggc cgagtctgac 1980
ctggagcggg ctcagaaggc tgtcaacgaa ctgttcacca gcagcaacca gattgggctc 2040
aagaccgacg tcacggacta tcacattgac caagtgtcca accttgtgga gtgcctgtcc 2100
gacgagttct gcctcgacga gaagaaggag ctgtccgaga aggtcaaaca cgcgaagcgt 2160
ctgagtgacg agcggaattt gctccaggac ccgaacttcc gtggcatcaa ccgccagctc 2220
gaccgtggtt ggcgcgggag tacagacatc accatccagg gaggcgacga tgtgttcaag 2280
gagaactatg tgacgctgct cgggactttc gacgaatgct acccgacgta tctctaccag 2340
aagatagacg agagtaaatt gaaggcgtac acccgctacc agcttcgcgg gtacatcgag 2400
gatagtcagg acctggaaat ctacctgatc cgatacaacg ccaagcacga gacagtgaac 2460
gtgccaggca cgggctcact ttggccattg agcgctccct ctccaatcgg aaagtgcgct 2520
caccactcgc accacttctc tctggacatc gacgtgggct gcaccgacct caacgaggac 2580
ctgggtgtct gggttatctt caagattaag acccaggacg gacatgcccg cctcggcaac 2640
ctggagttcc ttgaggagaa gcctctcgtg ggcgaggccc tcgctcgtgt gaagcgcgcc 2700
gagaagaaat ggcgagacaa gcgggagaag ctggagtggg agaccaacat cgtgtacaag 2760
gaggccaagg agtcagtgga cgcactcttc gtcaacagcc agtacgaccg cctccaggct 2820
gacaccaaca tcgccatgat ccacgcggct gacaagcggg tccacagcat ccgtgaggcg 2880
tacctgcccg agctgtcagt gatccctggt gtgaacgcgg cgatcttcga ggaactggag 2940
ggccgcatct tcacagcatt cagcctgtac gatgccagga atgttattaa gaacggtgac 3000
ttcaacaacg ggctgagttg ctggaacgtc aagggccatg tggacgtcga ggagcagaac 3060
aaccaccggt ccgtgctggt cgtgccggag tgggaggcag aggtgagcca ggaggtccgc 3120
gtctgccctg gtcgcggcta catcctccgt gtgactgcgt acaaggaagg ctacggtgaa 3180
ggctgcgtga ctatccacga gatcgagaac aacaccgacg agctcaagtt ctcgaactgt 3240
gtggaggagg aggtgtaccc gaacaacacc gttacttgca acgactacac tgccacgcaa 3300
gaggagtacg agggcactta cacttcccgg aatcgcggct atgatggcgc gtacgagtcc 3360
aacagcagcg tgcctgcgga ttatgcgtcc gcttacgagg agaaggcgta caccgacgga 3420
cggagggaca acccttgcga gtccaaccgt ggctacggtg actacactcc gctgcccgcc 3480
gggtacgtca ccaaggagct ggagtacttc ccggagaccg acaaagtctg gatcgagatc 3540
ggcgagacgg agggcacttt catcgtggac tcggtcgagc tgctactgat ggaggagtga 3600
<210> 47
<211> 1199
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质变体TIC868_29的氨基酸序列。
<400> 47
Met Thr Ser Asn Arg Lys Asn Glu Asn Glu Ile Ile Asn Ala Leu Ser
1 5 10 15
Ile Pro Ala Val Ser Asn His Ser Ala Gln Met Asn Leu Ser Thr Asp
20 25 30
Ala Arg Ile Glu Asp Ser Leu Cys Ile Ala Glu Gly Asn Asn Ile Asp
35 40 45
Pro Phe Val Ser Ala Ser Thr Val Gln Thr Gly Ile Asn Ile Ala Gly
50 55 60
Arg Ile Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gln Ile Ala Ser
65 70 75 80
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro
85 90 95
Trp Glu Ile Phe Leu Glu His Val Glu Gln Leu Ile Arg Gln Gln Val
100 105 110
Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gln Gly Leu Gly
115 120 125
Asn Ser Phe Arg Ala Tyr Gln Tyr Ser Leu Glu Asp Trp Leu Glu Asn
130 135 140
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gln Tyr Ile Ala
145 150 155 160
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala Ile Arg Asn
165 170 175
Gln Glu Val Pro Leu Leu Met Val Tyr Ala Gln Ala Ala Asn Leu His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu
195 200 205
Thr Ser Gln Glu Ile Gln Arg Tyr Tyr Glu Arg Gln Val Glu Lys Thr
210 215 220
Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn
225 230 235 240
Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gln Phe
245 250 255
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro
260 265 270
Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gln Leu Thr
275 280 285
Arg Glu Ile Tyr Thr Asp Pro Ile Gly Arg Thr Asn Ala Pro Ser Gly
290 295 300
Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala
305 310 315 320
Ile Glu Ala Ala Val Ile Arg Pro Pro His Leu Leu Asp Phe Pro Glu
325 330 335
Gln Leu Thr Ile Phe Ser Gln Leu Ser Arg Trp Ser His Thr Gln Tyr
340 345 350
Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr Ile Arg Gly
355 360 365
Ser Leu Ser Thr Ser Thr His Gly Asn Thr Asn Thr Ser Ile Asn Pro
370 375 380
Val Thr Leu Gln Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe
385 390 395 400
Ala Gly Ile Asn Ile Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp
405 410 415
Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu
420 425 430
Leu Tyr Thr Ile Gly Tyr Thr Gly Val Gly Thr Gln Leu Phe Asp Ser
435 440 445
Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser
450 455 460
Tyr Ser His Arg Leu Ser Asn Ile Arg Leu Ile Ser Gly Asn Thr Leu
465 470 475 480
Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn
485 490 495
Thr Ile Ser Ser Asp Ser Ile Asn Gln Ile Pro Leu Val Lys Gly Phe
500 505 510
Arg Val Trp Gly Gly Thr Ser Val Ile Thr Gly Pro Gly Phe Thr Gly
515 520 525
Gly Asp Ile Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gln
530 535 540
Val Asn Ile Asn Ser Pro Ile Thr Gln Arg Tyr Arg Leu Arg Phe Arg
545 550 555 560
Tyr Ala Ser Ser Arg Asp Ala Arg Val Ile Val Leu Thr Gly Ala Ala
565 570 575
Ser Thr Gly Val Gly Gly Gln Val Ser Val Asn Met Pro Leu Gln Lys
580 585 590
Thr Met Glu Ile Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr
595 600 605
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp Ile Ile Gly
610 615 620
Ile Ser Glu Gln Pro Leu Phe Gly Ala Gly Ser Ile Ser Ser Gly Glu
625 630 635 640
Leu Tyr Ile Asp Lys Ile Glu Ile Ile Leu Ala Asp Ala Thr Phe Glu
645 650 655
Ala Glu Ser Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Glu Leu Phe
660 665 670
Thr Ser Ser Asn Gln Ile Gly Leu Lys Thr Asp Val Thr Asp Tyr His
675 680 685
Ile Asp Gln Val Ser Asn Leu Val Glu Cys Leu Ser Asp Glu Phe Cys
690 695 700
Leu Asp Glu Lys Lys Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg
705 710 715 720
Leu Ser Asp Glu Arg Asn Leu Leu Gln Asp Pro Asn Phe Arg Gly Ile
725 730 735
Asn Arg Gln Leu Asp Arg Gly Trp Arg Gly Ser Thr Asp Ile Thr Ile
740 745 750
Gln Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Leu Gly
755 760 765
Thr Phe Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu
770 775 780
Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu
785 790 795 800
Asp Ser Gln Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His
805 810 815
Glu Thr Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala
820 825 830
Pro Ser Pro Ile Gly Lys Cys Ala His His Ser His His Phe Ser Leu
835 840 845
Asp Ile Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp
850 855 860
Val Ile Phe Lys Ile Lys Thr Gln Asp Gly His Ala Arg Leu Gly Asn
865 870 875 880
Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu Ala Leu Ala Arg
885 890 895
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Glu
900 905 910
Trp Glu Thr Asn Ile Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala
915 920 925
Leu Phe Val Asn Ser Gln Tyr Asp Arg Leu Gln Ala Asp Thr Asn Ile
930 935 940
Ala Met Ile His Ala Ala Asp Lys Arg Val His Ser Ile Arg Glu Ala
945 950 955 960
Tyr Leu Pro Glu Leu Ser Val Ile Pro Gly Val Asn Ala Ala Ile Phe
965 970 975
Glu Glu Leu Glu Gly Arg Ile Phe Thr Ala Phe Ser Leu Tyr Asp Ala
980 985 990
Arg Asn Val Ile Lys Asn Gly Asp Phe Asn Asn Gly Leu Ser Cys Trp
995 1000 1005
Asn Val Lys Gly His Val Asp Val Glu Glu Gln Asn Asn His Arg
1010 1015 1020
Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gln Glu
1025 1030 1035
Val Arg Val Cys Pro Gly Arg Gly Tyr Ile Leu Arg Val Thr Ala
1040 1045 1050
Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr Ile His Glu Ile
1055 1060 1065
Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu
1070 1075 1080
Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Ala
1085 1090 1095
Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly
1100 1105 1110
Tyr Asp Gly Ala Tyr Glu Ser Asn Ser Ser Val Pro Ala Asp Tyr
1115 1120 1125
Ala Ser Ala Tyr Glu Glu Lys Ala Tyr Thr Asp Gly Arg Arg Asp
1130 1135 1140
Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu
1145 1150 1155
Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr
1160 1165 1170
Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe Ile
1175 1180 1185
Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1190 1195
<210> 48
<211> 3432
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC869的重组核苷酸序列。
<400> 48
atggagataa ataatcagaa gcaatgcata ccatataatt gcttaagtaa tcctgaggaa 60
gtacttttgg atggggagag gatattacct gatatcgatc cactcgaagt ttctttgtcg 120
cttttgcaat ttcttttgaa taactttgtt ccagggggag gctttatttc aggattagtt 180
gataaaatat ggggggcttt gagaccatct gaatgggact tatttcttgc acagattgaa 240
cggttgattg atcaaagaat agaagcaaca gtaagagcaa aagcaatcac tgaattagaa 300
ggattaggga gaaattatca aatatacgct gaagcattta aagaatggga atcagatcct 360
gataacgaag cggctaaaag tagagtaatt gatcgctttc gtatacttga tggtctaatt 420
gaagcaaata tcccttcatt tcggataatt ggatttgaag tgccactttt atcggtttat 480
gttcaagcag ctaatctaca tctcgctcta ttgagagatt ctgttatttt tggagagaga 540
tggggattga cgacaaaaaa tgtcaatgat atctataata gacaaattag agaaattcat 600
gaatatagca atcattgcgt agatacgtat aacacagaac tagaacgtct agggtttaga 660
tctatagcgc agtggagaat atataatcag tttagaagag aactaacact aactgtatta 720
gatattgtcg ctcttttccc gaactatgac agtagactgt atccgatcca aactttttct 780
caattgacaa gagaaattgt tacatcccca gtaagcgaat tttattatgg tgttattaat 840
agtggtaata taattggtac tcttactgaa cagcagataa ggcgaccaca tcttatggac 900
ttctttaact ccatgatcat gtatacatca gataatagac gggaacatta ttggtcagga 960
cttgaaatga cggcttattt tacaggattt gcaggagctc aagtgtcatt ccctttagtc 1020
gggactagag gggagtcagc tccaccatta actgttagaa gtgttaatga tggaatttat 1080
agaatattat cggcaccgtt ttattcagcg ccttttctag gcaccattgt attgggaagt 1140
cgtggagaaa aatttgattt tgcgcttaat aatatttcac ctccgccatc tacaatatac 1200
agacatcctg gaacagtaga ttcactagtc agtataccgc cacaggataa tagcgtacca 1260
ccgcacaggg gatctagtca tcgattaagt catgttacaa tgcgcgcaag ttcccctata 1320
ttccattgga cgcatcgcag cgcaaccact acaaatacaa ttaatccaaa tgctattatc 1380
caaataccac tagtaaaagc atttaacctt cattcaggtg ccactgttgt tagaggacca 1440
gggtttacag gtggagatct cttacgaaga acgaatactg gtacatttgc agacataaga 1500
gtcaatgttc cttcatcact attttcccaa agatatcgcg taaggattcg ttatgcttct 1560
actaccgatt tacaattttt cacgagaatt aatggaactt ctgttaatca aggtaatttc 1620
tcaaaaacga tggatagagg ggataaactg aaatctgaaa actttagaac tgccggattt 1680
agtactcctt ttagattttc aaattttcaa agtacattca cgttgggtac tcaggctttt 1740
tcaaatcagg aagtttatat agatagaatt gaatttgtcc cggcagaagt aacattcgag 1800
gcagaatctg atttagaaag agcacaaaag gcggtgaatg agctgtttac ttcttccaat 1860
caaatcgggt taaaaacaga tgtgacggat tatcatattg atcaagtatc caatttagtt 1920
gagtgtttat ctgatgaatt ttgtctggat gaaaaaaaag aattgtccga gaaagtcaaa 1980
catgcgaagc gacttagtga tgagcggaat ttacttcaag atccaaactt tagagggatc 2040
aatagacaac tagaccgtgg ctggagagga agtacggata ttaccatcca aggaggcgat 2100
gacgtattca aagagaatta cgttacgcta ttgggtacct ttgatgagtg ctatccaacg 2160
tatttatatc aaaaaataga tgagtcgaaa ttaaaagcct atacccgtta ccaattaaga 2220
gggtatatcg aagatagtca agacttagaa atctatttaa ttcgctacaa tgccaaacac 2280
gaaacagtaa atgtgccagg tacgggttcc ttatggccgc tttcagcccc aagtccaatc 2340
ggaaaatgtg cccatcattc ccatcatttc tccttggaca ttgatgttgg atgtacagac 2400
ttaaatgagg acttaggtgt atgggtgata ttcaagatta agacgcaaga tggccatgca 2460
agactaggaa atctagaatt tctcgaagag aaaccattag taggagaagc actagctcgt 2520
gtgaaaagag cggagaaaaa atggagagac aaacgtgaaa aattggaatg ggaaacaaat 2580
attgtttata aagaggcaaa agaatctgta gatgctttat ttgtaaactc tcaatatgat 2640
agattacaag cggataccaa catcgcgatg attcatgcgg cagataaacg cgttcatagc 2700
attcgagaag cttatctgcc tgagctgtct gtgattccgg gtgtcaatgc ggctattttt 2760
gaagaattag aagggcgtat tttcactgca ttctccctat atgatgcgag aaatgtcatt 2820
aaaaatggtg attttaataa tggcttatcc tgctggaacg tgaaagggca tgtagatgta 2880
gaagaacaaa acaaccaccg ttcggtcctt gttgttccgg aatgggaagc agaagtgtca 2940
caagaagttc gtgtctgtcc gggtcgtggc tatatccttc gtgtcacagc gtacaaggag 3000
ggatatggag aaggttgcgt aaccattcat gagatcgaga acaatacaga cgaactgaag 3060
tttagcaact gtgtagaaga ggaagtatat ccaaacaaca cggtaacgtg taatgattat 3120
actgcgactc aagaagaata tgagggtacg tacacttctc gtaatcgagg atatgacgga 3180
gcctatgaaa gcaattcttc tgtaccagct gattatgcat cagcctatga agaaaaagca 3240
tatacagatg gacgaagaga caatccttgt gaatctaaca gaggatatgg ggattacaca 3300
ccactaccag ctggctatgt gacaaaagaa ttagagtact tcccagaaac cgataaggta 3360
tggattgaga tcggagaaac ggaaggaaca ttcatcgtgg acagcgtgga attacttctt 3420
atggaggaat ag 3432
<210> 49
<211> 3432
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC869的合成核苷酸序列。
<400> 49
atggagataa acaaccagaa gcagtgcatt ccgtacaact gcctcagcaa cccggaggag 60
gtgctgctgg acggcgagcg tatcctccca gacatcgacc cactggaggt cagcctgagc 120
ctcctccagt tcctcctcaa taacttcgtg ccaggcggcg gcttcatctc cggcctggtg 180
gacaagatct ggggcgcact ccggccaagt gagtgggatc tgttcctggc ccaaatcgag 240
cgcctgatcg accagaggat cgaggcgacg gtccgcgcca aggcgataac cgagctggag 300
ggcctcggtc gcaactacca gatctacgca gaggcgttca aggagtggga gagcgacccg 360
gacaacgagg cggccaagtc tcgggtgatt gaccgcttcc gcatcctcga cggcctcatc 420
gaagccaaca tcccttcctt ccggatcata ggcttcgaag tcccgctcct cagcgtgtac 480
gtgcaagcgg ccaatctcca cctcgcgttg ctccgtgaca gcgtcatctt tggcgagaga 540
tggggcctga cgacgaagaa cgtgaacgac atctacaaca ggcagatccg agagattcac 600
gagtacagca accactgcgt ggacacatac aacacggagc tggagcggct cggcttccgc 660
tcaatcgctc agtggcggat ctacaaccag ttccgccgcg agctgaccct caccgtgctc 720
gacatcgtcg cattgtttcc caattacgac tcacgcctct acccaatcca gactttcagc 780
cagctcacac gcgagattgt gaccagcccg gtgtcagagt tctactacgg cgtcatcaac 840
tcaggcaaca tcatcgggac actgactgaa cagcagatca gacgtccgca cttgatggac 900
ttcttcaact ccatgattat gtacacatca gacaacagga gagagcacta ctggtccggg 960
ttggagatga ctgcttactt caccggcttc gccggtgccc aagtgagctt cccactggtc 1020
ggaactcgtg gcgagtcagc tcctccgcta actgtgcgat ctgtcaacga cgggatctac 1080
agaatactgt cggctccctt ctacagtgcg ccgttcctcg gcaccatcgt cctcggctca 1140
cgtggtgaga agttcgactt cgcactgaac aacattagcc cgccgcctag tacaatctac 1200
aggcaccctg gcaccgtgga ctcactggtt tcgatcccgc cacaagacaa cagtgtgccg 1260
ccacatcgtg gttctagcca caggctctcc catgtgacca tgcgcgcctc ttcaccgatc 1320
tttcactgga cccatcggtc cgctacaacc acaaacacca tcaaccctaa cgccatcatc 1380
caaatcccgc tggtgaaggc gtttaacctc cacagcggcg caactgtcgt gcgcggccct 1440
ggattcaccg gtggtgacct gctccgtcgg accaatactg gcacgttcgc agacatccga 1500
gtgaacgtcc cgtcctcgct gttcagtcag cgctaccgtg tccgcattcg gtacgcttcc 1560
accacggatc tccagttctt tactcgcatc aatgggacga gcgtcaacca gggcaacttc 1620
agcaagacga tggaccgtgg agataagctc aagtccgaga acttccgcac ggctggcttc 1680
tcgacaccgt tcagattcag caacttccag agcactttca cgctgggcac acaggcgttc 1740
tccaaccagg aggtgtacat cgaccgcatc gagttcgtgc ctgctgaggt taccttcgag 1800
gcggaaagcg acctcgaaag ggcccagaag gccgtcaacg agctgttcac ctccagcaac 1860
cagatcggtc tcaagaccga cgtcactgac tatcacattg accaagtcag caacctggtg 1920
gagtgcctca gtgatgagtt ctgcctggat gagaagaagg agcttagcga gaaggtcaag 1980
cacgcaaagc gcttgagcga cgagcgcaac cttctccagg acccgaattt ccgtggtatc 2040
aatagacagc ttgaccgtgg gtggcgcggt agtaccgaca taaccatcca gggtggcgac 2100
gatgtgttca aggagaatta tgttacgctg ctcggtacgt tcgacgagtg ctatcccacg 2160
tacttgtacc agaagattga cgagagcaag ctcaaggcgt acacccgtta ccagctccgt 2220
ggctacatcg aggacagcca ggatctggaa atctacctta tccgatacaa tgctaagcac 2280
gagacagtca acgtgcccgg aacagggtcg ctctggccgc tcagtgctcc gtcgcccatt 2340
ggcaagtgcg cgcaccattc gcatcacttc tcacttgaca ttgacgtggg ctgcaccgac 2400
ctgaacgagg atctgggtgt ctgggtcatc ttcaagatca agacccaaga cggccacgcg 2460
cgcctcggga acctggagtt cctggaggag aagcctttgg taggtgaagc cctggcccgc 2520
gtcaagcgcg cggagaagaa gtggcgcgac aagagggaga agctggaatg ggagaccaac 2580
atcgtgtaca aggaggcgaa ggagtcggtg gacgcactat tcgtgaactc ccagtacgac 2640
cgtctccagg ccgacaccaa catcgccatg atccacgccg ctgacaaacg agttcattcc 2700
attcgtgaag cctatcttcc cgagctgtct gtcataccgg gcgtcaacgc ggccatcttc 2760
gaggagttag agggtcggat ctttacagct ttctcactgt acgatgcccg caacgtcatc 2820
aagaacggcg acttcaacaa cggtctctcc tgttggaacg tgaagggcca cgtggatgtc 2880
gaggagcaga acaaccaccg ctctgtgctt gtggtgcccg agtgggaggc cgaggtgagc 2940
caggaggtcc gcgtctgtcc gggtcgcggc tacatcctgc gggtcaccgc ctacaaggag 3000
ggctacggcg aaggctgcgt tactattcac gagattgaga acaataccga cgaactcaag 3060
ttctccaact gtgtcgagga ggaggtgtac ccgaacaaca ccgtgacgtg caacgactac 3120
accgcgacac aggaggaata cgagggcacc tacaccagcc gcaaccgagg ctacgacgga 3180
gcgtacgaga gcaactcgtc cgtgcccgct gattacgcga gtgcgtacga ggagaaggct 3240
tacaccgacg gacggcgcga caatccctgc gagagtaacc gtggatacgg agattacacg 3300
ccgctacccg ctggctacgt cactaaggaa ctggagtact tcccagagac ggacaaggtg 3360
tggatcgaaa tcggcgagac agagggcacg ttcatcgtgg actccgtgga gctgctgctg 3420
atggaggagt ga 3432
<210> 50
<211> 1143
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC869的氨基酸序列。
<400> 50
Met Glu Ile Asn Asn Gln Lys Gln Cys Ile Pro Tyr Asn Cys Leu Ser
1 5 10 15
Asn Pro Glu Glu Val Leu Leu Asp Gly Glu Arg Ile Leu Pro Asp Ile
20 25 30
Asp Pro Leu Glu Val Ser Leu Ser Leu Leu Gln Phe Leu Leu Asn Asn
35 40 45
Phe Val Pro Gly Gly Gly Phe Ile Ser Gly Leu Val Asp Lys Ile Trp
50 55 60
Gly Ala Leu Arg Pro Ser Glu Trp Asp Leu Phe Leu Ala Gln Ile Glu
65 70 75 80
Arg Leu Ile Asp Gln Arg Ile Glu Ala Thr Val Arg Ala Lys Ala Ile
85 90 95
Thr Glu Leu Glu Gly Leu Gly Arg Asn Tyr Gln Ile Tyr Ala Glu Ala
100 105 110
Phe Lys Glu Trp Glu Ser Asp Pro Asp Asn Glu Ala Ala Lys Ser Arg
115 120 125
Val Ile Asp Arg Phe Arg Ile Leu Asp Gly Leu Ile Glu Ala Asn Ile
130 135 140
Pro Ser Phe Arg Ile Ile Gly Phe Glu Val Pro Leu Leu Ser Val Tyr
145 150 155 160
Val Gln Ala Ala Asn Leu His Leu Ala Leu Leu Arg Asp Ser Val Ile
165 170 175
Phe Gly Glu Arg Trp Gly Leu Thr Thr Lys Asn Val Asn Asp Ile Tyr
180 185 190
Asn Arg Gln Ile Arg Glu Ile His Glu Tyr Ser Asn His Cys Val Asp
195 200 205
Thr Tyr Asn Thr Glu Leu Glu Arg Leu Gly Phe Arg Ser Ile Ala Gln
210 215 220
Trp Arg Ile Tyr Asn Gln Phe Arg Arg Glu Leu Thr Leu Thr Val Leu
225 230 235 240
Asp Ile Val Ala Leu Phe Pro Asn Tyr Asp Ser Arg Leu Tyr Pro Ile
245 250 255
Gln Thr Phe Ser Gln Leu Thr Arg Glu Ile Val Thr Ser Pro Val Ser
260 265 270
Glu Phe Tyr Tyr Gly Val Ile Asn Ser Gly Asn Ile Ile Gly Thr Leu
275 280 285
Thr Glu Gln Gln Ile Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser
290 295 300
Met Ile Met Tyr Thr Ser Asp Asn Arg Arg Glu His Tyr Trp Ser Gly
305 310 315 320
Leu Glu Met Thr Ala Tyr Phe Thr Gly Phe Ala Gly Ala Gln Val Ser
325 330 335
Phe Pro Leu Val Gly Thr Arg Gly Glu Ser Ala Pro Pro Leu Thr Val
340 345 350
Arg Ser Val Asn Asp Gly Ile Tyr Arg Ile Leu Ser Ala Pro Phe Tyr
355 360 365
Ser Ala Pro Phe Leu Gly Thr Ile Val Leu Gly Ser Arg Gly Glu Lys
370 375 380
Phe Asp Phe Ala Leu Asn Asn Ile Ser Pro Pro Pro Ser Thr Ile Tyr
385 390 395 400
Arg His Pro Gly Thr Val Asp Ser Leu Val Ser Ile Pro Pro Gln Asp
405 410 415
Asn Ser Val Pro Pro His Arg Gly Ser Ser His Arg Leu Ser His Val
420 425 430
Thr Met Arg Ala Ser Ser Pro Ile Phe His Trp Thr His Arg Ser Ala
435 440 445
Thr Thr Thr Asn Thr Ile Asn Pro Asn Ala Ile Ile Gln Ile Pro Leu
450 455 460
Val Lys Ala Phe Asn Leu His Ser Gly Ala Thr Val Val Arg Gly Pro
465 470 475 480
Gly Phe Thr Gly Gly Asp Leu Leu Arg Arg Thr Asn Thr Gly Thr Phe
485 490 495
Ala Asp Ile Arg Val Asn Val Pro Ser Ser Leu Phe Ser Gln Arg Tyr
500 505 510
Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asp Leu Gln Phe Phe Thr
515 520 525
Arg Ile Asn Gly Thr Ser Val Asn Gln Gly Asn Phe Ser Lys Thr Met
530 535 540
Asp Arg Gly Asp Lys Leu Lys Ser Glu Asn Phe Arg Thr Ala Gly Phe
545 550 555 560
Ser Thr Pro Phe Arg Phe Ser Asn Phe Gln Ser Thr Phe Thr Leu Gly
565 570 575
Thr Gln Ala Phe Ser Asn Gln Glu Val Tyr Ile Asp Arg Ile Glu Phe
580 585 590
Val Pro Ala Glu Val Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg Ala
595 600 605
Gln Lys Ala Val Asn Glu Leu Phe Thr Ser Ser Asn Gln Ile Gly Leu
610 615 620
Lys Thr Asp Val Thr Asp Tyr His Ile Asp Gln Val Ser Asn Leu Val
625 630 635 640
Glu Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Lys Glu Leu Ser
645 650 655
Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu
660 665 670
Gln Asp Pro Asn Phe Arg Gly Ile Asn Arg Gln Leu Asp Arg Gly Trp
675 680 685
Arg Gly Ser Thr Asp Ile Thr Ile Gln Gly Gly Asp Asp Val Phe Lys
690 695 700
Glu Asn Tyr Val Thr Leu Leu Gly Thr Phe Asp Glu Cys Tyr Pro Thr
705 710 715 720
Tyr Leu Tyr Gln Lys Ile Asp Glu Ser Lys Leu Lys Ala Tyr Thr Arg
725 730 735
Tyr Gln Leu Arg Gly Tyr Ile Glu Asp Ser Gln Asp Leu Glu Ile Tyr
740 745 750
Leu Ile Arg Tyr Asn Ala Lys His Glu Thr Val Asn Val Pro Gly Thr
755 760 765
Gly Ser Leu Trp Pro Leu Ser Ala Pro Ser Pro Ile Gly Lys Cys Ala
770 775 780
His His Ser His His Phe Ser Leu Asp Ile Asp Val Gly Cys Thr Asp
785 790 795 800
Leu Asn Glu Asp Leu Gly Val Trp Val Ile Phe Lys Ile Lys Thr Gln
805 810 815
Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro
820 825 830
Leu Val Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp
835 840 845
Arg Asp Lys Arg Glu Lys Leu Glu Trp Glu Thr Asn Ile Val Tyr Lys
850 855 860
Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gln Tyr Asp
865 870 875 880
Arg Leu Gln Ala Asp Thr Asn Ile Ala Met Ile His Ala Ala Asp Lys
885 890 895
Arg Val His Ser Ile Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val Ile
900 905 910
Pro Gly Val Asn Ala Ala Ile Phe Glu Glu Leu Glu Gly Arg Ile Phe
915 920 925
Thr Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val Ile Lys Asn Gly Asp
930 935 940
Phe Asn Asn Gly Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val
945 950 955 960
Glu Glu Gln Asn Asn His Arg Ser Val Leu Val Val Pro Glu Trp Glu
965 970 975
Ala Glu Val Ser Gln Glu Val Arg Val Cys Pro Gly Arg Gly Tyr Ile
980 985 990
Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr
995 1000 1005
Ile His Glu Ile Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn
1010 1015 1020
Cys Val Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn
1025 1030 1035
Asp Tyr Thr Ala Thr Gln Glu Glu Tyr Glu Gly Thr Tyr Thr Ser
1040 1045 1050
Arg Asn Arg Gly Tyr Asp Gly Ala Tyr Glu Ser Asn Ser Ser Val
1055 1060 1065
Pro Ala Asp Tyr Ala Ser Ala Tyr Glu Glu Lys Ala Tyr Thr Asp
1070 1075 1080
Gly Arg Arg Asp Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp
1085 1090 1095
Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr
1100 1105 1110
Phe Pro Glu Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu
1115 1120 1125
Gly Thr Phe Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1130 1135 1140
<210> 51
<211> 3513
<212> DNA
<213> 人工的
<220>
<223> 用于在细菌细胞中表达的编码TIC836的重组核苷酸序列。
<400> 51
atggagaata atattcaaaa tcaatgcgta ccttacaatt gtttaaataa tcctgaagta 60
gaaatattaa atgaagaaag aagtactggc agattaccgt tagatatatc cttatcgctt 120
acacgtttcc ttttgagtga atttgttcca ggtgtgggag ttgcgtttgg attatttgat 180
ttaatatggg gttttataac tccttctgat tggagcttat ttcttttaca gattgaacaa 240
ttgattgagc aaagaataga aacattggaa aggaaccggg caattactac attacgaggg 300
ttagcagata gctatgaaat ttatattgaa gcactaagag agtgggaagc aaatcctaat 360
aatgcacaat taagggaaga tgtgcgtatt cgatttgcta atacagacga cgctttaata 420
acagcaataa ataattttac acttacaagt tttgaaatcc ctcttttatc ggtctatgtt 480
caagcggcga atttacattt atcactatta agagacgctg tatcgtttgg gcagggttgg 540
ggactggata tagctactgt taataatcat tataatagat taataaatct tattcataga 600
tatacgaaac attgtttgga cacatacaat caaggattag aaaacttaag aggtactaat 660
actcgacaat gggcaagatt caatcagttt aggagagatt taacacttac tgtattagat 720
atcgttgctc tttttccgaa ctacgatgtt agaacatatc caattcaaac gtcatcccaa 780
ttaacaaggg aaatttatac aagttcagta attgaggatt ctccagtttc tgctaatata 840
cctaatggtt ttaatagggc ggaatttgga gttagaccgc cccatcttat ggactttatg 900
aattctttgt ttgtaactgc agagactgtt agaagtcaaa ctgtgtgggg aggacactta 960
gttagttcac gaaatacggc tggtaaccgt ataaatttcc ctagttacgg ggtcttcaat 1020
cctggtggcg ccatttggat tgcagatgag gatccacgtc ctttttatcg gacattatca 1080
gatcctgttt ttgtccgagg aggatttggg aatcctcatt atgtactggg gcttagggga 1140
gtagcatttc aacaaactgg tacgaaccac acccgaacat ttagaaatag tgggaccata 1200
gattctctag atgaaatccc acctcaggat aatagtgggg caccttggaa tgattatagt 1260
catgtattaa atcatgttac atttgtacga tggccaggtg agatttcagg aagtgattca 1320
tggagagctc caatgttttc ttggacgcac cgtagtgcaa cccctacaaa tacaattgat 1380
ccggagagga ttacacaaat acctttaaca aaatctacta atcttggctc tggaacttct 1440
gtcgttaaag gaccaggatt tacaggagga gatattcttc gaagaacttc acctggccag 1500
atttcaacct taagagtaaa tattactgca ccattatcac aaagatatcg ggtaagaatt 1560
cgctacgctt ctaccacaaa tttacaattc catacatcaa ttgacggaag acctattaat 1620
caggggaatt tttcagcaac tatgagtagt gggagtaatt tacagtccgg aagctttagg 1680
actgtaggtt ttactactcc gtttaacttt tcaaatggat caagtgtatt tacgttaagt 1740
gctcatgtct tcaattcagg caatgaagtt tatatagatc gaattgaatt tgttccggca 1800
gaagtaacct ttgaggcaga atatgattta gaaagagcgc agaaggcggt gaatgcgctg 1860
tttacgtcta caaaccaact agggctaaaa acaaatgtaa cggattatca tattgatcaa 1920
gtgtccaatt tagttacgta tttatcggat gaattttgtc tggatgaaaa gcgagaattg 1980
tccgagaaag tcaaacatgc gaagcgactc agtgatgaac gcaatttact ccaagattca 2040
aatttcaaag acattaatag gcaaccagaa cgtgggtggg gcggaagtac agggattacc 2100
atccaaggag gggatgacgt atttaaagaa aattacgtca cactatcagg tacctttgat 2160
gagtgctatc caacatattt gtatcaaaaa atcgatgaat caaaattaaa agcctttacc 2220
cgttatcaat taagagggta tatcgaagat agtcaagact tagaaatcta tttaattcgc 2280
tacaatgcaa aacatgaaac agtaaatgtg ccaggtacgg gttccttatg gccgctttca 2340
gcccaaagtc caatcggaaa gtgtggagag ccgaatcgat gcgcgccaca ccttgaatgg 2400
aatcctgact tagattgttc gtgtagggat ggagaaaagt gtgcccatca ttcgcatcat 2460
ttctccttag acattgatgt aggatgtaca gacttaaatg aggacctagg tgtatgggtg 2520
atctttaaga ttaagacgca agatgggcac gcaagactag ggaatctaga gtttctcgaa 2580
gaaaaaccat tagtaggaga agcgctagct cgtgtgaaaa gagcggagaa aaaatggaga 2640
gacaaacgtg aaaaattgga atgggaaaca aatatcgttt ataaagaggc aaaagaatct 2700
gtagatgctt tatttgtaaa ctctcaatat gatcaattac aagcggatac gaatattgcc 2760
atgattcatg cggcagataa acgtgttcat agcattcgag aagcttatct gcctgagctg 2820
tctgtgattc cgggtgtcaa tgcggctatt tttgaagaat tagaagggcg tattttcact 2880
gcattctccc tatatgatgc gagaaatgtc attaaaaatg gtgattttaa taatggctta 2940
tcctgctgga acgtgaaagg gcatgtagat gtagaagaac aaaacaacca acgttcggtc 3000
cttgttgttc cggaatggga agcagaagtg tcacaagaag ttcgtgtctg tccgggtcgt 3060
ggctatatcc ttcgtgtcac agcgtacaag gagggatatg gagaaggttg cgtaaccatt 3120
catgagatcg agaacaatac agacgaactg aagtttagca actgcgtaga ggaggaaatc 3180
tatccaaata acacggtaac gtgtaatgat tatactgtaa atcaagaaga atacggaggt 3240
gcgtacactt ctcgtaatcg aggatataac gaagctcctt ccgtaccagc tgattatgcg 3300
tcagtctatg aagaaaaatc gtatacagat ggacgtagag agaatccttg tgaatttaac 3360
agagggtata gggattacac gccactacca gttggttatg tgacaaaaga attagaatac 3420
ttcccagaaa ccgataaggt atggattgag attggagaaa cggaaggaac atttatcgtg 3480
gacagcgtgg aattactcct tatggaggaa taa 3513
<210> 52
<211> 3513
<212> DNA
<213> 人工的
<220>
<223> 设计用于在植物细胞中表达的编码TIC836的合成核苷酸序列。
<400> 52
atggagaaca acatccagaa ccagtgcgtg ccctacaact gcctgaacaa ccctgaggtt 60
gagatcctga acgaggagcg tagcaccggt aggctcccgc tagacatctc cctgagcctg 120
acccgcttcc tccttagtga gttcgtgccc ggcgtgggcg tggccttcgg cctcttcgac 180
ctcatctggg gcttcatcac tccttccgac tggtccctct tcctccttca gattgagcaa 240
ctgatcgagc agcgcatcga gacccttgag cgcaaccgcg ccatcaccac tctcagaggt 300
ctcgccgact cctacgaaat ctacatcgag gcactccgtg agtgggaggc caacccgaac 360
aatgcccagc tccgcgagga cgtgaggatc agattcgcca acaccgacga tgccctcatc 420
accgccatca acaatttcac cctcacctcc ttcgagatcc ctcttctgtc tgtgtacgtt 480
caagctgcta accttcacct ttccctcctg cgcgacgccg tgagcttcgg ccagggctgg 540
ggcctcgaca tcgccaccgt gaacaatcac tacaaccgcc tcatcaacct catccaccgc 600
tacaccaagc actgccttga cacctacaac cagggccttg agaacctccg tggcaccaac 660
acccgccagt gggcccgctt caaccagttc cgcagagacc tcaccctcac cgtgctcgac 720
atcgtggcac tcttcccaaa ctacgacgtg cgtacctacc ctatccagac ctccagccag 780
ctcaccaggg aaatctacac ctccagcgtg atcgaggact ctcctgtgtc cgccaacatc 840
cctaacggct tcaaccgcgc cgagttcggc gtgcgccctc ctcacctcat ggacttcatg 900
aactccctct tcgtcactgc cgagaccgtg cgctcccaga ccgtgtgggg cggtcacctc 960
gtgtccagcc gtaacaccgc tggcaacagg atcaacttcc cgtcctacgg cgtgttcaac 1020
ccaggcggtg ccatctggat cgccgatgaa gaccctcgtc ctttctaccg taccctgtcc 1080
gaccctgtgt tcgtgcgtgg cggtttcggc aaccctcact acgtgctggg cctgcgtggc 1140
gtggccttcc agcaaaccgg caccaaccac accaggacgt tccgtaactc cggcaccatc 1200
gacagtcttg acgagatccc tccgcaagac aactccggtg caccttggaa cgactactcc 1260
cacgtgctga accacgtgac cttcgtgagg tggcctggcg aaatctccgg ctccgactcc 1320
tggagggctc ctatgttcag ttggacccac aggagcgcta cgcctaccaa caccatcgac 1380
cctgagcgta tcactcagat ccctctgact aagagcacta acctgggcag cggcactagc 1440
gtggtcaagg gccctggctt cactggcggt gacatcctga ggcggactag ccctggccag 1500
atcagcactc tgagggtgaa catcactgct ccgctgagcc agcgttacag ggtcagaatc 1560
cgttacgctt ctactactaa ccttcagttc cacactagca tcgacggccg tccgatcaac 1620
cagggcaact tctctgctac tatgagttct ggcagtaacc tccagtctgg tagtttccgg 1680
actgtcggtt tcactacgcc gttcaacttc tccaacggta gttctgtctt cactctgtct 1740
gctcacgtgt tcaactctgg caacgaggtg tacatcgacc ggatcgagtt cgtccctgct 1800
gaggtgacgt tcgaggccga gtacgacctg gagcgggctc agaaggctgt caacgctctg 1860
ttcacttcta ctaaccagct tggtttgaag actaacgtga ccgactacca cattgatcaa 1920
gtcagtaacc tggtcacgta cctgtctgac gagttctgtc ttgacgagaa gcgggagctg 1980
tctgagaagg tcaagcacgc taagcggctg tctgacgagc ggaacctgct tcaagacagt 2040
aacttcaagg acattaaccg ccagcctgag cgtggttggg gagggtccac gggtattacg 2100
attcaaggag gtgacgatgt ctttaaggag aactatgtga cgctttcggg tacgtttgat 2160
gagtgctatc caacgtacct ttaccagaag attgacgagt cgaagctgaa ggctttcact 2220
cgttaccagc ttcgtggtta cattgaggac tcgcaagacc tcgaaatcta cctcattcgt 2280
tacaacgcta agcacgagac tgtcaacgtc cctggtacgg gtagtctttg gccgctttct 2340
gctcagtcgc cgattggcaa gtgtggcgag ccgaaccgtt gcgctcctca cttggagtgg 2400
aacccggatc tcgattgctc gtgccgtgac ggtgagaagt gcgcgcacca tagtcatcac 2460
tttagccttg acattgatgt cggttgcacg gatcttaacg aggatcttgg agtctgggtg 2520
attttcaaga tcaaaactca ggatgggcac gcgcgtcttg ggaatcttga gttcctggag 2580
gagaagccac ttgtcggtga ggcgcttgcg cgtgtcaagc gtgcggagaa gaaatggcgt 2640
gataagcgtg agaagttgga gtgggagacg aacatcgtgt acaaggaggc gaaggagtcg 2700
gtcgatgcgt tgtttgtcaa tagtcaatac gatcaattgc aagcggatac gaacatcgca 2760
atgattcatg cggcagataa gcgtgtccat tcgattcgtg aggcgtactt gccagagttg 2820
tcggtcatcc caggagttaa tgcggcaatc tttgaggaat tggagggcag aatcttcacg 2880
gcgttctcgt tgtacgatgc aagaaatgtt attaagaatg gagatttcaa caatgggttg 2940
tcatgctgga atgttaaggg tcacgttgat gttgaagaac agaacaacca gagatcagtg 3000
ttggttgtac cagagtggga ggcagaggtt tcacaagagg tgagagtttg cccaggcaga 3060
ggctacatct tgagagttac agcatacaaa gagggatacg gcgagggatg tgttacaatc 3120
cacgaaatcg agaacaatac cgatgagcta aagttctcaa attgtgttga ggaggagatc 3180
tacccgaaca acacggttac ttgtaatgat tacacagtga accaggagga gtatggtggt 3240
gcatacacat caagaaatag aggctacaat gaagcaccat cagttccagc agattatgcc 3300
tcagtttatg aggagaagtc atacacagat ggacgacgtg agaatccatg tgagttcaat 3360
cgaggatacc gagattacac accactacca gttggatacg ttacaaagga actagaatac 3420
ttcccagaaa cagataaagt atggatagag atcggagaaa cagaaggaac attcatcgtt 3480
gattcagtag aactactact tatggaagaa tga 3513
<210> 53
<211> 1170
<212> PRT
<213> 人工的
<220>
<223> 嵌合蛋白质TIC836的氨基酸序列。
<400> 53
Met Glu Asn Asn Ile Gln Asn Gln Cys Val Pro Tyr Asn Cys Leu Asn
1 5 10 15
Asn Pro Glu Val Glu Ile Leu Asn Glu Glu Arg Ser Thr Gly Arg Leu
20 25 30
Pro Leu Asp Ile Ser Leu Ser Leu Thr Arg Phe Leu Leu Ser Glu Phe
35 40 45
Val Pro Gly Val Gly Val Ala Phe Gly Leu Phe Asp Leu Ile Trp Gly
50 55 60
Phe Ile Thr Pro Ser Asp Trp Ser Leu Phe Leu Leu Gln Ile Glu Gln
65 70 75 80
Leu Ile Glu Gln Arg Ile Glu Thr Leu Glu Arg Asn Arg Ala Ile Thr
85 90 95
Thr Leu Arg Gly Leu Ala Asp Ser Tyr Glu Ile Tyr Ile Glu Ala Leu
100 105 110
Arg Glu Trp Glu Ala Asn Pro Asn Asn Ala Gln Leu Arg Glu Asp Val
115 120 125
Arg Ile Arg Phe Ala Asn Thr Asp Asp Ala Leu Ile Thr Ala Ile Asn
130 135 140
Asn Phe Thr Leu Thr Ser Phe Glu Ile Pro Leu Leu Ser Val Tyr Val
145 150 155 160
Gln Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val Ser Phe
165 170 175
Gly Gln Gly Trp Gly Leu Asp Ile Ala Thr Val Asn Asn His Tyr Asn
180 185 190
Arg Leu Ile Asn Leu Ile His Arg Tyr Thr Lys His Cys Leu Asp Thr
195 200 205
Tyr Asn Gln Gly Leu Glu Asn Leu Arg Gly Thr Asn Thr Arg Gln Trp
210 215 220
Ala Arg Phe Asn Gln Phe Arg Arg Asp Leu Thr Leu Thr Val Leu Asp
225 230 235 240
Ile Val Ala Leu Phe Pro Asn Tyr Asp Val Arg Thr Tyr Pro Ile Gln
245 250 255
Thr Ser Ser Gln Leu Thr Arg Glu Ile Tyr Thr Ser Ser Val Ile Glu
260 265 270
Asp Ser Pro Val Ser Ala Asn Ile Pro Asn Gly Phe Asn Arg Ala Glu
275 280 285
Phe Gly Val Arg Pro Pro His Leu Met Asp Phe Met Asn Ser Leu Phe
290 295 300
Val Thr Ala Glu Thr Val Arg Ser Gln Thr Val Trp Gly Gly His Leu
305 310 315 320
Val Ser Ser Arg Asn Thr Ala Gly Asn Arg Ile Asn Phe Pro Ser Tyr
325 330 335
Gly Val Phe Asn Pro Gly Gly Ala Ile Trp Ile Ala Asp Glu Asp Pro
340 345 350
Arg Pro Phe Tyr Arg Thr Leu Ser Asp Pro Val Phe Val Arg Gly Gly
355 360 365
Phe Gly Asn Pro His Tyr Val Leu Gly Leu Arg Gly Val Ala Phe Gln
370 375 380
Gln Thr Gly Thr Asn His Thr Arg Thr Phe Arg Asn Ser Gly Thr Ile
385 390 395 400
Asp Ser Leu Asp Glu Ile Pro Pro Gln Asp Asn Ser Gly Ala Pro Trp
405 410 415
Asn Asp Tyr Ser His Val Leu Asn His Val Thr Phe Val Arg Trp Pro
420 425 430
Gly Glu Ile Ser Gly Ser Asp Ser Trp Arg Ala Pro Met Phe Ser Trp
435 440 445
Thr His Arg Ser Ala Thr Pro Thr Asn Thr Ile Asp Pro Glu Arg Ile
450 455 460
Thr Gln Ile Pro Leu Thr Lys Ser Thr Asn Leu Gly Ser Gly Thr Ser
465 470 475 480
Val Val Lys Gly Pro Gly Phe Thr Gly Gly Asp Ile Leu Arg Arg Thr
485 490 495
Ser Pro Gly Gln Ile Ser Thr Leu Arg Val Asn Ile Thr Ala Pro Leu
500 505 510
Ser Gln Arg Tyr Arg Val Arg Ile Arg Tyr Ala Ser Thr Thr Asn Leu
515 520 525
Gln Phe His Thr Ser Ile Asp Gly Arg Pro Ile Asn Gln Gly Asn Phe
530 535 540
Ser Ala Thr Met Ser Ser Gly Ser Asn Leu Gln Ser Gly Ser Phe Arg
545 550 555 560
Thr Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val
565 570 575
Phe Thr Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr Ile
580 585 590
Asp Arg Ile Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr
595 600 605
Asp Leu Glu Arg Ala Gln Lys Ala Val Asn Ala Leu Phe Thr Ser Thr
610 615 620
Asn Gln Leu Gly Leu Lys Thr Asn Val Thr Asp Tyr His Ile Asp Gln
625 630 635 640
Val Ser Asn Leu Val Thr Tyr Leu Ser Asp Glu Phe Cys Leu Asp Glu
645 650 655
Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp
660 665 670
Glu Arg Asn Leu Leu Gln Asp Ser Asn Phe Lys Asp Ile Asn Arg Gln
675 680 685
Pro Glu Arg Gly Trp Gly Gly Ser Thr Gly Ile Thr Ile Gln Gly Gly
690 695 700
Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Ser Gly Thr Phe Asp
705 710 715 720
Glu Cys Tyr Pro Thr Tyr Leu Tyr Gln Lys Ile Asp Glu Ser Lys Leu
725 730 735
Lys Ala Phe Thr Arg Tyr Gln Leu Arg Gly Tyr Ile Glu Asp Ser Gln
740 745 750
Asp Leu Glu Ile Tyr Leu Ile Arg Tyr Asn Ala Lys His Glu Thr Val
755 760 765
Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gln Ser Pro
770 775 780
Ile Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp
785 790 795 800
Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His
805 810 815
His Ser His His Phe Ser Leu Asp Ile Asp Val Gly Cys Thr Asp Leu
820 825 830
Asn Glu Asp Leu Gly Val Trp Val Ile Phe Lys Ile Lys Thr Gln Asp
835 840 845
Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu
850 855 860
Val Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg
865 870 875 880
Asp Lys Arg Glu Lys Leu Glu Trp Glu Thr Asn Ile Val Tyr Lys Glu
885 890 895
Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gln Tyr Asp Gln
900 905 910
Leu Gln Ala Asp Thr Asn Ile Ala Met Ile His Ala Ala Asp Lys Arg
915 920 925
Val His Ser Ile Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val Ile Pro
930 935 940
Gly Val Asn Ala Ala Ile Phe Glu Glu Leu Glu Gly Arg Ile Phe Thr
945 950 955 960
Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val Ile Lys Asn Gly Asp Phe
965 970 975
Asn Asn Gly Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu
980 985 990
Glu Gln Asn Asn Gln Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala
995 1000 1005
Glu Val Ser Gln Glu Val Arg Val Cys Pro Gly Arg Gly Tyr Ile
1010 1015 1020
Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val
1025 1030 1035
Thr Ile His Glu Ile Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser
1040 1045 1050
Asn Cys Val Glu Glu Glu Ile Tyr Pro Asn Asn Thr Val Thr Cys
1055 1060 1065
Asn Asp Tyr Thr Val Asn Gln Glu Glu Tyr Gly Gly Ala Tyr Thr
1070 1075 1080
Ser Arg Asn Arg Gly Tyr Asn Glu Ala Pro Ser Val Pro Ala Asp
1085 1090 1095
Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg
1100 1105 1110
Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr Arg Asp Tyr Thr Pro
1115 1120 1125
Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu
1130 1135 1140
Thr Asp Lys Val Trp Ile Glu Ile Gly Glu Thr Glu Gly Thr Phe
1145 1150 1155
Ile Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu
1160 1165 1170
Claims (17)
1.一种嵌合杀昆虫蛋白质,其氨基酸序列如SEQ ID NO:10所示。
2.一种编码嵌合杀昆虫蛋白质的多核苷酸,其中所述多核苷酸可操作地连接至异源启动子,并且所述嵌合杀昆虫蛋白质的氨基酸序列如SEQ ID NO:10所示。
3.一种编码嵌合杀昆虫蛋白质的多核苷酸,其中所述多核苷酸编码氨基酸序列如SEQID NO:10所示的嵌合杀昆虫蛋白质。
4.一种宿主细胞,其包含编码嵌合杀昆虫蛋白质的多核苷酸,其中所述嵌合杀昆虫蛋白质的氨基酸序列如SEQ ID NO:10所示或其中所述多核苷酸序列如SEQ ID NO:9所示,其中所述宿主细胞是选自由细菌宿主细胞和不可再生的植物宿主细胞组成的群组。
5.如权利要求4所述的宿主细胞,其中所述细菌宿主细胞是选自由以下各项组成的群组:土壤杆菌、根瘤菌、芽孢杆菌、短芽孢杆菌、埃希氏杆菌、假单胞菌、克雷伯氏杆菌和欧文氏菌。
6.如权利要求4所述的宿主细胞,其中所述宿主细胞是源自选自由单子叶植物和双子叶植物组成的植物群组的植物的不可再生的植物宿主细胞。
7.一种昆虫抑制性组合物,其包含氨基酸序列如SEQ ID NO:10所示的嵌合杀昆虫蛋白质。
8.如权利要求7所述的昆虫抑制性组合物,其还包含与所述嵌合杀昆虫蛋白质不同的至少一种昆虫抑制剂。
9.如权利要求8所述的昆虫抑制性组合物,其中所述至少一种昆虫抑制剂是选自由以下各项组成的群组:昆虫抑制性蛋白质、昆虫抑制性dsRNA分子和昆虫抑制性化学物质。
10.如权利要求8所述的昆虫抑制性组合物,其中所述至少一种昆虫抑制剂对鳞翅目、鞘翅目、半翅目、同翅目或缨翅目的一个或多个害虫物种表现出活性。
11.一种防治鳞翅目害虫的方法,所述方法包括使所述鳞翅目害虫与抑制量的如权利要求1所述的嵌合杀昆虫蛋白质接触。
12.一种不可再生的转基因植物细胞,其包含嵌合杀昆虫蛋白质,其中:
所述嵌合杀昆虫蛋白质的氨基酸序列如SEQ ID NO:10所示。
13.一种防治鳞翅目害虫的方法,其包括使所述害虫暴露于如权利要求12所述的不可再生的转基因植物细胞,其中所述不可再生的植物细胞表达鳞翅目抑制量的所述嵌合杀昆虫蛋白质。
14.一种来源于如权利要求12所述的不可再生的植物细胞的商品产品,其中所述产品包含可检测量的所述嵌合杀昆虫蛋白质。
15.如权利要求14所述的商品产品,其中所述产品是选自由以下各项组成的群组:油、膳食、动物饲料、面粉、薄片、糠、棉绒和外壳。
16.一种编码如权利要求1所述的嵌合杀昆虫蛋白质的重组多核苷酸分子,其包含编码嵌合杀昆虫蛋白质的多核苷酸,其中所述嵌合杀昆虫蛋白质的氨基酸序列如SEQ ID NO:10所示或其中所述多核苷酸序列如SEQ ID NO:9所示;和编码与所述嵌合杀昆虫蛋白质不同的昆虫抑制剂的多核苷酸序列。
17.一种重组核酸分子,其包含可操作地连接至编码嵌合杀昆虫蛋白质的多核苷酸片段的异源启动子,其中:
所述嵌合杀昆虫蛋白质的氨基酸序列如SEQ ID NO:10所示。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011054128.8A CN112175093B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462064989P | 2014-10-16 | 2014-10-16 | |
US62/064,989 | 2014-10-16 | ||
PCT/US2015/055800 WO2016061391A2 (en) | 2014-10-16 | 2015-10-15 | Novel chimeric insecticidal proteins toxic or inhibitory to lepidopteran pests |
CN202011054128.8A CN112175093B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN201580055840.0A CN107074974B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201580055840.0A Division CN107074974B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112175093A CN112175093A (zh) | 2021-01-05 |
CN112175093B true CN112175093B (zh) | 2024-08-27 |
Family
ID=54608929
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011055790.5A Active CN112175094B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN202011068643.1A Active CN112142857B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN201580055840.0A Active CN107074974B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN202011054128.8A Active CN112175093B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN202011069672.XA Active CN112142858B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011055790.5A Active CN112175094B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN202011068643.1A Active CN112142857B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
CN201580055840.0A Active CN107074974B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011069672.XA Active CN112142858B (zh) | 2014-10-16 | 2015-10-15 | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 |
Country Status (29)
Country | Link |
---|---|
US (7) | US10233217B2 (zh) |
EP (5) | EP3715362A1 (zh) |
JP (1) | JP6626102B2 (zh) |
KR (5) | KR102208978B1 (zh) |
CN (5) | CN112175094B (zh) |
AR (6) | AR103129A1 (zh) |
AU (6) | AU2015332384B2 (zh) |
BR (4) | BR122020004891B1 (zh) |
CA (3) | CA3151123A1 (zh) |
CL (5) | CL2017000895A1 (zh) |
CO (1) | CO2017004807A2 (zh) |
CR (3) | CR20210269A (zh) |
CU (5) | CU24571B1 (zh) |
EA (5) | EA201892763A1 (zh) |
EC (1) | ECSP17029551A (zh) |
ES (1) | ES2864657T3 (zh) |
IL (1) | IL251570B (zh) |
MX (5) | MX2017004919A (zh) |
MY (1) | MY181627A (zh) |
NI (1) | NI201700044A (zh) |
NZ (3) | NZ768153A (zh) |
PE (5) | PE20220940A1 (zh) |
PH (5) | PH12017500697A1 (zh) |
SG (5) | SG10201913859XA (zh) |
SV (1) | SV2017005422A (zh) |
UA (3) | UA123480C2 (zh) |
UY (1) | UY36360A (zh) |
WO (1) | WO2016061391A2 (zh) |
ZA (6) | ZA201702191B (zh) |
Families Citing this family (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR112016012643B1 (pt) * | 2013-12-09 | 2022-10-11 | BASF Agricultural Solutions Seed US LLC | Constructo, vetor, célula hospedeira bacteriana, polipeptídeo, composição, método de controle e exterminação de praga lepidóptera, método de proteção de planta e método de produção de polipeptídeo com atividade pesticida |
US10487123B2 (en) | 2014-10-16 | 2019-11-26 | Monsanto Technology Llc | Chimeric insecticidal proteins toxic or inhibitory to lepidopteran pests |
CU24571B1 (es) | 2014-10-16 | 2022-01-13 | Monsanto Technology Llc | Proteínas de bacillus thuringiensis quiméricas insecticidas tóxicas o inhibidoras de plagas de lepidópteros |
US20170226164A1 (en) | 2014-10-16 | 2017-08-10 | Pioneer Hi-Bred International, Inc. | Insecticidal polypeptides having improved activity spectrum and uses thereof |
CA2986265A1 (en) | 2015-06-16 | 2016-12-22 | Pioneer Hi-Bred International, Inc. | Compositions and methods to control insect pests |
WO2017030808A1 (en) | 2015-08-18 | 2017-02-23 | Monsanto Technology Llc | Novel insect inhibitory proteins |
MY189005A (en) | 2015-08-27 | 2022-01-18 | Monsanto Technology Llc | Novel insect inhibitory proteins |
US10572836B2 (en) | 2015-10-15 | 2020-02-25 | International Business Machines Corporation | Automatic time interval metadata determination for business intelligence and predictive analytics |
CA3022858A1 (en) | 2016-06-16 | 2017-12-21 | Pioneer Hi-Bred International, Inc. | Compositions and methods to control insect pests |
WO2018013333A1 (en) | 2016-07-12 | 2018-01-18 | Pioneer Hi-Bred International, Inc. | Compositions and methods to control insect pests |
US11016730B2 (en) | 2016-07-28 | 2021-05-25 | International Business Machines Corporation | Transforming a transactional data set to generate forecasting and prediction insights |
WO2018075350A1 (en) * | 2016-10-21 | 2018-04-26 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins from plants and methods for their use |
EP3550961A4 (en) * | 2016-12-12 | 2020-11-04 | Syngenta Participations AG | ENGINEERED PROTEIN PESTICIDES AND METHODS FOR CONTROLLING PLANT PESTS |
CN110022676B (zh) | 2017-01-04 | 2023-07-25 | 先正达参股股份有限公司 | 用于控制植物有害生物的组合物和方法 |
CR20190367A (es) * | 2017-01-12 | 2019-09-25 | Monsanto Technology Llc | Proteínas toxinas pesticidas activas contra insectos lepidópteros |
WO2019074598A1 (en) | 2017-10-13 | 2019-04-18 | Pioneer Hi-Bred International, Inc. | VIRUS-INDUCED GENETIC SILENCING TECHNOLOGY FOR THE CONTROL OF INSECTS IN MAIZE |
CN108148841B (zh) * | 2017-12-14 | 2020-12-29 | 云南大学 | 氨基酸序列在用于使昆虫Dip3蛋白失活中的应用 |
EP3728294A4 (en) | 2017-12-19 | 2021-12-29 | Pioneer Hi-Bred International, Inc. | Insecticidal polypeptides and uses thereof |
CN115850420A (zh) * | 2018-03-14 | 2023-03-28 | 先锋国际良种公司 | 来自植物的杀昆虫蛋白及其使用方法 |
CN111867377B (zh) * | 2018-03-14 | 2023-05-23 | 先锋国际良种公司 | 来自植物的杀昆虫蛋白及其使用方法 |
JP2021532744A (ja) * | 2018-07-30 | 2021-12-02 | モンサント テクノロジー エルエルシー | トウモロコシの遺伝子組み換え事象mon95379ならびにその検出及び使用方法 |
CN109198845A (zh) * | 2018-08-21 | 2019-01-15 | 广州杰赛科技股份有限公司 | 全自主甲面彩绘装置、方法、设备及存储介质 |
CA3106444A1 (en) | 2018-08-29 | 2020-03-05 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for conferring pesticidal activity to plants |
CN111100208A (zh) * | 2020-01-16 | 2020-05-05 | 黑龙江大鹏农业有限公司 | 一种人工合成的抗虫蛋白mCry1Ia2及其制备方法和应用 |
WO2022125639A1 (en) | 2020-12-08 | 2022-06-16 | Monsanto Technology Llc | Modified plant-associated bacteria and methods of their use |
CA3206159A1 (en) | 2020-12-21 | 2022-06-30 | Monsanto Technology Llc | Novel insect inhibitory proteins |
UY39585A (es) | 2020-12-23 | 2022-07-29 | Monsanto Technology Llc | Proteínas que exhiben actividad inhibidora de insectos frente a plagas con importancia agrícola de plantas de cultivo y semillas |
US11673922B2 (en) | 2020-12-31 | 2023-06-13 | Monsanto Technology Llc | Insect inhibitory proteins |
WO2022204464A1 (en) | 2021-03-26 | 2022-09-29 | Flagship Pioneering Innovations Vii, Llc | Production of circular polyribonucleotides in a eukaryotic system |
US20240263206A1 (en) | 2021-03-26 | 2024-08-08 | Flagship Pioneering Innovations Vii, Llc | Compositions and methods for producing circular polyribonucleotides |
WO2022204466A1 (en) | 2021-03-26 | 2022-09-29 | Flagship Pioneering Innovations Vii, Llc | Production of circular polyribonucleotides in a prokaryotic system |
MX2024000435A (es) | 2021-07-08 | 2024-01-29 | Monsanto Technology Llc | Proteinas inhibidoras de insectos novedosas. |
CN114134171B (zh) * | 2021-10-29 | 2023-09-15 | 隆平生物技术(海南)有限公司 | 一种抑制或杀灭东方黏虫的方法及其应用 |
EP4426842A1 (en) | 2021-11-01 | 2024-09-11 | Flagship Pioneering Innovations VII, LLC | Polynucleotides for modifying organisms |
MX2024009021A (es) | 2022-01-20 | 2024-08-06 | Flagship Pioneering Innovations Vii Llc | Polinucleotidos para modificar organismos. |
CN114507673A (zh) * | 2022-01-20 | 2022-05-17 | 隆平生物技术(海南)有限公司 | 一种抑制或杀灭小地老虎的方法及应用 |
CN116063431B (zh) * | 2022-09-19 | 2023-11-10 | 隆平生物技术(海南)有限公司 | 一种植物抗虫蛋白质及其应用 |
WO2024092330A1 (pt) * | 2022-11-04 | 2024-05-10 | Empresa Brasileira De Pesquisa Agropecuária - Embrapa | Proteínas inseticidas quiméricas truncadas |
CN117144054B (zh) * | 2023-10-27 | 2024-06-11 | 莱肯生物科技(海南)有限公司 | 一种核酸检测方法及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101686705A (zh) * | 2007-04-27 | 2010-03-31 | 孟山都技术有限责任公司 | 来自Bacillus thuringiensis的半翅目和鞘翅目活性的毒素蛋白 |
CN102596988A (zh) * | 2009-10-02 | 2012-07-18 | 先正达参股股份有限公司 | 杀虫蛋白 |
Family Cites Families (82)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE93542T1 (de) | 1984-12-28 | 1993-09-15 | Plant Genetic Systems Nv | Rekombinante dna, die in pflanzliche zellen eingebracht werden kann. |
EP0218571B1 (en) | 1985-08-07 | 1993-02-03 | Monsanto Company | Glyphosate-resistant plants |
US5312910A (en) | 1987-05-26 | 1994-05-17 | Monsanto Company | Glyphosate-tolerant 5-enolpyruvyl-3-phosphoshikimate synthase |
ES2164633T3 (es) | 1989-02-24 | 2002-03-01 | Monsanto Technology Llc | Genes vegetales sinteticos y procedimiento para su preparacion. |
US5633435A (en) | 1990-08-31 | 1997-05-27 | Monsanto Company | Glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthases |
FR2673643B1 (fr) | 1991-03-05 | 1993-05-21 | Rhone Poulenc Agrochimie | Peptide de transit pour l'insertion d'un gene etranger dans un gene vegetal et plantes transformees en utilisant ce peptide. |
US5723758A (en) * | 1991-09-13 | 1998-03-03 | Mycogen Corporation | Bacillus thuringiensis genes encoding lepidopteran-active toxins |
US5322687A (en) | 1993-07-29 | 1994-06-21 | Ecogen Inc. | Bacillus thuringiensis cryet4 and cryet5 toxin genes and proteins toxic to lepidopteran insects |
GB9318207D0 (en) | 1993-09-02 | 1993-10-20 | Sandoz Ltd | Improvements in or relating to organic compounds |
US5508264A (en) * | 1994-12-06 | 1996-04-16 | Mycogen Corporation | Pesticidal compositions |
US6063756A (en) * | 1996-09-24 | 2000-05-16 | Monsanto Company | Bacillus thuringiensis cryET33 and cryET34 compositions and uses therefor |
CA2272843C (en) * | 1996-11-20 | 2009-11-10 | Ecogen, Inc. | Broad-spectrum delta-endotoxins |
US6017534A (en) * | 1996-11-20 | 2000-01-25 | Ecogen, Inc. | Hybrid Bacillus thuringiensis δ-endotoxins with novel broad-spectrum insecticidal activity |
US6713063B1 (en) | 1996-11-20 | 2004-03-30 | Monsanto Technology, Llc | Broad-spectrum δ-endotoxins |
US5942664A (en) | 1996-11-27 | 1999-08-24 | Ecogen, Inc. | Bacillus thuringiensis Cry1C compositions toxic to lepidopteran insects and methods for making Cry1C mutants |
US6218188B1 (en) * | 1997-11-12 | 2001-04-17 | Mycogen Corporation | Plant-optimized genes encoding pesticidal toxins |
US6489542B1 (en) | 1998-11-04 | 2002-12-03 | Monsanto Technology Llc | Methods for transforming plants to express Cry2Ab δ-endotoxins targeted to the plastids |
US6283613B1 (en) | 1999-07-29 | 2001-09-04 | Cooper Technologies Company | LED traffic light with individual LED reflectors |
US6501009B1 (en) | 1999-08-19 | 2002-12-31 | Monsanto Technology Llc | Expression of Cry3B insecticidal protein in plants |
AU6702300A (en) * | 1999-08-19 | 2001-03-19 | Syngenta Participations Ag | Hybrid insecticidal toxins and nucleic acid sequences coding therefor |
WO2001019859A2 (en) * | 1999-09-15 | 2001-03-22 | Monsanto Technology Llc | LEPIDOPTERAN-ACTIVE BACILLUS THURINGIENSIS δ-ENDOTOXIN COMPOSITIONS AND METHODS OF USE |
AU2000269030A1 (en) * | 2000-08-11 | 2002-02-25 | Monsanto Technology Llc | Broad-spectrum delta-endotoxins |
JP2004506432A (ja) * | 2000-08-25 | 2004-03-04 | シンジェンタ・パティシペーションズ・アクチェンゲゼルシャフト | Bacillusthuringiensis殺虫性結晶タンパク質由来の新規殺虫性毒素 |
AR035799A1 (es) * | 2001-03-30 | 2004-07-14 | Syngenta Participations Ag | Toxinas insecticidas aisladas de bacillus thuringiensis y sus usos. |
CN101385467B (zh) * | 2001-03-30 | 2014-11-05 | 辛根塔参与股份公司 | 新的杀虫毒素 |
EP2213681A1 (en) * | 2002-03-22 | 2010-08-04 | Bayer BioScience N.V. | Novel Bacillus thuringiensis insecticidal proteins |
US20060112447A1 (en) | 2002-08-29 | 2006-05-25 | Bogdanova Natalia N | Nucleotide sequences encoding cry1bb proteins for enhanced expression in plants |
CA2562022C (en) | 2004-04-09 | 2016-01-26 | Monsanto Technology Llc | Compositions and methods for control of insect infestations in plants |
AR049214A1 (es) | 2004-06-09 | 2006-07-05 | Pioneer Hi Bred Int | Peptidos de transito a plastidos |
CA2601857A1 (en) | 2005-04-01 | 2006-10-12 | Nadine Carozzi | Axmi-027, axmi-036 and axmi-038, a family of delta-endotoxin genes and methods for their use |
NZ566028A (en) | 2005-08-31 | 2011-09-30 | Monsanto Technology Llc | Nucleotide sequences encoding insecticidal proteins |
US8148077B2 (en) | 2006-07-21 | 2012-04-03 | Pioneer Hi-Bred International, Inc. | Method for identifying novel genes |
US7521235B2 (en) | 2006-07-21 | 2009-04-21 | Pioneer Hi-Bred International, Inc. | Unique novel Bacillus thuringiensis gene with Lepidopteran activity |
CA2672036C (en) | 2006-12-08 | 2015-10-13 | Pioneer Hi-Bred International, Inc. | Novel bacillus thuringiensis crystal polypeptides, polynucleotides, and compositions thereof |
EP2126096B1 (en) | 2007-03-09 | 2013-12-18 | Monsanto Technology, LLC | Methods for plant transformation using spectinomycin selection |
ES2601577T3 (es) * | 2007-03-28 | 2017-02-15 | Syngenta Participations Ag | Proteínas insecticidas |
WO2008137985A2 (en) | 2007-05-08 | 2008-11-13 | Monsanto Technology Llc | Methods for inducing cotton embryogenic callus |
US7772465B2 (en) | 2007-06-26 | 2010-08-10 | Pioneer Hi-Bred International, Inc. | Bacillus thuringiensis gene with lepidopteran activity |
WO2009029852A2 (en) | 2007-08-31 | 2009-03-05 | Monsanto Technology Llc | Method and apparatus for substantially isolating plant tissues |
US8283524B2 (en) | 2008-05-15 | 2012-10-09 | Pioneer Hi-Bred International, Inc | Bacillus thuringiensis gene with lepidopteran activity |
US8129594B2 (en) * | 2008-06-11 | 2012-03-06 | Pioneer Hi-Bred International, Inc. | Bacillus thuringiensis gene with lepidopteran activity |
CN102076858B (zh) | 2008-06-11 | 2013-10-30 | 先锋国际良种公司 | 具有鳞翅目活性的新的苏云金芽孢杆菌基因 |
EA020327B1 (ru) | 2008-06-25 | 2014-10-30 | Атеникс Корпорейшн | Гены токсинов и способы их применения |
EA035103B1 (ru) | 2008-07-02 | 2020-04-28 | Атеникс Корпорейшн | Инсектицидный полипептид axmi-115 дельта-эндотоксина и его применение |
US8445749B2 (en) | 2008-09-19 | 2013-05-21 | Pioneer Hi Bred International Inc | Bacillus thuringiensis gene with lepidopteran activity |
US20100077507A1 (en) | 2008-09-22 | 2010-03-25 | Pioneer Hi-Bred International, Inc. | Novel Bacillus Thuringiensis Gene with Lepidopteran Activity |
EP2361307B1 (en) | 2008-12-22 | 2014-09-24 | Athenix Corporation | Pesticidal genes from Brevibacillus and methods for their use |
CA2747826A1 (en) | 2008-12-23 | 2010-07-01 | Athenix Corporation | Axmi-150 delta-endotoxin gene and methods for its use |
US8692065B2 (en) | 2009-01-23 | 2014-04-08 | Pioneer Hi Bred International Inc | Bacillus thuringiensis gene with lepidopteran activity |
AR075371A1 (es) | 2009-02-05 | 2011-03-30 | Athenix Corp | Genes de delta endotoxina variante axmi-r1 y metodos de uso de los mismos |
US8318900B2 (en) | 2009-02-27 | 2012-11-27 | Athenix Corp. | Pesticidal proteins and methods for their use |
WO2010141141A2 (en) | 2009-03-11 | 2010-12-09 | Athenix Corporation | Axmi-001, axmi-002, axmi-030, axmi-035, and axmi-045: toxin genes and methods for their use |
CN102459315B (zh) | 2009-04-17 | 2016-03-02 | 陶氏益农公司 | Dig-3杀虫cry毒素 |
JP2012529911A (ja) | 2009-06-16 | 2012-11-29 | ダウ アグロサイエンシィズ エルエルシー | Dig−10殺虫性cry毒素 |
WO2010147880A2 (en) | 2009-06-16 | 2010-12-23 | Dow Agrosciences Llc | Dig-11 insecticidal cry toxins |
AU2010260302A1 (en) | 2009-06-16 | 2011-12-15 | Dow Agrosciences Llc | DIG-5 insecticidal Cry toxins |
US8461415B2 (en) | 2009-07-31 | 2013-06-11 | Athenix Corp. | AXMI-192 family of pesticidal genes and methods for their use |
US8772577B2 (en) | 2009-11-12 | 2014-07-08 | Pioneer Hi Bred International Inc | Bacillus thuringiensis gene with lepidopteran activity |
EP2512222B1 (en) * | 2009-12-16 | 2018-01-24 | Dow AgroSciences LLC | COMBINED USE OF CRY1Ca AND CRY1Fa PROTEINS FOR INSECT RESISTANCE MANAGEMENT |
WO2011084324A2 (en) | 2009-12-21 | 2011-07-14 | Pioneer Hi-Bred International, Inc. | Novel bacillus thuringiensis gene with lepidopteran activity |
BR122019005253A8 (pt) | 2010-02-18 | 2022-07-05 | Athenix Corp | Molécula de ácido nucleico recombinante, vetor, célula hospedeira, polipeptídeo recombinante com atividade pesticida, composição, bem como métodos para o controle de uma população de pragas para matar uma praga, para a produção de um polipeptídeo com atividade pesticida, para a proteção de uma planta de uma praga, e para aumentar o rendimento em uma planta |
AU2011218130B2 (en) | 2010-02-18 | 2016-03-03 | Athenix Corp. | AXMI218, AXMI219, AXMI220, AXMI226, AXMI227, AXMI228, AXMI229, AXMI230, and AXMI231 delta-endotoxin genes and methods for their use |
WO2012024200A2 (en) | 2010-08-19 | 2012-02-23 | Pioneer Hi-Bred International, Inc. | Novel bacillus thuringiensis gene with lepidopteran activity |
BR112013015515A2 (pt) | 2010-12-28 | 2018-04-24 | Pioneer Hi Bred Int | molécula de ácido nucleico isolada, construto de dna, célula hospedeira, planta transgênica, semente transformada da planta, polipeptídeo isolado com atividade pesticida, composição, método para controlar uma população de praga de lepidóptero, método para matar uma praga de lepidóptero, método para produzir um polipeptídeo com atividade pesticida, planta que tem incorporado de maneira estável em seu genoma um construto de dna, método para proteger uma planta contra uma praga |
CN103328637B (zh) | 2011-01-24 | 2016-01-27 | 先锋国际良种公司 | 具有抗鳞翅目昆虫活性的新型苏云金芽孢杆菌基因 |
CA3048240C (en) | 2011-02-11 | 2021-02-23 | Monsanto Technology Llc | Pesticidal nucleic acids and proteins and uses thereof |
CA2826229A1 (en) | 2011-02-11 | 2012-08-16 | Pioneer Hi-Bred International, Inc. | Synthetic insecticidal proteins active against corn rootworm |
US8878007B2 (en) | 2011-03-10 | 2014-11-04 | Pioneer Hi Bred International Inc | Bacillus thuringiensis gene with lepidopteran activity |
US9321814B2 (en) | 2011-03-30 | 2016-04-26 | Athenix Corp. | AXMI238 toxin gene and methods for its use |
GB201105418D0 (en) | 2011-03-31 | 2011-05-18 | Univ Durham | Pesticide |
MX346662B (es) * | 2011-04-07 | 2017-03-27 | Monsanto Technology Llc | Familia de toxinas inhibidoras de insectos activas contra insectos hemipteros y lepidopteros. |
UY34227A (es) | 2011-07-28 | 2013-02-28 | Athenix Corp | Acido nucleico recombinante que codifica el gen de la toxina axmi270, vectores, células, plantas y sus métodos de empleo |
ES2647596T3 (es) | 2011-07-28 | 2017-12-22 | Athenix Corp. | Variantes de proteínas AXMI205 y métodos para su uso |
UA122657C2 (uk) | 2011-07-29 | 2020-12-28 | Атенікс Корп. | Ген пестициду axmi279 та спосіб його застосування |
CA2866241C (en) | 2012-03-08 | 2021-03-16 | Athenix Corp. | Axmi345 delta-endotoxin gene and methods for its use |
US9567381B2 (en) | 2012-03-09 | 2017-02-14 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
CA2868815C (en) | 2012-04-06 | 2017-10-24 | Monsanto Technology Llc | Proteins toxic to hemipteran insect species |
US9688730B2 (en) | 2012-07-02 | 2017-06-27 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
US9475847B2 (en) | 2012-07-26 | 2016-10-25 | Pioneer Hi-Bred International, Inc. | Insecticidal proteins and methods for their use |
WO2014055881A1 (en) * | 2012-10-05 | 2014-04-10 | Dow Agrosciences Llc | Use of cry1ea in combinations for management of resistant fall armyworm insects |
US10487123B2 (en) | 2014-10-16 | 2019-11-26 | Monsanto Technology Llc | Chimeric insecticidal proteins toxic or inhibitory to lepidopteran pests |
CU24571B1 (es) | 2014-10-16 | 2022-01-13 | Monsanto Technology Llc | Proteínas de bacillus thuringiensis quiméricas insecticidas tóxicas o inhibidoras de plagas de lepidópteros |
-
2015
- 2015-10-15 CU CU2018000053A patent/CU24571B1/es unknown
- 2015-10-15 CR CR20210269A patent/CR20210269A/es unknown
- 2015-10-15 EA EA201892763A patent/EA201892763A1/ru unknown
- 2015-10-15 EP EP20171024.1A patent/EP3715362A1/en active Pending
- 2015-10-15 CR CR20170198A patent/CR20170198A/es unknown
- 2015-10-15 CA CA3151123A patent/CA3151123A1/en active Pending
- 2015-10-15 WO PCT/US2015/055800 patent/WO2016061391A2/en active Application Filing
- 2015-10-15 EA EA201892762A patent/EA201892762A1/ru unknown
- 2015-10-15 AU AU2015332384A patent/AU2015332384B2/en active Active
- 2015-10-15 PE PE2021002059A patent/PE20220940A1/es unknown
- 2015-10-15 UA UAA201909674A patent/UA123480C2/uk unknown
- 2015-10-15 CU CU2018000052A patent/CU24570B1/es unknown
- 2015-10-15 SG SG10201913859XA patent/SG10201913859XA/en unknown
- 2015-10-15 CA CA3151125A patent/CA3151125A1/en active Pending
- 2015-10-15 CA CA2964776A patent/CA2964776A1/en active Pending
- 2015-10-15 PE PE2021002064A patent/PE20220372A1/es unknown
- 2015-10-15 CN CN202011055790.5A patent/CN112175094B/zh active Active
- 2015-10-15 JP JP2017520352A patent/JP6626102B2/ja active Active
- 2015-10-15 BR BR122020004891-3A patent/BR122020004891B1/pt active IP Right Grant
- 2015-10-15 CU CU2017000049A patent/CU24456B1/es unknown
- 2015-10-15 MY MYPI2017701293A patent/MY181627A/en unknown
- 2015-10-15 SG SG10201913849RA patent/SG10201913849RA/en unknown
- 2015-10-15 PE PE2021002063A patent/PE20220374A1/es unknown
- 2015-10-15 KR KR1020197037522A patent/KR102208978B1/ko active IP Right Grant
- 2015-10-15 BR BR122020004875-1A patent/BR122020004875B1/pt active IP Right Grant
- 2015-10-15 KR KR1020197037523A patent/KR102208980B1/ko active IP Right Grant
- 2015-10-15 EP EP15797725.7A patent/EP3207049B1/en active Active
- 2015-10-15 US US14/884,469 patent/US10233217B2/en active Active
- 2015-10-15 KR KR1020177013031A patent/KR102127553B1/ko active IP Right Grant
- 2015-10-15 SG SG11201702749RA patent/SG11201702749RA/en unknown
- 2015-10-15 CR CR20210268A patent/CR20210268A/es unknown
- 2015-10-15 PE PE2021002061A patent/PE20220375A1/es unknown
- 2015-10-15 SG SG10201913870RA patent/SG10201913870RA/en unknown
- 2015-10-15 NZ NZ768153A patent/NZ768153A/en unknown
- 2015-10-15 SG SG10201913879PA patent/SG10201913879PA/en unknown
- 2015-10-15 ES ES15797725T patent/ES2864657T3/es active Active
- 2015-10-15 EA EA201892760A patent/EA201892760A1/ru unknown
- 2015-10-15 CN CN202011068643.1A patent/CN112142857B/zh active Active
- 2015-10-15 KR KR1020197037525A patent/KR102208985B1/ko active IP Right Grant
- 2015-10-15 CU CU2018000051A patent/CU24541B1/es unknown
- 2015-10-15 CN CN201580055840.0A patent/CN107074974B/zh active Active
- 2015-10-15 EA EA201892761A patent/EA201892761A1/ru unknown
- 2015-10-15 MX MX2017004919A patent/MX2017004919A/es unknown
- 2015-10-15 CU CU2018000054A patent/CU24551B1/es unknown
- 2015-10-15 UA UAA201909675A patent/UA123481C2/uk unknown
- 2015-10-15 BR BR122020004897-2A patent/BR122020004897B1/pt active IP Right Grant
- 2015-10-15 BR BR112017007794-9A patent/BR112017007794B1/pt active IP Right Grant
- 2015-10-15 EP EP20171028.2A patent/EP3715364A1/en active Pending
- 2015-10-15 EP EP20171026.6A patent/EP3715363A1/en active Pending
- 2015-10-15 CN CN202011054128.8A patent/CN112175093B/zh active Active
- 2015-10-15 NZ NZ768151A patent/NZ768151A/en unknown
- 2015-10-15 KR KR1020197037524A patent/KR102208984B1/ko active IP Right Grant
- 2015-10-15 EP EP20171022.5A patent/EP3715361A1/en active Pending
- 2015-10-15 CN CN202011069672.XA patent/CN112142858B/zh active Active
- 2015-10-15 UA UAA201909676A patent/UA123482C2/uk unknown
- 2015-10-15 NZ NZ730747A patent/NZ730747A/en unknown
- 2015-10-15 PE PE2017000604A patent/PE20170895A1/es unknown
- 2015-10-15 EA EA201790843A patent/EA034918B1/ru unknown
- 2015-10-16 AR ARP150103361A patent/AR103129A1/es unknown
- 2015-10-16 UY UY0001036360A patent/UY36360A/es active IP Right Grant
-
2017
- 2017-03-29 ZA ZA2017/02191A patent/ZA201702191B/en unknown
- 2017-04-05 IL IL251570A patent/IL251570B/en active IP Right Grant
- 2017-04-07 NI NI201700044A patent/NI201700044A/es unknown
- 2017-04-07 SV SV2017005422A patent/SV2017005422A/es unknown
- 2017-04-11 CL CL2017000895A patent/CL2017000895A1/es unknown
- 2017-04-11 PH PH12017500697A patent/PH12017500697A1/en unknown
- 2017-04-12 MX MX2021009320A patent/MX2021009320A/es unknown
- 2017-04-12 MX MX2021009317A patent/MX2021009317A/es unknown
- 2017-04-12 MX MX2021009318A patent/MX2021009318A/es unknown
- 2017-04-12 MX MX2021009319A patent/MX2021009319A/es unknown
- 2017-05-12 CO CONC2017/0004807A patent/CO2017004807A2/es unknown
- 2017-05-12 EC ECIEPI201729551A patent/ECSP17029551A/es unknown
- 2017-12-20 US US15/849,218 patent/US10611806B2/en active Active
- 2017-12-20 US US15/848,852 patent/US10494409B2/en active Active
- 2017-12-20 US US15/849,012 patent/US10669317B2/en active Active
- 2017-12-20 US US15/848,837 patent/US10494408B2/en active Active
-
2019
- 2019-01-10 ZA ZA2019/00217A patent/ZA201900217B/en unknown
- 2019-01-14 CL CL2019000109A patent/CL2019000109A1/es unknown
- 2019-01-14 CL CL2019000110A patent/CL2019000110A1/es unknown
- 2019-01-15 CL CL2019000112A patent/CL2019000112A1/es unknown
- 2019-04-30 AU AU2019203015A patent/AU2019203015B2/en active Active
- 2019-04-30 AU AU2019203021A patent/AU2019203021B2/en active Active
- 2019-04-30 AU AU2019203014A patent/AU2019203014B2/en active Active
- 2019-04-30 AU AU2019203025A patent/AU2019203025B2/en active Active
- 2019-05-15 CL CL2019001328A patent/CL2019001328A1/es unknown
-
2020
- 2020-03-20 ZA ZA2020/01770A patent/ZA202001770B/en unknown
- 2020-03-20 ZA ZA2020/01768A patent/ZA202001768B/en unknown
- 2020-03-20 ZA ZA2020/01769A patent/ZA202001769B/en unknown
- 2020-03-20 ZA ZA2020/01771A patent/ZA202001771B/en unknown
- 2020-04-05 AU AU2020202394A patent/AU2020202394C1/en active Active
- 2020-05-08 AR ARP200101322A patent/AR118890A2/es unknown
- 2020-05-08 AR ARP200101325A patent/AR118893A2/es unknown
- 2020-05-08 AR ARP200101326A patent/AR118894A2/es unknown
- 2020-05-08 AR ARP200101323A patent/AR118891A2/es unknown
- 2020-05-08 AR ARP200101324A patent/AR118892A2/es unknown
- 2020-05-14 US US16/874,186 patent/US11267849B2/en active Active
-
2021
- 2021-03-04 PH PH12021500020A patent/PH12021500020A1/en unknown
- 2021-03-04 PH PH12021500022A patent/PH12021500022A1/en unknown
- 2021-03-04 PH PH12021500021A patent/PH12021500021A1/en unknown
- 2021-03-04 PH PH12021500019A patent/PH12021500019A1/en unknown
-
2022
- 2022-02-14 US US17/671,011 patent/US20220306703A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101686705A (zh) * | 2007-04-27 | 2010-03-31 | 孟山都技术有限责任公司 | 来自Bacillus thuringiensis的半翅目和鞘翅目活性的毒素蛋白 |
CN102596988A (zh) * | 2009-10-02 | 2012-07-18 | 先正达参股股份有限公司 | 杀虫蛋白 |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112175093B (zh) | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 | |
EP2142009B1 (en) | Hemipteran- and coleopteran- active toxin proteins from bacillus thuringiensis | |
EA020327B1 (ru) | Гены токсинов и способы их применения | |
CA2547933C (en) | Secreted insecticidal protein and gene compositions from bacillus thuringiensis and uses therefor | |
US20090068159A1 (en) | Insecticidal Compositions and Methods for Making Insect-Resistant Transgenic Plants | |
JP7297443B2 (ja) | 新規防虫タンパク質 | |
CN107109417B (zh) | 新型昆虫抑制性蛋白 | |
CN109952024B (zh) | 新型昆虫抑制蛋白 | |
JP2020503879A (ja) | Lepidopteran昆虫に対する活性を有する殺虫毒性タンパク質 | |
CN107849571B (zh) | 新型昆虫抑制蛋白 | |
CN110678067A (zh) | 新型昆虫抑制蛋白 | |
CA2972016A1 (en) | Modified cry1ca toxins useful for control of insect pests | |
TW201738380A (zh) | 用於植物蟲害管理的四種vip與cry蛋白質毒素之組合 | |
MXPA04009206A (es) | Nuevas proteinas insecticidas de bacillus thuringiensis. | |
RU2780626C2 (ru) | Пестицидные белковые токсины, активные в отношении чешуекрылых | |
RU2781075C2 (ru) | Новые белки, имеющие ингибирующее действие в отношении насекомых |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |