KR20170101247A - 살충 유전자 및 사용 방법 - Google Patents
살충 유전자 및 사용 방법 Download PDFInfo
- Publication number
- KR20170101247A KR20170101247A KR1020177020203A KR20177020203A KR20170101247A KR 20170101247 A KR20170101247 A KR 20170101247A KR 1020177020203 A KR1020177020203 A KR 1020177020203A KR 20177020203 A KR20177020203 A KR 20177020203A KR 20170101247 A KR20170101247 A KR 20170101247A
- Authority
- KR
- South Korea
- Prior art keywords
- ser
- asn
- leu
- ile
- thr
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 187
- 238000000034 method Methods 0.000 title claims abstract description 69
- 230000000361 pesticidal effect Effects 0.000 title claims description 7
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 144
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 125
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 125
- 229920001184 polypeptide Polymers 0.000 claims abstract description 124
- 241000238631 Hexapoda Species 0.000 claims abstract description 116
- 230000000749 insecticidal effect Effects 0.000 claims abstract description 110
- 241000607479 Yersinia pestis Species 0.000 claims abstract description 70
- 239000000203 mixture Substances 0.000 claims abstract description 62
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 51
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 48
- 239000002773 nucleotide Substances 0.000 claims abstract description 47
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 41
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 41
- 230000009261 transgenic effect Effects 0.000 claims abstract description 21
- 239000013598 vector Substances 0.000 claims abstract description 8
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 56
- 108020004414 DNA Proteins 0.000 claims description 32
- 241000254173 Coleoptera Species 0.000 claims description 29
- -1 dusts Substances 0.000 claims description 22
- 241000894006 Bacteria Species 0.000 claims description 19
- 230000001580 bacterial effect Effects 0.000 claims description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 12
- 238000004519 manufacturing process Methods 0.000 claims description 12
- 230000001965 increasing effect Effects 0.000 claims description 10
- 239000008188 pellet Substances 0.000 claims description 6
- 239000000843 powder Substances 0.000 claims description 5
- 239000008187 granular material Substances 0.000 claims description 4
- 239000007921 spray Substances 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 3
- 239000000839 emulsion Substances 0.000 claims description 3
- 239000000243 solution Substances 0.000 claims description 3
- 239000000084 colloidal system Substances 0.000 claims description 2
- 108091033319 polynucleotide Proteins 0.000 abstract description 47
- 102000040430 polynucleotide Human genes 0.000 abstract description 47
- 239000002157 polynucleotide Substances 0.000 abstract description 47
- 230000009466 transformation Effects 0.000 abstract description 22
- 230000001976 improved effect Effects 0.000 abstract description 9
- 241000196324 Embryophyta Species 0.000 description 165
- 235000018102 proteins Nutrition 0.000 description 139
- 210000004027 cell Anatomy 0.000 description 89
- 108700012359 toxins Proteins 0.000 description 75
- 239000003053 toxin Substances 0.000 description 68
- 231100000765 toxin Toxicity 0.000 description 68
- 235000001014 amino acid Nutrition 0.000 description 33
- 239000012634 fragment Substances 0.000 description 32
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 31
- 229940024606 amino acid Drugs 0.000 description 27
- 150000001413 amino acids Chemical class 0.000 description 27
- 244000068988 Glycine max Species 0.000 description 24
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 23
- 235000010469 Glycine max Nutrition 0.000 description 22
- 241000880493 Leptailurus serval Species 0.000 description 22
- 240000008042 Zea mays Species 0.000 description 22
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 22
- 239000002609 medium Substances 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- 108010054155 lysyllysine Proteins 0.000 description 19
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 18
- 235000012054 meals Nutrition 0.000 description 18
- 238000012360 testing method Methods 0.000 description 18
- 108010073969 valyllysine Proteins 0.000 description 18
- 108010047857 aspartylglycine Proteins 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 17
- 108010077245 asparaginyl-proline Proteins 0.000 description 16
- 108010050848 glycylleucine Proteins 0.000 description 16
- 108010079364 N-glycylalanine Proteins 0.000 description 15
- 241000244206 Nematoda Species 0.000 description 15
- 108010044940 alanylglutamine Proteins 0.000 description 15
- 238000004166 bioassay Methods 0.000 description 15
- 230000000694 effects Effects 0.000 description 15
- 239000000523 sample Substances 0.000 description 14
- 108010051110 tyrosyl-lysine Proteins 0.000 description 14
- 108010013835 arginine glutamate Proteins 0.000 description 13
- 239000000463 material Substances 0.000 description 13
- 239000012528 membrane Substances 0.000 description 13
- 108010051242 phenylalanylserine Proteins 0.000 description 13
- 231100000419 toxicity Toxicity 0.000 description 13
- 230000001988 toxicity Effects 0.000 description 13
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 12
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 12
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 12
- 230000007613 environmental effect Effects 0.000 description 12
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 12
- 235000009973 maize Nutrition 0.000 description 12
- 108010031719 prolyl-serine Proteins 0.000 description 12
- 241000193388 Bacillus thuringiensis Species 0.000 description 11
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 11
- 108010089804 glycyl-threonine Proteins 0.000 description 11
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 10
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 10
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 10
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 10
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 10
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 10
- 108010038633 aspartylglutamate Proteins 0.000 description 10
- 108010092854 aspartyllysine Proteins 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 229940097012 bacillus thuringiensis Drugs 0.000 description 10
- 235000005822 corn Nutrition 0.000 description 10
- 230000005764 inhibitory process Effects 0.000 description 10
- 108010015796 prolylisoleucine Proteins 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 241000589158 Agrobacterium Species 0.000 description 9
- 241000193830 Bacillus <bacterium> Species 0.000 description 9
- 241000233866 Fungi Species 0.000 description 9
- 108010060231 Insect Proteins Proteins 0.000 description 9
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 9
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 9
- 240000007594 Oryza sativa Species 0.000 description 9
- 235000007164 Oryza sativa Nutrition 0.000 description 9
- 235000021307 Triticum Nutrition 0.000 description 9
- 241000209140 Triticum Species 0.000 description 9
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 9
- 108010068380 arginylarginine Proteins 0.000 description 9
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 9
- 235000013339 cereals Nutrition 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 208000015181 infectious disease Diseases 0.000 description 9
- 108010017391 lysylvaline Proteins 0.000 description 9
- 239000011159 matrix material Substances 0.000 description 9
- 244000005700 microbiome Species 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 235000009566 rice Nutrition 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- 108010020532 tyrosyl-proline Proteins 0.000 description 9
- 241000238876 Acari Species 0.000 description 8
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 8
- 229920000742 Cotton Polymers 0.000 description 8
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 8
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 8
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 8
- 241000219146 Gossypium Species 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- 241000589989 Helicobacter Species 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 8
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 8
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 8
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 8
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 8
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 8
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 8
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 108010087924 alanylproline Proteins 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 238000009739 binding Methods 0.000 description 8
- 238000011161 development Methods 0.000 description 8
- 230000018109 developmental process Effects 0.000 description 8
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 8
- 108010081551 glycylphenylalanine Proteins 0.000 description 8
- 230000012010 growth Effects 0.000 description 8
- 108010000761 leucylarginine Proteins 0.000 description 8
- 230000001404 mediated effect Effects 0.000 description 8
- 241000894007 species Species 0.000 description 8
- 231100000331 toxic Toxicity 0.000 description 8
- 230000002588 toxic effect Effects 0.000 description 8
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 7
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 7
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 7
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 7
- 241000489947 Diabrotica virgifera virgifera Species 0.000 description 7
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 7
- 241000208818 Helianthus Species 0.000 description 7
- 235000003222 Helianthus annuus Nutrition 0.000 description 7
- 241000498254 Heterodera glycines Species 0.000 description 7
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 7
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 7
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 7
- 108010062796 arginyllysine Proteins 0.000 description 7
- 108010068265 aspartyltyrosine Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 210000002257 embryonic structure Anatomy 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 7
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 7
- 108010064235 lysylglycine Proteins 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 7
- 108010003137 tyrosyltyrosine Proteins 0.000 description 7
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 6
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 6
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 6
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 6
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 6
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 6
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 6
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 6
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 6
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 6
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 6
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 6
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 6
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 6
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 6
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 6
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 6
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 6
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 6
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 6
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 6
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 6
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 6
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 6
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 6
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 6
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 6
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 6
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 6
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 6
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 6
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 6
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 6
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 6
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 6
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 6
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 6
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 6
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 6
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 6
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 6
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 6
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 6
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010011559 alanylphenylalanine Proteins 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 6
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 238000011081 inoculation Methods 0.000 description 6
- 239000002917 insecticide Substances 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 239000000575 pesticide Substances 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010004914 prolylarginine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 241000254032 Acrididae Species 0.000 description 5
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 5
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 5
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 5
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 5
- 239000002028 Biomass Substances 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 5
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 5
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 5
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 5
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 5
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 5
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- 241000258916 Leptinotarsa decemlineata Species 0.000 description 5
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 5
- 241001147398 Ostrinia nubilalis Species 0.000 description 5
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 5
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 5
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 5
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 5
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 5
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 5
- 235000002595 Solanum tuberosum Nutrition 0.000 description 5
- 244000061456 Solanum tuberosum Species 0.000 description 5
- 241000256251 Spodoptera frugiperda Species 0.000 description 5
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 5
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 5
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 5
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 5
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 5
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 5
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 239000013078 crystal Substances 0.000 description 5
- 239000000417 fungicide Substances 0.000 description 5
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 5
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 150000003839 salts Chemical class 0.000 description 5
- 238000002864 sequence alignment Methods 0.000 description 5
- 238000003860 storage Methods 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 239000004094 surface-active agent Substances 0.000 description 5
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 101150084623 vip gene Proteins 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 4
- BAAVRTJSLCSMNM-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 BAAVRTJSLCSMNM-CMOCDZPBSA-N 0.000 description 4
- 241000218473 Agrotis Species 0.000 description 4
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 4
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 4
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 4
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 4
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 4
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 4
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 4
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 4
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 4
- 241000272814 Anser sp. Species 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 4
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 4
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 4
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 4
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 4
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 4
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 4
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 4
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 4
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 4
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 4
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 4
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 4
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 4
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 4
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 4
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 4
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 4
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 4
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 4
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 4
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 4
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 4
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 4
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 4
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 4
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 4
- 241000223651 Aureobasidium Species 0.000 description 4
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 4
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 4
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 4
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 4
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 4
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 4
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 4
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 4
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 4
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 4
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 4
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 4
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 4
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 4
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 4
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 4
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 4
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 4
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 4
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 4
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 4
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 4
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 4
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 4
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 4
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 4
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 4
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 4
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 4
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 4
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 4
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 4
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 4
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 4
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 4
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 4
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 4
- 241000256602 Isoptera Species 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 4
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 4
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 4
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 4
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 4
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 4
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 4
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 4
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 4
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 4
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 4
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 4
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 4
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 4
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 4
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 4
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 4
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 4
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 4
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 4
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 4
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 4
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 4
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 4
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 4
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 4
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 4
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 4
- 244000061176 Nicotiana tabacum Species 0.000 description 4
- 101150014068 PPIP5K1 gene Proteins 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 4
- 244000046052 Phaseolus vulgaris Species 0.000 description 4
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 4
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 4
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 4
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 4
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 4
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 4
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 4
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 4
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 4
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 4
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 4
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 4
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 4
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 4
- MCPXQHVVCPTRIM-HJOGWXRNSA-N Pro-Trp-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)[C@@H]1CCCN1 MCPXQHVVCPTRIM-HJOGWXRNSA-N 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 4
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 4
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 4
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 4
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 4
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 4
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 4
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 4
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 4
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 4
- 244000062793 Sorghum vulgare Species 0.000 description 4
- 229930006000 Sucrose Natural products 0.000 description 4
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 4
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 4
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 4
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 4
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 4
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 4
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 4
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 4
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 4
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 4
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 4
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 4
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 4
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 4
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 4
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 4
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 4
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 4
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 4
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 4
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 4
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 4
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 4
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 4
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 4
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 4
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 4
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 4
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 4
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 4
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 4
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 4
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 4
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 4
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 4
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 4
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 101150086784 cry gene Proteins 0.000 description 4
- 230000006378 damage Effects 0.000 description 4
- 230000034994 death Effects 0.000 description 4
- 235000014113 dietary fatty acids Nutrition 0.000 description 4
- 108010009297 diglycyl-histidine Proteins 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- 235000021186 dishes Nutrition 0.000 description 4
- 239000000194 fatty acid Substances 0.000 description 4
- 229930195729 fatty acid Natural products 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 210000001161 mammalian embryo Anatomy 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010034507 methionyltryptophan Proteins 0.000 description 4
- 244000052769 pathogen Species 0.000 description 4
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 4
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 239000002689 soil Substances 0.000 description 4
- 239000005720 sucrose Substances 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 4
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 3
- 102000007469 Actins Human genes 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 3
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 3
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 3
- 235000017060 Arachis glabrata Nutrition 0.000 description 3
- 244000105624 Arachis hypogaea Species 0.000 description 3
- 235000010777 Arachis hypogaea Nutrition 0.000 description 3
- 235000018262 Arachis monticola Nutrition 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 3
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 3
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 3
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 3
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 3
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 3
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 3
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 3
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 3
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 3
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 3
- 241000589151 Azotobacter Species 0.000 description 3
- 241000929635 Blissus Species 0.000 description 3
- 240000002791 Brassica napus Species 0.000 description 3
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 3
- 241000254171 Curculionidae Species 0.000 description 3
- 241001634817 Cydia Species 0.000 description 3
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 3
- 241001609607 Delia platura Species 0.000 description 3
- 241000489975 Diabrotica Species 0.000 description 3
- 241000489976 Diabrotica undecimpunctata howardi Species 0.000 description 3
- 241000255925 Diptera Species 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 241000237858 Gastropoda Species 0.000 description 3
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 3
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 3
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 3
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 3
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 3
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 3
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 3
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 3
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 3
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 3
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 3
- 241000255967 Helicoverpa zea Species 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- 241000257303 Hymenoptera Species 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 3
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 3
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 3
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 3
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 3
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 3
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 3
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 3
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 3
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 3
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 3
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 3
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 3
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 3
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 3
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 3
- 241001502129 Mullus Species 0.000 description 3
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 3
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 3
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 3
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- 241000517946 Phyllotreta nemorum Species 0.000 description 3
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 3
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 3
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 241000589180 Rhizobium Species 0.000 description 3
- 241000167882 Rhopalosiphum maidis Species 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 240000000111 Saccharum officinarum Species 0.000 description 3
- 235000007201 Saccharum officinarum Nutrition 0.000 description 3
- 241000607142 Salmonella Species 0.000 description 3
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 3
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 3
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 3
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 3
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 3
- 241000258242 Siphonaptera Species 0.000 description 3
- 240000003768 Solanum lycopersicum Species 0.000 description 3
- 241000256247 Spodoptera exigua Species 0.000 description 3
- 241000222068 Sporobolomyces <Sporidiobolaceae> Species 0.000 description 3
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 3
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 3
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 3
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 3
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 3
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 3
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 3
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 3
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 3
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 3
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 150000001412 amines Chemical class 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000007859 condensation product Substances 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 244000038559 crop plants Species 0.000 description 3
- 208000031513 cyst Diseases 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 108010054812 diprotin A Proteins 0.000 description 3
- 235000013601 eggs Nutrition 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 239000003337 fertilizer Substances 0.000 description 3
- 230000037406 food intake Effects 0.000 description 3
- 108020002326 glutamine synthetase Proteins 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 230000009036 growth inhibition Effects 0.000 description 3
- 239000004009 herbicide Substances 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 210000003000 inclusion body Anatomy 0.000 description 3
- 229910052500 inorganic mineral Inorganic materials 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 235000020232 peanut Nutrition 0.000 description 3
- 239000011148 porous material Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 3
- 239000013605 shuttle vector Substances 0.000 description 3
- 239000000377 silicon dioxide Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 210000002784 stomach Anatomy 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 3
- 235000013311 vegetables Nutrition 0.000 description 3
- 238000004383 yellowing Methods 0.000 description 3
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 241001143309 Acanthoscelides obtectus Species 0.000 description 2
- 241000589220 Acetobacter Species 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 2
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 2
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 2
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- 108010076441 Ala-His-His Proteins 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 2
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 2
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 2
- 241000588986 Alcaligenes Species 0.000 description 2
- 244000291564 Allium cepa Species 0.000 description 2
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 2
- 241000272525 Anas platyrhynchos Species 0.000 description 2
- 241001124076 Aphididae Species 0.000 description 2
- 241001600408 Aphis gossypii Species 0.000 description 2
- 241000256844 Apis mellifera Species 0.000 description 2
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 2
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 2
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 2
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 2
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 2
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 2
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 2
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 2
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 2
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 2
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 2
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 2
- 241000238421 Arthropoda Species 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 2
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 2
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 2
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 2
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 2
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 2
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 2
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 2
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 2
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 2
- XFJKRRCWLTZIQA-XIRDDKMYSA-N Asn-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N XFJKRRCWLTZIQA-XIRDDKMYSA-N 0.000 description 2
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 2
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 2
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 2
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 2
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 2
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 2
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 2
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 2
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 2
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 2
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 2
- CGYKCTPUGXFPMG-IHPCNDPISA-N Asn-Tyr-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CGYKCTPUGXFPMG-IHPCNDPISA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 2
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 2
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 2
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 2
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 2
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 2
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 2
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 2
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 2
- 235000017491 Bambusa tulda Nutrition 0.000 description 2
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101710150190 Beta-secretase 2 Proteins 0.000 description 2
- 241001629132 Blissus leucopterus Species 0.000 description 2
- 241000255789 Bombyx mori Species 0.000 description 2
- 241001444260 Brassicogethes aeneus Species 0.000 description 2
- 241001414201 Bruchus pisorum Species 0.000 description 2
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 2
- 244000025254 Cannabis sativa Species 0.000 description 2
- 235000002566 Capsicum Nutrition 0.000 description 2
- 241000877105 Chaetocnema confinis Species 0.000 description 2
- KZBUYRJDOAKODT-UHFFFAOYSA-N Chlorine Chemical compound ClCl KZBUYRJDOAKODT-UHFFFAOYSA-N 0.000 description 2
- 241001529599 Colaspis brunnea Species 0.000 description 2
- 241000870659 Crassula perfoliata var. minor Species 0.000 description 2
- 101150102464 Cry1 gene Proteins 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- 101710151559 Crystal protein Proteins 0.000 description 2
- 244000241257 Cucumis melo Species 0.000 description 2
- 235000009847 Cucumis melo var cantalupensis Nutrition 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 2
- 241001090151 Cyrtopeltis Species 0.000 description 2
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 2
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 2
- WZZGXXNRSZIQFC-VGDYDELISA-N Cys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WZZGXXNRSZIQFC-VGDYDELISA-N 0.000 description 2
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 2
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 2
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 2
- IWVNIQXKTIQXCT-SRVKXCTJSA-N Cys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IWVNIQXKTIQXCT-SRVKXCTJSA-N 0.000 description 2
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 2
- ZAKOWWREFLAJOT-CEFNRUSXSA-N D-alpha-tocopherylacetate Chemical compound CC(=O)OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C ZAKOWWREFLAJOT-CEFNRUSXSA-N 0.000 description 2
- 241001585354 Delia coarctata Species 0.000 description 2
- 241000489972 Diabrotica barberi Species 0.000 description 2
- 241000462639 Epilachna varivestis Species 0.000 description 2
- 241001183323 Epitrix cucumeris Species 0.000 description 2
- 241000738498 Epitrix pubescens Species 0.000 description 2
- 241000588698 Erwinia Species 0.000 description 2
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 2
- 241000726221 Gemma Species 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 2
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 2
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 2
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 2
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 2
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 2
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 2
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 2
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 2
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 2
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 2
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 2
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 2
- XMWNHGKDDIFXQJ-NWLDYVSISA-N Gln-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XMWNHGKDDIFXQJ-NWLDYVSISA-N 0.000 description 2
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 2
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 2
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 2
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 2
- 241001442498 Globodera Species 0.000 description 2
- 241000482313 Globodera ellingtonae Species 0.000 description 2
- 241001442497 Globodera rostochiensis Species 0.000 description 2
- 101710186901 Globulin 1 Proteins 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 2
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 2
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 2
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 2
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 2
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 2
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 2
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 2
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 2
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000825556 Halyomorpha halys Species 0.000 description 2
- 241001147381 Helicoverpa armigera Species 0.000 description 2
- 241000256257 Heliothis Species 0.000 description 2
- 241000256244 Heliothis virescens Species 0.000 description 2
- 241000258937 Hemiptera Species 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- MJICNEVRDVQXJH-WDSOQIARSA-N His-Arg-Trp Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O MJICNEVRDVQXJH-WDSOQIARSA-N 0.000 description 2
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 2
- UOAVQQRILDGZEN-SRVKXCTJSA-N His-Asp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UOAVQQRILDGZEN-SRVKXCTJSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 2
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 2
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 2
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 2
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 2
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- 241000785682 Hispania Species 0.000 description 2
- 241001508566 Hypera postica Species 0.000 description 2
- 241001508564 Hypera punctata Species 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 2
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 2
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 2
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 2
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- 108020005350 Initiator Codon Proteins 0.000 description 2
- 108700001097 Insect Genes Proteins 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 244000017020 Ipomoea batatas Species 0.000 description 2
- 235000002678 Ipomoea batatas Nutrition 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 108010025815 Kanamycin Kinase Proteins 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- 241001177117 Lasioderma serricorne Species 0.000 description 2
- 241000255777 Lepidoptera Species 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 2
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- 241000254022 Locusta migratoria Species 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 2
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 2
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 2
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 2
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 2
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 2
- PSVAVKGDUAKZKU-BZSNNMDCSA-N Lys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N)O PSVAVKGDUAKZKU-BZSNNMDCSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 2
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 241000193386 Lysinibacillus sphaericus Species 0.000 description 2
- 101000739632 Lysinibacillus sphaericus Binary larvicide subunit BinB Proteins 0.000 description 2
- 241001130335 Maladera castanea Species 0.000 description 2
- 241000555303 Mamestra brassicae Species 0.000 description 2
- 241001422926 Mayetiola hordei Species 0.000 description 2
- 241000766511 Meligethes Species 0.000 description 2
- 241001143352 Meloidogyne Species 0.000 description 2
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 2
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 2
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 2
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 2
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 2
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 2
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 2
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 2
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 2
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 2
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 2
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 2
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 2
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241000721621 Myzus persicae Species 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- SUZRRICLUFMAQD-UHFFFAOYSA-N N-Methyltaurine Chemical compound CNCCS(O)(=O)=O SUZRRICLUFMAQD-UHFFFAOYSA-N 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000346285 Ostrinia furnacalis Species 0.000 description 2
- 239000006002 Pepper Substances 0.000 description 2
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- PHJUFDQVVKVOPU-ULQDDVLXSA-N Phe-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=CC=C1)N PHJUFDQVVKVOPU-ULQDDVLXSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- OAOLATANIHTNCZ-IHRRRGAJSA-N Phe-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N OAOLATANIHTNCZ-IHRRRGAJSA-N 0.000 description 2
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 2
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 2
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 2
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 2
- 235000011613 Pinus brutia Nutrition 0.000 description 2
- 241000018646 Pinus brutia Species 0.000 description 2
- 235000016761 Piper aduncum Nutrition 0.000 description 2
- 240000003889 Piper guineense Species 0.000 description 2
- 235000017804 Piper guineense Nutrition 0.000 description 2
- 235000008184 Piper nigrum Nutrition 0.000 description 2
- 241000500437 Plutella xylostella Species 0.000 description 2
- 241000254101 Popillia japonica Species 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 2
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 2
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 2
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 2
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 2
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 2
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 2
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 2
- NWUIBMXICBBZQQ-DWRORGKVSA-N Pro-Val-Asn-Phe Chemical compound N([C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 NWUIBMXICBBZQQ-DWRORGKVSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- 241000590524 Protaphis middletonii Species 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 241000721694 Pseudatomoscelis seriatus Species 0.000 description 2
- 241000223252 Rhodotorula Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 241000130993 Scarabaeus <genus> Species 0.000 description 2
- 241000332477 Scutellonema bradys Species 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 2
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 2
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 2
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 2
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 2
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 2
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 241001279786 Sipha flava Species 0.000 description 2
- 241000254152 Sitophilus oryzae Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 2
- 241000256248 Spodoptera Species 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 235000021536 Sugar beet Nutrition 0.000 description 2
- 241001454293 Tetranychus urticae Species 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 2
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 2
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 2
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 2
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 2
- VGNKUXWYFFDWDH-BEMMVCDISA-N Thr-Trp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N)O VGNKUXWYFFDWDH-BEMMVCDISA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 241001414989 Thysanoptera Species 0.000 description 2
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 2
- AOAMKFFPFOPMLX-BVSLBCMMSA-N Trp-Arg-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AOAMKFFPFOPMLX-BVSLBCMMSA-N 0.000 description 2
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 2
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 2
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 2
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 2
- NFVQCNMGJILYMI-SZMVWBNQSA-N Trp-Met-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NFVQCNMGJILYMI-SZMVWBNQSA-N 0.000 description 2
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 2
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 2
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 2
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 2
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 2
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 2
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 2
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 2
- LFCQXIXJQXWZJI-BZSNNMDCSA-N Tyr-His-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O LFCQXIXJQXWZJI-BZSNNMDCSA-N 0.000 description 2
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 2
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 2
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 2
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 2
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 2
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 2
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 2
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 2
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 2
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 2
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 2
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 2
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 2
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 241000314934 Zygogramma exclamationis Species 0.000 description 2
- 239000000443 aerosol Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 235000021405 artificial diet Nutrition 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 150000007942 carboxylates Chemical class 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000012707 chemical precursor Substances 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 239000012141 concentrate Substances 0.000 description 2
- 239000007799 cork Substances 0.000 description 2
- 101150060198 cyt gene Proteins 0.000 description 2
- 230000001461 cytolytic effect Effects 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 150000002191 fatty alcohols Chemical class 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 210000002816 gill Anatomy 0.000 description 2
- 102000005396 glutamine synthetase Human genes 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 239000010903 husk Substances 0.000 description 2
- 108010002685 hygromycin-B kinase Proteins 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 230000010534 mechanism of action Effects 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 230000004899 motility Effects 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000003739 neck Anatomy 0.000 description 2
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 2
- 230000003204 osmotic effect Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 210000002706 plastid Anatomy 0.000 description 2
- 229920000768 polyamine Polymers 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 231100000654 protein toxin Toxicity 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000012882 rooting medium Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000007226 seed germination Effects 0.000 description 2
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 2
- 235000020238 sunflower seed Nutrition 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010071141 valyl-valyl-tyrosyl-prolyl-aspartic acid Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- SQAINHDHICKHLX-UHFFFAOYSA-N 1-naphthaldehyde Chemical class C1=CC=C2C(C=O)=CC=CC2=C1 SQAINHDHICKHLX-UHFFFAOYSA-N 0.000 description 1
- OVSKIKFHRZPJSS-UHFFFAOYSA-N 2,4-D Chemical compound OC(=O)COC1=CC=C(Cl)C=C1Cl OVSKIKFHRZPJSS-UHFFFAOYSA-N 0.000 description 1
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 1
- 229940087195 2,4-dichlorophenoxyacetate Drugs 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- 101710106459 29 kDa protein Proteins 0.000 description 1
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- 101710093560 34 kDa protein Proteins 0.000 description 1
- 101710134681 40 kDa protein Proteins 0.000 description 1
- XDRVGXCIPIURSL-UHFFFAOYSA-N 5,8-diethyl-3,10-dimethyldodec-6-yne-5,8-diol Chemical compound CCC(C)CC(O)(CC)C#CC(O)(CC)CC(C)CC XDRVGXCIPIURSL-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 241001558877 Aceria tulipae Species 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 244000235858 Acetobacter xylinum Species 0.000 description 1
- 235000002837 Acetobacter xylinum Nutrition 0.000 description 1
- 241000495828 Acleris gloverana Species 0.000 description 1
- 241000834107 Acleris variana Species 0.000 description 1
- 241001014341 Acrosternum hilare Species 0.000 description 1
- 241000693815 Adelphocoris rapidus Species 0.000 description 1
- 241000175828 Adoxophyes orana Species 0.000 description 1
- 241000673185 Aeolus Species 0.000 description 1
- 241000607534 Aeromonas Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000993143 Agromyza Species 0.000 description 1
- 241000566547 Agrotis ipsilon Species 0.000 description 1
- 241000001996 Agrotis orthogonia Species 0.000 description 1
- 241001652650 Agrotis subterranea Species 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- 241000449794 Alabama argillacea Species 0.000 description 1
- 241001116389 Aloe Species 0.000 description 1
- 241001367806 Alsophila pometaria Species 0.000 description 1
- 239000005995 Aluminium silicate Substances 0.000 description 1
- 241001259789 Amyelois transitella Species 0.000 description 1
- 244000144725 Amygdalus communis Species 0.000 description 1
- 244000226021 Anacardium occidentale Species 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 241000318389 Anaphothrips Species 0.000 description 1
- 241001198505 Anarsia lineatella Species 0.000 description 1
- 241000153204 Anisota senatoria Species 0.000 description 1
- 241000519878 Anomala rufocuprea Species 0.000 description 1
- 241001427556 Anoplura Species 0.000 description 1
- 241000255978 Antheraea pernyi Species 0.000 description 1
- 241000254175 Anthonomus grandis Species 0.000 description 1
- 241000360281 Anthonomus quadrigibbus Species 0.000 description 1
- 241000625764 Anticarsia gemmatalis Species 0.000 description 1
- 241001332254 Araecerus fasciculatus Species 0.000 description 1
- 241001002469 Archips Species 0.000 description 1
- 241000534456 Arenaria <Aves> Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- 241001554566 Argyria Species 0.000 description 1
- 208000002109 Argyria Diseases 0.000 description 1
- 241000384127 Argyrotaenia Species 0.000 description 1
- 241000186063 Arthrobacter Species 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- ALKWEXBKAHPJAQ-NAKRPEOUSA-N Asn-Leu-Asp-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ALKWEXBKAHPJAQ-NAKRPEOUSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 1
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 241000589154 Azotobacter group Species 0.000 description 1
- 241001112741 Bacillaceae Species 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 108700003860 Bacterial Genes Proteins 0.000 description 1
- 241000209128 Bambusa Species 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 240000000724 Berberis vulgaris Species 0.000 description 1
- 101000679617 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) Pertussis toxin subunit 1 Proteins 0.000 description 1
- 241000131971 Bradyrhizobiaceae Species 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 1
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 1
- 235000010149 Brassica rapa subsp chinensis Nutrition 0.000 description 1
- 235000000536 Brassica rapa subsp pekinensis Nutrition 0.000 description 1
- 241000499436 Brassica rapa subsp. pekinensis Species 0.000 description 1
- 241000982105 Brevicoryne brassicae Species 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- 241001325378 Bruchus Species 0.000 description 1
- 241001388466 Bruchus rufimanus Species 0.000 description 1
- 241001517925 Bucculatrix Species 0.000 description 1
- 101100394003 Butyrivibrio fibrisolvens end1 gene Proteins 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 241000219357 Cactaceae Species 0.000 description 1
- 241000726760 Cadra cautella Species 0.000 description 1
- 101100458634 Caenorhabditis elegans mtx-2 gene Proteins 0.000 description 1
- 241000906761 Calocoris Species 0.000 description 1
- 244000045232 Canavalia ensiformis Species 0.000 description 1
- 206010007134 Candida infections Diseases 0.000 description 1
- 235000012766 Cannabis sativa ssp. sativa var. sativa Nutrition 0.000 description 1
- 235000012765 Cannabis sativa ssp. sativa var. spontanea Nutrition 0.000 description 1
- 235000002568 Capsicum frutescens Nutrition 0.000 description 1
- 229920000049 Carbon (fiber) Polymers 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 235000009467 Carica papaya Nutrition 0.000 description 1
- 240000006432 Carica papaya Species 0.000 description 1
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 1
- 244000020518 Carthamus tinctorius Species 0.000 description 1
- 241001536086 Cephus cinctus Species 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 241000343781 Chaetocnema pulicaria Species 0.000 description 1
- 241000661337 Chilo partellus Species 0.000 description 1
- 241000392215 Chinavia Species 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 241000195628 Chlorophyta Species 0.000 description 1
- 241000255945 Choristoneura Species 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 241001367803 Chrysodeixis includens Species 0.000 description 1
- 244000223760 Cinnamomum zeylanicum Species 0.000 description 1
- 241000132536 Cirsium Species 0.000 description 1
- 241000207199 Citrus Species 0.000 description 1
- DBPRUZCKPFOVDV-UHFFFAOYSA-N Clorprenaline hydrochloride Chemical compound O.Cl.CC(C)NCC(O)C1=CC=CC=C1Cl DBPRUZCKPFOVDV-UHFFFAOYSA-N 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 240000007154 Coffea arabica Species 0.000 description 1
- 241000143939 Colias eurytheme Species 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 241000218631 Coniferophyta Species 0.000 description 1
- 241000683561 Conoderus Species 0.000 description 1
- 241000532642 Conotrachelus nenuphar Species 0.000 description 1
- 241001663470 Contarinia <gall midge> Species 0.000 description 1
- 241000993412 Corcyra cephalonica Species 0.000 description 1
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 1
- 241000256057 Culex quinquefasciatus Species 0.000 description 1
- 241000256113 Culicidae Species 0.000 description 1
- 241001641310 Cunea Species 0.000 description 1
- 241001156075 Cyclocephala Species 0.000 description 1
- 241001587738 Cyclocephala borealis Species 0.000 description 1
- 241001635274 Cydia pomonella Species 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- 206010011732 Cyst Diseases 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241001351082 Datana integerrima Species 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 241001631715 Dendrolimus Species 0.000 description 1
- 208000006558 Dental Calculus Diseases 0.000 description 1
- 241001124144 Dermaptera Species 0.000 description 1
- 241000605716 Desulfovibrio Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000916723 Diabrotica longicornis Species 0.000 description 1
- 241000489977 Diabrotica virgifera Species 0.000 description 1
- 241000381325 Diabrotica virgifera zeae Species 0.000 description 1
- 235000009355 Dianthus caryophyllus Nutrition 0.000 description 1
- 240000006497 Dianthus caryophyllus Species 0.000 description 1
- 241001000394 Diaphania hyalinata Species 0.000 description 1
- 241001012951 Diaphania nitidalis Species 0.000 description 1
- 241000879145 Diatraea grandiosella Species 0.000 description 1
- 241000122106 Diatraea saccharalis Species 0.000 description 1
- 241001279823 Diuraphis noxia Species 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- 229920005682 EO-PO block copolymer Polymers 0.000 description 1
- 241000400698 Elasmopalpus lignosellus Species 0.000 description 1
- 241001105160 Eleodes Species 0.000 description 1
- 241000995027 Empoasca fabae Species 0.000 description 1
- 241001608224 Ennomos subsignaria Species 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241000661448 Eoreuma loftini Species 0.000 description 1
- 241000122098 Ephestia kuehniella Species 0.000 description 1
- 108050004280 Epsilon toxin Proteins 0.000 description 1
- 241001491718 Erannis Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000567412 Estigmene acrea Species 0.000 description 1
- 241001201696 Eulia Species 0.000 description 1
- 240000002395 Euphorbia pulcherrima Species 0.000 description 1
- 241000060469 Eupoecilia ambiguella Species 0.000 description 1
- 241000483001 Euproctis chrysorrhoea Species 0.000 description 1
- 241000515838 Eurygaster Species 0.000 description 1
- 241000167999 Euscepes Species 0.000 description 1
- 241001619920 Euschistus servus Species 0.000 description 1
- 241001368778 Euxoa messoria Species 0.000 description 1
- 240000000731 Fagus sylvatica Species 0.000 description 1
- 235000010099 Fagus sylvatica Nutrition 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 241000654868 Frankliniella fusca Species 0.000 description 1
- 241000255896 Galleria mellonella Species 0.000 description 1
- 241000702463 Geminiviridae Species 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 108700037728 Glycine max beta-conglycinin Proteins 0.000 description 1
- 241001441330 Grapholita molesta Species 0.000 description 1
- 241000190714 Gymnosporangium clavipes Species 0.000 description 1
- 241001352371 Harrisina americana Species 0.000 description 1
- 241000255990 Helicoverpa Species 0.000 description 1
- 241000413128 Hemileuca oliviae Species 0.000 description 1
- 241001480224 Heterodera Species 0.000 description 1
- 241001481225 Heterodera avenae Species 0.000 description 1
- 241000379510 Heterodera schachtii Species 0.000 description 1
- 235000005206 Hibiscus Nutrition 0.000 description 1
- 235000007185 Hibiscus lunariifolius Nutrition 0.000 description 1
- 244000284380 Hibiscus rosa sinensis Species 0.000 description 1
- 240000004153 Hibiscus sabdariffa Species 0.000 description 1
- 235000001018 Hibiscus sabdariffa Nutrition 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000742054 Homo sapiens Protein phosphatase 1D Proteins 0.000 description 1
- 241000526466 Homoeosoma Species 0.000 description 1
- 241000630740 Homoeosoma electellum Species 0.000 description 1
- 235000014486 Hydrangea macrophylla Nutrition 0.000 description 1
- 241001091442 Hydrangeaceae Species 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- 241000370523 Hypena scabra Species 0.000 description 1
- 241001508558 Hypera Species 0.000 description 1
- 241001508570 Hypera brunneipennis Species 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- GLLAUPMJCGKPFY-BLMTYFJBSA-N Ile-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 GLLAUPMJCGKPFY-BLMTYFJBSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- IMQLKJBTEOYOSI-GPIVLXJGSA-N Inositol-hexakisphosphate Chemical compound OP(O)(=O)O[C@H]1[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H]1OP(O)(O)=O IMQLKJBTEOYOSI-GPIVLXJGSA-N 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- 241001495069 Ischnocera Species 0.000 description 1
- 240000007049 Juglans regia Species 0.000 description 1
- 235000009496 Juglans regia Nutrition 0.000 description 1
- 241000400431 Keiferia lycopersicella Species 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- 241001468155 Lactobacillaceae Species 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 241001658022 Lambdina fiscellaria fiscellaria Species 0.000 description 1
- 241001658020 Lambdina fiscellaria lugubrosa Species 0.000 description 1
- 241001425930 Latina Species 0.000 description 1
- 241000611348 Leifsonia xyli subsp. xyli Species 0.000 description 1
- 241000258915 Leptinotarsa Species 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- 241001352367 Leucoma salicis Species 0.000 description 1
- 241000192132 Leuconostoc Species 0.000 description 1
- 229920001732 Lignosulfonate Polymers 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 241000966204 Lissorhoptrus oryzophilus Species 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 241001261104 Lobesia botrana Species 0.000 description 1
- 241001106041 Lycium Species 0.000 description 1
- UPYKUZBSLRQECL-UKMVMLAPSA-N Lycopene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1C(=C)CCCC1(C)C)C=CC=C(/C)C=CC2C(=C)CCCC2(C)C UPYKUZBSLRQECL-UKMVMLAPSA-N 0.000 description 1
- 241000283636 Lygocoris pabulinus Species 0.000 description 1
- 241001414823 Lygus hesperus Species 0.000 description 1
- 241000501345 Lygus lineolaris Species 0.000 description 1
- 241000721703 Lymantria dispar Species 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 241001472591 Lysinibacillus sp. Species 0.000 description 1
- 101150050813 MPI gene Proteins 0.000 description 1
- 241000208467 Macadamia Species 0.000 description 1
- 241000081125 Macalla Species 0.000 description 1
- 241001155765 Macrodactylus Species 0.000 description 1
- 241000255676 Malacosoma Species 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000732113 Mamestra configurata Species 0.000 description 1
- 241000256010 Manduca Species 0.000 description 1
- 241000369513 Manduca quinquemaculata Species 0.000 description 1
- 241000255908 Manduca sexta Species 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- 241001232130 Maruca testulalis Species 0.000 description 1
- 240000004658 Medicago sativa Species 0.000 description 1
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 1
- 241001367645 Melanchra picta Species 0.000 description 1
- 241001415015 Melanoplus differentialis Species 0.000 description 1
- 241001478965 Melanoplus femurrubrum Species 0.000 description 1
- 241000922538 Melanoplus sanguinipes Species 0.000 description 1
- 241001062280 Melanotus <basidiomycete fungus> Species 0.000 description 1
- 241000537142 Meligethes nigrescens Species 0.000 description 1
- 241000042079 Meligethes viridescens Species 0.000 description 1
- 241001608711 Melo Species 0.000 description 1
- 241000831628 Meloidogyne enterolobii Species 0.000 description 1
- 241000243787 Meloidogyne hapla Species 0.000 description 1
- 241000243786 Meloidogyne incognita Species 0.000 description 1
- 241000243785 Meloidogyne javanica Species 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 241000088587 Meromyza Species 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 239000004909 Moisturizer Substances 0.000 description 1
- 244000111261 Mucuna pruriens Species 0.000 description 1
- 235000008540 Mucuna pruriens var utilis Nutrition 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 102000018463 Myo-Inositol-1-Phosphate Synthase Human genes 0.000 description 1
- 108091000020 Myo-Inositol-1-Phosphate Synthase Proteins 0.000 description 1
- 241001477931 Mythimna unipuncta Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 241001443590 Naganishia albida Species 0.000 description 1
- 241000033319 Naganishia diffluens Species 0.000 description 1
- 101710202365 Napin Proteins 0.000 description 1
- 241000234479 Narcissus Species 0.000 description 1
- 241000084931 Neohydatothrips variabilis Species 0.000 description 1
- 241000912288 Neolasioptera Species 0.000 description 1
- 241000615716 Nephotettix nigropictus Species 0.000 description 1
- 241001671709 Nezara viridula Species 0.000 description 1
- 241001548845 Nysius ericae Species 0.000 description 1
- 241001666448 Nysius raphanus Species 0.000 description 1
- 108010079246 OMPA outer membrane proteins Proteins 0.000 description 1
- 240000007817 Olea europaea Species 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 241001491877 Operophtera brumata Species 0.000 description 1
- 208000007027 Oral Candidiasis Diseases 0.000 description 1
- 241001465800 Orgyia Species 0.000 description 1
- 241000906034 Orthops Species 0.000 description 1
- 241000238814 Orthoptera Species 0.000 description 1
- 241001160353 Oulema melanopus Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001585671 Paleacrita vernata Species 0.000 description 1
- 241001300993 Papilio cresphontes Species 0.000 description 1
- 241000222051 Papiliotrema laurentii Species 0.000 description 1
- 241000721452 Pectinophora Species 0.000 description 1
- 241000721451 Pectinophora gossypiella Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 241001387976 Pera Species 0.000 description 1
- 244000025272 Persea americana Species 0.000 description 1
- 235000008673 Persea americana Nutrition 0.000 description 1
- 244000170788 Persicaria vulgaris Species 0.000 description 1
- 235000017845 Persicaria vulgaris Nutrition 0.000 description 1
- 241000316608 Petrobia latens Species 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 235000010617 Phaseolus lunatus Nutrition 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 241000607568 Photobacterium Species 0.000 description 1
- 241001674048 Phthiraptera Species 0.000 description 1
- 241001517955 Phyllonorycter blancardella Species 0.000 description 1
- 241000286134 Phyllophaga crinita Species 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 241000275069 Phyllotreta cruciferae Species 0.000 description 1
- 241000437063 Phyllotreta striolata Species 0.000 description 1
- 241000738916 Phyllotreta undulata Species 0.000 description 1
- 241001313099 Pieris napi Species 0.000 description 1
- 241000907661 Pieris rapae Species 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 241000242594 Platyhelminthes Species 0.000 description 1
- 241001608845 Platynota Species 0.000 description 1
- 241001456328 Platynota stultana Species 0.000 description 1
- 241000495716 Platyptilia carduidactyla Species 0.000 description 1
- 241000595629 Plodia interpunctella Species 0.000 description 1
- 241001662912 Poecilocapsus lineatus Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 229920001214 Polysorbate 60 Polymers 0.000 description 1
- 241000143945 Pontia protodice Species 0.000 description 1
- 241000193943 Pratylenchus Species 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 102100038675 Protein phosphatase 1D Human genes 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241001657916 Proxenus mindara Species 0.000 description 1
- 241000947836 Pseudomonadaceae Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 241000589615 Pseudomonas syringae Species 0.000 description 1
- 241000508269 Psidium Species 0.000 description 1
- 241000442474 Pulsatilla vulgaris Species 0.000 description 1
- 244000184734 Pyrus japonica Species 0.000 description 1
- 101150075111 ROLB gene Proteins 0.000 description 1
- 101150013395 ROLC gene Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000158450 Rhodobacter sp. KYW73 Species 0.000 description 1
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 1
- 241000208422 Rhododendron Species 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- 241000223253 Rhodotorula glutinis Species 0.000 description 1
- 241000223254 Rhodotorula mucilaginosa Species 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 241000109329 Rosa xanthina Species 0.000 description 1
- 235000004789 Rosa xanthina Nutrition 0.000 description 1
- 235000005291 Rumex acetosa Nutrition 0.000 description 1
- 241000004261 Sabulodes Species 0.000 description 1
- 206010039509 Scab Diseases 0.000 description 1
- 241000722027 Schizaphis graminum Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241001351292 Schizura concinna Species 0.000 description 1
- 241000209056 Secale Species 0.000 description 1
- 235000007238 Secale cereale Nutrition 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- 241001479507 Senecio odorus Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 241000533293 Sesbania emerus Species 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000180219 Sitobion avenae Species 0.000 description 1
- 241000068648 Sitodiplosis mosellana Species 0.000 description 1
- 241000254181 Sitophilus Species 0.000 description 1
- 241000254154 Sitophilus zeamais Species 0.000 description 1
- 241000753145 Sitotroga cerealella Species 0.000 description 1
- 241001153355 Smicronyx Species 0.000 description 1
- 241001153341 Smicronyx sordidus Species 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 101000611441 Solanum lycopersicum Pathogenesis-related leaf protein 6 Proteins 0.000 description 1
- 241001492664 Solenopsis <angiosperm> Species 0.000 description 1
- 241000421631 Spanagonicus albofasciatus Species 0.000 description 1
- 241000532885 Sphenophorus Species 0.000 description 1
- 241000253368 Spirillaceae Species 0.000 description 1
- 241000605008 Spirillum Species 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 241000950030 Sternechus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241001575047 Suleima Species 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 241000254109 Tenebrio molitor Species 0.000 description 1
- 241001124191 Tenebrio obscurus Species 0.000 description 1
- 241001349589 Tethys Species 0.000 description 1
- 241000344246 Tetranychus cinnabarinus Species 0.000 description 1
- 241000916142 Tetranychus turkestani Species 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- 244000299461 Theobroma cacao Species 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- 241000339374 Thrips tabaci Species 0.000 description 1
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 1
- 244000002460 Tithonia rotundifolia Species 0.000 description 1
- 244000288561 Torulaspora delbrueckii Species 0.000 description 1
- 235000014681 Torulaspora delbrueckii Nutrition 0.000 description 1
- 241001495125 Torulaspora pretoriensis Species 0.000 description 1
- 101710182223 Toxin B Proteins 0.000 description 1
- 235000004220 Tradescantia virginiana Nutrition 0.000 description 1
- 240000000958 Tradescantia virginiana Species 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 241000750338 Trialeurodes abutilonea Species 0.000 description 1
- 241000254086 Tribolium <beetle> Species 0.000 description 1
- 241000254113 Tribolium castaneum Species 0.000 description 1
- 241000254112 Tribolium confusum Species 0.000 description 1
- 241000255985 Trichoplusia Species 0.000 description 1
- 241001414983 Trichoptera Species 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 1
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- VOCHZIJXPRBVSI-XIRDDKMYSA-N Trp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VOCHZIJXPRBVSI-XIRDDKMYSA-N 0.000 description 1
- ZHDQRPWESGUDST-JBACZVJFSA-N Trp-Phe-Gln Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ZHDQRPWESGUDST-JBACZVJFSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- 241000722921 Tulipa gesneriana Species 0.000 description 1
- 241000287411 Turdidae Species 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- DXUVJJRTVACXSO-KKUMJFAQSA-N Tyr-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DXUVJJRTVACXSO-KKUMJFAQSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- 241001351286 Udea rubigalis Species 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 235000009754 Vitis X bourquina Nutrition 0.000 description 1
- 235000012333 Vitis X labruscana Nutrition 0.000 description 1
- 240000006365 Vitis vinifera Species 0.000 description 1
- 235000014787 Vitis vinifera Nutrition 0.000 description 1
- 241000589634 Xanthomonas Species 0.000 description 1
- 241000589636 Xanthomonas campestris Species 0.000 description 1
- 241000064240 Yponomeuta padellus Species 0.000 description 1
- 241000532815 Zabrotes subfasciatus Species 0.000 description 1
- 241000588901 Zymomonas Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000012271 agricultural production Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 150000003973 alkyl amines Chemical class 0.000 description 1
- 150000008055 alkyl aryl sulfonates Chemical class 0.000 description 1
- 235000020224 almond Nutrition 0.000 description 1
- 235000011399 aloe vera Nutrition 0.000 description 1
- HSFWRNGVRCDJHI-UHFFFAOYSA-N alpha-acetylene Natural products C#C HSFWRNGVRCDJHI-UHFFFAOYSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 235000012211 aluminium silicate Nutrition 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 150000001449 anionic compounds Chemical class 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 229960005475 antiinfective agent Drugs 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 239000005667 attractant Substances 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- XKGUZGHMWUIYDR-UHFFFAOYSA-N benzyl n-(3-fluoro-4-morpholin-4-ylphenyl)carbamate Chemical compound C=1C=C(N2CCOCC2)C(F)=CC=1NC(=O)OCC1=CC=CC=C1 XKGUZGHMWUIYDR-UHFFFAOYSA-N 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- 230000000035 biogenic effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 125000001246 bromo group Chemical group Br* 0.000 description 1
- 235000021329 brown rice Nutrition 0.000 description 1
- JIJAYWGYIDJVJI-UHFFFAOYSA-N butyl naphthalene-1-sulfonate Chemical compound C1=CC=C2C(S(=O)(=O)OCCCC)=CC=CC2=C1 JIJAYWGYIDJVJI-UHFFFAOYSA-N 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 201000003984 candidiasis Diseases 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 239000004917 carbon fiber Substances 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 235000012730 carminic acid Nutrition 0.000 description 1
- 150000001746 carotenes Chemical class 0.000 description 1
- 235000005473 carotenes Nutrition 0.000 description 1
- 235000020226 cashew nut Nutrition 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 235000017803 cinnamon Nutrition 0.000 description 1
- 235000020971 citrus fruits Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000011260 co-administration Methods 0.000 description 1
- 235000016213 coffee Nutrition 0.000 description 1
- 235000013353 coffee beverage Nutrition 0.000 description 1
- 230000002016 colloidosmotic effect Effects 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 239000013065 commercial product Substances 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 210000003464 cuspid Anatomy 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 239000004062 cytokinin Substances 0.000 description 1
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 210000004292 cytoskeleton Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 150000005690 diesters Chemical class 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- KWABLUYIOFEZOY-UHFFFAOYSA-N dioctyl butanedioate Chemical compound CCCCCCCCOC(=O)CCC(=O)OCCCCCCCC KWABLUYIOFEZOY-UHFFFAOYSA-N 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000010410 dusting Methods 0.000 description 1
- 244000013123 dwarf bean Species 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000012877 elongation medium Substances 0.000 description 1
- 239000004495 emulsifiable concentrate Substances 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000008393 encapsulating agent Substances 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N ethylene glycol Natural products OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 150000002194 fatty esters Chemical class 0.000 description 1
- YYJNOYZRYGDPNH-MFKUBSTISA-N fenpyroximate Chemical compound C=1C=C(C(=O)OC(C)(C)C)C=CC=1CO/N=C/C=1C(C)=NN(C)C=1OC1=CC=CC=C1 YYJNOYZRYGDPNH-MFKUBSTISA-N 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- SLGWESQGEUXWJQ-UHFFFAOYSA-N formaldehyde;phenol Chemical class O=C.OC1=CC=CC=C1 SLGWESQGEUXWJQ-UHFFFAOYSA-N 0.000 description 1
- 239000003517 fume Substances 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 101150091511 glb-1 gene Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 235000021331 green beans Nutrition 0.000 description 1
- 230000003779 hair growth Effects 0.000 description 1
- 230000002140 halogenating effect Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000002949 hemolytic effect Effects 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 229930186900 holotoxin Natural products 0.000 description 1
- 235000012907 honey Nutrition 0.000 description 1
- 230000005745 host immune response Effects 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 210000001144 hymen Anatomy 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229910052738 indium Inorganic materials 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 230000001418 larval effect Effects 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 210000004705 lumbosacral region Anatomy 0.000 description 1
- 206010025135 lupus erythematosus Diseases 0.000 description 1
- 239000003120 macrolide antibiotic agent Substances 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- WABYCCJHARSRBH-UHFFFAOYSA-N metaclazepam Chemical compound C12=CC(Br)=CC=C2N(C)C(COC)CN=C1C1=CC=CC=C1Cl WABYCCJHARSRBH-UHFFFAOYSA-N 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 210000003632 microfilament Anatomy 0.000 description 1
- 230000001333 moisturizer Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 125000005609 naphthenate group Chemical group 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 239000005645 nematicide Substances 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 229940049964 oleate Drugs 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000003950 pathogenic mechanism Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 238000009527 percussion Methods 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 229910052615 phyllosilicate Inorganic materials 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 235000002949 phytic acid Nutrition 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 239000005648 plant growth regulator Substances 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 150000003242 quaternary ammonium salts Chemical class 0.000 description 1
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 229940071089 sarcosinate Drugs 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000008117 seed development Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 235000003513 sheep sorrel Nutrition 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 229960002930 sirolimus Drugs 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000012868 site-directed mutagenesis technique Methods 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 229940080236 sodium cetyl sulfate Drugs 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- GGHPAKFFUZUEKL-UHFFFAOYSA-M sodium;hexadecyl sulfate Chemical compound [Na+].CCCCCCCCCCCCCCCCOS([O-])(=O)=O GGHPAKFFUZUEKL-UHFFFAOYSA-M 0.000 description 1
- NWZBFJYXRGSRGD-UHFFFAOYSA-M sodium;octadecyl sulfate Chemical compound [Na+].CCCCCCCCCCCCCCCCCCOS([O-])(=O)=O NWZBFJYXRGSRGD-UHFFFAOYSA-M 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 108010048090 soybean lectin Proteins 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 238000002798 spectrophotometry method Methods 0.000 description 1
- 230000028070 sporulation Effects 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 239000010907 stover Substances 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 150000003871 sulfonates Chemical class 0.000 description 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 1
- 238000004381 surface treatment Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 235000013616 tea Nutrition 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 239000005418 vegetable material Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- NCYCYZXNIZJOKI-UHFFFAOYSA-N vitamin A aldehyde Natural products O=CC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-UHFFFAOYSA-N 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
- 235000020234 walnut Nutrition 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- A01N63/02—
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N63/00—Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
- A01N63/50—Isolated enzymes; Isolated proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Pest Control & Pesticides (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Insects & Arthropods (AREA)
- Gastroenterology & Hepatology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Virology (AREA)
- Agronomy & Crop Science (AREA)
- Dentistry (AREA)
- Environmental Sciences (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
살충 활성을 갖는 조성물 및 그의 사용 방법이 제공된다. 조성물은 살충 활성을 갖는 단리된 및 재조합 폴리펩티드, 폴리펩티드를 코딩하는 재조합 및 합성 핵산 분자, 핵산 분자를 포함하는 DNA 구축물 및 벡터, 벡터를 포함하는 숙주 세포, 및 폴리펩티드에 대한 항체를 포함한다. 폴리펩티드를 코딩하는 폴리뉴클레오티드 서열은 관심의 유기체에서의 형질전환 및 발현을 위한 DNA 구축물 또는 발현 카세트에서 사용될 수 있다. 제공된 조성물 및 방법은 향상된 해충 저항성 또는 내성을 갖는 유기체를 제조하는 데 유용하다. 본 발명의 살충 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 트랜스제닉 식물 및 종자가 또한 제공된다. 이러한 식물은 곤충 및 다른 해충에 대해 저항성이다. 본원에 개시된 다양한 폴리펩티드를 제조하는 방법, 및 해충을 방제하거나 죽이기 위해 그들 폴리펩티드를 사용하는 방법이 제공된다. 샘플에서 본 발명의 폴리펩티드를 검출하기 위한 방법 및 키트가 또한 포함된다.
Description
관련 출원의 상호 참조
이 출원은 2014년 12월 22일에 출원된 미국 가출원 일련 번호 62/095,524의 우선권을 주장하며, 이 출원의 내용은 그 전문이 본원에 참조로 포함된다.
기술 분야
본 발명은 해충, 특히 식물 해충을 방제하기 위한 방법 및 조성물에 관한 것이다.
EFS-WEB을 통한 텍스트 파일로서 제출된 서열 목록에 대한 언급
서열 목록의 공식적 카피는 2015년 12월 14일에 생성되고, 956 KB의 크기를 갖는 파일명 AgB006.PCT_SEQLIST.txt를 갖는 ASCII 포맷화된 서열 목록으로서 EFS-Web을 통해 전자적으로 제출되며, 본 명세서와 공동으로 출원된다. 이 ASCII 포맷화된 문서에 함유된 서열 목록은 본 명세서의 일부이며, 그 전문이 본원에 참조로 포함된다.
해충, 식물 질환, 및 잡초는 작물에게 심각한 위협일 수 있다. 해충 및 질환으로 인한 손실은 전세계적으로 농업 생산의 37%로 추정되며, 13%는 곤충, 박테리아 및 다른 유기체로 인한 것이다.
독소는 미생물 병원성 및/또는 숙주 면역 반응의 회피에서 중요한 역할을 하는 독성 결정자이다. 그람-양성 박테리아 바실루스 (Bacillus), 특히 바실루스 투린기엔시스로부터의 독소는 살곤충 단백질 (통상적으로 Bt 독소로 지칭됨)로서 사용되었다. 현재 전략은 트랜스제닉 작물을 생산하기 위해 이들 독소를 발현시키는 유전자를 사용한다. 살곤충 단백질 독소를 발현하는 트랜스제닉 작물은 곤충으로부터의 작물 손상을 방지하는 데 사용된다.
바실루스 독소의 사용은 곤충을 방제하는 데 성공적이었던 반면, 이러한 독소가 집중적으로 사용된 권역의 많은 부분에서 Bt 독소에 대한 저항성이 일부 표적 해충에서 발달되었다. 이 문제를 해결하는 한 가지 방법은 Bt 작물을 규칙적인 비 Bt 작물 (은신처)과 교대하는 열로 심는 것이다. 곤충 저항성의 발달을 회피하거나 감속시키는 대안적인 방법은 트랜스제닉 식물에서 곤충에 대한 상이한 작용 방식을 갖는 살곤충 유전자를 스태킹하는 것이다. 살곤충 단백질 독소를 발현하는 트랜스제닉 작물을 사용하는 현재 전략은 박테리아 비. 투린기엔시스로부터 이미 유래된 것들을 넘어, 신규 독소의 발견에 더욱 강조를 두고 있다. 신규 독소는 곤충- 및 해충-저항성 트랜스제닉 식물의 개발을 위한 비. 투린기엔시스로부터 유래된 것들에 대한 대안으로서 유용한 것으로 입증될 수 있다. 따라서, 새로운 독소 단백질이 필요하다.
도 1은 서열식별번호 (SEQ ID NO): 8, 9, 123, 및 124의 아미노산 정렬을 제공한다. 표시된 영역은 4개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 2는 서열식별번호: 25, 26, 125 및 126의 아미노산 정렬을 제공한다. 표시된 영역은 4개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 3은 서열식별번호: 72, 94, 127, 및 128의 아미노산 정렬을 제공한다. 표시된 영역은 4개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 4는 서열식별번호: 70, 71, 129, 130 및 131의 아미노산 정렬을 제공한다. 표시된 영역은 5개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 5a 내지 5h는 서열식별번호: 6, 7, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및 159의 아미노산 정렬을 제공한다. 표시된 영역은 폴리펩티드 각각에서 보존된 영역을 나타낸다. 이 정렬에 존재하는 보존된 영역은 표 4에 제시된다.
도 6은 서열식별번호: 6 및 132 내지 159 각각의 퍼센트 서열 동일성 관계를 제공한다.
도 7은 서부 옥수수 뿌리벌레 생물검정에서 채용된 검정 점수화 지침 (크기 x 사멸률 행렬)을 제공한다.
요약
살충 활성을 갖는 조성물 및 그의 사용 방법이 제공된다. 조성물은 살충 활성을 갖는 단리된 및 재조합 폴리펩티드, 폴리펩티드를 코딩하는 재조합 및 합성 핵산 분자, 핵산 분자를 포함하는 DNA 구축물, 핵산 분자를 포함하는 벡터, 벡터를 포함하는 숙주 세포, 및 살충 폴리펩티드에 대한 항체를 포함한다. 본원에 제공된 폴리펩티드를 코딩하는 뉴클레오티드 서열은 미생물 및 식물을 비롯한 관심의 유기체에서의 형질전환 및 발현을 위한 DNA 구축물 또는 발현 카세트에서 사용될 수 있다.
본원에 제공된 조성물 및 방법은 향상된 해충 저항성 또는 내성을 갖는 유기체의 제조에 유용하다. 이들 유기체 및 유기체를 포함하는 조성물은 농업적 목적에 바람직하다. 본 발명의 살충 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 트랜스제닉 식물 및 종자가 또한 제공된다. 이러한 식물은 곤충 및 다른 해충에 대해 저항성이다.
본원에 개시된 다양한 폴리펩티드를 제조하는 방법, 및 해충을 방제하거나 죽이기 위해 그들 폴리펩티드를 사용하는 방법이 제공된다. 샘플에서 본 발명의 폴리펩티드를 검출하기 위한 방법 및 키트가 또한 포함된다.
도 2는 서열식별번호: 25, 26, 125 및 126의 아미노산 정렬을 제공한다. 표시된 영역은 4개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 3은 서열식별번호: 72, 94, 127, 및 128의 아미노산 정렬을 제공한다. 표시된 영역은 4개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 4는 서열식별번호: 70, 71, 129, 130 및 131의 아미노산 정렬을 제공한다. 표시된 영역은 5개의 폴리펩티드 사이에 아미노산이 상이한 영역을 나타낸다.
도 5a 내지 5h는 서열식별번호: 6, 7, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및 159의 아미노산 정렬을 제공한다. 표시된 영역은 폴리펩티드 각각에서 보존된 영역을 나타낸다. 이 정렬에 존재하는 보존된 영역은 표 4에 제시된다.
도 6은 서열식별번호: 6 및 132 내지 159 각각의 퍼센트 서열 동일성 관계를 제공한다.
도 7은 서부 옥수수 뿌리벌레 생물검정에서 채용된 검정 점수화 지침 (크기 x 사멸률 행렬)을 제공한다.
요약
살충 활성을 갖는 조성물 및 그의 사용 방법이 제공된다. 조성물은 살충 활성을 갖는 단리된 및 재조합 폴리펩티드, 폴리펩티드를 코딩하는 재조합 및 합성 핵산 분자, 핵산 분자를 포함하는 DNA 구축물, 핵산 분자를 포함하는 벡터, 벡터를 포함하는 숙주 세포, 및 살충 폴리펩티드에 대한 항체를 포함한다. 본원에 제공된 폴리펩티드를 코딩하는 뉴클레오티드 서열은 미생물 및 식물을 비롯한 관심의 유기체에서의 형질전환 및 발현을 위한 DNA 구축물 또는 발현 카세트에서 사용될 수 있다.
본원에 제공된 조성물 및 방법은 향상된 해충 저항성 또는 내성을 갖는 유기체의 제조에 유용하다. 이들 유기체 및 유기체를 포함하는 조성물은 농업적 목적에 바람직하다. 본 발명의 살충 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 트랜스제닉 식물 및 종자가 또한 제공된다. 이러한 식물은 곤충 및 다른 해충에 대해 저항성이다.
본원에 개시된 다양한 폴리펩티드를 제조하는 방법, 및 해충을 방제하거나 죽이기 위해 그들 폴리펩티드를 사용하는 방법이 제공된다. 샘플에서 본 발명의 폴리펩티드를 검출하기 위한 방법 및 키트가 또한 포함된다.
이제, 본 발명을 첨부된 도면을 참조로 이하에서 보다 충분하게 설명할 것이며, 본 발명의 모두는 아니지만 일부 실시양태를 나타낸다. 사실, 이들 발명은 많은 상이한 형태로 구현될 수 있으며, 본원에 제시된 실시양태에 제한되는 것으로 간주되지 않아야 하고, 오히려, 이들 실시양태는 이 개시내용이 적용가능한 법적 요건을 충족시킬 것이도록 제공된다. 전반에 걸쳐 동일한 숫자는 동일한 요소를 지칭한다.
본원에 제시된 본 발명의 많은 변형 및 다른 실시양태는 상기 설명 및 연관된 도면에 제시되는 교시의 이익을 갖는 이들 발명이 속하는 관련 기술분야의 통상의 기술자에게 생각날 것이다. 따라서, 본 발명은 개시된 구체적인 실시양태에 제한되지 않으며, 변형 및 다른 실시양태는 첨부된 청구범위의 범위 내에 포함되는 것으로 의도됨이 이해되어야 한다. 특정 용어가 본원에서 채용되지만, 이들은 단지 포괄적이고 설명적인 의미로만 사용되며, 제한의 목적을 위한 것이 아니다.
I.
폴리뉴클레오티드 및 폴리펩티드
유기체에 살충 활성을 부여하기 위한 조성물 및 방법이 제공된다. 변형된 유기체는 해충에 대해 저항성 또는 내성을 나타낸다. 살충 활성을 보유하는 재조합 살충 단백질, 또는 폴리펩티드 및 그의 단편 및 변이체가 제공되며, 이는 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 것들을 포함한다. 살충 단백질은 곤충, 진균, 선충 등을 비롯한 해충에 대해 생물학적으로 활성이다 (예를 들어, 살충성임). 예를 들어, 서열식별번호: 1 내지 159 또는 그의 활성 단편 또는 변이체를 비롯한, 살충 폴리펩티드를 코딩하는 폴리뉴클레오티드는 트랜스제닉 유기체, 예컨대 식물 및 미생물을 제조하는 데 사용될 수 있다. 형질전환된 유기체는 본원에 개시된 살충 단백질에 대한 코딩 서열을 포함하는 적어도 1개의 안정하게 혼입된 DNA 구축물을 포함하는 게놈을 특징으로 한다. 일부 실시양태에서, 코딩 서열은 코딩된 살충 폴리펩티드의 발현을 유도하는 프로모터에 작동적으로 연결된다. 따라서, 형질전환된 미생물, 식물 세포, 식물 조직, 식물, 종자, 및 식물 부분이 제공된다. 다양한 폴리펩티드, 그의 활성 변이체 및 단편, 및 이를 코딩하는 폴리뉴클레오티드의 요약은 하기 표 1에 제시된다. 표 1에서 주목된 바와 같이, 다양한 형태의 폴리펩티드가 제공된다. 전장 살충 폴리펩티드, 뿐만 아니라, 원래의 전장 서열의 변형된 형태 (변이체로 지칭됨)가 제공된다. 표 1은 추가로 "CryBP1" 서열을 나타낸다. 이러한 서열 (서열식별번호: 24, 67 및 73)은 독소 유전자의 일부와 회합될 수 있는 부속 폴리펩티드를 포함한다. 이러한 예에서, CryBP1 서열은 단독으로 또는 본원에 제공된 살충 폴리펩티드 중 임의의 것과 조합으로 사용될 수 있다. 표 1은 추가로 스플리트 (Split)-Cry C'-말단 폴리펩티드 (서열식별번호: 3, 74, 77, 80 및 91)를 제공한다. 이러한 서열은 독소 유전자의 Cry 부류의 C'-말단과 상동성을 갖는 하류 단백질의 서열을 포함하며, 전장이 아니고 예상되는 C'-말단 영역을 상실하고 있는 Cry 유전자 뒤에서 통상적으로 발견된다.
<표 1>
서열식별번호, 유전자 부류, 및 그의 변이체의 요약
i.
살충 단백질의 부류
본원에 제공된 살충 단백질 및 이를 코딩하는 뉴클레오티드 서열은 해충에 영향을 주는 방법에 유용하다. 즉, 본 발명의 조성물 및 방법은 많은 작물 식물의 해충을 비롯한, 해충을 방제하거나 죽이기 위해 농업에서 용도를 발견한다. 본원에 제공된 살충 단백질은 박테리아로부터의 독소 단백질이며, 특정 해충에 대해 활성을 나타낸다. 살충 단백질은 Cry, Cyt, BIN, Mtx 독소를 비롯한 몇몇 부류의 독소로부터의 것이다. 예를 들어, 본원에 제공된 다양한 서열식별번호의 구체적인 단백질 분류에 대한 표 1을 참조한다. 또한, 이 개시내용 전반에 걸쳐 Pfam 데이터베이스 엔트리에 대해 언급된다. Pfam 데이터베이스는 각각 다중 서열 정렬 및 프로파일 히든 마코브 (Markov) 모델에 의해 대표되는 단백질 패밀리의 데이터베이스이다. 문헌 [Finn et al. (2014) Nucl. Acid Res. Database Issue 42:D222-D230].
바실루스 투린기엔시스 (Bacillus thuringiensis, Bt)는 그의 성장의 포자형성 단계 동안 결정 봉입체로서 살곤충 단백질을 생산하는 그람-양성 박테리아이다. Bt의 단백질성 봉입체는 결정 단백질 또는 δ-내독소 (또는 Cry 단백질)로 지칭되며, 이는 곤충강 및 다른 무척추동물의 구성원에 대해 독성이다. 유사하게, Cyt 단백질은 용혈 (세포용해) 활성을 나타내는 Bt로부터의 부아포 봉입체 단백질이거나, 공지된 Cyt 단백질에 대해 자명한 서열 유사성을 갖는다. 이들 독소는 그들의 표적 유기체에 대해 매우 특이적이며, 인간, 척추동물, 및 식물에게 무해하다.
Cry 독소의 구조는 주로 도메인의 중심에서 또는 도메인 사이의 연접부에 농축된 5개의 보존된 아미노산 블록을 나타낸다. Cry 독소는 각각 특정 기능을 갖는 3개의 도메인으로 이루어진다. 도메인 I은 중심 나선이 6개의 외부 나선에 의해 완전히 둘러싸인 7개의 α-나선 묶음이다. 이 도메인은 막에서의 채널 형성에 관련된다. 도메인 II는 이뮤노글로불린의 항원-결합 영역과 유사한 3개의 역-평행 β-쉬트의 삼각형 컬럼으로서 보인다. 도메인 III은 β 샌드위치 형태의 역-평행 β-가닥을 함유한다. 독소 단백질의 N-말단 부분은 그의 독성 및 특이성을 담당하며, 5개의 보존된 영역을 함유한다. C-말단 부분은 통상적으로 매우 보존되어 있으며, 아마도 결정 형성을 담당한다. 예를 들어, 미국 특허 제8,878,007호를 참조한다.
비. 투린기엔시스의 균주는 상이한 곤충 목 (인시목 (Lepidoptera), 쌍시목 (Diptera), 딱정벌레목 (Coleoptera), 벌목 (Hymenoptera), 동시아목 (Homoptera), 이목 (Phthiraptera) 또는 털이목 (Mallophaga), 및 진드기목 (Acari)) 및 다른 무척추동물 (선형동물 (Nemathelminthes), 편형동물 (Platyhelminthes), 및 사로코마스테브라테스 (Sarocomastebrates))에 대해 광범위한 특이성을 나타낸다. Cry 단백질은 다양한 곤충 및 무척추동물 군에 대한 독성에 기초하여 군으로 분류되었다. 일반적으로, Cry I 단백질은 인시목에 대해 독성을 입증하고, Cry II 단백질은 인시목 및 쌍시목에 대해 독성을 입증하고, CryIII 단백질은 딱정벌레목에 대해 독성을 입증하고, Cry IV 단백질은 쌍시목에 대해 독성을 입증하고, Cry V 및 Cry VI 단백질은 선충에 대해 독성을 입증한다. 새로운 Cry 단백질이 아미노산 동일성에 기초하여 확인되고, Cry 군에 할당될 수 있다. 예를 들어, 본원에 참조로 포함되는 문헌 [Bravo, A. (1997) J. of Bacteriol. 179:2793-2801]; [Bravo et al. (2013) Microb. Biotechnol. 6:17-26]을 참조한다.
750개 초과의 상이한 cry 유전자 서열이 73개의 군 (Cry1 내지 Cry73)으로 분류되었으며, 이 유전자 패밀리의 새로운 구성원이 계속 발견되고 있다 (Crickmore et al. (2014) www.btnomenclature.info/). cry 유전자 패밀리는 상이한 작용 방식을 가질 수 있는 몇몇 계통적으로 비-관련된 단백질 패밀리로 이루어진다: 3-도메인 Cry 독소의 패밀리, 살모기 Cry 독소의 패밀리, 바이너리-유사 독소의 패밀리, 및 Cyt 패밀리의 독소 (Bravo et al., 2005). 일부 Bt 균주는 VIP 독소로 지칭되는 추가의 살곤충 독소를 생산한다. 또한, 문헌 [Cohen et al. (2011) J. Mol. Biol. 413:4-814]; 월드 와이드 웹 상에서 lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/에서 발견되는 문헌 [Crickmore et al. (2014) Bacillus thuringiensis toxin nomenclature]; [Crickmore et al. (1988) Microbiol. Mol. Biol. Rev. 62: 807-813]; [Gill et al. (1992) Ann. Rev. Entomol. 37: 807-636]; [Goldbert et al. (1997) Appl. Environ. Microbiol. 63:2716-2712]; [Knowles et al. (1992) Proc. R. Soc. Ser. B. 248: 1-7]; [Koni et al. (1994) Microbiology 140: 1869-1880]; [Lailak et al. (2013) Biochem. Biophys. Res. Commun. 435: 216-221]; [Lopez-Diaz et al. (2013) Environ. Microbiol. 15: 3030-3039]; [Perez et al. (2007) Cell. Microbiol. 9: 2931-2937]; [Promdonkoy et al. (2003) Biochem. J. 374: 255-259]; [Rigden (2009) FEBS Lett. 583: 1555-1560]; [Schnepf et al. (1998) Microbiol. Mol. Biol. Rev. 62: 775-806]; [Soberon et al. (2013) Peptides 41: 87-93]; [Thiery et al. (1998) J. Am. Mosq. Control Assoc. 14: 472-476]; [Thomas et al. (1983) FEBS Lett. 154: 362-368]; [Wirth et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94: 10536-10540]; [Wirth et al (2005) Appl. Environ. Microbiol. 71: 185-189]; 및 [Zhang et al. (2006) Biosci. Biotechnol. Biochem. 70: 2199-2204]을 참조하며, 이들 각각은 그 전문이 본원에 참조로 포함된다.
Cyt는 세포용해 활성을 갖는 바실루스 투린기엔시스로부터의 부아포 결정 봉입체 단백질, 또는 공지된 Cyt 단백질과 서열 유사성을 갖는 단백질을 지정한다. (Crickmore et al. (1998) Microbiol. Mol. Biol. Rev. 62: 807-813). 유전자는 cyt로 나타내어진다. 이들 단백질은 Cry 단백질과는 구조 및 활성이 상이하다 (Gill et al. (1992) Annu. Rev. Entomol. 37: 615-636). Cyt 독소는 비. 투린기엔시스 아종 이스라엘렌시스 (B. thuringiensis subspecies israelensis)에서 최초로 발견되었다 (Goldberg et al. (1977) Mosq. News. 37: 355-358). 현재 명명법에서 11개의 홀로타입 독소를 포함하여 3개의 Cyt 독소 패밀리가 있다 (월드 와이드 웹 상에서 lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/에서 발견되는 문헌 [Crickmore et al. (2014) Bacillus thuringiensis toxin nomenclature]). cyt 유전자를 갖는 대다수의 비. 투린기엔시스 단리체는 쌍시목 곤충 (특히 모기 및 먹파리)에 대해 활성을 나타내지만, 또한 인시목 또는 딱정벌레목 곤충을 표적화하는 비. 투린기엔시스 균주에 기재된 cyt 유전자가 있다 (Guerchicoff et al. (1997) Appl. Environ. Microbiol. 63: 2716-2721).
X-선 결정학에 의해 해결된 Cyt2A의 구조는 α-나선의 2개의 외부 층이 혼합된 β-쉬트 주위를 감싸는 단일 도메인을 나타낸다. Cyt 독소의 추가의 이용가능한 결정 구조는 7 내지 8개의 β-가닥을 함유하는 β-쉬트 코어에 플랭킹된 2개의 α-나선 헤어핀을 갖는 보존된 α-β 구조적 모델을 뒷받침한다 (Cohen et al. (2011) J. Mol. Biol. 413: 80 4-814). 돌연변이유발 연구는 독성에 대해 중요한 것으로서 β-쉬트 잔기를 확인한 반면, 나선 도메인 중의 돌연변이는 독성에 영향을 미치지 않았다 (Adang et al.; Diversity of Bacillus thuringiensis Crystal Toxins and Mechanism of Action. In: T. S. Dhadialla and S. S. Gill, eds, Advances in Insect Physiology, Vol. 47, Oxford: Academic Press, 2014, pp. 39-87.) Cyt 독소의 대표적인 도메인은 δ-내독소, Bac_thur_독소이다 (Pfam PF01338).
Cyt 독소의 작용 방식에 대한 다수의 제안된 모델이 있으며, 이는 여전히 활발한 조사의 영역이다. 일부 Cyt 단백질 (Cyt1A)은 결정화를 위한 부속 단백질의 존재를 요구하는 것으로 나타났다. Cyt1A 및 Cyt2A 프로톡신은 안정한 독소 코어와 N- 및 C-말단 중의 동일한 부위에서 소화 프로테아제에 의해 프로세싱된다. 그 후, Cyt 독소는 비-포화된 막 지질, 예컨대 포스파티딜콜린, 포스파티딜에탄올아민, 및 스핑고미엘린과 상호작용한다. Cyt 독소에 대해, 기공-형성 및 세정제-유사 막 붕괴는 비-배타적 메커니즘으로서 제안되었으며; 둘 다는 독소 농도에 따라 일어날 수 있는 것으로 일반적으로 받아들여지고, 보다 낮은 농도는 막 파손을 초래하는 올리고머성 기공 및 보다 높은 농도를 선호한다 (Butko (2003) Appl. Environ. Microbiol. 69: 2415-2422). 기공-형성 모델에서, Cyt 독소는 세포막에 결합하여 세포의 콜로이드-삼투 용해를 초래하는 막 소포에서의 양이온-선택적 채널의 형성을 유도한다. (Knowles et al. (1989) FEBS Lett. 244: 259-262; Knowles et al. (1992) Proc. R. Soc. Ser. B. 248: 1-7 및 Promdonkoy et al. (2003) Biochem. J. 374: 255-259). 세정제 모델에서, 막 탈어셈블리 및 세포 사멸을 초래하는 지질 이중층의 표면 상에서의 독소의 비특이적 응집이 있다. (Butko (2003) 상기 문헌; Manceva et al. (2005) Biochem. 44: 589-597).
다수의 연구는 Cyt 독소와 다른 비. 투린기엔시스 독소, 특히 Cry, Bin, 및 Mtx 독소 사이의 상승작용적 활성을 보여주었다. 이 상승작용은 다른 독소에 대한 곤충의 저항성을 극복하는 것으로 나타난 적이 있다 (Wirth 1997, Wirth 2005, Thiery 1998, Zhang 2006). Cry 독소에 대한 Cyt 상승작용적 효과는 Cry 독소 예비-기공 올리고머의 형성을 촉진시키는 용액 중 또는 막 평면 상의 Cry 독소의 도메인 II에의 Cyt1A 결합에 관여하는 것으로 제안되어 있다. 이 올리고머의 형성은 Cyt 올리고머화, 결합 또는 삽입에 독립적이다 (Lailak 2013, Perez 2007, Lopez-Diaz 2013).
Cry 단백질과 비관련된 다수의 살충 단백질은 식물 성장 동안 비. 투린기엔시스 및 비. 세레우스 (B. cereus)의 일부 균주에 의해 생산된다 (Estruch et al. (1996) Proc Natl Acad Sci USA 93:5389-5394; Warren et al. (1994) WO 94/21795). 이들 식물 살곤충 단백질, 또는 Vip는 부아포 결정 단백질을 형성하지 않으며, 명백하게 세포로부터 분비된다. Vip는 이들이 결정-형성 단백질이 아니기때문에 현재 Cry 단백질 명명법으로부터 제외되어 있다. 용어 VIP는 일부 비. 투린기엔시스 Cry 단백질이 또한 식물 성장 동안 뿐만 아니라 정지 및 포자형성 단계, 가장 현저하게 Cry3Aa 동안 생산된다는 의미에서 오칭이다. 비. 투린기엔시스 게놈에서 Vip 유전자의 위치는 또한 cry 유전자를 코딩하는 큰 플라스미드 상에 있는 것으로 보고되었다 (Mesrati et al. (2005) FEMS Microbiol. Lett. 244(2):353-8). Bt 독소의 명명법에 대한 웹-사이트는 월드 와이드 웹 상에서 경로 "/home/Neil_Crickmore/Bt/"를 갖는 lifesci.sussex.ac.uk에서 및 "btnomenclature.info/"에서 발견될 수 있다. 또한, 문헌 [Schnepf et al. (1998) Microbiol. Mol. Biol. Rev. 62(3):775-806]을 참조한다. 이러한 참고문헌은 본원에 참조로 포함된다.
지금까지 4개의 범주의 Vip가 확인되었다. 일부 Vip 유전자는 바이너리 2-성분 단백질 복합체를 형성하며; "A" 성분은 통상적으로 "활성" 부분이고, "B" 성분은 통상적으로 "결합" 부분이다 (Pfam _pfam.xfam.org/family/PF03495.). Vip1 및 Vip4 단백질은 일반적으로 바이너리 독소 B 단백질 도메인을 함유한다. Vip2 단백질은 일반적으로 바이너리 독소 A 단백질 도메인을 함유한다.
Vip1 및 Vip2 단백질은 딱정벌레목에 대해 독성을 나타내는 바이너리 독소의 2가지 성분이다. Vip1Aa1 및 Vip2Aa1은 옥수수 뿌리벌레, 특히 디아브로티카 비르기페라 (Diabrotica virgifera) 및 디아브로티카 론기코르니스 (Diabrotica longicornis)에 대해 매우 활성이다 (Han et al. (1999) Nat. Struct. Biol. 6:932-936; Warren GW (1997) "Vegetative insecticidal proteins: novel proteins for control of corn pests" In: Carozzi NB, Koziel M (eds) Advances in insect control, the role of transgenic plants; Taylor & Francis Ltd, London, pp 109-21). 막-결합 95 kDa Vip1 다량체는 52 kDa Vip2 ADP-리보실라제가 표적 서부 옥수수 뿌리벌레 세포의 세포질에 진입하는 경로를 제공한다 (Warren (1997) 상기 문헌). NAD-의존성 ADP-리보실트랜스페라제 Vip2는 유사하게 Arg177에서의 단량체성 액틴이 중합을 차단하도록 변형시켜, 액틴 세포골격의 손실 및 결국 생체내에서 액틴 필라멘트 내에서의 급속한 서브유닛 교환으로 인한 세포 사멸을 초래한다 (Carlier M.F. (1990) Adv. Biophys. 26:51-73).
Cry 독소와 같이, 활성화된 Vip3A 독소는 막에서 안정한 이온 채널을 만들 수 있는 기공-형성 단백질이다 (Lee et al. (2003) Appl. Environ. Microbiol. 69:4648-4657). Vip3 단백질은 몇몇 주요 인시목 해충에 대해 활성이다 (Rang et al. (2005) Appl. Environ. Microbiol. 71(10):6276-6281; Bhalla et al. (2005) FEMS Microbiol. Lett. 243:467-472; Estruch et al. (1998) WO 9844137; Estruch et al. (1996) Proc Natl Acad Sci USA 93:5389-5394; Selvapandiyan et al. (2001) Appl. Environ Microbiol. 67:5855-5858; Yu et al. (1997) Appl. Environ Microbiol. 63:532-536). Vip3A는 아그로티스 입실론 (Agrotis ipsilon), 스포돕테라 프루기페르다 (Spodoptera frugiperda), 스포돕테라 엑시구아 (Spodoptera exigua), 헬리오티스 비레센스 (Heliothis virescens), 및 헬리코베르파 제아 (Helicoverpa zea)에 대해 활성이다 (Warren et al. (1996) WO 96/10083; Estruch et al. (1996) Proc Natl Acad Sci USA 93:5389-5394). Cry 독소와 같이, Vip3A 단백질은 Cry 독소에 의해 인식되는 것들과는 상이한 특이적 막 단백질의 중장 상피의 표면에서 인식되기 전에 프로테아제에 의해 활성화되어야 한다.
독소 단백질의 MTX 패밀리는 보존된 도메인, ETX_MTX2 (pfam 03318)의 존재를 특징으로 한다. 이 패밀리의 구성원은 바실루스 스파에리쿠스 (Bacillus sphaericus)로부터의 살모기 독소 Mtx2 및 Mtx3과, 뿐만 아니라 클로스트리디움 페르프린겐스 (Clostridium perfringens)로부터의 엡실론 독소 ETX와 서열 상동성을 공유한다 (Cole et al. (2004) Nat. Struct. Mol. Biol. 11: 797-8; Thanabalu et al. (1996) Gene 170:85-9). MTX-유사 단백질은 이들이 신장되고 우세하게 β-쉬트-기재 구조를 갖기 때문에 3-도메인 Cry 독소와는 구조적으로 별개이다. 그러나, 3-도메인 독소와 유사하게, MTX-유사 단백질은 표적 세포의 막에서 기공을 형성하는 것으로 생각된다 (Adang et al. (2014) 상기 문헌). 3-도메인 Cry 단백질과는 다르게, MTX-유사 단백질은 길이가 훨씬 더 작고, 267 아미노산 (Cry23) 내지 340 아미노산 (Cry15A) 범위이다.
지금까지, 단지 MTX-유사 독소의 패밀리에 속하는 15개의 단백질만이 Cry 명칭을 할당받았으며, 이는 이들을 3-도메인 Cry 패밀리에 비해 상대적으로 작은 부류로 만든다 (Crickmore et al. (2014) 상기 문헌; Adang et al. (2014) 상기 문헌). MTX-유사 독소 패밀리의 구성원으로는 Cry15, Cry23, Cry33, Cry38, Cry45, Cry46, Cry51, Cry60A, Cry60B, 및 Cry64를 들 수 있다. 이 패밀리는 인시목 및 딱정벌레목 목의 곤충 해충에 대한 활성을 비롯한, 다양한 살곤충 활성을 나타낸다. 이 패밀리의 일부 구성원은 살곤충 활성을 위해 요구될 수 있거나 그렇지 않을 수 있는 다른 단백질과 바이너리 동반관계를 형성할 수 있다.
Cry15는 비. 투린기엔시스 혈청형 톰프소니 HD542에서 확인된 34 kDA 단백질이다. Cry15는 대략 40 kDa의 비관련된 단백질과 함께 결정에서 천연 발생한다. Cry15 및 그의 상대 단백질을 코딩하는 유전자는 오페론에서 함께 배열된다. Cry15 단독은 만두카 섹타 (Manduca sexta), 시디아 포모넬라 (Cydia pomonella), 및 피에리스 라파에 (Pieris rapae)를 비롯한 인시목 곤충 해충에 대해 활성을 갖는 것으로 나타났으며, 40 kDA 단백질의 존재는 단지 씨. 포모넬라에 대한 Cry15의 활성을 증가시키는 것으로 나타났다 (Brown K. and Whiteley H. (1992) J. Bacteriol. 174:549-557; Naimov et al. (2008) Appl. Environ. Microbiol. 74:7145-7151). Cry15의 상대 단백질의 기능을 해명하기 위해 추가의 연구가 필요하다. 유사하게, Cry23은 그의 상대 단백질 Cry37과 함께 딱정벌레목 해충 트리볼리움 카스타네움 (Tribolium castaneum) 및 포필리아 자포니카 (Popillia japonica)에 대해 활성을 갖는 것으로 나타난 29 kDA 단백질이다 (Donovan et al. (2000) 미국 특허 제6,063,756호).
MTX-유사 패밀리의 새로운 구성원이 계속 확인되고 있다. ETX_MTX 독소 유전자는 최근 바실루스 투린기엔시스 혈청형 톨오르티 균주 Na205-3의 게놈에서 확인되었다. 이 균주는 인시목 해충 헬리코베르파 아르미게라 (Helicoverpa armigera)에 대해 독성인 것으로 밝혀졌으며, 이는 또한 Cry1, Cry11, Vip1, Vip2, 및 Vip3의 상동체를 함유하였다 (Palma et al. (2014) Genome Announc. 2(2): e00187-14. Published online Mar 13, 2014 at doi: 10.1128/genomeA.00187-14; PMCID: PMC3953196). MTX-유사 단백질은 3-도메인 Cry 단백질에 비해 고유한 도메인 구조를 갖기 때문에, 이들은 고유한 작용 방식을 소유함으로써, 이들을 곤충 방제 및 곤충 저항성에 대한 싸움에서 가치있는 도구로 만드는 것으로 믿어진다.
박테리아 세포는 숙주 및 비-숙주 유기체에 대해 다양한 특이성을 갖는 다수의 독소를 생산한다. 곤충 해충에 대해 활성을 갖는 독소를 비롯한, 바이너리 독소의 큰 패밀리는 다수의 박테리아 패밀리에서 확인되었다. (Poopathi and Abidha (2010) J. Physiol. Path. 1(3): 22-38). 예전에 바실루스 스파에리쿠스였던 리시니바실루스 스파에리쿠스 (Lysinibacillus sphaericus) (Ls) (Ahmed et al. (2007) Int. J. Syst. Evol. Microbiol. 57:1117-1125)는 곤충 생물방제 균주로서 널리 공지되어 있다. Ls는 매우 강력한 바이너리 복합체 BinA/BinB를 비롯한, 몇몇 살곤충 단백질을 생산한다. 이 바이너리 복합체는 Ls 세포에서 부아포 결정을 형성하며, 쌍시목 곤충, 구체적으로 모기에 대해 강하고 특이적인 활성을 갖는다. 일부 지역에서, 기존의 Ls 살모기 균주에 대한 곤충 저항성이 보고되었다. 상이한 표적 특이성 또는 곤충 저항성을 극복하는 능력을 갖는 새로운 바이너리 독소의 발견은 상당히 흥미롭다.
Ls 바이너리 살곤충 단백질 복합체는 2가지 주요 폴리펩티드, 즉, 각각 BinA 및 BinB로 지정된 42 kDa 폴리펩티드 및 51 kDa 폴리펩티드를 함유한다 (Ahmed et al. (2007), 상기 문헌). 2가지 폴리펩티드는 상승작용적으로 작용하여 그들의 표적에 독성을 부여한다. 작용 방식은 유충 중장 중의 수용체에의 단백질의 결합을 포함한다. 일부의 경우, 단백질은 유충 장에서의 프로테아제 소화에 의해 변형되어 활성화된 형태를 생산한다. BinB 성분은 결합에 관여하는 것으로 생각되는 반면, BinA 성분은 독성을 부여한다 (Nielsen-LeRoux et al. (2001) Appl. Environ. Microbiol. 67(11):5049-5054). 별개로 클로닝되고 발현되는 경우, BinA 성분은 모기 유충에 대해 독성인 반면, BinB 성분은 그렇지 않다. 그러나, 단백질의 공동-투여는 독성을 현저하게 증가시킨다 (Nielsen-LeRoux et al. (2001) 상기 문헌).
소수의 Bin 단백질 상동체는 박테리아 공급원으로부터 기재되었다. 문헌 [Priest et al. (1997) Appl. Environ. Microbiol. 63(4):1195-1198]에는 새로운 Ls 균주를 확인하기 위한 혼성화 노력이 기재되어 있지만, 대부분의 유전자는 이들은 공지된 BinA/BinB 단백질과 동일한 코딩된 단백질을 확인하였다. BinA 단백질은 독소 10 수퍼패밀리 도메인으로 공지된 한정된 보존된 도메인을 함유한다. 이 독소 도메인은 원래 BinA 및 BinB에서의 그의 존재에 의해 정의되었다. 2가지 단백질 둘 다는 도메인을 갖지만, BinA과 BinB 사이의 서열 유사성은 이 영역에서 제한된다 (<40%). 또한 살곤충 활성을 갖는 Cry49Aa 단백질은 또한 이 도메인을 갖는다 (하기 설명됨).
Ls의 Cry48Aa/Cry49Aa 바이너리 독소는 쿨렉스 퀸퀘파시아투스 (Culex quinquefasciatus) 모기 유충을 죽이는 능력을 갖는다. 이들 단백질은 비. 투린기엔시스 (Bt) Cry 단백질 복합체에 대해 일부 유사성을 갖는 단백질 구조적 부류이다. Bt의 Cry34/Cry35 바이너리 독소는 또한 옥수수의 상당한 해충인 서부 옥수수 뿌리벌레를 비롯한 곤충을 죽이는 것으로 공지되어 있다. 그의 몇몇 변이체가 확인된 Cry34는 작은 (14 kDa) 폴리펩티드인 반면, Cry35 (또한 몇몇 변이체에 의해 코딩됨)은 44 kDa 폴리펩티드이다. 이들 단백질은 BinA/BinB 단백질 군과 일부 서열 상동성을 가지며, 진화적으로 관련되는 것으로 생각된다 (Ellis et al. (2002) Appl. Environ. Microbiol. 68(3):1137-1145).
이들 부류의 독소로부터의 살충 단백질이 본원에 제공된다. 살충 단백질은 그들의 구조, 공지된 독소에 대한 상동성 및/또는 그들의 살충 특이성에 의해 분류된다.
ii.
살충 단백질의 변이체 및 단편 및 이를 코딩하는 폴리뉴클레오티드
본 발명의 살충 단백질 또는 폴리펩티드는 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 것들 및 그의 단편 및 변이체를 포함한다. "살충 독소" 또는 "살충 단백질" 또는 "살충 폴리펩티드"는 해충이 죽거나 방제되도록 곤충, 진균, 선충 등을 비롯한, 1종 이상의 해충에 대한 활성을 갖는 독소 또는 단백질 또는 폴리펩티드로 의도된다.
"단리된" 또는 "정제된" 폴리펩티드 또는 단백질, 또는 그의 생물학적 활성 부분은 그의 천연 발생 환경에서 발견되는 바와 같은 폴리펩티드 또는 단백질을 통상적으로 수반하거나 그와 상호작용하는 성분이 실질적으로 또는 본질적으로 없다. 따라서, 단리된 또는 정제된 폴리펩티드 또는 단백질은 다른 세포성 물질, 또는 재조합 기술에 의해 제조되는 경우 배양 배지가 실질적으로 없거나, 화학적으로 합성되는 경우 화학적 전구체 또는 다른 화학물질이 실질적으로 없다. 세포성 물질이 실질적으로 없는 단백질은 약 30%, 20%, 10%, 5%, 또는 1% (건조 중량) 미만의 오염시키는 단백질을 갖는 단백질의 제제를 포함한다. 본 발명의 단백질 또는 그의 생물학적 활성 부분이 재조합적으로 제조되는 경우, 최적 배양 배지는 약 30%, 20%, 10%, 5%, 또는 1% (건조 중량)의 화학적 전구체 또는 관심의 비-단백질 화학물질을 나타낸다.
용어 "단편"은 본 발명의 폴리펩티드 서열의 부분을 지칭한다. "단편" 또는 "생물학적 활성 부분"은 생물학적 활성을 보유하는 (살충 활성을 갖는) 충분한 수의 인접한 아미노산 잔기를 포함하는 폴리펩티드를 포함한다. 살충 단백질의 단편은 교대 하류 출발 부위의 사용으로 인해, 또는 살충 활성을 갖는 보다 짧은 단백질을 제조하는 프로세싱으로 인해, 전장 서열보다 짧은 것들을 포함한다. 프로세싱은 단백질이 발현되는 유기체에서, 또는 단백질의 섭취 후 해충에서 일어날 수 있다. 단백질의 단편의 예는 표 1에서 발견될 수 있다. 살충 단백질의 생물학적 활성 부분은 예를 들어, 길이가 10, 25, 50, 100, 150, 200, 250 이상의 아미노산인 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 어느 하나의 폴리펩티드일 수 있다. 이러한 생물학적 활성 부분은 재조합 기술에 의해 제조되며, 살충 활성에 대해 평가될 수 있다. 본원에 사용된 바와 같이, 단편은 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159의 적어도 8개의 인접한 아미노산을 포함한다.
본원에 개시된 살충 단백질을 코딩하는 것들을 비롯한 박테리아 유전자는 매우 종종 오픈 리딩 프레임의 출발에 근접하게 다중 메티오닌 개시 코돈을 소유한다. 종종, 이들 출발 코돈 중 1개 이상에서의 번역 개시는 기능적 단백질의 생성을 초래할 것이다. 이들 출발 코돈은 ATG 코돈을 포함한다. 그러나, 박테리아, 예컨대 바실루스 종은 또한 출발 코돈으로서 코돈 GTG를 인식하며, GTG 코돈에서 번역을 개시하는 단백질은 최초 아미노산에서 메티오닌을 함유한다. 드문 경우, 박테리아 시스템에서의 번역은 TTG 코돈에서 개시될 수 있지만, 이 사건에서 TTG는 메티오닌을 코딩한다. 게다가, 이들 코돈이 박테리아에서 천연적으로 사용되는 선험은 종종 측정되지 않는다. 따라서, 교대 메티오닌 코돈 중 하나의 사용은 또한 살충 단백질의 생성을 초래할 수 있음이 이해된다. 이들 살충 단백질은 본 발명에 포함되며, 본원에 개시된 방법에 사용될 수 있다. 식물에서 발현되는 경우, 완전한 번역을 위해 교대 출발 코돈을 ATG로 변경하는 것이 필요할 것임이 이해될 것이다.
다양한 실시양태에서, 본원에 제공된 살충 단백질은 전장 뉴클레오티드 서열 및 교대 하류 출발 부위의 사용으로 인해 전장 서열보다 더 짧은 아미노산 서열로부터 추론된 아미노산 서열을 포함한다. 따라서, 본 발명의 뉴클레오티드 서열 및/또는 본 발명의 뉴클레오티드 서열을 포함하는 벡터, 숙주 세포, 및 식물 (및 본 발명의 뉴클레오티드 서열의 제조 및 사용 방법)은 교대 출발 부위를 코딩하는 뉴클레오티드 서열을 포함할 수 있다.
변이체 단백질을 생성하는 본원에 제공된 살충 폴리펩티드에 대해 변형이 이루어질 수 있음이 인식된다. 사람에 의해 설계된 변화는 부위-지정 돌연변이유발 기술의 적용을 통해 도입될 수 있다. 대안적으로, 본원에 개시된 서열과 구조적으로 및/또는 기능적으로-관련된 천연의, 아직-공지되지 않은 것으로서 또는 아직 확인되지 않은 것으로서의 폴리뉴클레오티드 및/또는 폴리펩티드는 또한 본 발명의 범위 내에 해당하는 것으로 확인될 수 있다. 보존적 아미노산 치환은 살충 단백질의 기능을 변경시키지 않는 비-보존된 영역에서 이루어질 수 있다. 대안적으로, 독소의 활성을 개선시키는 변형이 이루어질 수 있다. 예를 들어, 다양한 Cry 단백질 변이체가 고려된다. 도메인 III 스와핑에 의한 Cry 독소의 변형은 일부의 경우 특정 곤충 종에 대해 개선된 독성을 갖는 혼성체 독소를 초래하였다. 따라서, 도메인 III 스와핑은 Cry 독소의 독성을 개선시키거나, 모 Cry 독소에 대해 감수성을 나타내지 않는 해충에 대해 독성을 갖는 신규 혼성체 독소를 생성하기 위한 유효한 전략일 수 있다. 도메인 II 루프 서열의 부위-지정 돌연변이유발은 증가된 살곤충 활성을 갖는 새로운 독소를 초래할 수 있다. 도메인 II 루프 영역은 개선된 살곤충 특성을 갖는 Cry 독소의 돌연변이유발 및 선별을 위한 적합한 표적인 초기 Cry 독소의 핵심적 결합 영역이다. Cry 독소의 도메인 I은 특정 해충에 대한 활성을 개선시키기 위해 프로테아제 절단 부위을 도입하도록 변형될 수 있다. 다수의 cry 유전자 중에서 3개의 상이한 도메인을 셔플링하기 위한 전략 및 고 처리량 생물검정 스크리닝 방법은 개선된 또는 신규 독성을 갖는 신규 Cry 독소를 제공할 수 있다.
지시된 바와 같이, 본원에 개시된 폴리펩티드의 단편 및 변이체는 살충 활성을 보유할 것이다. 살충 활성은 예를 들어, 적어도 1종의 해충의 사멸, 또는 해충 성장, 섭식, 또는 정상적인 생리학적 발달의 현저한 감소를 비롯한, 표적 해충의 발생 또는 활성을 감소시키는 관찰가능한 효과를 달성하는 조성물의 능력을 포함한다. 이러한 수, 해충 성장, 섭식 또는 정상적인 발달의 감소는 예를 들어 약 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 85%, 90%, 95% 이상의 감소를 비롯한, 임의의 통계적으로 유의한 감소를 포함할 수 있다. 예를 들어, 딱정벌레목, 쌍시목, 벌목, 인시목, 털이목, 동시아목, 반시목 (Hemiptera), 메뚜기목 (Orthoptera), 선충, 총채벌레목 (Thysanoptera), 집게벌레목 (Dermaptera), 흰개미목 (Isoptera), 이아목 (Anoplura), 벼룩목 (Siphonaptera), 모시목 (Trichoptera) 등, 또는 본원에 기재된 임의의 다른 해충에 대한 살충 활성을 비롯한, 다양한 해충 중 1종 이상에 대한 살충 활성이 본원에 제공되었다. 살충 활성은 천연 단백질의 활성에 비해 상이하거나 개선될 수 있거나, 이는 살충 활성이 보유되는 한 변화되지 않을 수 있음이 인식된다. 살충 활성을 측정하는 방법은 본원의 다른 곳에 제공된다. 또한, 문헌 [Czapla and Lang (1990) J. Econ. Entomol. 83:2480-2485]; [Andrews et al. (1988) Biochem. J. 252:199-206]; [Marrone et al. (1985) J. of Economic Entomology 78:290-293]; 및 미국 특허 제5,743,477호를 참조하며, 이들 모두는 그 전문이 본원에 참조로 포함된다.
이 개시내용의 변이체는 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 임의의 것의 아미노산 서열과 적어도 약 60%, 약 65%, 약 70%, 약 75%, 약 80%, 약 85%, 약 86%, 약 87%, 약 88%, 약 89%, 약 90%, 약 91%, 약 92%, 약 93%, 약 94%, 약 95%, 약 96%, 약 97%, 약 98% 또는 약 99% 동일한 아미노산 서열을 가지며, 살충 활성을 보유하는 폴리펩티드를 포함한다. 표 1은 서열식별번호: 1 내지 159 각각에 대한 변이체 폴리펩티드 (및 이를 코딩하는 폴리뉴클레오티드)의 비-제한적 예를 제공한다. 본원에 제공된 살충 폴리펩티드의 생물학적 활성 변이체는 약 1 내지 15 아미노산 잔기만큼 적게, 약 1 내지 10, 예컨대 약 6 내지 10만큼 적게, 5만큼 적게, 4만큼 적게, 3만큼 적게, 2만큼 적게, 또는 1 아미노산 잔기만큼 적게 상이할 수 있다. 구체적인 실시양태에서, 폴리펩티드는 폴리펩티드의 N' 또는 C' 말단 중 어느 하나로부터 적어도 10, 15, 20, 25, 30, 35, 40, 45, 50 아미노산 이상의 결실을 포함할 수 있는 N'-말단 또는 C'-말단 말단절단을 포함할 수 있다.
표 2는 PFAM 데이터에 기초하여 서열식별번호: 1 내지 159에서 발견되는 단백질 도메인을 제공한다. 도메인 설명 및 주어진 서열식별번호 내에서의 위치 둘 다가 표 2에 제공된다. 구체적인 실시양태에서, 서열식별번호: 1 내지 159 중 어느 하나를 포함하는 활성 변이체는 서열식별번호: 1 내지 159 중 어느 하나와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 포함하며, 추가로 표 2에 제시된 보존된 도메인 중 적어도 하나를 포함한다. 예를 들어, 한 실시양태에서, 활성 변이체는 서열식별번호: 15와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 포함할 것이고, 추가로 위치 78 내지 329에서 천연 아미노산, 위치 554 내지 708에서 천연 아미노산, 또는 위치 78 내지 329 및 위치 554 내지 708에서 천연 아미노산을 포함한다.
<표 2>
서열식별번호: 1 내지 159 각각에서 PFAM 도메인의 요약
서열식별번호: 8의 변이체는 서열식별번호: 9, 123 및 124를 포함한다. 도 1은 서열식별번호: 8, 9, 123 및 124의 아미노산 서열 정렬을 제공하며, 표 3은 변형의 유형 및 서열식별번호: 9, 123 및 124가 서열식별번호: 8과 공유하는 퍼센트 서열 동일성의 요약을 제공한다.
서열식별번호: 25의 변이체는 서열식별번호: 26, 125 및 126을 포함한다. 도 2는 서열식별번호: 25, 26, 125 및 126의 아미노산 서열 정렬을 제공하며, 표 3은 변형의 유형 및 서열식별번호: 26, 125 및 126이 서열식별번호: 25와 공유하는 퍼센트 서열 동일성의 요약을 제공한다.
서열식별번호: 72의 변이체는 서열식별번호: 94, 127 및 128을 포함한다. 도 3은 서열식별번호: 72, 94, 127 및 128의 아미노산 서열 정렬을 제공하며, 표 3은 변형의 유형 및 서열식별번호: 94, 127 및 128이 서열식별번호: 72와 공유하는 퍼센트 서열 동일성의 요약을 제공한다.
서열식별번호: 70의 변이체는 서열식별번호: 71, 129, 130, 및 131을 포함한다. 도 4는 서열식별번호: 70, 71, 129, 130, 및 131의 아미노산 서열 정렬을 제공한다. 표 3은 변형의 유형 및 서열식별번호: 71, 129, 130, 및 131이 서열식별번호: 70과 공유하는 퍼센트 서열 동일성의 요약을 제공한다.
<표 3>
서열식별번호: 8, 25, 72, 및 70에 대한 변이체의 요약
도 5a 내지 5h에서 주목된 바와 같이, 서열식별번호: 6, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및 159 각각은 하기 표 4에 나타낸 바와 같은 공통적인 모티프를 공유한다. 도 5a 내지 5h는 어디에서 보존된 서열 각각이 서열식별번호: 6 및 132 내지 159 각각에서 발견되는 지를 지시한다.
<표 4>
서열식별번호: 6 및 132 내지 159에서의 보존된 영역
구체적인 실시양태에서, 서열식별번호: 6 또는 132 내지 159를 포함하는 활성 변이체는 서열식별번호: 6 및 132 내지 159 중 어느 하나와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 포함할 수 있고, 추가로 표 4에 제시된 보존된 도메인 중 적어도 하나를 포함한다. 예를 들어, 한 실시양태에서, 활성 변이체는 서열식별번호: 6 또는 132 내지 159와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 포함하고, 추가로 표 4에 제시된 1종 이상의 아미노산 도메인 (즉, 서열식별번호: 161, 162, 163, 164, 165, 166, 및/또는 167 중 적어도 하나)을 포함할 것이다. 비-제한적 실시양태에서, 활성 변이체는 서열식별번호: 6 또는 132 내지 159와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 포함하고, 추가로 서열식별번호: 166을 포함한다. 또다른 실시양태에서, 서열식별번호: 6 또는 132 내지 159의 활성 변이체는 서열식별번호: 6 또는 132 내지 159와 적어도 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 서열 동일성을 갖는 서열을 포함하며, 여기서, 활성 변이체는 서열식별번호: 160이 아니다.
도 6은 서열식별번호: 6, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및 159 사이의 퍼센트 서열 동일성 관계를 제공한다.
본원에 개시된 살충 폴리펩티드를 코딩하는 재조합 또는 합성 핵산이 또한 제공된다. 특히 흥미로운 것은 관심의 식물에서의 발현용으로 설계된 핵산 서열이다. 예를 들어, 핵산 서열은 숙주 식물에서의 증가된 발현을 위해 최적화될 수 있다. 본원에 제공된 바와 같은 살충 단백질은 특정 숙주, 예를 들어, 작물 식물에서의 발현을 위해 최적화된 코돈을 포함하는 핵산을 생성하기 위해 역-번역될 수 있다. 또다른 실시양태에서, 본원에 제공된 폴리펩티드를 코딩하는 폴리뉴클레오티드는 형질전환된 식물에서의 증가된 발현을 위해 최적화될 수 있다. 예를 들어, 폴리뉴클레오티드는 개선된 발현을 위해 식물-바람직한 코돈을 사용하여 합성될 수 있다. 예를 들어, 숙주-바람직한 코돈 사용법의 논의에 대해서는 문헌 [Campbell and Gowri (1990) Plant Physiol. 92:1-11]을 참조한다. 식물-바람직한 유전자를 합성하기 위한 방법이 관련 기술분야에서 이용가능하다. 예를 들어, 미국 특허 제5,380,831호 및 제5,436,391호, 및 문헌 [Murray et al. (1989) Nucleic Acids Res. 17:477-498]을 참조하며, 이들 각각은 본원에 참조로 포함된다. 형질전환된 식물 (예를 들어, 쌍자엽식물 또는 단자엽식물)에 의한 이러한 코딩 서열의 발현은 살충 폴리펩티드의 제조를 초래하고, 해충에 대해 식물에서의 증가된 저항성을 부여할 것이다. 본 발명의 살충 단백질을 코딩하는 재조합 및 합성 핵산 분자는 단백질을 코딩하는 천연 발생 박테리아 서열을 포함하지 않는다.
"재조합 폴리뉴클레오티드" 또는 "재조합 핵산"은 자연에서 직접 결합된 것으로 발견되지 않는 2개 이상의 화학적으로 연결된 핵산 절편의 조합을 포함한다. "직접 결합된"은 2개의 핵산 절편이 바로 인접하고, 화학적 연결에 의해 서로 결합되는 것으로 의도된다. 구체적인 실시양태에서, 재조합 폴리뉴클레오티드는 추가의 화학적으로 연결된 핵산 절편이 관심의 폴리뉴클레오티드에 대해 5', 3' 또는 내부 중 어느 하나에 위치하도록, 관심의 폴리뉴클레오티드 또는 그의 변이체 또는 단편을 포함한다. 대안적으로, 재조합 폴리뉴클레오티드의 화학적으로-연결된 핵산 절편은 서열의 결실에 의해 형성될 수 있다. 추가의 화학적으로 연결된 핵산 절편, 또는 연결된 핵산 절편을 결합시키기 위해 결실된 서열은 예를 들어, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20 뉴클레오티드 이상을 비롯한, 임의의 길이의 것일 수 있다. 이러한 재조합 폴리뉴클레오티드를 제조하는 다양한 방법으로는 화학적 합성, 및 유전자 조작 기술에 의한 폴리뉴클레오티드의 단리된 절편의 조작을 들 수 있다. 구체적인 실시양태에서, 재조합 폴리뉴클레오티드는 재조합 DNA 서열 또는 재조합 RNA 서열을 포함할 수 있다. "재조합 폴리뉴클레오티드 또는 핵산의 단편"은 자연에서 직접 결합된 것으로 발견되지 않는 2개 이상의 화학적으로 연결된 아미노산 절편의 조합 중 적어도 하나를 포함한다.
폴리뉴클레오티드의 단편 (RNA 또는 DNA)은 활성을 보유하는 단백질 단편을 코딩할 수 있다. 구체적인 실시양태에서, 재조합 폴리뉴클레오티드 또는 재조합 폴리뉴클레오티드 구축물의 단편은 자연에서 직접 결합된 것으로 발견되지 않는 2개 이상의 화학적으로 연결된 또는 작동적으로 연결된 핵산 절편의 적어도 하나의 연접을 포함한다. 살충 활성을 보유하는 폴리펩티드의 생물학적 활성 부분을 코딩하는 폴리뉴클레오티드의 단편은 적어도 25, 30, 40, 50, 60, 70, 75, 80, 90, 100, 110, 120, 125, 130, 140, 150, 160, 170, 175, 또는 180개의 인접한 아미노산, 또는 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 바와 같은 전장 폴리펩티드에 존재하는 아미노산의 총 수 이하를 코딩할 것이다. 구체적인 실시양태에서, 이러한 폴리펩티드 단편은 활성 단편이다. 일부 실시양태에서, 폴리펩티드 단편은 재조합 폴리펩티드 단편을 포함한다. 본원에 사용된 바와 같이, 재조합 폴리펩티드의 단편은 자연에서 직접 결합된 것으로 발견되지 않는 2개 이상의 화학적으로 연결된 아미노산 절편의 조합 중 적어도 하나를 포함한다.
본원에 사용된 용어 "변이체"는 실질적으로 유사한 서열을 의미하는 것으로 의도된다. 폴리뉴클레오티드에 대해, 변이체는 천연 폴리뉴클레오티드 내의 1개 이상의 내부 부위에 1개 이상의 뉴클레오티드의 결실 및/또는 부가, 및/또는 천연 폴리뉴클레오티드 중의 1개 이상의 부위에 1개 이상의 뉴클레오티드의 치환을 포함한다. 본원에 사용된 바와 같은 "천연" 폴리뉴클레오티드 또는 폴리펩티드는 각각 천연 발생 뉴클레오티드 서열 또는 아미노산 서열을 포함한다.
본 발명의 특정 폴리뉴클레오티드의 변이체 (즉, 참조 폴리뉴클레오티드)는 또한 2개 사이의 퍼센트 서열 동일성을 측정하기 위한 변이체 폴리뉴클레오티드에 의해 코딩되는 폴리펩티드 및 참조 폴리뉴클레오티드에 의해 코딩되는 폴리펩티드의 비교에 의해 평가될 수 있다. 따라서, 예를 들어, 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159의 폴리펩티드와 주어진 퍼센트 서열 동일성을 갖는 폴리펩티드를 코딩하는 단리된 폴리뉴클레오티드가 개시된다. 임의의 2개의 폴리펩티드 사이의 퍼센트 서열 동일성은 본원의 다른 곳에 기재된 서열 정렬 프로그램 및 파라미터를 사용하여 계산될 수 있다. 본 발명의 폴리뉴클레오티드의 임의의 주어진 쌍이 그들이 코딩하는 2개의 폴리펩티드에 의해 공유되는 퍼센트 서열 동일성의 비교에 의해 평가되는 경우, 2개의 코딩되는 폴리펩티드 사이의 퍼센트 서열 동일성은 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159와 적어도 약 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% 이상 서열 동일성이다. 다른 실시양태에서, 본원에 제공된 폴리뉴클레오티드의 변이체는 적어도 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 이상의 뉴클레오티드만큼 천연 서열과 상이하다.
iii.
서열 비교
정렬된 아미노산 서열의 특정 쌍에 관하여 사용되는 경우, 본원에 사용된 용어 "동일성" 또는 "퍼센트 동일성"은 정렬에서 동일한 매치의 수를 카운팅하고, 이러한 동일한 매치의 수를 정렬된 서열의 길이로 나눔으로써 얻어진 퍼센트 아미노산 서열 동일성을 지칭한다. 정렬된 아미노산 서열의 특정 쌍에 관하여 사용되는 경우, 본원에 사용된 용어 "유사성" 또는 "퍼센트 유사성"은 정렬에서 각각의 아미노산 쌍에 대한 점수화 행렬을 정렬된 서열의 길이로 나눈 것으로부터 얻어진 점수의 합을 지칭한다.
달리 언급되지 않는다면, 동일성 및 유사성은 디폴트 갭 패널티 및 점수화 행렬을 사용하는 (단백질에 대해 EBLOSUM62 및 DNA에 대해 EDNAFULL) EMBOSS 소프트웨어 패키지 (문헌 [Rice, P. Longden, I. and Bleasby, A., EMBOSS: The European Molecular Biology Open Software Suite, 2000, Trends in Genetics 16, (6) pp. 276-277], 다른 소스 중에서도 EMBnet으로부터 embnet.org/resource/emboss 및 emboss.sourceforge.net에서 이용가능한 버전 6.3.1)의 부품으로서 배포된 "니들" 프로그램에 의해 실행되는 바와 같은 니들맨-운쉬 (Needleman-Wunsch) 글로벌 정렬 및 점수화 알고리듬 (Needleman and Wunsch (1970) J. Mol. Biol. 48(3):443-453)에 의해 계산된다. 등가의 프로그램이 또한 사용될 수 있다. "등가의 프로그램"은 EMBOSS 버전 6.3.1로부터의 니들에 의해 생성된 상응하는 정렬과 비교할 경우, 해당 임의의 2개의 서열에 대해, 동일한 뉴클레오티드 잔기 매치 및 동일한 퍼센트 서열 동일성을 갖는 정렬을 생성하는 임의의 서열 비교 프로그램으로 의도된다.
추가의 수학적 알고리듬은 관련 기술분야에 공지되어 있으며, 2개의 서열의 비교를 위해 이용될 수 있다. 예를 들어, 문헌 [Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877]에서와 같이 변형된 문헌 [Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264]의 알고리듬을 참조한다. 이러한 알고리듬은 문헌 [Altschul et al. (1990) J. Mol. Biol. 215:403]의 BLAST 프로그램에 포함된다. BLAST 뉴클레오티드 검색은 본 발명의 살충-유사 핵산 분자와 상동성인 뉴클레오티드 서열을 얻기 위해 BLASTN 프로그램으로 수행될 수 있다. BLAST 단백질 검색은 본 발명의 살충 단백질 분자와 상동성인 아미노산 서열을 얻기 위해 BLASTP 프로그램으로 수행될 수 있다. 비교 목적을 위한 간극화된 정렬을 얻기 위해, 갭드 (Gapped) BLAST (BLAST 2.0에서)가 문헌 [Altschul et al. (1997) Nucleic Acids Res. 25:3389]에 기재된 바와 같이 이용될 수 있다. 대안적으로, PSI-Blast를 사용하여 분자 사이의 떨어진 관계를 검출하는 반복된 검색을 수행할 수 있다. Altschul et al. (1997) 상기 문헌 참조. BLAST, 갭드 BLAST, 및 PSI-Blast 프로그램을 이용하는 경우, 각각의 프로그램의 디폴트 파라미터 (예를 들어, BLASTX 및 BLASTN)가 사용될 수 있다. 정렬은 또한 검사에 의해 수동으로 수행될 수 있다.
2개의 서열은 이들이 그 서열의 쌍에 대해 가능한 최고 점수에 도달하도록 위해 정의된 아미노산 치환 행렬 (예를 들어, BLOSUM62), 갭 존재 패널티 및 갭 연장 패널티를 사용하여 유사성 점수화에 대해 정렬되는 경우 "최적으로 정렬된다". 아미노산 치환 행렬 및 2개의 서열 사이의 유사성을 정량화하는 데 있어서의 그들의 사용은 관련 기술분야에 널리 공지되어 있으며, 예를 들어, 문헌 [Dayhoff et al. (1978) "A model of evolutionary change in proteins." In "Atlas of Protein Sequence and Structure," Vol. 5, Suppl. 3 (ed. M. O. Dayhoff), pp. 345-352. Natl. Biomed. Res. Found., Washington, D.C. and Henikoff et al. (1992) Proc. Natl. Acad. Sci. USA 89:10915-10919]에 기재되어 있다. BLOSUM62 행렬은 종종 서열 정렬 프로토콜에서 디폴트 점수화 치환 행렬로서 사용된다. 갭 존재 패널티는 정렬된 서열 중 하나에서 단일 아미노산 갭의 도입을 위해 시행되며, 갭 연장 패널티는 이미 오픈된 갭 내로 삽입된 각각의 추가의 빈 아미노산 위치를 위해 시행된다. 정렬은, 최고 가능한 점수에 도달하도록, 정렬이 시작되고 종료되는 각각의 서열의 아미노산 위치에 의해, 및 임의로 하나 또는 둘 다의 서열에서의 갭 또는 다중 갭의 삽입에 의해 정의된다. 최적 정렬 및 점수화는 수동으로 수행될 수 있는 반면, 프로세스는 컴퓨터-실행된 정렬 알고리듬, 예컨대, 예를 들어, 문헌 [Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402]에 기재된 갭드 BLAST 2.0의 사용에 의해 용이화되며, 미국 국립 생물공학 정보 센터 (National Center for Biotechnology Information) 웹사이트 (www.ncbi.nlm.nih.gov)에서 공중에게 이용가능하게 된다. 다중 정렬을 비롯한 최적 정렬은 예를 들어, www.ncbi.nlm.nih.gov를 통해 이용가능하고, 문헌 [Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402]에 의해 기재된 PSI-BLAST를 사용하여 제작될 수 있다.
참조 서열과 최적으로 정렬된 아미노산 서열에 관하여, 아미노산 잔기는 잔기가 정렬에서 그와 쌍형성되는 참조 서열에서의 위치"에 상응한다". "위치"는 N-말단에 비한 그의 위치에 기초하여 참조 서열에서 각각의 아미노산을 순차적으로 확인하는 번호에 의해 나타내어진다. 예를 들어, 서열식별번호: 1에서, 위치 1은 M이고, 위치 2는 A이고, 위치 3은 N인 등이다. 시험 서열이 서열식별번호: 1과 최적으로 정렬되는 경우, 위치 3에서의 N과 정렬하는 시험 서열에서의 잔기는 서열식별번호: 1의 "위치 3에 상응하는" 것으로 이야기된다. 최적 정렬을 결정하는 경우 고려되어야 하는 결실, 삽입, 말단절단, 융합 등 때문에, 일반적으로, N-말단으로부터 단순하게 카운팅함으로써 측정되는 바와 같은 시험 서열에서의 아미노산 잔기 번호는 참조 서열에서의 그의 상응하는 위치의 번호와 반드시 동일하지는 않을 것이다. 예를 들어, 정렬된 시험 서열에서 결실이 있는 경우, 결실의 부위에서 참조 서열에서의 위치에 상응하는 아미노산은 없을 것이다. 정렬된 참조 서열에서 삽입이 있는 경우, 그 삽입은 참조 서열에서의 임의의 아미노산 위치에 상응하지 않을 것이다. 말단절단 또는 융합의 경우, 상응하는 서열에서의 임의의 아미노산에 상응하지 않는 참조 또는 정렬된 서열 중 어느 하나에서 아미노산의 스트레치가 있을 수 있다.
iv.
항체
본 발명의 폴리펩티드에 대한, 또는 그의 변이체 또는 단편에 대한 항체가 또한 포함된다. 항체를 제조하는 방법은 관련 기술분야에 널리 공지되어 있다 (예를 들어, 문헌 [Harlow and Lane (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.]; 및 미국 특허 제4,196,265호 참조). 이들 항체는 독소 폴리펩티드의 검출 및 단리용 키트에서 사용될 수 있다. 따라서, 이 개시내용은 예를 들어, 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159의 서열을 갖는 폴리펩티드를 비롯한, 본원에 기재된 폴리펩티드에 특이적으로 결합하는 항체를 포함하는 키트를 제공한다.
II.
해충
본원에 제공된 조성물 및 방법은 다양한 해충에 대해 유용하다. "해충"으로는 곤충, 진균, 박테리아, 선충, 진드기, 원생동물 병원체, 동물-기생 간 흡충 등을 들 수 있으나, 이에 제한되지는 않는다. 특정 관심의 해충은 곤충 해충, 특히 농업 식물에 상당한 손상을 유발하는 곤충 해충이다. 곤충 해충으로는 딱정벌레목, 쌍시목, 벌목, 인시목, 털이목, 동시아목, 반시목, 메뚜기목, 총채벌레목, 집게벌레목, 흰개미목, 이아목, 벼룩목, 모시목, 또는 선충으로부터 선택되는 곤충을 들 수 있다. 비-제한적 실시양태에서, 곤충 해충은 서부 옥수수 뿌리벌레, 디아브로티카 비르기페라 비르기페라 (Diabrotica virgifera virgifera); 가을 거염벌레, 스포돕테라 프루기페르다; 콜로라도 감자 딱정벌레, 렙티노타르사 데셈리네아타 (Leptinotarsa decemlineata); 옥수수 이삭벌레, 헬리코베르파 제아 (북아메리카에서 동일 종은 목화를 공격하며, 목화 다래벌레로 지칭됨); 유럽 옥수수 천곤충, 오스트리니아 누빌랄리스 (Ostrinia nubilalis); 검거세미나방, 아그로티스 입실론; 배추좀나방, 플루텔라 크실로스텔라 (Plutella xylostella); 벨벳빈 카테필러, 안티카르시아 겜마탈리스 (Anticarsia gemmatalis); 남서부 옥수수 천곤충, 디아트라에아 그란디오셀라 (Diatraea grandiosella); 목화 다래벌레, 헬리코베르파 아르미게라 (미국 외에 나머지 권역에서 발견됨); 남부 녹색 노린재, 네자라 비리둘라 (Nezara viridula); 녹색 노린재, 키나비아 할라리스 (Chinavia halaris); 썩덩나무 노린재, 할리오모르파 할리스 (Halyomorpha halys); 및 갈색 노린재, 유스키스투스 세르부스 (Euschistus servus)를 포함한다. 다른 실시양태에서, 해충은 멜로이도기네 하플라 (Meloidogyne hapla) (북부 뿌리-혹 선충); 멜로이도기네 엔테롤로비이 (Meloidogyne enterolobii), 멜로이도기네 아레나리아 (Meloidogyne arenaria) (땅콩 뿌리-혹 선충); 및 멜로이도기네 자바니카 (Meloidogyne javanica)를 포함하나 이에 제한되지 않는 선충을 포함한다.
본원에 사용된 용어 "곤충 해충"은 곤충 및 다른 유사한 해충, 예컨대, 예를 들어, 응애 및 진드기를 포함하나 이에 제한되지 않는 진드기목의 것들을 지칭한다. 본 발명의 곤충 해충으로는 인시목의 곤충, 예를 들어 아코로이아 그리셀라 (Achoroia grisella), 아클레리스 글로베라나 (Acleris gloverana), 아클레리스 바리아나 (Acleris variana), 아독소피에스 오라나 (Adoxophyes orana), 아그로티스 입실론, 알라바마 아르길라세아 (Alabama argillacea), 알소필라 포메타리아 (Alsophila pometaria), 아미엘로이스 트란시텔라 (Amyelois transitella), 아나가스타 쿠에니엘라 (Anagasta kuehniella), 아나르시아 리네아텔라 (Anarsia lineatella), 아니소타 세나토리아 (Anisota senatoria), 안테라에아 페르니이 (Antheraea pernyi), 안티카르시아 겜마탈리스, 아르킵스 (Archips) 종, 아르기로타에니아 (Argyrotaenia) 종, 아테티스 민다라 (Athetis mindara), 봄빅스 모리 (Bombyx mori), 부쿨라트릭스 투르베리엘라 (Bucculatrix thurberiella), 카드라 카우텔라 (Cadra cautella), 코리스토뉴라 (Choristoneura) 종, 코킬스 호스페스 (Cochylls hospes), 콜리아스 유리테메 (Colias eurytheme), 코르키라 세팔로니카 (Corcyra cephalonica), 시디아 라티페레아누스 (Cydia latiferreanus), 시디아 포모넬라, 다타나 인테게리마 (Datana integerrima), 덴드롤리무스 시베리쿠스 (Dendrolimus sibericus), 데스미아페네랄리스 (Desmiafeneralis), 디아파니아 히알리나타 (Diaphania hyalinata), 디아파니아 니티달리스 (Diaphania nitidalis), 디아트라에아 그란디오셀라, 디아트라에아 사카랄리스 (Diatraea saccharalis), 엔노모스 수브시그나리아 (Ennomos subsignaria), 에오류마 로프티니 (Eoreuma loftini), 에스페스티아 엘루텔라 (Esphestia elutella), 에란니스 틸라리아 (Erannis tilaria), 에스티그메네 아크레아 (Estigmene acrea), 율리아 살루브리콜라 (Eulia salubricola), 유포코엘리아 암비구엘라 (Eupocoellia ambiguella), 유포에실리아 암비구엘라 (Eupoecilia ambiguella), 유프로크티스 크리소르호에아 (Euproctis chrysorrhoea), 유크소아 메소리아 (Euxoa messoria), 갈레리아 멜로넬라 (Galleria mellonella), 그라폴리타 몰레스타 (Grapholita molesta), 하리시나 아메리카나 (Harrisina americana), 헬리코베르파 수브플렉사 (Helicoverpa subflexa), 헬리코베르파 제아, 헬리오티스 비레센스 (Heliothis virescens), 헤밀류카 올리비아에 (Hemileuca oliviae), 호모에오소마 엘렉텔룸 (Homoeosoma electellum), 히판티아 쿠네아 (Hyphantia cunea), 케이페리아 리코페르시켈라 (Keiferia lycopersicella), 람디나 피셀라리아 피셀라리아 (Lambdina fiscellaria fiscellaria), 람디나 피셀라리아 루구브로사 (Lambdina fiscellaria lugubrosa), 류코마 살리시스 (Leucoma salicis), 로베시아 보트라나 (Lobesia botrana), 록소스테게 스틱티칼리스 (Loxostege sticticalis), 리만트리아 디스파르 (Lymantria dispar), 마칼라 티리살리스 (Macalla thyrisalis), 말라코소마 (Malacosoma) 종, 마메스트라 브라시카에 (Mamestra brassicae), 마메스트라 콘피구라타 (Mamestra configurata), 만두카 퀸퀘마쿨라타 (Manduca quinquemaculata), 만두카 섹타, 마루카 테스툴라리스 (Maruca testulalis), 멜란크라 픽타 (Melanchra picta), 오페로프테라 브루마타 (Operophtera brumata), 오르기아 (Orgyia) 종, 오스트리니아 누빌랄리스, 팔레아크리타 베르나타 (Paleacrita vernata), 파필리오 크레스폰테스 (Papilio cresphontes), 펙티노포라 고시피엘라 (Pectinophora gossypiella), 프리가니디아 칼리포르니카 (Phryganidia californica), 필로노릭터 블란카르델라 (Phyllonorycter blancardella), 피에리스 나피 (Pieris napi), 피에리스 라파에, 플라티페나 스카브라 (Plathypena scabra), 플라티노타 플루오엔다나 (Platynota flouendana), 플라티노타 스툴타나 (Platynota stultana), 플라팁틸라 카르두이닥틸라 (Platyptilia carduidactyla), 플로디아 인터푼크텔라 (Plodia interpunctella), 플루텔라 크실로스텔라, 폰티아 프로토디세 (Pontia protodice), 슈달레티아 우니푼크타 (Pseudaletia unipuncta), 슈도플라시아 인클루덴스 (Pseudoplasia includens), 사불로데스 아에그로타타 (Sabulodes aegrotata), 쉬주라 콘킨나 (Schizura concinna), 시토트로가 세레알렐라 (Sitotroga cerealella), 스필론타 오셀라나 (Spilonta ocellana), 스포돕테라 (Spodoptera) 종, 타우른스토포에아 피티오캄파 (Thaurnstopoea pityocampa), 틴솔라 비셀리엘라 (Tinsola bisselliella), 트리코클루시아 히 (Trichoplusia hi), 우데아 루비갈리스 (Udea rubigalis), 크실로미게스 쿠리알리스 (Xylomyges curiails), 및 이포노뮤타 파델라 (Yponomeuta padella)를 들 수 있으나, 이에 제한되지는 않는다.
곤충 해충은 또한 쌍시목, 벌목, 인시목, 털이목, 동시아목, 반시목, 메뚜기목, 총채벌레목, 집게벌레목, 흰개미목, 이아목, 벼룩목, 모시목, 딱정벌레목으로부터 선택되는 곤충을 포함한다.
주요 작물에 대한 곤충 해충으로는 메이즈: 오스트리니아 누빌랄리스, 유럽 옥수수 천곤충; 아그로티스 입실론, 검거세미나방; 헬리코베르파 제아, 옥수수 이삭벌레; 스포돕테라 프루기페르다, 가을 거염벌레; 디아트라에아 그란디오셀라, 남서부 옥수수 천곤충; 엘라스모팔푸스 리그노셀루스 (Elasmopalpus lignosellus), 옥수숫대 천곤충 유충; 디아트라에아 사카랄리스, 사탕수수 천곤충; 서부 옥수수 뿌리벌레, 예를 들어, 디아브로티카 비르기페라 비르기페라; 북부 옥수수 뿌리벌레, 예를 들어, 디아브로티카 론기코르니스 바르베리 (Diabrotica longicornis barberi); 남부 옥수수 뿌리벌레, 예를 들어, 디아브로티카 운데킴푼크타타 호와르디 (Diabrotica undecimpunctata howardi); 멜라노투스 (Melanotus) 종, 방아벌레 유충; 시클로세팔라 보레알리스 (Cyclocephala borealis), 북부 마스크 딱정벌레 (굼벵이); 시클로세팔라 임마쿨라타 (Cyclocephala immaculata), 남부 마스크 딱정벌레 (굼벵이); 포필리아 자포니카, 일본 딱정벌레; 카에토크네마 풀리카리아 (Chaetocnema pulicaria), 옥수수 벼룩잎벌레; 스페노포루스 마이디스 (Sphenophorus maidis), 메이즈 바구미; 로팔로시품 마이디스 (Rhopalosiphum maidis), 옥수수 잎 진딧물; 아누라피스 마이디라디시스 (Anuraphis maidiradicis), 옥수수 뿌리 진딧물; 블리수스 류콥테루스 류콥테루스 (Blissus leucopterus leucopterus), 긴노린재; 멜라노플루스 페무루브룸 (Melanoplus femurrubrum), 붉은다리 메뚜기; 멜라노플루스 산구이니페스 (Melanoplus sanguinipes), 이동성 메뚜기; 힐레미아 플라투라 (Hylemya platura), 옥수수씨 구더기; 아그로미자 파르비코르니스 (Agromyza parvicornis), 옥수수 얼룩 잎나방; 아나포트립스 옵스크루루스 (Anaphothrips obscrurus), 잔디 삽주벌레; 솔레놉시스 밀레스타 (Solenopsis milesta), 도둑 개미; 테트라니쿠스 우르티카에 (Tetranychus urticae), 두반점 잎응애; 수수: 킬로 파르텔루스 (Chilo partellus), 수수 천곤충; 스포돕테라 프루기페르다, 가을 거염벌레; 헬리코베르파 제아, 옥수수 이삭벌레; 엘라스모팔푸스 리그노셀루스, 옥수숫대 천곤충 유충; 펠티아 수브테라네아 (Feltia subterranea), 곡물 거세미나방; 필로파가 크리니타 (Phyllophaga crinita), 굼벵이; 엘레오데스 (Eleodes), 코노데루스 (Conoderus), 및 아에올루스 (Aeolus) 종, 방아벌레 유충; 오울레마 멜라노푸스 (Oulema melanopus), 긴가슴 잎벌레; 카에토크네마 풀리카리아, 옥수수 벼룩잎벌레; 스페노포루스 마이디스, 메이즈 바구미; 로팔로시품 마이디스; 옥수수 잎 진딧물; 시파 플라바 (Sipha flava), 노란 사탕수수 진딧물; 긴노린재, 예를 들어, 블리수스 류콥테루스 류콥테루스; 콘타리니아 소르그히콜라 (Contarinia sorghicola), 수수 각다귀; 테트라니쿠스 신나바리누스 (Tetranychus cinnabarinus), 카르민 잎응애; 테트라니쿠스 우르티카에, 두반점 잎응애; 밀: 슈달레티아 우니푼크타, 거염벌레; 스포돕테라 프루기페르다, 가을 거염벌레; 엘라스모팔푸스 리그노셀루스, 옥수숫대 천곤충 유충; 아그로티스 오르토고니아 (Agrotis orthogonia), 배추밤나방 유충; 엘라스모팔푸스 리그노셀루스, 옥수숫대 천곤충 유충; 오울레마 멜라노푸스, 긴가슴 잎벌레; 히페라 푼크타타 (Hypera punctata), 클로버 잎 바구미; 남부 옥수수 뿌리벌레, 예를 들어, 디아브로티카 운데킴푼크타타 호와르디; 러시아 밀 진딧물; 쉬자피스 그라미눔 (Schizaphis graminum), 녹색벌레; 마크로시품 아베나에 (Macrosiphum avenae), 영국 곡물 진딧물; 멜라노플루스 페무루브룸, 붉은다리 메뚜기; 멜라노플루스 디페렌티알리스 (Melanoplus differentialis), 디퍼렌셜 메뚜기; 멜라노플루스 산구이니페스, 이동성 메뚜기; 마이에티올라 데스트룩토르 (Mayetiola destructor), 헤시안 플라이 (Hessian fly); 시토디플로시스 모셀라나 (Sitodiplosis mosellana), 밀 각다귀; 메로미자 아메리카나 (Meromyza americana), 밀 줄기 구더기; 힐레미아 코아르크타타 (Hylemya coarctata), 밀 구근 플라이 (wheat bulb fly); 프란클리니엘라 푸스카 (Frankliniella fusca), 담배 삽주벌레; 세푸스 신크투스 (Cephus cinctus), 밀 줄기 잎벌; 아세리아 툴리파에 (Aceria tulipae), 밀 오갈 응애; 해바라기: 실린드로쿱투루스 아드스페르수스 (Cylindrocupturus adspersus), 해바라기 줄기 바구미; 스미크로닉스 풀루스 (Smicronyx fulus), 붉은 해바라기 씨 바구미; 스미크로닉스 소르디두스 (Smicronyx sordidus), 회색 해바라기 씨 바구미; 술레이마 헬리안타나 (Suleima helianthana), 해바라기 순 나방; 호모에오소마 엘렉텔룸, 해바라기 나방; 지고그람마 엑스칼라마티오니스 (Zygogramma exclamationis), 해바라기 딱정벌레; 보티루스 깁보수스 (Bothyrus gibbosus), 당근 딱정벌레; 네올라시옵테라 무르트펠드티아나 (Neolasioptera murtfeldtiana), 해바라기 씨 각다귀; 목화: 헬리오티스 비레센스, 담배 눈벌레; 헬리코베르파 제아, 목화 다래벌레; 스포돕테라 엑시구아, 비트 거염벌레; 펙티노포라 고시피엘라, 분홍 다래벌레; 목화 바구미, 예를 들어, 안토노무스 그란디스 (Anthonomus grandis); 아피스 고시피이 (Aphis gossypii), 목화 진딧물; 슈다토모셀리스 세리아투스 (Pseudatomoscelis seriatus), 목화 플리호퍼 (cotton fleahopper); 트리알류로데스 아부틸로네아 (Trialeurodes abutilonea), 구부러진 날개 가루이; 리구스 리네올라리스 (Lygus lineolaris), 흐린 장님노린재; 멜라노플루스 페무루브룸, 붉은다리 메뚜기; 멜라노플루스 디페렌티알리스, 디퍼렌셜 메뚜기; 트립스 타바시 (Thrips tabaci), 양파 삽주벌레; 프란클린키엘라 푸스카 (Franklinkiella fusca), 담배 삽주벌레; 테트라니쿠스 신나바리누스, 카르민 잎응애; 테트라니쿠스 우르티카에, 두반점 잎응애; 벼: 디아트라에아 사카랄리스, 사탕수수 천곤충; 스포돕테라 프루기페르다, 가을 거염벌레; 헬리코베르파 제아, 옥수수 이삭벌레; 콜라스피스 브룬네아 (Colaspis brunnea), 포도 콜라스피스 (grape colaspis); 리소르홉트루스 오리조필루스 (Lissorhoptrus oryzophilus), 벼 물 바구미; 시토필루스 오리자에 (Sitophilus oryzae), 벼 바구미; 네포테틱스 니그로픽투스 (Nephotettix nigropictus), 벼 매미충; 긴노린재, 예를 들어, 블리수스 류콥테루스 류콥테루스; 아크로스테르눔 힐라레 (Acrosternum hilare), 녹색 노린재; 대두: 슈도플루시아 인클루덴스 (Pseudoplusia includens), 대두 루퍼; 안티카르시아 겜마탈리스, 벨벳빈 카테필러; 플라티페나 스카브라, 녹색 클로버벌레; 오스트리니아 누빌랄리스, 유럽 옥수수 천곤충; 아그로티스 입실론, 검거세미나방; 스포돕테라 엑시구아, 비트 거염벌레; 헬리오티스 비레센스, 담배 눈벌레; 헬리코베르파 제아, 목화 다래벌레; 에필라크나 바리베스티스 (Epilachna varivestis), 멕시코 콩 딱정벌레; 미주스 페르시카에 (Myzus persicae), 녹색 복숭아 진딧물; 엠포아스카 파바에 (Empoasca fabae), 감자 매미충; 아크로스테르눔 힐라레, 녹색 노린재; 멜라노플루스 페무루브룸, 붉은다리 메뚜기; 멜라노플루스 디페렌티알리스, 디퍼렌셜 메뚜기; 힐레미아 플라투라, 옥수수씨 구더기; 세리코트립스 바리아빌리스 (Sericothrips variabilis), 대두 삽주벌레; 트립스 타바시, 양파 삽주벌레; 테트라니쿠스 투르케스타니 (Tetranychus turkestani), 딸기 잎응애; 테트라니쿠스 우르티카에, 두반점 잎응애; 보리: 오스트리니아 누빌랄리스, 유럽 옥수수 천곤충; 아그로티스 입실론, 검거세미나방; 쉬자피스 그라미눔, 녹색벌레; 긴노린재, 예를 들어, 블리수스 류콥테루스 류콥테루스; 아크로스테르눔 힐라레, 녹색 노린재; 유스키스투스 세르부스, 갈색 노린재; 질레미아 플라투라 (Jylemya platura), 옥수수씨 구더기; 마이에티올라 데스트룩토르, 헤시안 플라이; 페트로비아 라텐스 (Petrobia latens), 갈색 밀 응애; 유채: 브레비코리네 브라시카에 (Vrevicoryne brassicae), 양배추 진딧물; 필로트레타 크루시페라에 (Phyllotreta cruciferae), 배추벼룩 딱정벌레; 필로트레타 스트리올라타 (Phyllotreta striolata), 줄무늬 벼룩 딱정벌레; 필로트레타 네모룸 (Phyllotreta nemorum), 줄무늬 순무 벼룩 딱정벌레; 멜리게테스 아에뉴스 (Meligethes aeneus), 유채씨 딱정벌레; 및 화분 딱정벌레 멜리게테스 루피마누스 (Meligethes rufimanus), 멜리게테스 니그레센스 (Meligethes nigrescens), 멜리게테스 카나디아누스 (Meligethes canadianus), 및 멜리게테스 비리데센스 (Meligethes viridescens); 감자: 렙티노타르사 데켐리네아타, 콜로라도 감자 딱정벌레를 들 수 있으나, 이에 제한되지는 않는다.
본원에 제공된 방법 및 조성물은 반시목, 예컨대 리구스 헤스페루스 (Lygus hesperus), 리구스 리네올라리스, 리구스 프라텐시스 (Lygus pratensis), 리구스 루굴리펜니스 포프 (Lygus rugulipennis Popp), 리구스 파불리누스 (Lygus pabulinus), 칼로코리스 노르베기쿠스 (Calocoris norvegicus), 오르톱스 콤페스트리스 (Orthops compestris), 플레시오코리스 루기콜리스 (Plesiocoris rugicollis), 시르토펠티스 모데스투스 (Cyrtopeltis modestus), 시르토펠티스 노타투스 (Cyrtopeltis notatus), 스파나고니쿠스 알보파시아투스 (Spanagonicus albofasciatus), 디아프노코리스 클로리노니스 (Diaphnocoris chlorinonis), 라보피디콜라 알리이 (Labopidicola allii), 슈다토모셀리스 세리아투스, 아델포코리스 라피두스 (Adelphocoris rapidus), 포에킬로캅수스 리네아투스 (Poecilocapsus lineatus), 블리수스 류콥테루스 (Blissus leucopterus), 니시우스 에리카에 (Nysius ericae), 니시우스 라파누스 (Nysius raphanus), 유스키스투스 세르부스, 네자라 비리둘라, 유리가스터 (Eurygaster), 허리노린재과, 별노린재과, 티디다에 (Tinidae), 물장군과, 침노린재과, 및 빈대과에 대해 유효할 수 있다. 관심의 해충은 또한 아라에세루스 파시쿨라투스 (Araecerus fasciculatus), 커피콩 바구미; 아칸토셀리데스 옵텍투스 (Acanthoscelides obtectus), 콩 바구미; 브루쿠스 루프마누스 (Bruchus rufmanus), 잠두 바구미; 브루쿠스 피소룸 (Bruchus pisorum), 완두콩 바구미; 자브로테스 수브파시아투스 (Zabrotes subfasciatus), 멕시코 콩 바구미; 디아브로티카 발테아타 (Diabrotica balteata), 구부러진 오이 딱정벌레; 세로토마 트리푸르카타 (Cerotoma trifurcata), 콩잎 딱정벌레; 디아브로티카 비르기페라, 멕시코 옥수수 뿌리벌레; 에피트릭스 쿠쿠메리스 (Epitrix cucumeris), 감자 벼룩 딱정벌레; 카에토그네마 콘피니스 (Chaetocnema confinis), 고구마 벼룩 딱정벌레; 히페라 포스티카 (Hypera postica), 알팔파 바구미; 안토노무스 쿠아드리깁부스 (Anthonomus quadrigibbus), 사과 꿀꿀이바구미; 스테르네쿠스 팔루다투스 (Sternechus paludatus), 콩 줄기 바구미; 히페라 브룬니펜니스 (Hypera brunnipennis), 이집트 알팔파 바구미; 시토필루스 그라나리에스 (Sitophilus granaries), 그라나리 바구미; 크라포니우스 이나에쿠알리스 (Craponius inaequalis), 포도 꿀꿀이바구미; 시토필루스 제아마이스 (Sitophilus zeamais), 메이즈 바구미; 코노트라켈루스 네누파르 (Conotrachelus nenuphar), 자두 꿀꿀이바구미; 유스케페스 포스트파키아투스 (Euscepes postfaciatus), 서인도 고구마 바구미; 말라데라 카스타네아 (Maladera castanea), 아시아 정원 딱정벌레; 리조트로구스 마잘리스 (Rhizotrogus majalis), 유럽 풍뎅이; 마크로닥틸루스 수브스피노수스 (Macrodactylus subspinosus), 장미 풍뎅이; 트리볼리움 콘푸숨 (Tribolium confusum), 어리쌀도둑 거저리; 테네브리오 옵스쿠루스 (Tenebrio obscurus), 어두운 거저리; 트리볼리움 카스타네움, 거짓쌀도둑 거저리; 테네브리오 몰리토르 (Tenebrio molitor), 노란 거저리를 포함한다.
선충은 기생성 선충, 예컨대 뿌리-혹, 낭포, 및 병변 선충, 예를 들어, 헤테로데라 (Heterodera) 종, 멜로이도기네 (Meloidogyne) 종, 및 글로보데라 (Globodera) 종; 특히 헤테로데라 글리시네스 (Heterodera glycines) (대두 낭포 선충); 헤테로데라 쉬아크티이 (Heterodera schachtii) (비트 낭포 선충); 헤테로데라 아베나에 (Heterodera avenae) (곡물 낭포 선충); 및 글로보데라 로스토키엔시스 (Globodera rostochiensis) 및 글로보데라 파일리다 (Globodera pailida) (감자 낭포 선충)를 포함하나 이에 제한되지 않는 낭포 선충의 구성원을 포함한다. 병변 선충은 프라틸렌쿠스 (Pratylenchus) 종을 포함한다.
곤충 해충은 유충 또는 다른 미성숙 형태와 같은 초기 발달 단계에서 본 발명의 조성물의 살충 활성에 대해 시험될 수 있다. 곤충은 전체적인 어둠에서 약 20℃ 내지 약 30℃에서 및 약 30% 내지 약 70% 상대 습도에서 사육될 수 있다. 생물검정은 문헌 [Czapla and Lang (1990) J. Econ. Entomol. 83 (6): 2480-2485]에 기재된 바와 같이 수행될 수 있다. 또한, 본원의 실험 섹션을 참조한다.
III.
발현 카세트
본원에 제공된 살충 단백질을 코딩하는 폴리뉴클레오티드는 관심의 유기체에서의 발현을 위한 발현 카세트에서 제공될 수 있다. 카세트는 폴리뉴클레오티드의 발현을 허용하는 본원에 제공된 살충 폴리펩티드를 코딩하는 폴리뉴클레오티드에 작동적으로 연결된 5' 및 3' 조절 서열을 포함할 것이다. 카세트는 유기체 내로 공동형질전환되는 적어도 1개의 추가의 유전자 또는 유전 요소를 추가적으로 함유할 수 있다. 추가의 유전자 또는 요소가 포함되는 경우, 성분은 작동적으로 연결된다. 대안적으로, 추가의 유전자(들) 또는 요소(들)는 다중 발현 카세트 상에 제공될 수 있다. 이러한 발현 카세트는 조절 영역의 전사적 조절 하에 있는 폴리뉴클레오티드의 삽입을 위한 복수의 제한 부위 및/또는 재조합 부위가 제공된다. 발현 카세트는 선택적 마커 유전자를 추가적으로 함유할 수 있다.
발현 카세트는 전사의 5' 내지 3' 방향으로 전사 및 번역 개시 영역 (즉, 프로모터), 본 발명의 살충 폴리뉴클레오티드, 및 관심의 유기체, 즉, 식물 또는 박테리아에서 기능적인 전사 및 번역 종결 영역 (즉, 종결 영역)을 포함할 것이다. 본 발명의 프로모터는 숙주 세포에서 코딩 서열의 발현을 지정하거나 유도할 수 있다. 조절 영역 (즉, 프로모터, 전사 조절 영역, 및 번역 종결 영역)은 숙주 세포에 대해 또는 서로에 대해 내인성이거나 이종일 수 있다. 서열에 대한 언급에서 본원에 사용된 "이종"은 외래 종으로부터 기원하거나, 동일한 종으로부터인 겅우, 의도적인 인간 개입에 의해 조성물 및/또는 게놈 좌위에서 그의 천연 형태로부터 실질적으로 변형된 서열이다. 본원에 사용된 바와 같은 키메라 유전자는 코딩 서열에 대해 이종인 전사 개시 영역에 작동적으로 연결된 코딩 서열을 포함한다.
편리한 종결 영역은 에이. 투메파시엔스 (A. tumefaciens)의 Ti-플라스미드, 예컨대 옥토핀 신타제 및 노팔린 신타제 종결 영역으로부터 이용가능하다. 또한, 문헌 [Guerineau et al. (1991) Mol. Gen. Genet. 262:141-144]; [Proudfoot (1991) Cell 64:671-674]; [Sanfacon et al. (1991) Genes Dev. 5:141-149]; [Mogen et al. (1990) Plant Cell 2:1261-1272]; [Munroe et al. (1990) Gene 91:151-158]; [Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903]; 및 [Joshi et al. (1987) Nucleic Acids Res. 15:9627-9639]을 참조한다.
추가의 조절 신호로는 전사 개시 출발 부위, 오퍼레이터, 액티베이터, 인핸서, 다른 조절 요소, 리보솜 결합 부위, 개시 코돈, 종결 신호 등을 들 수 있으나, 이에 제한되지는 않는다. 예를 들어, 미국 특허 제5,039,523호 및 제4,853,331호; EPO 0480762A2; 문헌 [Sambrook et al. (1992) Molecular Cloning: A Laboratory Manual, ed. Maniatis et al. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.) (이하 "Sambrook 11")]; [Davis et al., eds. (1980) Advanced Bacterial Genetics (Cold Spring Harbor Laboratory Press), Cold Spring Harbor, N.Y.], 및 그에 인용된 참고문헌을 참조한다.
발현 카세트의 제조에 있어서, 다양한 DNA 단편은 DNA 서열이 적절한 배향으로, 및 적절하게는, 적절한 리딩 프레임으로 제공되도록 조작될 수 있다. 이 목적을 위해, DNA 단편을 결합시키기 위해 어댑터 또는 링커가 채용될 수 있거나, 편리한 제한 부위, 불필요한 DNA의 제거, 제한 부위의 제거 등을 제공하기 위해 다른 조작이 포함될 수 있다. 이 목적을 위해, 시험관내 돌연변이유발, 프라이머 복구, 제한, 어닐링, 재치환, 예를 들어, 전이 및 전환이 포함될 수 있다.
다수의 프로모터는 본 발명의 실시에 사용될 수 있다. 프로모터는 바람직한 결과에 기초하여 선택될 수 있다. 핵산은 관심의 유기체에서의 발현을 위해 구조성, 유도성, 조직-바람직한, 또는 다른 프로모터와 조합될 수 있다. 예를 들어, 본원에 참조로 포함되는 WO 99/43838 및 미국 특허 제8,575,425호; 제7,790,846호; 제8,147,856호; 제8,586832호; 제7,772,369호; 제7,534,939호; 제6,072,050호; 제5,659,026호; 제5,608,149호; 제5,608,144호; 제5,604,121호; 제5,569,597호; 제5,466,785호; 제5,399,680호; 제5,268,463호; 제5,608,142호; 및 제6,177,611호에 제시된 프로모터를 참조한다.
식물에서의 발현용으로, 구조성 프로모터는 또한 CaMV 35S 프로모터 (Odell et al. (1985) Nature 313:810-812); 벼 액틴 (McElroy et al. (1990) Plant Cell 2:163-171); 유비퀴틴 (문헌 [Christensen et al. (1989) Plant Mol. Biol. 12:619-632] 및 [Christensen et al. (1992) Plant Mol. Biol. 18:675-689]); pEMU (Last et al. (1991) Theor. Appl. Genet. 81:581-588); MAS (Velten et al. (1984) EMBO J. 3:2723-2730)를 포함한다. 유도성 프로모터는 병원체에 의한 감염 후 유도되는 발병기전-관련된 단백질 (PR 단백질)의 발현을 유도하는 것들을 포함한다. 예를 들어, 각각 본원에 참조로 포함되는 문헌 [Redolfi et al. (1983) Neth. J. Plant Pathol. 89:245-254]; [Uknes et al. (1992) Plant Cell 4:645-656]; 및 [Van Loon (1985) Plant Mol. Virol. 4:111-116]; 및 WO 99/43819를 참조한다. 병원체 감염의 부위에서 또는 부근에서 국소적으로 발현되는 프로모터가 또한 사용될 수 있다 (Marineau et al. (1987) Plant Mol. Biol. 9:335-342; Matton et al. (1989) Molecular Plant-Microbe Interactions 2:325-331; Somsisch et al. (1986) Proc. Natl. Acad. Sci. USA 83:2427-2430; Somsisch et al. (1988) Mol. Gen. Genet. 2:93-98; 및 Yang (1996) Proc. Natl. Acad. Sci. USA 93:14972-14977; Chen et al. (1996) Plant J. 10:955-966; Zhang et al. (1994) Proc. Natl. Acad. Sci. USA 91:2507-2511; Warner et al. (1993) Plant J. 3:191-201; Siebertz et al. (1989) Plant Cell 1:961-968; Cordero et al. (1992) Physiol. Mol. Plant Path. 41:189-200; 미국 특허 제5,750,386호 (선충-유도성); 및 그에 인용된 참고문헌).
상처-유도성 프로모터는 본 발명의 구성에 사용될 수 있다. 이러한 상처-유도성 프로모터로는 pin II 프로모터 (Ryan (1990) Ann. Rev. Phytopath. 28:425-449; Duan et al. (1996) Nature Biotechnology 14:494-498); wun1 및 wun2 (미국 특허 제5,428,148호); win1 및 win2 (Stanford et al. (1989) Mol. Gen. Genet. 215:200-208); 시스테민 (McGurl et al. (1992) Science 225:1570-1573); WIP1 (Rohmeier et al. (1993) Plant Mol. Biol. 22:783-792; Eckelkamp et al. (1993) FEBS Letters 323:73-76); MPI 유전자 (Corderok et al. (1994) Plant J. 6(2):141-150); 등을 들 수 있으며, 상기 문헌은 본원에 참조로 포함된다.
본 발명에 사용하기 위한 조직-바람직한 프로모터로는 문헌 [Yamamoto et al. (1997) Plant J. 12(2):255-265]; [Kawamata et al. (1997) Plant Cell Physiol. 38(7):792-803]; [Hansen et al. (1997) Mol. Gen Genet. 254(3):337-343]; [Russell et al. (1997) Transgenic Res. 6(2):157-168]; [Rinehart et al. (1996) Plant Physiol. 112(3):1331-1341]; [Van Camp et al. (1996) Plant Physiol. 112(2):525-535]; [Canevascini et al. (1996) Plant Physiol. 112(2):513-524]; [Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778]; [Lam (1994) Results Probl. Cell Differ. 20:181-196]; [Orozco et al. (1993) Plant Mol Biol. 23(6):1129-1138]; [Matsuoka et al. (1993) Proc Natl. Acad. Sci. USA 90(20):9586-9590]; 및 [Guevara-Garcia et al. (1993) Plant J. 4(3):495-505]에 제시된 것들을 들 수 있다.
잎-바람직한 프로모터로는 문헌 [Yamamoto et al. (1997) Plant J. 12(2):255-265]; [Kwon et al. (1994) Plant Physiol. 105:357-67]; [Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778]; [Gotor et al. (1993) Plant J. 3:509-18]; [Orozco et al. (1993) Plant Mol. Biol. 23(6):1129-1138]; 및 [Matsuoka et al. (1993) Proc. Natl. Acad. Sci. USA 90(20):9586-9590]에 제시된 것들을 들 수 있다.
뿌리-바람직한 프로모터가 공지되어 있으며, 이로는 문헌 [Hire et al. (1992) Plant Mol. Biol. 20(2):207-218] (대두 뿌리-특이적 글루타민 신테타제 유전자); [Keller and Baumgartner (1991) Plant Cell 3(10):1051-1061] (뿌리-특이적 제어 요소); [Sanger et al. (1990) Plant Mol. Biol. 14(3):433-443] (아그로박테리움 투메파시엔스 (Agrobacterium tumefaciens)의 만노핀 신타제 (MAS) 유전자); 및 [Miao et al. (1991) Plant Cell 3(1):11-22] (세포질 글루타민 신테타제 (GS)); [Bogusz et al. (1990) Plant Cell 2(7):633-641]; [Leach and Aoyagi (1991) Plant Science (Limerick) 79(1):69-76] (rolC 및 rolD); [Teeri et al. (1989) EMBO J. 8(2):343-350]; [Kuster et al. (1995) Plant Mol. Biol. 29(4):759-772] (VfENOD-GRP3 유전자 프로모터); 및 [Capana et al. (1994) Plant Mol. Biol. 25(4):681-691 (rolB 프로모터)]에서의 것들을 들 수 있다. 또한, 미국 특허 제5,837,876호; 제5,750,386호; 제5,633,363호; 제5,459,252호; 제5,401,836호; 제5,110,732호; 및 제5,023,179호를 참조한다.
"종자-바람직한" 프로모터는 "종자-특이적" 프로모터 (종자 발달 동안 활성인 프로모터, 예컨대 종자 저장 단백질의 프로모터) 뿐만 아니라 "종자-발아" 프로모터 (종자 발아 동안 활성인 프로모터) 둘 다를 포함한다. 문헌 [Thompson et al. (1989) BioEssays 10:108]을 참조한다. 종자-바람직한 프로모터로는 Cim1 (시토키닌-유도된 메시지); cZ19B1 (메이즈 19 kDa 제인); milps (미오-이노시톨-1-포스페이트 신타제) (WO 00/11177 및 미국 특허 제6,225,529 참조)를 들 수 있으나, 이에 제한되지는 않는다. 감마-제인은 배젖-특이적 프로모터이다. 글로불린 1 (Glb-1)은 대표적인 배아-특이적 프로모터이다. 쌍자엽식물에 대해, 종자-특이적 프로모터로는 콩 β-파세올린, 나핀, β-콘글리시닌, 대두 렉틴, 크루시페린 등을 들 수 있으나, 이에 제한되지는 않는다. 단자엽식물에 대해, 종자-특이적 프로모터로는 메이즈 15 kDa 제인, 22 kDa 제인, 27 kDa 제인, 감마-제인, 왁시, 쉬런켄 1, 쉬런켄 2, 글로불린 1 등을 들 수 있으나, 이에 제한되지는 않는다. 또한 end1 및 end2 유전자로부터의 종자-바람직한 프로모터가 개시된 WO 00/12733을 참조한다.
박테리아 숙주에서의 발현용으로, 박테리아에서 기능하는 프로모터가 관련 기술분야에 널리 공지되어 있다. 이러한 프로모터로는 본 발명의 살충 단백질 중 임의의 것의 프로모터, 및 비. 투린기엔시스 시그마 인자에 대해 특이적인 프로모터를 비롯한, 공지된 결정 단백질 유전자 프로모터 중 임의의 것을 들 수 있다. 대안적으로, 돌연변이화된 또는 재조합 결정 단백질-코딩 유전자 프로모터는 재조합적으로 조작되고, 본원에 개시된 신규 유전자 절편의 발현을 촉진시키는 데 사용될 수 있다.
발현 카세트는 또한 형질전환된 세포의 선별을 위한 선택적 마커 유전자를 포함할 수 있다. 선택적 마커 유전자는 형질전환된 세포 또는 조직의 선별을 위해 이용된다. 마커 유전자로는 항생제 저항성을 코딩하는 유전자, 예컨대 네오마이신 포스포트랜스퍼라제 II (NEO) 및 히그로마이신 포스포트랜스퍼라제 (HPT)를 코딩하는 것들, 뿐만 아니라 제초제 화합물에 대해 저항성을 부여하는 유전자, 예컨대 글루포시네이트 암모늄, 브로목시닐, 이미다졸리논, 및 2,4-디클로로페녹시아세테이트 (2,4-D)를 들 수 있다. 추가의 선택적 마커가 공지되어 있으며, 임의의 것이 사용될 수 있다. 예를 들어, 선택적 마커로서 채용될 수 있는 글루포시네이트 저항성 서열이 개시되어 있는, 그 전문이 본원에 참조로 포함되는 2014년 12월 19일에 출원된 미국 가출원 62/094,697을 참조한다.
IV.
방법, 숙주 세포 및 식물 세포
지시된 바와 같이, 살충 단백질 또는 그의 활성 변이체 또는 단편을 코딩하는 뉴클레오티드 서열을 포함하는 DNA 구축물을 사용하여 관심의 식물 또는 관심의 다른 유기체를 형질전환시킬 수 있다. 형질전환을 위한 방법은 뉴클레오티드 구축물을 식물 내로 도입하는 것을 포함한다. "도입하는"은 구축물이 식물의 세포 또는 숙주 세포의 내부에의 접근을 달성하도록 하는 방식으로 뉴클레오티드 구축물을 식물 또는 다른 숙주 세포에 도입하는 것으로 의도된다. 본 발명의 방법은 뉴클레오티드 구축물을 식물 또는 숙주 세포에 도입하기 위한 특정 방법을 요구하지 않으며, 단지 뉴클레오티드 구축물은 식물 또는 숙주 유기체의 적어도 하나의 세포의 내부에의 접근을 달성한다. 안정한 형질전환 방법, 일시적 형질전환 방법, 및 바이러스-매개 방법을 포함하나 이에 제한되지 않는 뉴클레오티드 구축물을 식물 및 다른 숙주 세포 내로 도입하는 방법이 관련 기술분야에 공지되어 있다.
방법은 형질전환된 유기체, 예컨대 전체 식물, 뿐만 아니라 식물 기관 (예를 들어, 잎, 줄기, 뿌리 등), 종자, 식물 세포, 번식체, 배아 및 그의 자손을 비롯한 식물을 초래한다. 식물 세포는 분화되거나 비분화될 수 있다 (예를 들어 캘러스, 현탁 배양 세포, 원형질체, 잎 세포, 뿌리 세포, 체관 세포, 화분).
"트랜스제닉 식물" 또는 "형질전환된 식물" 또는 "안정하게 형질전환된" 식물 또는 세포 또는 조직은 본 발명의 적어도 하나의 살충 폴리펩티드를 코딩하는 폴리뉴클레오티드가 혼입되거나 통합된 식물을 지칭한다. 다른 외인성 또는 내인성 핵산 서열 또는 DNA 단편은 또한 식물 세포 내로 혼입될 수 있음이 인식된다. 아그로박테리움 (Agrobacterium)-및 바이오리스틱 (biolistic)-매개 형질전환은 2개의 우세하게 채용되는 접근법으로 남아 있다. 그러나, 형질전환은 감염, 형질감염, 미세주사, 전기천공, 미세투사, 바이오리스틱 또는 입자 포격, 전기천공, 실리카/탄소 섬유, 초음파 매개, PEG 매개, 인산칼슘 공-침전, 다가양이온 DMSO 기술, DEAE 덱스트란 절차, 아그로 (Agro) 및 바이러스 매개 (카울리모리바이러스 (Caulimorivirus), 게미니바이러스 (Geminivirus), RNA 식물 바이러스), 리포좀 매개 등에 의해 수행될 수 있다.
형질전환 프로토콜 뿐만 아니라 폴리펩티드 또는 폴리뉴클레오티드 서열을 식물 내로 도입하는 프로토콜은 형질전환을 위해 표적화되는 식물 또는 식물 세포의 유형, 즉, 단자엽식물 또는 쌍자엽식물에 따라 다양할 수 있다. 형질전환을 위한 방법은 관련 기술분야에 공지되어 있으며, 이로는 미국 특허 제8,575,425호; 제7,692,068호; 제8,802,934호; 및 제7,541,517호에 제시된 것들을 들 수 있고, 이들 각각은 본원에 참조로 포함된다. 또한, 문헌 [Rakoczy-Trojanowska, M. (2002) Cell Mol Biol Lett. 7:849-858]; [Jones et al. (2005) Plant Methods 1:5]; [Rivera et al. (2012) Physics of Life Reviews 9:308-345]; [Bartlett et al. (2008) Plant Methods 4:1-12]; [Bates, G.W. (1999) Methods in Molecular Biology 111:359-366]; [Binns and Thomashow (1988) Annual Reviews in Microbiology 42:575-606]; [Christou, P. (1992) The Plant Journal 2:275-281]; [Christou, P. (1995) Euphytica 85:13-27]; [Tzfira et al. (2004) TRENDS in Genetics 20:375-383]; [Yao et al. (2006) Journal of Experimental Botany 57:3737-3746]; [Zupan and Zambryski (1995) Plant Physiology 107:1041-1047]; 및 [Jones et al. (2005) Plant Methods 1:5]을 참조한다.
형질전환은 세포 내로의 핵산의 안정한 또는 일시적 혼입을 초래할 수 있다. "안정한 형질전환"은 숙주 세포 내로 도입된 뉴클레오티드 구축물이 숙주 세포의 게놈 내로 통합되고, 그의 자손에 의해 유전될 수 있음을 의미하는 것으로 의도된다. "일시적 형질전환"은 폴리뉴클레오티드가 숙주 세포 내로 도입되고, 숙주 세포의 게놈 내로 통합되지 않음을 의미하는 것으로 의도된다.
엽록체의 형질전환을 위한 방법은 관련 기술분야에 공지되어 있다. 예를 들어, 문헌 [Svab et al. (1990) Proc. Natl. Acad. Sci. USA 87:8526-8530]; [Svab and Maliga (1993) Proc. Natl. Acad. Sci. USA 90:913-917]; [Svab and Maliga (1993) EMBO J. 12:601-606]을 참조한다. 방법은 선택적 마커를 함유하고, 상동성 재조합을 통해 플라스티드 게놈에 DNA를 표적화하는 DNA의 입자 총 전달에 의존한다. 추가적으로, 플라스티드 형질전환은 핵-코딩된 및 플라스티드-지정된 RNA 폴리머라제의 조직-바람직한 발현에 의한 침묵 플라스티드-보유 트랜스진의 전사촉진에 의해 달성될 수 있다. 이러한 시스템은 문헌 [McBride et al. (1994) Proc. Natl. Acad. Sci. USA 91:7301-7305]에 보고되었다.
형질전환된 세포는 통상적인 방식에 따라 식물로 성장할 수 있다. 예를 들어, 문헌 [McCormick et al. (1986) Plant Cell Reports 5:81-84]을 참조한다. 이들 식물은 그 후 성장되고, 동일한 형질전환된 균주 또는 상이한 균주로 수분되고, 바람직한 표현형적 특징의 구조적 발현을 갖는 생성된 혼성체가 확인될 수 있다. 바람직한 표현형적 특징의 발현이 안정하게 유지되고 유전되는 것을 보장하기 위해 2 이상의 세대를 성장시키고, 그 후 바람직한 표현형적 특징의 발현이 달성된 것을 보장하기 위해 종자를 수확할 수 있다. 이 방식으로, 본 발명은 그들의 게놈 내로 안정하게 혼입된 본 발명의 뉴클레오티드 구축물을 갖는 형질전환된 종자 ("트랜스제닉 종자"로도 지칭됨), 예를 들어, 본 발명의 발현 카세트를 제공한다.
구체적인 실시양태에서, 본원에 제공된 서열은 숙주 세포 또는 식물 세포의 게놈 내에서 특이적 부위에 표적화될 수 있다. 이러한 방법으로는 관심의 식물 게놈 서열에 대해 설계된 메가뉴클레아제 (D'Halluin et al. 2013 Plant Biotechnol J); CRISPR-Cas9, TALEN, 및 게놈의 정확한 편집을 위한 다른 기술 (Feng, et al. Cell Research 23:1229-1232, 2013, Podevin, et al. Trends Biotechnology, online publication, 2013, Wei et al., J Gen Genomics, 2013, Zhang et al (2013) WO 2013/026740); Cre-lox 부위-특이적 재조합 (Dale et al. (1995) Plant J 7:649-659; Lyznik, et al. (2007) Transgenic Plant J 1:1-9; FLP-FRT 재조합 (Li et al. (2009) Plant Physiol 151:1087-1095); Bxb1-매개 통합 (Yau et al. Plant J (2011) 701:147-166); 아연-손가락 매개 통합 (Wright et al. (2005) Plant J 44:693-705); Cai et al. (2009) Plant Mol Biol 69:699-709); 및 상동성 재조합 (Lieberman-Lazarovich and Levy (2011) Methods Mol Biol 701: 51-65); Puchta (2002) Plant Mol Biol 48:173-182)을 들 수 있으나, 이에 제한되지는 않는다.
본원에 제공된 서열은 단자엽식물 및 쌍자엽식물을 포함하나 이에 제한되지 않는 임의의 식물 종의 형질전환에 사용될 수 있다. 관심의 식물의 예로는 옥수수 (메이즈), 수수, 밀, 해바라기, 토마토, 십자화과, 후추, 감자, 목화, 벼, 대두, 사탕무, 사탕수수, 담배, 보리, 및 유채, 브라시카 (Brassica) 종, 알팔파, 호밀, 기장, 잇꽃, 땅콩, 고구마, 카사바, 커피, 코코넛, 파인애플, 감귤 나무, 코코아, 차, 바나나, 아보카도, 무화과, 구아바, 망고, 올리브, 파파야, 캐슈, 마카다미아, 아몬드, 귀리, 채소, 관상식물, 및 침엽수를 들 수 있으나, 이에 제한되지는 않는다.
채소로는 토마토, 상추, 껍질콩, 리마콩, 완두콩, 및 쿠르쿠미스 (Curcumis) 속의 구성원, 예컨대 오이, 칸탈루프, 및 머스크 멜론을 들 수 있으나, 이에 제한되지는 않는다. 관상식물로는 진달래, 수국, 히비스쿠스, 장미, 튤립, 수선화, 페튜니아, 카네이션, 포인세티아, 및 국화를 들 수 있으나, 이에 제한되지는 않는다. 바람직하게는, 본 발명의 식물은 작물 식물 (예를 들어, 메이즈, 수수, 밀, 해바라기, 토마토, 십자화과, 후추, 감자, 목화, 벼, 대두, 사탕무, 사탕수수, 담배, 보리, 유채 등)이다.
본원에 사용된 용어 식물은 식물 세포, 식물 원형질체, 식물이 그로부터 재생될 수 있는 식물 세포 조직 배양물, 식물 캘러스, 식물 클럼프, 및 식물 또는 식물의 부분, 예컨대 배아, 화분, 배주, 종자, 잎, 꽃, 가지, 열매, 인, 이삭, 콥, 겉껍질, 줄기, 뿌리, 근단, 꽃밥 등에서 무손상인 식물 세포를 포함한다. 곡물은 종을 성장시키거나 번식시키는 것 외의 목적을 위해 상업적인 성장자에 의해 생산되는 성숙한 종자를 의미하는 것으로 의도된다. 재생된 식물의 자손, 변이체, 및 돌연변이체는 또한 이들 부분이 도입된 폴리뉴클레오티드를 포함하는 한, 본 발명의 범위 내에 포함된다. 예를 들어, 소이밀을 비롯한, 본원에 개시된 서열을 보유하는 가공된 식물 생성물 또는 부산물이 추가로 제공된다.
또다른 실시양태에서, 살충 단백질을 코딩하는 유전자는 곤충 병원체성 유기체를 형질전환시키는 데 사용될 수 있다. 이러한 유기체로는 바쿨로바이러스, 진균, 원생동물, 박테리아, 및 선충을 들 수 있다. 관심의 1종 이상의 작물의 "식물권" (엽면, 엽권, 근권, 및/또는 근면)을 점유하는 것으로 공지된 미생물 숙주가 선택될 수 있다. 이들 미생물은 야생형 미생물과 특정 환경에서 성공적으로 경쟁할 수 있도록 선택되고, 살충 단백질을 발현하는 유전자의 안정한 유지 및 발현을 제공하며, 바람직하게는 환경적 열화 및 불활성화로부터 살충제의 개선된 보호를 제공한다.
이러한 미생물로는 고세균, 박테리아, 조류, 및 진균을 들 수 있다. 특정 관심의 것은 미생물, 예컨대 박테리아, 예를 들어, 바실루스, 슈도모나스 (Pseudomonas), 에르위니아 (Erwinia), 세라티아 (Serratia), 클레브시엘라 (Klebsiella), 크산토모나스 (Xanthomonas), 스트렙토미세스 (Streptomyces), 리조비움 (Rhizobium), 로도슈도모나스 (Rhodopseudomonas), 메틸리우스 (Methylius), 아그로박테리움 (Agrobacterium), 아세토박터 (Acetobacter), 락토바실루스 (Lactobacillus), 아르트로박터 (Arthrobacter), 아조토박터 (Azotobacter), 류코노스토크 (Leuconostoc), 및 알칼리게네스 (Alcaligenes)이다. 진균으로는 효모, 예를 들어, 사카로미세스 (Saccharomyces), 크립토코쿠스 (Cryptococcus), 클루이베로미세스 (Kluyveromyces), 스포로볼로미세스 (Sporobolomyces), 로도토룰라 (Rhodotorula), 및 아우레오바시디움 (Aureobasidium)을 들 수 있다. 특정 관심의 것은 이러한 식물권 박테리아 종, 예컨대 슈도모나스 시린가에 (Pseudomonas syringae), 슈도모나스 아에루기노사 (Pseudomonas aeruginosa), 슈도모나스 플루오레센스 (Pseudomonas fluorescens), 세라티아 마르케센스 (Serratia marcescens), 아세토박터 크실리눔 (Acetobacter xylinum), 아그로박테리아, 로도슈도모나스 스페로이데스 (Rhodopseudomonas spheroides), 크산토모나스 캄페스트리스 (Xanthomonas campestris), 리조비움 멜리오티 (Rhizobium melioti), 알칼리게네스 엔트로푸스 (Alcaligenes entrophus), 클라비박터 크실리 (Clavibacter xyli) 및 아조토박터 빈란디르 (Azotobacter vinlandir) 및 식물권 효모 종, 예컨대 로도토룰라 루브라 (Rhodotorula rubra), 알. 글루티니스 (R. glutinis), 알. 마리나 (R. marina), 알. 아우란티아카 (R. aurantiaca), 크립토코쿠스 알비두스 (Cryptococcus albidus), 씨. 디플루엔스 (C. diffluens), 씨. 라우렌티이 (C. laurentii), 사카로미세스 로세이 (Saccharomyces rosei), 에스. 프레토리엔시스 (S. pretoriensis), 에스. 세레비시아에 (S. cerevisiae), 스포로볼로미세스 로수에스 (Sporobolomyces rosues), 에스. 오도루스 (S. odorus), 클루이베로미세스 베로나에 (Kluyveromyces veronae), 아우레오바시디움 폴룰란스 (Aureobasidium pollulans), 바실루스 투린기엔시스, 에스케리키아 콜라이 (Escherichia coli), 바실루스 수브틸리스 (Bacillus subtilis) 등이다.
예시적인 원핵생물, 즉 그람-음성 및 그람-양성 둘 다로는 엔테로박테리아세아에 (Enterobacteriaceae), 예컨대 에스케리키아 (Escherichia), 에르위니아, 시겔라 (Shigella), 살모넬라 (Salmonella), 및 프로테우스 (Proteus); 바실라세아에 (Bacillaceae); 리조비세아에 (Rhizobiceae), 예컨대 리조비움 (Rhizobium); 스피릴라세아에 (Spirillaceae), 예컨대 포토박테리움, 지모모나스 (Zymomonas), 세라티아, 아에로모나스 (Aeromonas), 비브리오 (Vibrio), 데술포비브리오 (Desulfovibrio), 스피릴룸 (Spirillum); 락토바실라세아에 (Lactobacillaceae); 슈도모나다세아에 (Pseudomonadaceae), 예컨대 슈도모나스 및 아세토박터; 아조토박테라세아에 (Azotobacteraceae) 및 니트로박테라세아에 (Nitrobacteraceae)를 들 수 있다. 진균으로는 피코미세테스 (Phycomycetes) 및 아스코미세테스 (Ascomycetes), 예를 들어, 효모, 예컨대 사카로미세스 및 쉬조사카로미세스 (Schizosaccharomyces); 및 바시디오미세테스 (Basidiomycetes) 효모, 예컨대 로도토룰라 (Rhodotorula), 아우레오바시디움, 스포로볼로미세스 (Sporobolomyces) 등을 들 수 있다.
살충 단백질을 코딩하는 유전자는 전기형질전환, PEG 유도 형질전환, 히트 쇼크, 형질도입, 접합 등에 의해 도입될 수 있다. 구체적으로, 살충 단백질을 코딩하는 유전자는 셔틀 벡터 내로 클로닝될 수 있다. 예시적인 셔틀 벡터는 pHT3101 (Lerecius et al. (1989) FEMS Microbiol. Letts. 60: 211-218)이다. 특정 살충 단백질 유전자에 대한 코딩 서열을 함유하는 셔틀 벡터 pHT3101은 예를 들어, 전기천공에 의해 뿌리-콜로니화 바실루스 내로 형질전환될 수 있다 (Lerecius et al. (1989) FEMS Microbiol. Letts. 60: 211-218).
발현 시스템은 살충 단백질이 적절한 신호 펩티드를 살충 단백질의 아미노-말단에 융합시킴으로써 그람-음성 박테리아의 세포질 외부로 분비되도록 설계될 수 있다. 이. 콜라이에 의해 인식되는 신호 펩티드로는 OmpA 단백질을 들 수 있다 (Ghrayeb et al. (1984) EMBO J, 3: 2437-2442).
살충 단백질 및 그의 활성 변이체는 박테리아 숙주에서 발효되고, 생성된 박테리아는 프로세싱되고 바실루스 투린기엔시스 균주가 살곤충 분무제로서 사용되었던 것과 동일한 방식으로 미생물 분무제로서 사용될 수 있다. 바실루스로부터 분비되는 살충 단백질(들)의 경우, 분비 신호는 관련 기술분야에 공지된 절차를 사용하여 제거되거나 돌연변이된다. 이러한 돌연변이 및/또는 결실은 발효 공정 동안 성장 배지 내로의 살충 단백질(들)의 분비를 방지한다. 살충 단백질은 세포 내에 보유되고, 그 후 세포는 프로세싱되어 캡슐화된 살충 단백질을 생성한다.
대안적으로, 살충 단백질은 이종 유전자를 세포 숙주 내로 도입함으로써 생성된다. 이종 유전자의 발현은 직접적으로 또는 간접적으로 살충제의 세포내 생성 및 유지를 초래한다. 그 후, 이들 세포는 세포가 표적 해충(들)의 환경에 적용되는 경우 세포에서 생산되는 독소의 활성을 연장시키는 조건 하에서 처리된다. 생성된 생성물은 독소의 독성을 보유한다. 그 후, 이들 천연 캡슐화된 살충 단백질은 표적 해충을 숙주화하는 환경, 예를 들어, 토양, 물, 및 식물의 나뭇잎에의 시용을 위한 통상적인 기술에 따라 제제화될 수 있다. 예를 들어 미국 특허 제6,468,523호 및 미국 공개 제20050138685호, 및 그에 인용된 참고문헌을 참조한다. 본 발명에서, 형질전환된 미생물 (전체 유기체, 세포, 포자(들), 살충 단백질(들), 살충 성분(들), 해충-영향 성분(들), 돌연변이체(들), 살아 있거나 죽은 세포 및 세포 성분의 혼합물을 비롯한, 및 파쇄된 세포 및 세포 성분을 비롯한 살아 있거나 죽은 세포 및 세포 성분) 또는 단리된 살충 단백질은 허용되는 담체와 함께 살충 또는 농업 조성물(들), 즉, 예를 들어, 현탁액, 용액, 에멀젼, 더스팅 분말, 분산성 과립, 습윤가능한 분말, 및 유화가능한 농축물, 에어로졸, 함침된 과립, 아주번트, 코팅가능한 페이스트, 및 또한 예를 들어, 중합체 물질 중의 캡슐제로 제제화될 수 있다.
농업 조성물은 본원에 개시된 바와 같은 폴리펩티드, 재조합유전성 폴리펩티드 또는 그의 변이체 또는 단편을 포함할 수 있다. 본원에 개시된 농업 조성물은 식물의 환경 또는 재배의 영역에 시용되거나, 식물, 식물 부분, 식물 세포, 또는 종자에 시용될 수 있다.
상기 개시된 이러한 조성물은 표면-활성제, 불활성 담체, 보존제, 보습제, 섭식 촉진제, 유인제, 캡슐화제, 결합제, 유화제, 염료, UV 보호제, 완충제, 유동제 또는 비료, 미세영양 공여제, 또는 식물 성장에 영향을 미치는 다른 제제의 첨가에 의해 얻어질 수 있다. 제초제, 살곤충제, 살진균제, 살박테리아제, 살선충제, 살연체동물제, 살응애제, 식물 성장 조절제, 수확 보조제, 및 비료를 포함하나 이에 제한되지 않는 1종 이상의 농약은 제제화의 분야에 통상적으로 채용되는 담체, 계면활성제 또는 아주번트 또는 특정 표적 해충에 대해 생성물 취급 및 시용을 용이하게 하는 다른 성분과 조합될 수 있다. 적합한 담체 및 아주번트는 고체 또는 액체일 수 있으며, 제제화 기술에 통상적으로 채용되는 물질, 예를 들어, 천연 또는 재생된 광물 물질, 용매, 분산제, 습윤화제, 증점제, 결합제, 또는 비료에 상응한다. 본 발명의 활성 성분은 통상적으로 조성물의 형태로 시용되며, 처리되어야 하는 작물 영역, 식물, 또는 종자에 시용될 수 있다. 예를 들어, 본 발명의 조성물은 곡물저장 빈 또는 곡물저장 사일로 등에서의 저장 동안 제제 중의 곡물에 시용될 수 있다. 본 발명의 조성물은 다른 화합물과 동시에 또는 연속적으로 시용될 수 있다. 본 발명의 박테리아 균주에 의해 생산되는 살충 단백질 중 적어도 1종을 함유하는 본 발명의 활성 성분 또는 본 발명의 농약 조성물을 시용하는 방법으로는 엽면 시용, 종자 코팅, 및 토양 시용을 들 수 있으나, 이에 제한되지는 않는다. 시용의 횟수 및 시용의 속도는 상응하는 해충에 의한 감염의 강도에 의존한다.
적합한 표면-활성제로는 음이온성 화합물, 예컨대 예를 들어, 금속의 카르복실레이트; 장쇄 지방산의 카르복실레이트; N-아실사르코시네이트; 지방 알콜 에톡실레이트를 갖는 인산의 모노 또는 디-에스테르 또는 이러한 에스테르의 염; 지방 알콜 술페이트, 예컨대 나트륨 도데실 술페이트, 나트륨 옥타데실 술페이트 또는 나트륨 세틸 술페이트; 에톡실화 지방 알콜 술페이트; 에톡실화 알킬페놀 술페이트; 리그닌 술포네이트; 석유 술포네이트; 알킬 아릴 술포네이트, 예컨대 알킬-벤젠 술포네이트 또는 저급 알킬나프탈렌 술포네이트, 예를 들어, 부틸-나프탈렌 술포네이트; 술포네이트화 나프탈렌-포름알데히드 축합물의 염; 술포네이트화 페놀-포름알데히드 축합물의 염; 보다 복합 술포네이트, 예컨대 아미드 술포네이트, 예를 들어, 올레산 및 N-메틸 타우린의 술포네이트화 축합 생성물; 또는 디알킬 술포숙시네이트, 예를 들어, 디옥틸 숙시네이트의 나트륨 술포네이트를 들 수 있으나, 이에 제한되지는 않는다. 비-이온성 제제로는 지방산 에스테르, 지방 알콜, 지방산 아미드 또는 지방-알킬- 또는 알케닐-치환된 페놀과 에틸렌 옥시드의 축합 생성물, 다가 알콜 에테르의 지방 에스테르, 예를 들어, 소르비탄 지방산 에스테르, 이러한 에스테르와 에틸렌 옥시드의 축합 생성물, 예를 들어, 폴리옥시에틸렌 소르비탄 지방산 에스테르, 에틸렌 옥시드 및 프로필렌 옥시드의 블록 공중합체, 아세틸렌 글리콜, 예컨대 2,4,7,9-테트라에틸-5-데신-4,7-디올, 또는 에톡실화 아세틸렌 글리콜을 들 수 있다. 양이온성 표면-활성제의 예로는 예를 들어, 지방족 모노-, 디-, 또는 폴리아민, 예컨대 아세테이트, 나프테네이트 또는 올레에이트; 또는 산소-함유 아민, 예컨대 폴리옥시에틸렌 알킬아민의 아민 옥시드; 카르복실산과 디- 또는 폴리아민의 축합에 의해 제조된 아미드-연결된 아민; 또는 4급 암모늄 염을 들 수 있다.
불활성 물질의 예로는 무기 광물, 예컨대 카올린, 필로실리케이트, 카르보네이트, 술페이트, 포스페이트, 또는 식물성 물질, 예컨대 코르크, 분말화된 옥수숫대, 땅콩 겉껍질, 벼 겉껍질, 및 호두 껍질을 들 수 있으나, 이에 제한되지는 않는다.
본 발명의 조성물은 직접 시용에 적합한 형태로 또는 시용 전에 적합한 양의 물 또는 다른 희적제로의 희석을 요구하는 1차 조성물의 농축물로서일 수 있다. 살충 농도는 특정 제제의 성질, 구체적으로 그것이 농축물인지 또는 집적 사용되는지 여부에 따라 다양할 것이다. 조성물은 1 내지 98%의 고체 또는 액체 불활성 담체, 및 0 내지 50% 또는 0.1 내지 50%의 계면활성제 (w/w, v/v, 또는 w/v, 적절하거나 바람직할 경우)를 함유한다. 이들 조성물은 시판 제품에 대해 표지된 속도로, 예를 들어, 건조 형태일 경우 아크 당 약 0.01 lb 내지 5.0 lb 및 액체 형태일 경우 약 0.01 pts 내지 10 pts로 투여될 것이다.
추가의 실시양태에서, 본원에 제공된 조성물, 뿐만 아니라 형질전환된 미생물 및 살충 단백질은 전처리가 살충 활성에 대해 유해하지 않는 한, 표적 해충의 환경에 시용되는 경우 살충 활성을 연장하도록 제제화 전에 처리될 수 있다. 이러한 처리는 조성물(들)의 특성에 유해하게 영향을 미치지 않는 한, 화학적 수단, 물리적 수단, 또는 둘 다에 의할 수 있다. 화학 시약의 예로는 할로겐화제; 알데히드, 예컨대 포름알데히드 및 글루타르알데히드; 항-감염제, 예컨대 제피란 클로라이드; 알콜, 예컨대 이소프로판올 및 에탄올; 및 조직학적 고정제, 예컨대 보우인 (Bouin) 고정제 및 헬리 (Helly) 고정제 (예를 들어, 문헌 [Humason (1967) Animal Tissue Techniques (W.H. Freeman and Co.)] 참조)를 들 수 있으나, 이에 제한되지는 않는다.
한 측면에서, 해충은 본원에 제공된 살충 단백질의 영역에의 시용에 의해 주어진 영역에서 죽거나 수가 감소될 수 있다. 대안적으로, 살충 단백질은 감수성 해충에 의한 감염을 예방하기 위해 환경 영역에 예방적으로 시용될 수 있다. 바람직하게는 해충은 살충-유효량의 폴리펩티드를 섭취하거나, 그와 접촉된다. "살충-유효량"은 적어도 1종의 해충에 대한 사멸을 야기하거나, 해충 성장, 섭식, 또는 정상적인 생리학적 발달을 현저하게 감소시킬 수 있는 살충제의 양으로 의도된다. 이 양은 예를 들어, 방제되는 특정 표적 해충, 처리되는 특정 환경, 위치, 식물, 작물, 또는 농업 지역, 환경 조건, 및 살충-유효 폴리펩티드 조성물의 시용의 방법, 속도, 농도, 안정성, 및 양과 같은 인자에 따라 다양할 것이다. 제제 또는 조성물은 또한 기후 조건, 환경적 고려사항, 및/또는 시용의 빈도 및/또는 해충 감염의 심각도에 관하여 다양할 수 있다.
활성 성분은 통상적으로 조성물의 형태로 시용되며, 처리되는 작물 영역, 식물, 또는 종자에 시용될 수 있다. 따라서, 식물, 식물 세포, 종자, 식물 부분 또는 재배의 영역에 폴리펩티드, 재조합유전성 폴리펩티드 또는 그의 활성 변이체 또는 단편을 포함하는 농업 조성물의 유효량을 제공하는 방법이 제공된다. "유효량"은 단백질 또는 조성물의 양이 해충을 죽이거나 방제하거나, 해충 성장, 섭식, 또는 정상적인 생리학적 발달의 현저한 감소를 초래하는 데 충분한 살충 활성을 갖는 양으로 의도된다. 이러한 해충 수, 해충 성장, 해충 섭식 또는 해충 정상적인 발달의 감소는 예를 들어 약 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 85%, 90%, 95% 이상의 감소를 비롯한, 임의의 통계적으로 유의한 감소를 포함할 수 있다. 예를 들어, 조성물은 곡물저장 빈 또는 사일로 등에서의 저장 동안 제제 중의 곡물에 시용될 수 있다. 조성물은 다른 화합물과 동시에 또는 연속적으로 시용될 수 있다. 본원에 개시된 바와 같은 폴리펩티드, 재조합유전성 폴리펩티드, 또는 그의 변이체 또는 단편 중 적어도 1종을 포함하는 활성 성분 또는 농약 조성물을 시용하는 방법으로는 엽면 시용, 종자 코팅, 및 토양 시용을 들 수 있으나, 이에 제한되지는 않는다.
식물 수율을 증가시키는 방법이 제공된다. 방법은 본원에 개시된 살충 폴리펩티드 서열을 코딩하는 폴리뉴클레오티드를 발현하는 식물 또는 식물 세포를 제공하고, 식물 또는 그의 종자를 상기 폴리펩티드가 그에 대해 살충 활성을 갖는 해충으로 감염된 (또는 그에 의한 감염에 감수성인) 재배지에서 성장시키는 것을 포함한다. 일부 실시양태에서, 폴리펩티드는 인시목, 딱정벌레목, 쌍시목, 반시목, 또는 선충 해충에 대해 살충 활성을 가지며, 상기 밭은 인시목, 반시목, 딱정벌레목, 쌍시목, 또는 선충 해충으로 감염된다. 본원에 정의된 바와 같은 식물의 "수율"은 식물에 의해 생산되는 바이오매스의 질 및/또는 양을 지칭한다. "바이오매스"는 임의의 측정되는 식물 생성물로 의도된다. 바이오매스 생산의 증가는 측정되는 식물 생성물의 수율의 임의의 개선이다. 식물 수율을 증가시키는 것은 몇몇 상업적 적용을 갖는다. 예를 들어, 식물 잎 바이오매스를 증가시키는 것은 인간 또는 동물 소비를 위해 잎 채소의 수율을 증가시킬 수 있다. 추가적으로, 잎 바이오매스를 증가시키는 것은 식물-유래된 제약 또는 공업 제품의 생산을 증가시키는 데 사용될 수 있다. 수율의 증가는 살충 서열을 발현하지 않는 식물에 비해 수율의 적어도 1% 증가, 적어도 3% 증가, 적어도 5% 증가, 적어도 10% 증가, 적어도 20% 증가, 적어도 30%, 적어도 50%, 적어도 70%, 적어도 100% 이상 증가를 포함하나 이에 제한되지 않는 임의의 통계적으로 유의한 증가를 포함할 수 있다. 구체적인 방법에서, 식물 수율은 본원에 개시된 살충 단백질을 발현하는 식물의 개선된 해충 저항성의 결과로서 증가된다. 살충 단백질의 발현은 감염시키거나 섭식하는 해충의 감소된 능력을 초래한다.
식물은 또한 1종 이상의 제초제, 살곤충제, 또는 살진균제, 또는 이들의 2종 이상의 조합을 비롯한, 1종 이상의 화학 조성물로 처리될 수 있다.
비-제한적 실시양태는 하기를 포함한다:
1. (a) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하는 폴리펩티드; 또는
(b) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 표 1에 제시된 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드
를 포함하는, 살충 활성을 갖는 단리된 폴리펩티드.
2. 상기 폴리펩티드가 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 아미노산 서열을 포함하는 것인 실시양태 1의 폴리펩티드.
3. 실시양태 1 또는 2의 폴리펩티드를 포함하는 조성물.
4. 이종 아미노산 서열을 더 포함하는 실시양태 2의 폴리펩티드.
5. 재조합 핵산 분자가 상기 폴리펩티드를 코딩하는 천연 발생 서열이 아닌, 실시양태 1의 폴리펩티드를 코딩하는 재조합 핵산 분자.
6. 상기 핵산 분자가 식물에서의 발현용으로 설계된 합성 서열인 실시양태 5의 재조합 핵산.
7. 상기 핵산 분자가 식물 세포에서의 발현을 지정할 수 있는 프로모터에 작동적으로 연결된 것인 실시양태 6의 재조합 핵산 분자.
8. 상기 핵산 분자가 박테리아에서의 발현을 지정할 수 있는 프로모터에 작동적으로 연결된 것인 실시양태 5의 재조합 핵산 분자.
9. 실시양태 8의 재조합 핵산 분자를 함유하는 숙주 세포.
10. 상기 숙주 세포가 박테리아 숙주 세포인 실시양태 9의 숙주 세포.
11. (a) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 표 1에 제시된 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 재조합 핵산 분자에 작동적으로 연결된 식물 세포에서의 발현을 유도하는 프로모터를 포함하는 DNA 구축물.
12. 상기 뉴클레오티드 서열이 식물에서의 발현용으로 설계된 합성 DNA 서열인 실시양태 11의 DNA 구축물.
13. 실시양태 11의 DNA 구축물을 포함하는 벡터.
14. 실시양태 11 내지 13 중 어느 하나의 DNA 구축물을 함유하는 숙주 세포.
15. 숙주 세포가 식물 세포인 실시양태 14의 숙주 세포.
16. 실시양태 15의 숙주 세포를 포함하는 트랜스제닉 식물.
17. 실시양태 10의 숙주 세포를 포함하는 조성물.
18. 상기 조성물이 분말, 더스트, 펠릿, 과립, 분무제, 에멀젼, 콜로이드, 및 용액으로 이루어진 군으로부터 선택되는 것인 실시양태 17의 조성물.
19. 상기 조성물이 약 1% 내지 약 99 중량%의 상기 폴리펩티드를 포함하는 것인 실시양태 17의 조성물.
20. 해충 집단을 살충-유효량의 실시양태 3 또는 17의 조성물과 접촉시키는 것을 포함하는 해충 집단을 방제하는 방법.
21. 해충 집단을 살충-유효량의 실시양태 3 또는 17의 조성물과 접촉시키는 것을 포함하는 해충 집단을 죽이는 방법.
22. 실시양태 9의 숙주 세포를 폴리펩티드를 코딩하는 핵산 분자가 발현되는 조건 하에서 배양하는 것을 포함하는, 살충 활성을 갖는 폴리펩티드를 제조하는 방법.
23. 뉴클레오티드가
(a) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 표 1에 제시된 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 살충 활성을 갖는 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 DNA 구축물이 그의 게놈 내로 안정하게 혼입된 식물.
24. 실시양태 23의 식물의 트랜스제닉 종자.
25. 뉴클레오티드 서열이
(a) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 표 1에 제시된 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 살충 폴리펩티드를 코딩하는 뉴클레오티드 서열을 식물 또는 그의 세포에서 발현시키는 것을 포함하는, 곤충 해충으로부터 식물을 보호하는 방법.
26. 상기 식물이 인시목 또는 딱정벌레목 해충 또는 반시목 해충에 대한 살충성을 갖는 살충 폴리펩티드를 생산하는 것인 실시양태 25의 방법.
27. 뉴클레오티드 서열이
(a) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 및/또는 159에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 표 1에 제시된 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 살충 폴리펩티드를 코딩하는 뉴클레오티드 서열에 작동적으로 연결된 식물에서의 발현을 유도하는 프로모터를 포함하는 DNA 구축물이 그의 게놈 내로 안정하게 혼입된 식물 또는 그의 종자를 재배지에서 성장시키는 것을 포함하는, 식물의 수율을 증가시키는 방법.
하기 실시예는 예시의 방식으로 제공되며, 제한의 방식으로가 아니다.
실시예
실시예 1: 시퀀싱 및 DNA 분석에 의한 신규 유전자의 발견
미생물 배양물을 액체 배양물에서 표준 실험실 배지에서 성장시켰다. 배양물을 DNA 제조 전에 포화까지 (16 내지 24시간) 성장시켰다. DNA를 세정제 용해에 의해 박테리아 세포로부터 추출한 후, 실리카 매트릭스에 결합시키고, 에탄올 완충제로 세척하였다. 정제된 DNA를 실리카 매트릭스로 온건한 알칼리성 수성 완충제로 용리시켰다.
시퀀싱용 DNA를 분광광도법에 의해 순도 및 농도에 대해 시험하였다. 시퀀싱 라이브러리를 넥스테라 (Nextera) XT 라이브러리 제조 키트를 제조자의 프로토콜에 따라 사용하여 제조하였다. 서열 데이터를 HiSeq 2000 상에서 일루미나 (Illumina) HiSeq 2000 시스템 유저 가이드 (System User Guide) 프로토콜에 따라 생성하였다.
시퀀싱 판독물을 CLC 바이오 어셈블리 셀 (Bio Assembly Cell) 소프트웨어 패키지를 사용하여 드래프트 게놈 내로 어셈블리하였다. 어셈블리 후, 유전자 콜 (call)을 몇몇 방법에 의해 제조하고, 생성된 유전자 서열을 조사하여 살충 유전자의 신규 상동체를 확인하였다. 신규 유전자를 BLAST에 의해, 도메인 조성물에 의해, 및 살충 유전자의 표적 세트에 대한 쌍별 정렬에 의해 확인하였다. 이러한 서열의 요약을 표 1에 제시한다.
상동성 검색에서 확인된 유전자를 PCR에 의해 박테리아 DNA로부터 증폭시키고, 융합된 인-프레임 정제 태그를 함유하는 박테리아 발현 벡터 내로 클로닝하였다. 클론된 유전자를 이. 콜라이에서 발현시키고, 컬럼 크로마토그래피에 의해 정제하였다. 정제된 단백질을 곤충 식사 생물검정 연구에서 평가하여 활성 단백질을 확인하였다.
곤충 식사 생물검정을 정제된 단백질이 표면 처리로서 적용되는 밀 싹 및 한천 인공 식사를 사용하여 수행하였다. 곤충 유충을 처리된 식사에 적용하고, 사멸률에 대해 모니터링하였다.
곤충 식사 생물검정을 정제된 단백질이 첨가되는 막 사세에 함유된 수크로스 액체 식사를 사용하여 수행하였다. 곤충 약충을 식사 사세 상에서 섭식하게 하고, 사멸률에 대해 모니터링하였다. 생물검정에서 시험된 곤충은 갈색 노린재 (BSB), 유스키스투스 세르부스, 및 남부 녹색 노린재 (SGSB), 네자라 비리둘라를 포함하였다. 데이터를 하기에서 표 5에 열거한다.
<표 5>
생물검정 결과
BSB = 갈색 노린재, SGSB = 남부 녹색 노린재
실시예 2. 이. 콜라이에서의 이종 발현
표 6 및 7에 제시된 각각의 오픈 리딩 프레임을 말토스 결합 단백질 (pMBP)을 함유하는 이. 콜라이 발현 벡터 내로 클로닝하였다. 발현 벡터를 BL21*RIPL 내로 형질전환시켰다. 카르베니실린으로 보충된 LB 배양물을 단일 콜로니로 접종시키고, 37℃에서 0.5%의 밤샘 배양물을 사용하여 밤새 성장시키고, 신선한 배양물을 접종하고, 37℃에서 대수 기로 성장시켰다. 배양을 250 mM IPTG를 사용하여 18시간 동안 16℃에서 유도하였다. 세포를 펠릿화하고, 프로테아제 억제제로 보충된 10 mM 트리스 (Tris) pH 7.4 및 150 mM NaCl에 재현탁시켰다. 단백질 발현을 SDS-PAGE에 의해 평가하였다.
실시예 3. 딱정벌레목 및 인시목에 대한 살충 활성
단백질 발현 : 표 6에 제시된 각각의 서열을 실시예 2에 기재된 바와 같이 이. 콜라이에서 발현시켰다. LB 400 mL를 접종하고, 0.6의 OD600으로 성장시켰다. 배양을 0.25 mM IPTG로 16℃에서 밤새 유도하였다. 세포를 스핀 다운시키고, 세포 펠릿을 완충제 5 mL에 재현탁시켰다. 재현탁액을 얼음 상에서 2분 동안 초음파처리하였다.
생물검정 : 가을 거염벌레 (FAW), 옥수수 이삭 벌레 (CEW), 유럽 옥수수 천곤충 (ECB) 남서부 옥수수 천곤충 (SWCB) 및 배추좀나방 (DBM) 알을 상업적 곤충사육실 (벤존 리서치 인코포레이티드 (Benzon Research Inc.), 미국 펜실베니아주 칼리슬)로부터 구입하였다. FAW, CEW, ECB 및 BCW 알을 부화가 검정 설정의 12시간 내에 일어날 시점으로 인큐베이션하였다. SWCB 및 DBM을 신생아 유충으로서 검정에 도입하였다. 검정을 다중종 인시목 식사 (사우쓰랜드 프로덕츠 인코포레이티드 (Southland Products Inc.), 미국 알칸사스주 레이크 빌리지)를 함유하는 24-웰 트레이에서 수행하였다. 초음파처리된 용해물의 샘플을 식사의 표면 (식사 오버레이)에 적용하고, 증발시키고, 식사 내로 흡인하였다. CEW, FAW, BCW, ECB 및 SWCB에 대해, 초음파처리된 용해물 125 μl를 식사 표면에 첨가하고, 건조시켰다. DBM에 대해, 초음파처리된 용해물의 1:2 희석액 50 μl를 식사 표면에 첨가하였다. 생물검정 플레이트를 핀 홀로 배기된 플레이트 밀봉 필름으로 밀봉하였다. 플레이트를 26℃에서 65% 상대 습도 (RH)에서 16:8 낮:밤 주기로 퍼시벌 (Percival)에서 5일 동안 인큐베이션하였다. 검정을 사멸률, 성장 억제 및 섭식 억제의 수준에 대해 평가하였다.
서부 옥수수 뿌리벌레 생물검정을 위해, 단백질 구축물/용해물을 웰/s 24-웰 플레이트 (셀스타 (Cellstar), 24-웰, 그레이너 바이오 원 (Greiner Bio One))에서 식사의 상부 표면 상에 60 μl 부피를 분산시킴으로써 곤충 생물검정에서 평가하고, 건조시켰다. 각각의 웰은 500 μl 식사를 함유하였다 (Marrone et al., 1985). 15 내지 20마리의 신생아 유충을 정밀 팁 페인트 브러쉬를 사용하여 각각의 웰에서 도입하고, 플레이트를 막 (뷰실 (Viewseal), 그레이너 바이오 원)으로 덮었다. 생물검정을 주위 온도에서 저장하고, 제4일에 사멸률, 및/또는 성장/섭식 억제에 대해 점수화하였다. 도 7은 옥수수 뿌리 벌레 생물검정에 대한 검정 점수화 지침을 제공한다.
콜로라도 감자 딱정벌레 (CPB)에 대해, 코르크 천공 크기 번호 8 잎 원반을 감자 잎으로부터 잘라내고, 철저하게 습윤될 때까지 단백질 구축물/용해물에 침지시키고, 필터 원반 (밀리포어 (Millipore), 유리 섬유 필터, 13 mm)의 상부 상에 정치하였다. 60 μl dH2O를 각각의 필터 원반에 첨가하고, 24-웰 플레이트 (셀스타, 24-웰, 그레이너 바이오 원)의 각각의 웰에 정치하였다. 잎 원반을 건조시키고, 5 내지 7마리의 제1령 유충을 정밀 팁 페인트 브러쉬를 사용하여 각각의 웰에서 도입하였다. 플레이트를 막 (뷰실, 그레이너 바이오 원)으로 덮고, 작은 구멍을 막의 각각의 웰에서 천공하였다. 구축물을 4개의 사본으로 평가하고, 제3일에 사멸률 및 잎 손상에 대해 점수화하였다.
표 6은 다양한 서열의 딱정벌레목 및 인시목에 대한 살충 활성의 요약을 제공한다. 표 부호: "-"는 나타나는 활성이 없음을 지시하고; "NT"는 시험되지 않음을 지시하고; "S"는 성장방해를 지시하고; "SS"는 약간 성장방해를 지시하고; "LF"는 낮은 섭식을 지시한다.
<표 6>
딱정벌레목 및 인시목에 대한 살충 활성의 요약.
실시예 4. 반시목에 대한 살충 활성
단백질 발현: 표 7에 제시된 서열 각각을 실시예 2에 기재된 바와 같이 이. 콜라이에서 발현시켰다. LB 400 mL를 접종하고, 0.6의 OD600으로 성장시켰다. 배양을 0.25 mM IPTG로 16℃에서 밤새 유도하였다. 세포를 스핀 다운시키고, 세포 펠릿을 완충제 5 mL에 재현탁시켰다. 재현탁액을 얼음 상에서 2분 동안 초음파처리하였다.
제2령 SGSB를 상업적 곤충사육실 (벤존 리서치 인코포레이티드, 미국 펜실베니아주 칼리슬)로부터 얻었다. 50% v/v 비의 초음파처리된 용해물 샘플 대 20% 수크로스를 생물검정에서 채용하였다. 신장된 파라필름을 섭식 막으로서 사용하여 SGSB를 식사/샘플 혼합물에 노출시켰다. 플레이트를 25℃:21℃에서, 16:8 낮:밤 주기로 65% RH에서 5일 동안 인큐베이션하였다.
사멸률을 각각의 샘플에 대해 점수화하였다. 결과를 표 7에 제시한다. 점선은 사멸률이 검출되지 않았음을 지시한다. 단백질 (APG00034)은 남부 녹색 노린재에 대해 25% 사멸률을 나타내었다 (4마리 중 1마리의 노린재가 죽었음). 음성 대조군 (결합 도메인 및 완충제만이 발현된 빈 벡터)은 둘 다 사멸률을 나타내지 않았다 (4마리 중 0마리의 노린재).
<표 7>
반시목에 대한 살충 활성의 요약
실시예 5. 대두의 형질전환
식물에서 활성인 프로모터에 작동적으로 연결된 서열식별번호: 1 내지 159 각각 또는 그의 활성 변이체 또는 단편을 포함하는 DNA 구축물을 형질전환 벡터 내로 클로닝하고, 그 전문이 본원에 참조로 포함되는 2015년 12월 19일에 출원된 미국 가출원 제62/094,782호에 기재된 바와 같이 아그로박테리움 내로 도입하였다.
접종 4일 전에, 아그로박테리움의 몇몇 루프를 적절한 항생제** (스펙티노마이신, 클로람페니콜 및 카나마이신)로 보충된 YEP* 배지의 신선한 플레이트에 스트리킹하였다. 박테리아를 28℃에서 어둠에서 2일 동안 성장시켰다. 2일 후, 박테리아의 몇몇 루프를 125 ml 엘른마이어 (Erlenmeyer) 플라스크 중에서 항생제를 갖는 YEP 액체 배지 3 ml에 옮겼다. 플라스크를 250 RPM에서 28℃에서 밤새 회전 진탕기 상에 정치하였다. 접종 1일 전에, 밤샘 배양물 2 내지 3 ml를 500 ml 엘른마이어 플라스크 중에서 항생제를 갖는 YEP 125 ml에 옮겼다. 플라스크를 250 RPM에서 28℃에서 밤새 회전 진탕기 상에 정치하였다.
접종 전에, 박테리아 배양물의 OD를 OD 620에서 점검하였다. 0.8 내지 1.0의 OD는 배양물이 대수 기에 있음을 지시한다. 배양물을 4000 RPM에서 10분 동안 오크리지 (Oakridge) 튜브에서 원심분리하였다. 상청액을 버리고, 펠릿을 대두 감염 배지 (Soybean Infection Medium) (SI)의 부피에 재-현탁시켜 바람직한 OD를 달성하였다. 배양물을 접종에 필요할 때까지 주기적 혼합으로 유지하였다.
접종 2 또는 3일 전에, 대두 종자를 염소 기체를 사용하여 표면 살균하였다. 흄 후드에서, 종자를 갖는 페트리 디쉬를 뚜껑이 벗겨진 벨 자에 정치하였다. 12 N HCl 1.75 ml를 벨 자 내부의 250 ml 엘른마이어 플라스크에서 표백제 100 ml에 서서히 첨가하였다. 뚜껑을 벨 자의 상부에 즉시 정치하였다. 종자를 14 내지 16시간 (밤새) 동안 멸균하였다. 상부를 벨 자로부터 제거하고, 페트리 디쉬의 뚜껑을 교체하였다. 그 후, 표면이 멸균된 페트리 디쉬를 대략 30분 동안 층류에서 개방하여 임의의 잔류하는 염소 기체를 분산시켰다.
종자를 멸균 탈이온수 또는 대두 감염 배지 (SI) 중 어느 하나로 1 내지 2일 동안 흡수시켰다. 20 내지 30개의 종자를 100x25 mm 페트리 디쉬에서 액체로 덮고, 24℃에서 어둠에서 인큐베이션하였다. 흡수 후, 비-발아 종자를 버렸다.
자엽 외식편을 멸균 페이퍼 플레이트 상에서 프로세싱하였으며, 멸균 필터 페이퍼는 본원에 참조로 포함되는 미국 특허 제7,473,822호의 방법을 채용하여 SI 배지를 사용하여 적셨다.
전형적으로, 처리 당 16 내지 20개의 자엽을 접종하였다. 외식편을 유지하는 데 사용된 SI 배지를 버리고, 아그로박테리움 배양물 (OD 620=0.8 내지 20) 25 ml로 대체하였다. 모든 외식편을 침지시키고, 접종을 30분 동안 디쉬를 주기적으로 소용돌이치면서 수행하였다. 30분 후, 아그로박테리움 배양물을 제거하였다.
공동-재배 플레이트를 한 조각의 멸균 페이퍼를 대두 공동-재배 배지 (SCC) 상으로 중첩시킴으로써 제조하였다. 블롯팅하지 않고, 접종된 자엽을 필터 페이퍼 상에서 향축 측을 아래로 하여 배양하였다. 대략 20개의 외식편을 각각의 플레이트 상에서 배양할 수 있다. 플레이트를 파라필름으로 밀봉하고, 24℃ 및 대략 120 μmole m-2s-1 (퍼시벌 인큐베이터)에서 4 내지 5일 동안 배양하였다.
공동-재배 후에, 자엽을 세포탁심 및 티멘틴 200 mg/l를 갖는 대두 세척 배지 (Soybean Wash Medium) 25 ml에서 3회 세척하였다. 자엽을 멸균 필터 페이퍼 상에 플롯팅한 후, 대두 싹 유도 배지 (Soybean Shoot Induction Medium) (SSI)로 옮겼다. 외식편의 마디 말단을 표면 위로 약 45도에서 유지된 원위 말단을 갖는 배지 내로 약간 밀었다. 10개 이하의 외식편을 각각의 플레이트 상에서 배양하였다. 플레이트를 마이크로포어 (Micropore) 테이프로 감고, 퍼시벌에서 24℃ 및 대략 120 μmoles m-2s-1에서 배양하였다.
외식편을 14일 후에 신선한 SSI 배지로 옮겼다. 생장점 및 자엽 마디로부터 나타나는 싹을 버렸다. 싹 유도를 동일한 조건 하에서 추가의 14일 동안 계속하였다.
싹 유도 4주 후에, 자엽을 마디 말단으로부터 분리하고, 평행 커팅을 싹 유도의 영역 (싹 패드) 아래에서 수행하였다. 평행 커팅의 영역을 대두 싹 신장 배지 (Soybean Shoot Elongation Medium) (SSE) 상에 정치하고, 외식편을 퍼시벌에서 24℃ 및 대략 120 μmoles m-2s-1에서 배양하였다. 이 단계를 싹이 신장하기를 계속하는 한, 8주까지 동안 2주마다 반복하였다.
싹이 2 내지 3 cm의 길이에 도달한 경우, 이들을 플랜트콘 (Plantcon) 용기 중의 대두 발근 배지 (Soybean Rooting Medium) (SR)로 옮기고, 동일한 조건 하에서 2주 동안 또는 뿌리가 대략 3 내지 4 cm의 길이에 도달할 때까지 인큐베이션하였다. 그 후, 식물을 토양으로 옮겼다.
대두 형질전환에 대해 모니터링된 모든 배지는 그 전문이 본원에 참조로 포함되는 문헌 [Paz et al. (2010) Agrobacterium-mediated transformation of soybean and recovery of transgenic soybean plants; Plant Transformation Facility of Iowa State University]에서 발견됨을 유념한다. (agron-www.agron.iastate.edu/ptf/protocol/Soybean.pdf 참조)
실시예 6. 메이즈의 형질전환
메이즈 이삭은 수분 후 8 내지 12일에 가장 양호하게 수집된다. 배아를 이삭으로부터 단리하며, 크기가 0.8 내지 1.5 mm인 배아가 형질전환에 사용하기에 바람직하다. 배아를 적합한 인큐베이션 배지, 예컨대 DN62A5S 배지 (3.98 g/L N6 염; 1 mL/L (of 1000X 스톡) N6 비타민; 800 mg/L L-아스파라긴; 100 mg/L 미오-이노시톨; 1.4 g/L L-프롤린; 100 mg/L 카스아미노산; 50 g/L 수크로스; 1 mL/L (1 mg/mL 스톡의) 2,4-D) 상에 반상체 측이 위로 가도록 플레이팅하였다. 그러나, DN62A5S 이외의 배지 및 염은 적합하며, 관련 기술분야에 공지되어 있다. 배아를 25℃에서 어둠에서 밤새 인큐베이션하였다. 그러나, 배아를 밤새 인큐베이션하는 것 그 자체가 반드시 필요하지는 않다.
생성된 외식편을 메쉬 스퀘어 (플레이트 당 30 내지 40)로 옮기고, 삼투 배지 상으로 약 30 내지 45분 동안 옮긴 후, 비밍 (beaming) 플레이트로 옮겼다 (예를 들어, PCT 공개 제WO/0138514호 및 미국 특허 제5,240,842호 참조). 식물 세포에서 본 발명의 GRG 단백질을 발현하도록 설계된 DNA 구축물을 본질적으로 PCT 공개 제WO/0138514호에 기재된 바와 같은 조건을 사용하여, 에어로졸 빔 가속기를 사용하여 식물 조직 내로 가속화하였다. 비밍 후, 배아를 삼투 배지 상에서 약 30분 동안 인큐베이션하고, 25℃에서 어둠에서 밤새 인큐베이션 배지 상으로 정치하였다. 비밍된 외식편이 과도하게 손상되는 것을 회피하기 위해, 이들을 회수 배지로 옮기기 전에 적어도 24시간 동안 인큐베이션하였다. 그 후, 배아를 약 5일 동안, 25℃에서 어둠에서 회수 기간 배지 상으로 펼친 후, 선별 배지로 옮겼다. 외식편을 이용되는 특정 선별의 성질 및 특징에 따라 8주까지 동안 선별 배지에서 인큐베이션하였다. 선별 기간 후에, 생성된 캘러스를 성숙한 체세포 배아의 형성이 관찰될 때까지 배아 성숙 배지로 옮겼다. 그 후, 생성된 성숙한 체세포 배아를 낮은 빛 하에 정치하고, 재생의 과정을 관련 기술분야에 공지된 방법에 의해 개시하였다. 생성된 싹을 발근 배지 상에서 발근하게 하고, 생성된 식물을 육모 화분으로 옮기고, 트랜스제닉 식물로서 번식시켰다.
실시예 7. 선충에 대한 살충 활성
헤테로데라
글리신 (
Heterodera
glycine
)의 (대두 낭포
선충
)
시험관내
검정
대두 낭포 선충을 웰 당 100 uls 및 100 J2의 총 부피를 갖는 96 웰 검정 플레이트 내로 분배시켰다. 서열식별번호: 1 내지 159 중 어느 하나에 제시된 바와 같은 관심의 단백질을 웰 내로 분배시키고, 평가를 위해 실온에서 유지하였다. 마지막으로, SCN J2를 함유하는 96 웰 플레이트를 운동성에 대해 분석하였다. 데이터를 대조군과 비교한 % 억제로서 보고하였다. 히트는 70% 억제 이상인 것으로서 정의된다.
헤테로데라 글리신의 (대두 낭포 선충) 식물-상 검정
서열식별번호: 1 내지 159 중 하나 이상을 발현하는 대두 식물을 본원의 다른 곳에 기재된 바와 같이 생성하였다. 3-주령 대두 커팅물을 식물 당 5000 SCN 알로 접종하였다. 이 감염을 70일 동안 유지한 후, 식물 상에서 발달된 SCN 낭포의 카운팅을 위해 수확하였다. 데이터를 대조군과 비교한 % 억제로서 보고하였다. 히트는 90% 억제 이상인 것으로서 정의된다.
멜로이도기네 인코그니타 (Meloidogyne incognita) (뿌리-혹 선충) 시험관내 검정
뿌리-혹 선충을 웰 당 100 uls 및 100 J2의 총 부피를 갖는 96 웰 검정 플레이트 내로 분배시켰다. 서열식별번호: 1 내지 159 중 어느 하나를 포함하는 관심의 단백질을 웰 내로 분배시키고, 평가를 위해 실온에서 유지하였다. 마지막으로, RKN J2를 함유하는 96 웰 플레이트를 운동성에 대해 분석하였다. 데이터를 대조군과 비교한 % 억제로서 보고하였다. 히트는 70% 억제 이상인 것으로서 정의된다.
멜로이도기네
인코그니타 (뿌리-혹 선충) 식물-상 검정
서열식별번호: 1 내지 159 중 하나 이상을 발현하는 대두 식물을 본원의 다른 곳에 기재된 바와 같이 생성하였다. 3-주령 대두 커팅물을 식물 당 5000 RKN 알로 접종하였다. 이 감염을 70일 동안 유지한 후, 식물에서 발달된 RKN 알의 카운팅을 위해 수확하였다. 데이터를 대조군과 비교한 % 억제로서 보고하였다. 히트는 90% 억제 이상인 것으로서 정의된다.
실시예 8. 살충 활성에 대한 추가의 검정
서열식별번호: 1 내지 159에 제시된 다양한 폴리펩티드를 다수의 방식으로 해충에 대해 살충제로서 작용하는 것을 시험할 수 있다. 이러한 방법 중 하나는 섭식 검정을 수행하는 것이다. 이러한 섭식 검정에서, 해충을 시험되는 화합물을 함유하는 샘플 또는 대조군 샘플 중 어느 하나에 노출시킨다. 종종 이는 시험되는 물질, 또는 이러한 물질의 적합한 희석액을 해충이 섭취할 물질, 예컨대 인공 식사 상으로 정치함으로써 수행된다. 시험되는 물질은 액체, 고체, 또는 슬러리로 구성될 수 있다. 시험되는 물질을 표면 상에 정치한 후, 건조시킬 수 있다. 대안적으로, 시험되는 물질을 용융된 인공 식사와 혼합한 후, 검정 챔버 내로 분배시킬 수 있다. 검정 챔버는 예를 들어, 컵, 디쉬, 또는 미세역가 플레이트의 웰일 수 있다.
해충 (예를 들어 진딧물)을 흡인하기 위한 검정은 시험 물질을 곤충으로부터 칸막이, 이상적으로 곤충을 흡인하는 입 부분을 흡인함으로써 뚫어질 수 있는 부분에 의해 분리하여, 시험 물질을 섭취하게 하는 것을 포함할 수 있다. 종종 시험 물질을 섭식 자극제, 예컨대 수크로스와 혼합하여 시험 물질의 섭취를 촉진시킨다.
다른 유형의 검정은 해충의 입, 또는 장 내로의 시험 물질의 미세주사, 뿐만 아니라 트랜스제닉 식물의 발달 후, 트랜스제닉 식물에 대해 섭식하는 해충의 능력의 시험을 포함할 수 있다. 식물 시험은 정상적으로 소비되는 식물 부분, 예를 들어 잎에 부착된 작은 케이지의 단리, 또는 곤충을 함유하는 케이지에서의 전체 식물의 단리를 포함할 수 있다.
해충을 검정하는 다른 방법 및 접근법은 관련 기술분야에 공지되어 있으며, 예를 들어 문헌 [Robertson and Preisler, eds. (1992) Pesticide bioassays with arthropods, CRC, Boca Raton, Fla]에서 발견될 수 있다. 대안적으로, 검정은 통상적으로 저널 [Arthropod Management Tests and Journal of Economic Entomology]에 또는 미국 곤충 학회 (Entomological Society of America) (ESA)의 구성원으로의 논의에 의해 기재되어 있다. 서열식별번호: 1 내지 159 중 어느 하나는 본원의 실시예 3 및 4에 제시된 바와 같은 검정에에서 발현되고, 채용될 수 있다.
본 명세서에 언급된 모든 간행물 및 특허 출원은 이 발명이 속하는 관련 기술분야의 통상의 기술자의 수준을 지시한다. 모든 간행물 및 특허 출원은 각각의 개별 간행물 또는 특허 출원이 구체적으로 및 개별적으로 참조로 포함되는 것으로 지시된 것과 동일한 정도로 본원에 참조로 포함된다.
상기 발명은 이해의 명확성을 목적으로 예시 및 예의 방식에 의해 일부 상세하게 설명되었지만, 첨부된 청구범위의 범위 내에서 특정 변화 및 변형이 실시될 수 있음이 자명할 것이다.
SEQUENCE LISTING
<110> Parks, Jessica
Roberts, Kira Bulazel
Thayer, Rebecca E
<120> PESTICIDAL GENES AND METHODS OF USE
<130> 098699-0965873
<150> US 60/095,524
<151> 2014-12-22
<160> 167
<170> PatentIn version 3.5
<210> 1
<211> 885
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 1
Met Ala Asn Gly Thr Lys Lys His Val Ile Lys Lys Tyr Leu Glu Leu
1 5 10 15
Ile Lys Pro Ile Tyr Tyr Lys Gln Asn Ser Leu Leu Phe Ser Pro Phe
20 25 30
Ile Leu Ser Ile Pro Gly Cys Asp Ile Ser Gln His Gly Ser Ile Pro
35 40 45
Ala Met Lys Arg Asn Cys Phe Thr Leu Leu Leu Tyr Tyr Arg Arg Tyr
50 55 60
Val Met Ala Gln Leu Pro Glu Leu Gln Ala Gln Ser Phe Ile Pro Tyr
65 70 75 80
Asn Val Leu Ala Asn Pro Val Thr Gly Asn Ser Ala Gly Thr Asp Trp
85 90 95
Gly Glu Trp Leu Ser Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys
100 105 110
Thr Gly Ser Asp Lys Phe Leu Gln Ser Val Leu Ser Thr Gly Tyr Asp
115 120 125
Ala Ala Gln Gly Gly Ser Phe Asn Tyr Ile Ala Val Leu Gln Ala Gly
130 135 140
Val Ala Ile Leu Gly Thr Val Leu Gly Ala Glu Phe Pro Gly Ile Thr
145 150 155 160
Ala Ile Ala Val Pro Ala Leu Thr Phe Leu Ile Lys Trp Val Trp Pro
165 170 175
Lys Leu Thr Gly Gly Ser Ala Gly Glu Ser Asp Thr Gln Lys Met Leu
180 185 190
Asp Leu Ile Asp Lys Glu Ile Gln Gln Lys Leu Asp Gln Ala Leu Ser
195 200 205
Gln Gln Asp Ile Asn Asn Trp Arg Ala Tyr Leu Thr Ser Ile Phe Glu
210 215 220
Ala Ala Thr Asn Leu Ser Gln Ser Ile Ile Asp Ala Gln Phe Thr Gly
225 230 235 240
Thr Glu Arg Ser Ser Asp Arg Gln Pro Arg Val Pro Thr Ser Ser Asp
245 250 255
Tyr Glu Asp Val Tyr Thr Lys Leu Asn Ile Ala Asp Ala Val Leu Ser
260 265 270
Pro Ser Thr Asn Asn Ile Leu Thr Gly Asn Phe Asp Val Leu Ala Ile
275 280 285
Pro Tyr Tyr Thr Ile Gly Ala Thr Val Arg Ile Thr Ala Tyr Gln Ser
290 295 300
Phe Val Lys Phe Ala Asn Gln Trp Ile Asp Val Val Tyr Pro Asp Gln
305 310 315 320
Thr Ser Asn Thr Trp Leu Lys Ala His Gln Asn Leu Asp Asn Ala Lys
325 330 335
Asp Lys Met Ile Ala Met Ile Gln Gln Tyr Thr Ala Asn Val Leu Arg
340 345 350
Val Phe Thr Asp Pro Lys Asn Met Pro Pro Leu Gly Lys Asp Lys Lys
355 360 365
Ser Ile Asn Ala Tyr Asn Lys Phe Val Arg Gly Met Val Ile Ser Gly
370 375 380
Leu Asp Leu Val Ala Val Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Gln
385 390 395 400
Val Gln Thr Asp Leu Glu Lys Thr Arg Val Ile Phe Ser Asp Leu Ile
405 410 415
Gly Lys Asp Gln Thr Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala
420 425 430
Asp Gly Lys Asp Gly Ser Gly Ala Phe Asn Gln His Ala Ala Ile Asp
435 440 445
Leu Asn Ser Ile Phe Tyr Tyr His Glu Glu Leu Ser Ser Met Gln Leu
450 455 460
Gly Ile His Asp Gly Ser Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr
465 470 475 480
Gly Ile Ile Ala Asn Tyr Pro Asn His Ser Tyr Phe Tyr Gly Asp Leu
485 490 495
Tyr Ser Pro Ser Gly Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu
500 505 510
Asn Ala Gln Thr Gln Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile
515 520 525
Tyr Asn Ser Asp Pro Ser Ala Tyr Gly Leu Gly Cys His Ile Leu Gly
530 535 540
Gly Cys Thr Lys Gly Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly
545 550 555 560
Phe Gly Tyr Ser Cys Asn Thr Ser Leu Pro Ser Gln Lys Ile Asn Ala
565 570 575
Phe Tyr Pro Phe Lys Met Asn Leu Asn Arg Ser Gly Thr Ser Asp Lys
580 585 590
Leu Gly Leu Met Tyr Ser Leu Val Pro Leu Asp Leu Ser Pro Asn Asn
595 600 605
Val Phe Gly Glu Leu Asp Ser Asp Thr Arg Asn Val Ile Gly Lys Gly
610 615 620
Phe Pro Ala Glu Lys Gly Thr Leu Ser Tyr Gly Arg Gln Pro Met Leu
625 630 635 640
Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Val
645 650 655
Pro Leu Thr Leu Gln Met Thr Asn Met Thr Ser Gly Lys Tyr Tyr Ile
660 665 670
Arg Val Arg Tyr Ala Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln
675 680 685
Gln Val Tyr Ala Gly Asn Gln Asn Val Gln Asn Gly Gly Trp Ser Leu
690 695 700
Pro Ser Thr Thr Ala Ser Asn Val Ala Thr Asn Phe Pro Asn Gln Leu
705 710 715 720
Tyr Ile Thr Gly Ile Asn Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp
725 730 735
Gly Val Asp Gly Gln Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val
740 745 750
Thr Val Thr Leu Ser Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe
755 760 765
Ile Asp Arg Val Glu Leu Val Pro Ala Ala Pro Gln Ser Asp Tyr Ile
770 775 780
Asn Tyr Ser Ile Thr Ser Gln Ile Pro Asn Cys Asn Asn Phe Asp Thr
785 790 795 800
Ile Ile Trp Asp Gly Asn Gly Ala Gln Arg Ser Thr Ala Thr Val Thr
805 810 815
Val Ser Gly Leu Pro Asp Asn Asn Gly Ile Leu Ile Ser Gly Leu Ser
820 825 830
Ser Tyr Gly Trp Gln Pro Leu Gln Asp Val Asn Gly His Gly Asn Asn
835 840 845
Trp Val Thr Asn Gly Gly Pro Phe Thr Leu Ser Asn Asp His Thr Phe
850 855 860
Thr Ala Ile Ser Ile Cys Gly Phe Gln Asn Val Ser Ser Gly Lys Ile
865 870 875 880
Thr Gly Lys Leu Ser
885
<210> 2
<211> 777
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 2
Met Ala Asn Gly Thr Lys Lys His Val Ile Lys Lys Tyr Leu Glu Leu
1 5 10 15
Ile Lys Pro Ile Tyr Tyr Lys Gln Asn Ser Leu Leu Phe Ser Pro Phe
20 25 30
Ile Leu Ser Ile Pro Gly Cys Asp Ile Ser Gln His Gly Ser Ile Pro
35 40 45
Ala Met Lys Arg Asn Cys Phe Thr Leu Leu Leu Tyr Tyr Arg Arg Tyr
50 55 60
Val Met Ala Gln Leu Pro Glu Leu Gln Ala Gln Ser Phe Ile Pro Tyr
65 70 75 80
Asn Val Leu Ala Asn Pro Val Thr Gly Asn Ser Ala Gly Thr Asp Trp
85 90 95
Gly Glu Trp Leu Ser Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys
100 105 110
Thr Gly Ser Asp Lys Phe Leu Gln Ser Val Leu Ser Thr Gly Tyr Asp
115 120 125
Ala Ala Gln Gly Gly Ser Phe Asn Tyr Ile Ala Val Leu Gln Ala Gly
130 135 140
Val Ala Ile Leu Gly Thr Val Leu Gly Ala Glu Phe Pro Gly Ile Thr
145 150 155 160
Ala Ile Ala Val Pro Ala Leu Thr Phe Leu Ile Lys Trp Val Trp Pro
165 170 175
Lys Leu Thr Gly Gly Ser Ala Gly Glu Ser Asp Thr Gln Lys Met Leu
180 185 190
Asp Leu Ile Asp Lys Glu Ile Gln Gln Lys Leu Asp Gln Ala Leu Ser
195 200 205
Gln Gln Asp Ile Asn Asn Trp Arg Ala Tyr Leu Thr Ser Ile Phe Glu
210 215 220
Ala Ala Thr Asn Leu Ser Gln Ser Ile Ile Asp Ala Gln Phe Thr Gly
225 230 235 240
Thr Glu Arg Ser Ser Asp Arg Gln Pro Arg Val Pro Thr Ser Ser Asp
245 250 255
Tyr Glu Asp Val Tyr Thr Lys Leu Asn Ile Ala Asp Ala Val Leu Ser
260 265 270
Pro Ser Thr Asn Asn Ile Leu Thr Gly Asn Phe Asp Val Leu Ala Ile
275 280 285
Pro Tyr Tyr Thr Ile Gly Ala Thr Val Arg Ile Thr Ala Tyr Gln Ser
290 295 300
Phe Val Lys Phe Ala Asn Gln Trp Ile Asp Val Val Tyr Pro Asp Gln
305 310 315 320
Thr Ser Asn Thr Trp Leu Lys Ala His Gln Asn Leu Asp Asn Ala Lys
325 330 335
Asp Lys Met Ile Ala Met Ile Gln Gln Tyr Thr Ala Asn Val Leu Arg
340 345 350
Val Phe Thr Asp Pro Lys Asn Met Pro Pro Leu Gly Lys Asp Lys Lys
355 360 365
Ser Ile Asn Ala Tyr Asn Lys Phe Val Arg Gly Met Val Ile Ser Gly
370 375 380
Leu Asp Leu Val Ala Val Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Gln
385 390 395 400
Val Gln Thr Asp Leu Glu Lys Thr Arg Val Ile Phe Ser Asp Leu Ile
405 410 415
Gly Lys Asp Gln Thr Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala
420 425 430
Asp Gly Lys Asp Gly Ser Gly Ala Phe Asn Gln His Ala Ala Ile Asp
435 440 445
Leu Asn Ser Ile Phe Tyr Tyr His Glu Glu Leu Ser Ser Met Gln Leu
450 455 460
Gly Ile His Asp Gly Ser Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr
465 470 475 480
Gly Ile Ile Ala Asn Tyr Pro Asn His Ser Tyr Phe Tyr Gly Asp Leu
485 490 495
Tyr Ser Pro Ser Gly Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu
500 505 510
Asn Ala Gln Thr Gln Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile
515 520 525
Tyr Asn Ser Asp Pro Ser Ala Tyr Gly Leu Gly Cys His Ile Leu Gly
530 535 540
Gly Cys Thr Lys Gly Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly
545 550 555 560
Phe Gly Tyr Ser Cys Asn Thr Ser Leu Pro Ser Gln Lys Ile Asn Ala
565 570 575
Phe Tyr Pro Phe Lys Met Asn Leu Asn Arg Ser Gly Thr Ser Asp Lys
580 585 590
Leu Gly Leu Met Tyr Ser Leu Val Pro Leu Asp Leu Ser Pro Asn Asn
595 600 605
Val Phe Gly Glu Leu Asp Ser Asp Thr Arg Asn Val Ile Gly Lys Gly
610 615 620
Phe Pro Ala Glu Lys Gly Thr Leu Ser Tyr Gly Arg Gln Pro Met Leu
625 630 635 640
Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Val
645 650 655
Pro Leu Thr Leu Gln Met Thr Asn Met Thr Ser Gly Lys Tyr Tyr Ile
660 665 670
Arg Val Arg Tyr Ala Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln
675 680 685
Gln Val Tyr Ala Gly Asn Gln Asn Val Gln Asn Gly Gly Trp Ser Leu
690 695 700
Pro Ser Thr Thr Ala Ser Asn Val Ala Thr Asn Phe Pro Asn Gln Leu
705 710 715 720
Tyr Ile Thr Gly Ile Asn Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp
725 730 735
Gly Val Asp Gly Gln Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val
740 745 750
Thr Val Thr Leu Ser Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe
755 760 765
Ile Asp Arg Val Glu Leu Val Pro Ala
770 775
<210> 3
<211> 527
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. from an environmental sample
<400> 3
Met Ala Val Asn Gly Thr Gln Leu Ala Ser Thr Val Thr Thr Val Ser
1 5 10 15
Asn Asp Ile Val Gln Thr Arg Thr Met Val Gln Gly Leu Phe Ser Ala
20 25 30
Asp Asn Pro Thr Gln Leu Ala Pro Gly Val Thr Asp Tyr Trp Ile Asp
35 40 45
Tyr Val His Gln Gln Val Gln Ser Leu Pro Asn Ala Leu Ala Ser Glu
50 55 60
Lys Arg Glu Leu Asn Thr Arg Val Asn Gln Ala Lys Arg Leu Ser Lys
65 70 75 80
Ser Gln Asn Val Leu Gln Ser Asp Ser Met Gly Gly Ala Gly Trp Lys
85 90 95
Leu Gly Arg Lys Val Phe Ile Ser Pro Gly Asp Leu Gln Tyr Lys Gly
100 105 110
Pro Gln Leu Val Leu Pro Pro Ser Ser Leu Val Tyr Pro Ser Tyr Ala
115 120 125
Tyr Gln Thr Val Pro Glu Ser Ala Leu Lys Pro Tyr Thr Arg Tyr Tyr
130 135 140
Leu Ser Val Phe Val Leu Gln Ala Asp Ser Leu His Ile Ile Ser Ser
145 150 155 160
Arg Tyr Gly Asn Ala Ile Ser His Val Val Asn Val Gly Tyr Ser Thr
165 170 175
Ser Met Pro Thr Ser Pro Asp Gly Val Leu Asn Ser Pro His Phe Phe
180 185 190
Gln Tyr Pro Ile Asp Val Gly Ala Val His Pro Asp Ala Asn Leu Gly
195 200 205
Ile Thr Val Gly Phe Thr Val Pro Thr Asn Gly Phe Ala Lys Ile Ser
210 215 220
Gln Val Ser Leu Thr Glu Ala Arg Pro Leu Thr Ala Ala Glu Val Arg
225 230 235 240
Arg Val Gln Asn Gln Asp Glu Gln Trp Phe Gln Pro Tyr Leu Arg Gln
245 250 255
Gln Ala Glu Ile Gln Thr Ser Leu Gln Val Ala Thr Asn Gln Leu Asn
260 265 270
Ala Ile Tyr Asn Ser Thr Asn Trp Ser Gly Ala Ile Gly Pro Asn Val
275 280 285
Thr Tyr Thr Asp Leu Ala Asn Val Thr Val Leu Glu Pro Thr Pro Ser
290 295 300
Gln Pro Arg Leu Ser Thr Asp Leu Pro Ile Gln Asp Pro Asp Ala Gln
305 310 315 320
Gln Lys Ala Ala Ile Phe Gln Ala Thr Gln Arg Ala Trp Met Gln Leu
325 330 335
Gln Ala Arg Asn Ile Val Pro Asn Ser Leu Phe Leu Gln Gly Gly Gln
340 345 350
Asn Trp Asn Val Glu Gly Thr Val Ser Phe Pro Gln Glu Asn Asn Lys
355 360 365
Pro Val Leu Thr Leu Ser Asp Trp Asp Ala Thr Val Ser Gln Met Ile
370 375 380
Ser Leu Pro Val His Asn Gly Tyr Thr Glu Tyr Arg Val Arg Val Arg
385 390 395 400
Ala Lys Gly Lys Gly Thr Ile Ser Ile Val Pro Pro Gly Asp Ala Lys
405 410 415
Arg Met Leu Phe Phe Asn Ser Pro Ser Phe Phe Ala Ile Gln Glu Phe
420 425 430
Tyr Phe Tyr Pro Asp Thr Ser Met Ser Gln Val Asn Leu Ile Ile Gln
435 440 445
Ser Glu Gly Ser Glu Phe Thr Val Asp Trp Val Glu Met Gln Glu Met
450 455 460
Lys Phe Thr Glu Glu Pro Leu Gly Val Asn Gly Gly Asn Ala Ala His
465 470 475 480
Thr Val Gly Ile Thr Lys Pro Val Gly His Leu Glu Ala Ser Asn Pro
485 490 495
Pro Met Glu Ser Gln Ser Thr Asn Ser Ser Ser Asn Cys Lys Cys Ser
500 505 510
Gln Ser Ser Ser Gln Ser Asn Ser Gly Cys Arg Cys Ser Gln Ser
515 520 525
<210> 4
<211> 660
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. from an environmental sample
<400> 4
Met Asp Pro Asn Ala Lys Tyr Glu Ile Thr Asn Thr Lys Tyr Asn Leu
1 5 10 15
Ser His Thr Gly Asn Arg Tyr Ser Arg Phe Pro Leu Asp Asn Thr Gln
20 25 30
Tyr Lys Lys Leu Gln Asn Met Lys Tyr Glu Asn Met Ser Thr Ala Tyr
35 40 45
Gln Lys Asn Gly Asp Tyr Ala Lys Leu Pro Ile Ile Thr Ala Val Asp
50 55 60
Trp Ser Thr Ala Ser Ser Ala Leu Ile Leu Val Ala Gly Thr Leu Ile
65 70 75 80
Ser Ile Gly Ser Val Gly Val Gly Ala Ala Val Thr Thr Ile Gly Thr
85 90 95
Met Leu Pro Leu Phe Trp Ser Asp Leu Phe Gly Thr Pro Gly Glu Asn
100 105 110
Thr Trp Gln Asn Leu Ile Met Gln Gly Asn Ala Val Tyr Pro Val Gln
115 120 125
Asp Phe Thr Pro Tyr Phe Leu Gln Ser Leu Asn Asn Glu Ile Gly Arg
130 135 140
Ser Lys Ala Val Leu Glu Leu Phe Asp Asp Ile Phe Asn Tyr Trp Gln
145 150 155 160
Glu Tyr Asp Thr Pro Arg Ala Arg Glu Glu Val Ala Gln Gln Phe Arg
165 170 175
Asn Thr Asn Thr Thr Ile Val Gly Ser Leu Gly Val Ile Ser Asn Ala
180 185 190
Arg Glu Phe Asp Ile Ile Ser Leu Ala Thr Tyr Ala Ser Leu Ala Leu
195 200 205
Leu His Leu Thr Leu Leu Arg Gln Gly Val Lys Tyr Ala Asp Lys Trp
210 215 220
Gly Leu Ala Arg Glu Glu Gly Ala Tyr Tyr Lys Asn Leu Leu Lys Asp
225 230 235 240
Arg Ile Ala Thr Tyr Thr Asn Tyr Cys Gln Ser Arg Tyr Arg Ala Gly
245 250 255
Leu Asp Arg Leu Arg Thr Gly Thr Phe Tyr Gly Ala Trp Val Ser Phe
260 265 270
Asn Thr Tyr Arg Arg Glu Tyr Thr Ile Ser Val Leu Asn Ile Ile Ala
275 280 285
Met Phe Ser Thr Phe Asp Pro Asp Arg Tyr Pro Ser Asn Tyr Thr Thr
290 295 300
Asn Thr Gln Leu Thr Glu Lys Ile His Val Pro Thr Arg Asn Ser Ser
305 310 315 320
Glu Thr Phe Tyr Asn Gly Phe Pro Asp Leu Ser Asn Thr Glu Ala Lys
325 330 335
Leu Val Pro Thr Leu Ser Leu Phe Lys Ser Leu Ser Arg Leu Asp Phe
340 345 350
Tyr Ser Ser Tyr Thr Asn Gly Ile Tyr Phe Leu Asn Gly Ile Lys Asn
355 360 365
His Tyr Thr Tyr Ile Asp Gly Thr Asn Ala Gly Ser Leu Asn Ile Ile
370 375 380
Gly Asp Glu Arg His Ile Thr Tyr Leu Asp Gly Ser Ile Leu Ser Pro
385 390 395 400
Leu Val His Gln Cys Glu Tyr Asp Gln Ile Leu Thr Ala Val Ile Asn
405 410 415
Thr Pro Asn Tyr Gly Ile Ser Arg Leu Ala Phe Thr Ser Arg Asn Leu
420 425 430
Pro Pro Tyr Asp Val Tyr Ser Ala Asp Asn Gly Ile Gln Pro Val Arg
435 440 445
Thr Ala Lys Val Leu Pro Ile Gln Pro Ser Ser Leu Pro Pro Ser Ile
450 455 460
Pro Pro Ser Leu Arg Pro Tyr Met Val Pro Ile Asn Tyr Gln Leu Ser
465 470 475 480
Asp Ile Thr Ala Ile Asn Thr Gly Pro Phe Asn Pro Asp Ile Leu Tyr
485 490 495
Asn Phe Ser Trp Met Val Ala Ser Asp Asn Asn Val Ile Gln Gly Gly
500 505 510
Asn His Ile Thr Gln Ile Pro Ala Val Lys Gly Asn Ile Leu Gln Asn
515 520 525
Gly Ala Lys Val Arg Glu Gly Pro Gly His Thr Gly Gly Pro Ile Val
530 535 540
Glu Phe Gly Pro His Thr Pro Ala Asn Pro Pro Ala Leu Lys Ile Arg
545 550 555 560
Cys Gln Val Ser Pro Ala Ala Ala Asn Lys Tyr Leu Asn Met Arg Ile
565 570 575
Arg Tyr Ala Ser Asn Ser Thr Ile Ala Thr Asn Leu Asp Phe Ile Ile
580 585 590
Gly Gly Glu Val Arg Asn Ser Met Ala Phe Pro Gly Thr Asn Ala Asp
595 600 605
Ile Thr Lys Ile Asp Ile Pro Tyr Asn Asn Phe Asn Thr Val Glu Phe
610 615 620
Gln Ile Ile Asn Pro Lys Asn Val Gly Phe Met Asp Val Ser Ile Thr
625 630 635 640
Lys Ala Ser Asn Ile Asn Gly Thr Phe Leu Ile Asp Lys Ile Glu Phe
645 650 655
Tyr Val Arg Ser
660
<210> 5
<211> 659
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 5
Met Asp Pro Asn Ala Lys Tyr Glu Ile Thr Asn Thr Lys Tyr Asn Leu
1 5 10 15
Ser His Thr Gly Asn Arg Tyr Ser Arg Phe Pro Leu Asp Asn Thr Gln
20 25 30
Tyr Lys Lys Leu Gln Asn Met Lys Tyr Glu Asn Met Ser Thr Ala Tyr
35 40 45
Gln Lys Asn Gly Asp Tyr Ala Lys Leu Pro Ile Ile Thr Ala Val Asp
50 55 60
Trp Ser Thr Ala Ser Ser Ala Leu Ile Leu Val Ala Gly Thr Leu Ile
65 70 75 80
Ser Ile Gly Ser Val Gly Val Gly Ala Ala Val Thr Thr Ile Gly Thr
85 90 95
Met Leu Pro Leu Phe Trp Ser Asp Leu Phe Gly Thr Pro Gly Glu Asn
100 105 110
Thr Trp Gln Asn Leu Ile Met Gln Gly Asn Ala Val Tyr Pro Val Gln
115 120 125
Asp Phe Thr Pro Tyr Phe Leu Gln Ser Leu Asn Asn Glu Ile Gly Arg
130 135 140
Ser Lys Ala Val Leu Glu Leu Phe Asp Asp Ile Phe Asn Tyr Trp Gln
145 150 155 160
Glu Tyr Asp Thr Pro Arg Ala Arg Glu Glu Val Ala Gln Gln Phe Arg
165 170 175
Asn Thr Asn Thr Thr Ile Val Gly Ser Leu Gly Val Ile Ser Asn Ala
180 185 190
Arg Glu Phe Asp Ile Ile Ser Leu Ala Thr Tyr Ala Ser Leu Ala Leu
195 200 205
Leu His Leu Thr Leu Leu Arg Gln Gly Val Lys Tyr Ala Asp Lys Trp
210 215 220
Gly Leu Ala Arg Glu Glu Gly Ala Tyr Tyr Lys Asn Leu Leu Lys Asp
225 230 235 240
Arg Ile Ala Thr Tyr Thr Asn Tyr Cys Gln Ser Arg Tyr Arg Ala Gly
245 250 255
Leu Asp Arg Leu Arg Thr Gly Thr Phe Tyr Gly Ala Trp Val Ser Phe
260 265 270
Asn Thr Tyr Arg Arg Glu Tyr Thr Ile Ser Val Leu Asn Ile Ile Ala
275 280 285
Met Phe Ser Thr Phe Asp Pro Asp Arg Tyr Pro Ser Asn Tyr Thr Thr
290 295 300
Asn Thr Gln Leu Thr Glu Lys Ile His Val Pro Thr Arg Asn Ser Ser
305 310 315 320
Glu Thr Phe Tyr Asn Gly Phe Pro Asp Leu Ser Asn Thr Glu Ala Lys
325 330 335
Leu Val Pro Thr Leu Ser Leu Phe Lys Ser Leu Ser Arg Leu Asp Phe
340 345 350
Tyr Ser Ser Tyr Thr Asn Gly Ile Tyr Phe Leu Asn Gly Ile Lys Asn
355 360 365
His Tyr Thr Tyr Ile Asp Gly Thr Asn Ala Gly Ser Leu Asn Ile Ile
370 375 380
Gly Asp Glu Arg His Ile Thr Tyr Leu Asp Gly Ser Ile Leu Ser Pro
385 390 395 400
Leu Val His Gln Cys Glu Tyr Asp Gln Ile Leu Thr Ala Val Ile Asn
405 410 415
Thr Pro Asn Tyr Gly Ile Ser Arg Leu Ala Phe Thr Ser Arg Asn Leu
420 425 430
Pro Pro Tyr Asp Val Tyr Ser Ala Asp Asn Gly Ile Gln Pro Val Arg
435 440 445
Thr Ala Lys Val Leu Pro Ile Gln Pro Ser Ser Leu Pro Pro Ser Ile
450 455 460
Pro Pro Ser Leu Arg Pro Tyr Met Val Pro Ile Asn Tyr Gln Leu Ser
465 470 475 480
Asp Ile Thr Ala Ile Asn Thr Gly Pro Phe Asn Pro Asp Ile Leu Tyr
485 490 495
Asn Phe Ser Trp Met Val Ala Ser Asp Asn Asn Val Ile Gln Gly Gly
500 505 510
Asn His Ile Thr Gln Ile Pro Ala Val Lys Gly Asn Ile Leu Gln Asn
515 520 525
Gly Ala Lys Val Arg Glu Gly Pro Gly His Thr Gly Gly Pro Ile Val
530 535 540
Glu Phe Gly Pro His Thr Pro Ala Asn Pro Pro Ala Leu Lys Ile Arg
545 550 555 560
Cys Gln Val Ser Pro Ala Ala Ala Asn Lys Tyr Leu Asn Met Arg Ile
565 570 575
Arg Tyr Ala Ser Asn Ser Thr Ile Ala Thr Asn Leu Asp Phe Ile Ile
580 585 590
Gly Gly Glu Val Arg Asn Ser Met Ala Phe Pro Gly Thr Asn Ala Asp
595 600 605
Ile Thr Lys Ile Asp Ile Pro Tyr Asn Asn Phe Asn Thr Val Glu Phe
610 615 620
Gln Ile Ile Asn Pro Lys Asn Val Gly Phe Met Asp Val Ser Ile Thr
625 630 635 640
Lys Ala Ser Asn Ile Asn Gly Thr Phe Leu Ile Asp Lys Ile Glu Phe
645 650 655
Tyr Val Arg
<210> 6
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 6
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 7
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 7
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 8
<211> 319
<212> PRT
<213> Unknown
<220>
<223> Lysinibacillus sp. obtained from an environmental sample
<400> 8
Met Lys Asn Lys Ile Ile Leu Tyr Gly Val Val Thr Ser Thr Leu Leu
1 5 10 15
Ala Gly Gly Pro Phe Ser Asn Asn Ala Ser Ala Ser Glu Asn Gln Gly
20 25 30
Leu Thr Gly Asn Asn Ile Gly Val Val Ser Ser Asn Ile Lys Ala Ala
35 40 45
Glu Gln Ser Gln Asn Lys Asn Ser Ala Tyr Gln Asp Ile Asp Glu Arg
50 55 60
Val Lys Lys Met Leu Ile Ser Ala Ala Phe Asp Gly Thr Glu Tyr Arg
65 70 75 80
Tyr His His Ile Lys Asp Thr Glu Leu Asn Gly Asn Ile Ile Glu Gly
85 90 95
Ser Met Ile Glu Asn Ala Gln Ile Leu Thr Ile Gly Glu Asn Ile Leu
100 105 110
Glu Asn Asn Leu Gly His Glu Val Thr Leu Phe Thr Ser Gly Tyr Glu
115 120 125
His Glu Phe Lys Glu Met Thr Ser Thr Thr Asn Thr Ser Gly Trp Thr
130 135 140
Phe Gly Tyr Asn Tyr Asn Ala Thr Leu Ser Val Met Met Val Ser Ala
145 150 155 160
Ser Gln Ser Phe Ser Val Asp Tyr Asn Met Ser Thr Ser Asn Thr Thr
165 170 175
Glu Lys Ser Gln Val Arg Lys Phe Thr Val Pro Pro Gln Gly Phe Pro
180 185 190
Val Pro Ala Gly Lys Lys Tyr Lys Val Glu Tyr Val Phe Glu Lys Ile
195 200 205
Thr Ile Ser Gly Arg Asn Arg Leu Asn Ala Asp Leu Tyr Gly Asp Ala
210 215 220
Thr Tyr Tyr Tyr Asn Asn Leu Pro Met Ser Pro Gln Leu Leu Tyr Ser
225 230 235 240
Ala Met Asn Phe Ala Ser Asp Thr Gln Gly Phe Glu Arg Val Ile Arg
245 250 255
Asp Asn Ala Glu Gly Asn Asp Arg Phe Gly Ile Arg Ala Thr Gly Glu
260 265 270
Gly Arg Phe Ser Thr Glu Phe Gly Thr Arg Leu Tyr Ala Lys Ile Thr
275 280 285
Asp Ile Thr Asp Ser Lys Lys Pro Val Thr Ile Glu Thr Lys Lys Ile
290 295 300
Pro Val Asn Phe Glu Thr Leu Ser Val Ala Thr Lys Ile Ile Glu
305 310 315
<210> 9
<211> 293
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 9
Met Ser Glu Asn Gln Gly Leu Thr Gly Asn Asn Ile Gly Val Val Ser
1 5 10 15
Ser Asn Ile Lys Ala Ala Glu Gln Ser Gln Asn Lys Asn Ser Ala Tyr
20 25 30
Gln Asp Ile Asp Glu Arg Val Lys Lys Met Leu Ile Ser Ala Ala Phe
35 40 45
Asp Gly Thr Glu Tyr Arg Tyr His His Ile Lys Asp Thr Glu Leu Asn
50 55 60
Gly Asn Ile Ile Glu Gly Ser Met Ile Glu Asn Ala Gln Ile Leu Thr
65 70 75 80
Ile Gly Glu Asn Ile Leu Glu Asn Asn Leu Gly His Glu Val Thr Leu
85 90 95
Phe Thr Ser Gly Tyr Glu His Glu Phe Lys Glu Met Thr Ser Thr Thr
100 105 110
Asn Thr Ser Gly Trp Thr Phe Gly Tyr Asn Tyr Asn Ala Thr Leu Ser
115 120 125
Val Met Met Val Ser Ala Ser Gln Ser Phe Ser Val Asp Tyr Asn Met
130 135 140
Ser Thr Ser Asn Thr Thr Glu Lys Ser Gln Val Arg Lys Phe Thr Val
145 150 155 160
Pro Pro Gln Gly Phe Pro Val Pro Ala Gly Lys Lys Tyr Lys Val Glu
165 170 175
Tyr Val Phe Glu Lys Ile Thr Ile Ser Gly Arg Asn Arg Leu Asn Ala
180 185 190
Asp Leu Tyr Gly Asp Ala Thr Tyr Tyr Tyr Asn Asn Leu Pro Met Ser
195 200 205
Pro Gln Leu Leu Tyr Ser Ala Met Asn Phe Ala Ser Asp Thr Gln Gly
210 215 220
Phe Glu Arg Val Ile Arg Asp Asn Ala Glu Gly Asn Asp Arg Phe Gly
225 230 235 240
Ile Arg Ala Thr Gly Glu Gly Arg Phe Ser Thr Glu Phe Gly Thr Arg
245 250 255
Leu Tyr Ala Lys Ile Thr Asp Ile Thr Asp Ser Lys Lys Pro Val Thr
260 265 270
Ile Glu Thr Lys Lys Ile Pro Val Asn Phe Glu Thr Leu Ser Val Ala
275 280 285
Thr Lys Ile Ile Glu
290
<210> 10
<211> 803
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. from an environmental sample
<400> 10
Met Asp Lys Lys Gln Lys Lys Ile Leu Ser Met Thr Leu Ala Leu Gly
1 5 10 15
Val Leu Thr Gly Thr Tyr Ile Pro Ile Thr Ser Thr Ala Leu Ala Glu
20 25 30
Thr Glu Lys Glu Ser Ile Lys Lys Ser Ser Ala Lys Thr Ser Pro Gln
35 40 45
Met Asp Ser Arg Ala Trp Ile Asn Asn Pro Tyr Gln Gly Val Ser Met
50 55 60
Gly Gln Phe Leu Asn Ala Phe Asn Gly Asn Gln Trp Lys Pro Leu Leu
65 70 75 80
Glu Gln Ile His Asn Lys Gly Asp Ala Gly Ser Gly Thr Ile Ala Phe
85 90 95
Leu Lys Gly Met Met Thr Thr Gly Leu Ser Leu Leu Pro Pro Pro Gly
100 105 110
Ser Leu Leu Ala Ser Val Trp Asp Ile Phe Ile Pro Gly Thr Gly Val
115 120 125
Lys Asp Asp Met Trp Leu Gln Ile Glu Lys Tyr Val Asp Glu Lys Ile
130 135 140
Asp Ser Lys Ile Asn Asp Tyr His Lys Tyr Leu Met Gly Ala Gln Phe
145 150 155 160
Asn Gly Ala Met Glu Lys Val Lys Glu Tyr Gln Arg Val Leu Gln Ile
165 170 175
Tyr Asn Asn Ser Lys Val Ser Val Lys Val Glu Glu Pro Glu Ala Pro
180 185 190
Val Arg Asp Ala Val Arg Lys Ala Asp Ser Glu Leu Ala Ser Tyr Ile
195 200 205
Ala Val Ile Gln Thr Pro Glu Lys Glu Thr Asp Ser Val Tyr Gln Gln
210 215 220
Ile Thr Ala Pro Ile Phe Val Gln Ala Ala Asn Leu His Leu Leu Val
225 230 235 240
Leu Arg Asp Ile Ile Gln Phe Gly Glu Glu Trp Gly Ile Asp Lys Asp
245 250 255
Gln Leu Lys Gly Tyr Lys Asp Lys Gln Glu Lys Leu Ile Glu Asn Tyr
260 265 270
Thr Asp Tyr Ser Ile Lys Val Tyr Asn Asp Gly Leu Glu Lys Arg Ile
275 280 285
Asn Glu Ala Asp Lys Ile Phe Gly Pro Glu Asp Gly Tyr Gly Asn Pro
290 295 300
Ala Lys Pro Gly Pro Glu Glu Ala Tyr Thr Asn Thr His Arg Trp Asn
305 310 315 320
Tyr Ile Asn Glu Tyr Ile Arg Glu Tyr Thr Leu Ser Val Leu Asp Phe
325 330 335
Val Ala Leu Phe Pro Ser Thr Asn Pro Asn Ile Tyr Ser Arg Gly Ala
340 345 350
Met Gln Lys Asn Ser Arg Gln Ile Tyr Ser Ser Ile Ser Gly Thr Val
355 360 365
Thr Pro Asn Thr Gln Thr Trp Lys Asp Ile Asn Asn Arg Leu Ala Ser
370 375 380
Gln Glu Tyr Lys Gly Glu Leu Val Ala Ala Glu Val Gln Ser Tyr Ala
385 390 395 400
Arg Ile Asp Ala Leu Ala Pro Trp Phe Asp Arg Asn Ser Gly Thr Thr
405 410 415
Tyr Asn Thr Gly Trp Val Gly Asn Thr Ser Gly Gly Ile Leu Arg Lys
420 425 430
Ile Thr Ile Asp Tyr Ser Asn Pro Leu Thr Lys Val Asn Ile Gly Asp
435 440 445
Gln Arg Thr Pro Phe Tyr Phe Gln Leu Thr His Glu Asn Gly Ser Ala
450 455 460
Ile Pro Gln Phe Gly Gln Asp Asp Pro Arg Leu Lys Lys Arg Thr Phe
465 470 475 480
Glu Phe Pro Asn Gln Lys Ile Ser Asp Ile Gln Ala Phe Gly Lys Ser
485 490 495
Thr Ala Pro Gly Leu Asp Gly Ile Asp Ala Val Val Phe Gly Phe Thr
500 505 510
Asp Arg Asn Leu Asp Ser Lys His Asp Leu Met Arg Asn Met Ile Ser
515 520 525
Ser Ile Pro Ala Glu Met Tyr Gln Gly Lys Ser Asn Phe Lys Pro Gln
530 535 540
Met Glu Pro Ile Asn Ala Gln Gln Lys Ala Met Lys Thr Asp Thr Thr
545 550 555 560
Asn Ser Tyr Leu Thr Tyr Lys Val Asn Ser Tyr Lys Ala Gln Glu Tyr
565 570 575
Lys Ile Arg Tyr Arg Val Ala Ala Asn Glu Asn Ser Lys Ile Ser Ile
580 585 590
Ser Gln Lys Ser Gly Thr Asn Phe Glu Lys Ile Gly Asp Thr Asn Ile
595 600 605
Ser Lys Thr Val Asn Ala Glu Asp Thr Val Lys Gly Glu Tyr Gly Tyr
610 615 620
Tyr Lys Ile Ile Glu Gly Pro Thr Val Lys Leu Asn Tyr Gly Gln Asn
625 630 635 640
Glu Leu Lys Leu Glu Asn Thr Gln Gly Lys Phe Ala Ile Asp Arg Ile
645 650 655
Glu Leu Glu Pro Ile Gln Lys Asp Glu Ile Val Phe Glu Asp Asn Phe
660 665 670
Asp Gly Thr Asn Lys Gly Trp Ile Phe Tyr Ser Gly Gly Gly Ile Val
675 680 685
Gly Gly Gly Val Thr Gly Arg Ala Gly Met Ile Pro Ser Gly Thr Asn
690 695 700
Ser Ile Val Tyr Pro Ser Gly Ile Asp Tyr Tyr Ser Lys Tyr Thr Leu
705 710 715 720
Asn Met Lys Val Lys Leu Gln Ser Thr Asp Pro Asn Ile Lys Gln Lys
725 730 735
Val Lys Ile Tyr His Thr Thr Thr Lys Gly Gln Met Val Ser Lys Glu
740 745 750
Val Glu Leu Lys Gly Gly Glu Gly Tyr Lys Glu Leu Lys Leu Glu Phe
755 760 765
Ile Thr Gly Tyr Thr Pro Ser Glu Ser Asn Gln Ile Gly Ile Gln Ser
770 775 780
Ala Val Gly Gly Ser Asn Val Leu Phe Asp Asp Val Lys Leu Thr Gly
785 790 795 800
Ala Lys Arg
<210> 11
<211> 773
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 11
Met Glu Thr Glu Lys Glu Ser Ile Lys Lys Ser Ser Ala Lys Thr Ser
1 5 10 15
Pro Gln Met Asp Ser Arg Ala Trp Ile Asn Asn Pro Tyr Gln Gly Val
20 25 30
Ser Met Gly Gln Phe Leu Asn Ala Phe Asn Gly Asn Gln Trp Lys Pro
35 40 45
Leu Leu Glu Gln Ile His Asn Lys Gly Asp Ala Gly Ser Gly Thr Ile
50 55 60
Ala Phe Leu Lys Gly Met Met Thr Thr Gly Leu Ser Leu Leu Pro Pro
65 70 75 80
Pro Gly Ser Leu Leu Ala Ser Val Trp Asp Ile Phe Ile Pro Gly Thr
85 90 95
Gly Val Lys Asp Asp Met Trp Leu Gln Ile Glu Lys Tyr Val Asp Glu
100 105 110
Lys Ile Asp Ser Lys Ile Asn Asp Tyr His Lys Tyr Leu Met Gly Ala
115 120 125
Gln Phe Asn Gly Ala Met Glu Lys Val Lys Glu Tyr Gln Arg Val Leu
130 135 140
Gln Ile Tyr Asn Asn Ser Lys Val Ser Val Lys Val Glu Glu Pro Glu
145 150 155 160
Ala Pro Val Arg Asp Ala Val Arg Lys Ala Asp Ser Glu Leu Ala Ser
165 170 175
Tyr Ile Ala Val Ile Gln Thr Pro Glu Lys Glu Thr Asp Ser Val Tyr
180 185 190
Gln Gln Ile Thr Ala Pro Ile Phe Val Gln Ala Ala Asn Leu His Leu
195 200 205
Leu Val Leu Arg Asp Ile Ile Gln Phe Gly Glu Glu Trp Gly Ile Asp
210 215 220
Lys Asp Gln Leu Lys Gly Tyr Lys Asp Lys Gln Glu Lys Leu Ile Glu
225 230 235 240
Asn Tyr Thr Asp Tyr Ser Ile Lys Val Tyr Asn Asp Gly Leu Glu Lys
245 250 255
Arg Ile Asn Glu Ala Asp Lys Ile Phe Gly Pro Glu Asp Gly Tyr Gly
260 265 270
Asn Pro Ala Lys Pro Gly Pro Glu Glu Ala Tyr Thr Asn Thr His Arg
275 280 285
Trp Asn Tyr Ile Asn Glu Tyr Ile Arg Glu Tyr Thr Leu Ser Val Leu
290 295 300
Asp Phe Val Ala Leu Phe Pro Ser Thr Asn Pro Asn Ile Tyr Ser Arg
305 310 315 320
Gly Ala Met Gln Lys Asn Ser Arg Gln Ile Tyr Ser Ser Ile Ser Gly
325 330 335
Thr Val Thr Pro Asn Thr Gln Thr Trp Lys Asp Ile Asn Asn Arg Leu
340 345 350
Ala Ser Gln Glu Tyr Lys Gly Glu Leu Val Ala Ala Glu Val Gln Ser
355 360 365
Tyr Ala Arg Ile Asp Ala Leu Ala Pro Trp Phe Asp Arg Asn Ser Gly
370 375 380
Thr Thr Tyr Asn Thr Gly Trp Val Gly Asn Thr Ser Gly Gly Ile Leu
385 390 395 400
Arg Lys Ile Thr Ile Asp Tyr Ser Asn Pro Leu Thr Lys Val Asn Ile
405 410 415
Gly Asp Gln Arg Thr Pro Phe Tyr Phe Gln Leu Thr His Glu Asn Gly
420 425 430
Ser Ala Ile Pro Gln Phe Gly Gln Asp Asp Pro Arg Leu Lys Lys Arg
435 440 445
Thr Phe Glu Phe Pro Asn Gln Lys Ile Ser Asp Ile Gln Ala Phe Gly
450 455 460
Lys Ser Thr Ala Pro Gly Leu Asp Gly Ile Asp Ala Val Val Phe Gly
465 470 475 480
Phe Thr Asp Arg Asn Leu Asp Ser Lys His Asp Leu Met Arg Asn Met
485 490 495
Ile Ser Ser Ile Pro Ala Glu Met Tyr Gln Gly Lys Ser Asn Phe Lys
500 505 510
Pro Gln Met Glu Pro Ile Asn Ala Gln Gln Lys Ala Met Lys Thr Asp
515 520 525
Thr Thr Asn Ser Tyr Leu Thr Tyr Lys Val Asn Ser Tyr Lys Ala Gln
530 535 540
Glu Tyr Lys Ile Arg Tyr Arg Val Ala Ala Asn Glu Asn Ser Lys Ile
545 550 555 560
Ser Ile Ser Gln Lys Ser Gly Thr Asn Phe Glu Lys Ile Gly Asp Thr
565 570 575
Asn Ile Ser Lys Thr Val Asn Ala Glu Asp Thr Val Lys Gly Glu Tyr
580 585 590
Gly Tyr Tyr Lys Ile Ile Glu Gly Pro Thr Val Lys Leu Asn Tyr Gly
595 600 605
Gln Asn Glu Leu Lys Leu Glu Asn Thr Gln Gly Lys Phe Ala Ile Asp
610 615 620
Arg Ile Glu Leu Glu Pro Ile Gln Lys Asp Glu Ile Val Phe Glu Asp
625 630 635 640
Asn Phe Asp Gly Thr Asn Lys Gly Trp Ile Phe Tyr Ser Gly Gly Gly
645 650 655
Ile Val Gly Gly Gly Val Thr Gly Arg Ala Gly Met Ile Pro Ser Gly
660 665 670
Thr Asn Ser Ile Val Tyr Pro Ser Gly Ile Asp Tyr Tyr Ser Lys Tyr
675 680 685
Thr Leu Asn Met Lys Val Lys Leu Gln Ser Thr Asp Pro Asn Ile Lys
690 695 700
Gln Lys Val Lys Ile Tyr His Thr Thr Thr Lys Gly Gln Met Val Ser
705 710 715 720
Lys Glu Val Glu Leu Lys Gly Gly Glu Gly Tyr Lys Glu Leu Lys Leu
725 730 735
Glu Phe Ile Thr Gly Tyr Thr Pro Ser Glu Ser Asn Gln Ile Gly Ile
740 745 750
Gln Ser Ala Val Gly Gly Ser Asn Val Leu Phe Asp Asp Val Lys Leu
755 760 765
Thr Gly Ala Lys Arg
770
<210> 12
<211> 661
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 12
Met Asp Lys Lys Gln Lys Lys Ile Leu Ser Met Thr Leu Ala Leu Gly
1 5 10 15
Val Leu Thr Gly Thr Tyr Ile Pro Ile Thr Ser Thr Ala Leu Ala Glu
20 25 30
Thr Glu Lys Glu Ser Ile Lys Lys Ser Ser Ala Lys Thr Ser Pro Gln
35 40 45
Met Asp Ser Arg Ala Trp Ile Asn Asn Pro Tyr Gln Gly Val Ser Met
50 55 60
Gly Gln Phe Leu Asn Ala Phe Asn Gly Asn Gln Trp Lys Pro Leu Leu
65 70 75 80
Glu Gln Ile His Asn Lys Gly Asp Ala Gly Ser Gly Thr Ile Ala Phe
85 90 95
Leu Lys Gly Met Met Thr Thr Gly Leu Ser Leu Leu Pro Pro Pro Gly
100 105 110
Ser Leu Leu Ala Ser Val Trp Asp Ile Phe Ile Pro Gly Thr Gly Val
115 120 125
Lys Asp Asp Met Trp Leu Gln Ile Glu Lys Tyr Val Asp Glu Lys Ile
130 135 140
Asp Ser Lys Ile Asn Asp Tyr His Lys Tyr Leu Met Gly Ala Gln Phe
145 150 155 160
Asn Gly Ala Met Glu Lys Val Lys Glu Tyr Gln Arg Val Leu Gln Ile
165 170 175
Tyr Asn Asn Ser Lys Val Ser Val Lys Val Glu Glu Pro Glu Ala Pro
180 185 190
Val Arg Asp Ala Val Arg Lys Ala Asp Ser Glu Leu Ala Ser Tyr Ile
195 200 205
Ala Val Ile Gln Thr Pro Glu Lys Glu Thr Asp Ser Val Tyr Gln Gln
210 215 220
Ile Thr Ala Pro Ile Phe Val Gln Ala Ala Asn Leu His Leu Leu Val
225 230 235 240
Leu Arg Asp Ile Ile Gln Phe Gly Glu Glu Trp Gly Ile Asp Lys Asp
245 250 255
Gln Leu Lys Gly Tyr Lys Asp Lys Gln Glu Lys Leu Ile Glu Asn Tyr
260 265 270
Thr Asp Tyr Ser Ile Lys Val Tyr Asn Asp Gly Leu Glu Lys Arg Ile
275 280 285
Asn Glu Ala Asp Lys Ile Phe Gly Pro Glu Asp Gly Tyr Gly Asn Pro
290 295 300
Ala Lys Pro Gly Pro Glu Glu Ala Tyr Thr Asn Thr His Arg Trp Asn
305 310 315 320
Tyr Ile Asn Glu Tyr Ile Arg Glu Tyr Thr Leu Ser Val Leu Asp Phe
325 330 335
Val Ala Leu Phe Pro Ser Thr Asn Pro Asn Ile Tyr Ser Arg Gly Ala
340 345 350
Met Gln Lys Asn Ser Arg Gln Ile Tyr Ser Ser Ile Ser Gly Thr Val
355 360 365
Thr Pro Asn Thr Gln Thr Trp Lys Asp Ile Asn Asn Arg Leu Ala Ser
370 375 380
Gln Glu Tyr Lys Gly Glu Leu Val Ala Ala Glu Val Gln Ser Tyr Ala
385 390 395 400
Arg Ile Asp Ala Leu Ala Pro Trp Phe Asp Arg Asn Ser Gly Thr Thr
405 410 415
Tyr Asn Thr Gly Trp Val Gly Asn Thr Ser Gly Gly Ile Leu Arg Lys
420 425 430
Ile Thr Ile Asp Tyr Ser Asn Pro Leu Thr Lys Val Asn Ile Gly Asp
435 440 445
Gln Arg Thr Pro Phe Tyr Phe Gln Leu Thr His Glu Asn Gly Ser Ala
450 455 460
Ile Pro Gln Phe Gly Gln Asp Asp Pro Arg Leu Lys Lys Arg Thr Phe
465 470 475 480
Glu Phe Pro Asn Gln Lys Ile Ser Asp Ile Gln Ala Phe Gly Lys Ser
485 490 495
Thr Ala Pro Gly Leu Asp Gly Ile Asp Ala Val Val Phe Gly Phe Thr
500 505 510
Asp Arg Asn Leu Asp Ser Lys His Asp Leu Met Arg Asn Met Ile Ser
515 520 525
Ser Ile Pro Ala Glu Met Tyr Gln Gly Lys Ser Asn Phe Lys Pro Gln
530 535 540
Met Glu Pro Ile Asn Ala Gln Gln Lys Ala Met Lys Thr Asp Thr Thr
545 550 555 560
Asn Ser Tyr Leu Thr Tyr Lys Val Asn Ser Tyr Lys Ala Gln Glu Tyr
565 570 575
Lys Ile Arg Tyr Arg Val Ala Ala Asn Glu Asn Ser Lys Ile Ser Ile
580 585 590
Ser Gln Lys Ser Gly Thr Asn Phe Glu Lys Ile Gly Asp Thr Asn Ile
595 600 605
Ser Lys Thr Val Asn Ala Glu Asp Thr Val Lys Gly Glu Tyr Gly Tyr
610 615 620
Tyr Lys Ile Ile Glu Gly Pro Thr Val Lys Leu Asn Tyr Gly Gln Asn
625 630 635 640
Glu Leu Lys Leu Glu Asn Thr Gln Gly Lys Phe Ala Ile Asp Arg Ile
645 650 655
Glu Leu Glu Pro Ile
660
<210> 13
<211> 631
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 13
Met Glu Thr Glu Lys Glu Ser Ile Lys Lys Ser Ser Ala Lys Thr Ser
1 5 10 15
Pro Gln Met Asp Ser Arg Ala Trp Ile Asn Asn Pro Tyr Gln Gly Val
20 25 30
Ser Met Gly Gln Phe Leu Asn Ala Phe Asn Gly Asn Gln Trp Lys Pro
35 40 45
Leu Leu Glu Gln Ile His Asn Lys Gly Asp Ala Gly Ser Gly Thr Ile
50 55 60
Ala Phe Leu Lys Gly Met Met Thr Thr Gly Leu Ser Leu Leu Pro Pro
65 70 75 80
Pro Gly Ser Leu Leu Ala Ser Val Trp Asp Ile Phe Ile Pro Gly Thr
85 90 95
Gly Val Lys Asp Asp Met Trp Leu Gln Ile Glu Lys Tyr Val Asp Glu
100 105 110
Lys Ile Asp Ser Lys Ile Asn Asp Tyr His Lys Tyr Leu Met Gly Ala
115 120 125
Gln Phe Asn Gly Ala Met Glu Lys Val Lys Glu Tyr Gln Arg Val Leu
130 135 140
Gln Ile Tyr Asn Asn Ser Lys Val Ser Val Lys Val Glu Glu Pro Glu
145 150 155 160
Ala Pro Val Arg Asp Ala Val Arg Lys Ala Asp Ser Glu Leu Ala Ser
165 170 175
Tyr Ile Ala Val Ile Gln Thr Pro Glu Lys Glu Thr Asp Ser Val Tyr
180 185 190
Gln Gln Ile Thr Ala Pro Ile Phe Val Gln Ala Ala Asn Leu His Leu
195 200 205
Leu Val Leu Arg Asp Ile Ile Gln Phe Gly Glu Glu Trp Gly Ile Asp
210 215 220
Lys Asp Gln Leu Lys Gly Tyr Lys Asp Lys Gln Glu Lys Leu Ile Glu
225 230 235 240
Asn Tyr Thr Asp Tyr Ser Ile Lys Val Tyr Asn Asp Gly Leu Glu Lys
245 250 255
Arg Ile Asn Glu Ala Asp Lys Ile Phe Gly Pro Glu Asp Gly Tyr Gly
260 265 270
Asn Pro Ala Lys Pro Gly Pro Glu Glu Ala Tyr Thr Asn Thr His Arg
275 280 285
Trp Asn Tyr Ile Asn Glu Tyr Ile Arg Glu Tyr Thr Leu Ser Val Leu
290 295 300
Asp Phe Val Ala Leu Phe Pro Ser Thr Asn Pro Asn Ile Tyr Ser Arg
305 310 315 320
Gly Ala Met Gln Lys Asn Ser Arg Gln Ile Tyr Ser Ser Ile Ser Gly
325 330 335
Thr Val Thr Pro Asn Thr Gln Thr Trp Lys Asp Ile Asn Asn Arg Leu
340 345 350
Ala Ser Gln Glu Tyr Lys Gly Glu Leu Val Ala Ala Glu Val Gln Ser
355 360 365
Tyr Ala Arg Ile Asp Ala Leu Ala Pro Trp Phe Asp Arg Asn Ser Gly
370 375 380
Thr Thr Tyr Asn Thr Gly Trp Val Gly Asn Thr Ser Gly Gly Ile Leu
385 390 395 400
Arg Lys Ile Thr Ile Asp Tyr Ser Asn Pro Leu Thr Lys Val Asn Ile
405 410 415
Gly Asp Gln Arg Thr Pro Phe Tyr Phe Gln Leu Thr His Glu Asn Gly
420 425 430
Ser Ala Ile Pro Gln Phe Gly Gln Asp Asp Pro Arg Leu Lys Lys Arg
435 440 445
Thr Phe Glu Phe Pro Asn Gln Lys Ile Ser Asp Ile Gln Ala Phe Gly
450 455 460
Lys Ser Thr Ala Pro Gly Leu Asp Gly Ile Asp Ala Val Val Phe Gly
465 470 475 480
Phe Thr Asp Arg Asn Leu Asp Ser Lys His Asp Leu Met Arg Asn Met
485 490 495
Ile Ser Ser Ile Pro Ala Glu Met Tyr Gln Gly Lys Ser Asn Phe Lys
500 505 510
Pro Gln Met Glu Pro Ile Asn Ala Gln Gln Lys Ala Met Lys Thr Asp
515 520 525
Thr Thr Asn Ser Tyr Leu Thr Tyr Lys Val Asn Ser Tyr Lys Ala Gln
530 535 540
Glu Tyr Lys Ile Arg Tyr Arg Val Ala Ala Asn Glu Asn Ser Lys Ile
545 550 555 560
Ser Ile Ser Gln Lys Ser Gly Thr Asn Phe Glu Lys Ile Gly Asp Thr
565 570 575
Asn Ile Ser Lys Thr Val Asn Ala Glu Asp Thr Val Lys Gly Glu Tyr
580 585 590
Gly Tyr Tyr Lys Ile Ile Glu Gly Pro Thr Val Lys Leu Asn Tyr Gly
595 600 605
Gln Asn Glu Leu Lys Leu Glu Asn Thr Gln Gly Lys Phe Ala Ile Asp
610 615 620
Arg Ile Glu Leu Glu Pro Ile
625 630
<210> 14
<211> 831
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 14
Met Ala Gln Leu Arg Glu Leu Gln Thr Gln Ser Ile Ile Pro Tyr Asn
1 5 10 15
Ile Leu Ala Thr Pro Pro Ile Gly Asn Ala Asp Thr Gln Pro Ala Ser
20 25 30
Leu Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Ile Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Ala Leu Leu Gln Ser Val Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Gly Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Gly Ile Gly Ile Leu Gly Thr Met Leu Gly Ala Glu Ile Pro Gly Val
85 90 95
Ala Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro
100 105 110
His Lys Thr Gln Asp Asp Thr Gln Gln Leu Ile Asn Leu Ile Asp Ala
115 120 125
Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Ser Glu Gln Asp Lys Asn
130 135 140
Asn Trp Thr Gly Tyr Leu Asn Ala Ile Phe Gln Ile Ser Asn Asp Ala
145 150 155 160
Ser Arg Ser Val Val Asp Ala Gln Phe Thr Gly Asn Glu Gly Asp Ala
165 170 175
Thr Arg Gln Pro Arg Thr Pro Gly Val Ser Asp Tyr Glu Asn Val Tyr
180 185 190
Asn His Leu Leu Ala Thr Gln Gly Ser Leu Ile Val Ala Leu Ser Gln
195 200 205
Met Ile Asn Gly Asn Phe Asp Thr Leu Ala Ile Pro Phe Phe Val Ile
210 215 220
Gly Ala Thr Val Gln Leu Ser Thr Tyr Gln Ser Phe Ile Gln Phe Ala
225 230 235 240
Asn Lys Trp Leu Pro Ile Val Tyr Pro Asp Tyr Gln Thr Gln Gly Thr
245 250 255
Ala Gly Tyr Thr Gln Gln Gln Asn Leu Ile Asn Ala Lys Ser Asp Met
260 265 270
Arg Ala Ala Ile Gln Asn His Thr Gln Lys Val Phe Asn Ala Phe Lys
275 280 285
Ala Gly Leu Pro Ser Leu Ser Ser Asp Lys Asn Ser Ile Ser Val Tyr
290 295 300
Asn Gln Tyr Val Arg Gly Leu Val Leu Asn Gly Leu Asp Thr Val Ala
305 310 315 320
Thr Trp Pro Ser Met Tyr Pro Asp Asp Tyr Arg Thr Gln Thr Glu Leu
325 330 335
Glu Gln Thr Arg Val Ile Phe Ser Asp Leu Ile Gly Lys Asp Gln Thr
340 345 350
Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala Asp Gly Lys Asn Gly
355 360 365
Ser Gly Ala Phe Asn Gln His Ala Thr Ile Asp Leu Asn Ser Ile Phe
370 375 380
Tyr Tyr His Glu Glu Leu Ser Ser Ile Gln Leu Gly Ile His Asp Gly
385 390 395 400
Asn Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr Gly Ile Ile Thr Asn
405 410 415
Tyr Pro Asn Tyr Ser Tyr Phe Tyr Gly Asp Leu Tyr Ser Pro Ser Gly
420 425 430
Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu Asn Ala Gln Thr Gln
435 440 445
Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile Tyr Asn Ser Asp Pro
450 455 460
Ser Ala Tyr Gly Ala Gly Cys His Ile Leu Gly Gly Cys Thr Lys Gly
465 470 475 480
Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly Phe Gly Tyr Ser Cys
485 490 495
Asn Pro Ser Leu Pro Ser Gln Lys Ile Asn Ala Phe Tyr Pro Phe Lys
500 505 510
Met Asn Leu Ser Arg Ser Gly Thr Ser Asp Lys Leu Gly Leu Met Tyr
515 520 525
Ser Leu Val Pro Leu Asp Leu Asn Pro Asn Asn Thr Phe Gly Glu Leu
530 535 540
Asp Ser Asn Thr Gly Asn Val Ile Gly Lys Gly Phe Pro Ala Glu Lys
545 550 555 560
Gly Thr Leu Ala Ser Gly Asn Val Pro Thr Val Val Arg Glu Trp Val
565 570 575
Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Asp Ala Leu Thr Leu Lys
580 585 590
Met Thr Asn Met Thr Gly Gly Lys Tyr Tyr Ile Arg Val Arg Tyr Ala
595 600 605
Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln Ser Val Ser Ala Gly
610 615 620
Asn Gln Asn Ile Gln Asn Gly Pro Trp Gly Leu Pro Ser Thr Ile Thr
625 630 635 640
Ser Val Val Ser Asp Ala Ser Phe Lys Lys Leu Tyr Ile Ala Gly Gln
645 650 655
Thr Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp Gly Val Asp Gly Gln
660 665 670
Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val Thr Val Thr Leu Ser
675 680 685
Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe Ile Asp Arg Val Glu
690 695 700
Leu Val Pro Met Ser Leu Gln Thr Tyr Thr Val Asn Tyr Thr Ile Thr
705 710 715 720
Ser Ser Leu Pro Ser Trp Asp Asp Phe Asn His Asp Ala Asn Asp Ile
725 730 735
Asn Ser Phe Pro Thr Val Trp Lys Ala Asp Ala Asn Gln Ser Val Ser
740 745 750
Lys Phe Ile Ala Ser Ala Gln Asp Leu Pro Gln Leu His Ser Val Val
755 760 765
Pro Ile Gly Leu Ser Thr Asn Gly Ser Trp Ser Glu Leu Arg Asp Ala
770 775 780
Asn Leu Asp Asp Leu Ile Ser Ser Gly Thr Gln Asn Val Ala Tyr Ser
785 790 795 800
Met Gln Asp Gly Ser Ser Phe Thr Glu Leu Lys Leu Gly Ser Ser Glu
805 810 815
Tyr Gln Met Ala Ala Ser Ser Gly Thr Ile Thr Gly Thr Leu Ile
820 825 830
<210> 15
<211> 708
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 15
Met Ala Gln Leu Arg Glu Leu Gln Thr Gln Ser Ile Ile Pro Tyr Asn
1 5 10 15
Ile Leu Ala Thr Pro Pro Ile Gly Asn Ala Asp Thr Gln Pro Ala Ser
20 25 30
Leu Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Ile Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Ala Leu Leu Gln Ser Val Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Gly Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Gly Ile Gly Ile Leu Gly Thr Met Leu Gly Ala Glu Ile Pro Gly Val
85 90 95
Ala Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro
100 105 110
His Lys Thr Gln Asp Asp Thr Gln Gln Leu Ile Asn Leu Ile Asp Ala
115 120 125
Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Ser Glu Gln Asp Lys Asn
130 135 140
Asn Trp Thr Gly Tyr Leu Asn Ala Ile Phe Gln Ile Ser Asn Asp Ala
145 150 155 160
Ser Arg Ser Val Val Asp Ala Gln Phe Thr Gly Asn Glu Gly Asp Ala
165 170 175
Thr Arg Gln Pro Arg Thr Pro Gly Val Ser Asp Tyr Glu Asn Val Tyr
180 185 190
Asn His Leu Leu Ala Thr Gln Gly Ser Leu Ile Val Ala Leu Ser Gln
195 200 205
Met Ile Asn Gly Asn Phe Asp Thr Leu Ala Ile Pro Phe Phe Val Ile
210 215 220
Gly Ala Thr Val Gln Leu Ser Thr Tyr Gln Ser Phe Ile Gln Phe Ala
225 230 235 240
Asn Lys Trp Leu Pro Ile Val Tyr Pro Asp Tyr Gln Thr Gln Gly Thr
245 250 255
Ala Gly Tyr Thr Gln Gln Gln Asn Leu Ile Asn Ala Lys Ser Asp Met
260 265 270
Arg Ala Ala Ile Gln Asn His Thr Gln Lys Val Phe Asn Ala Phe Lys
275 280 285
Ala Gly Leu Pro Ser Leu Ser Ser Asp Lys Asn Ser Ile Ser Val Tyr
290 295 300
Asn Gln Tyr Val Arg Gly Leu Val Leu Asn Gly Leu Asp Thr Val Ala
305 310 315 320
Thr Trp Pro Ser Met Tyr Pro Asp Asp Tyr Arg Thr Gln Thr Glu Leu
325 330 335
Glu Gln Thr Arg Val Ile Phe Ser Asp Leu Ile Gly Lys Asp Gln Thr
340 345 350
Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala Asp Gly Lys Asn Gly
355 360 365
Ser Gly Ala Phe Asn Gln His Ala Thr Ile Asp Leu Asn Ser Ile Phe
370 375 380
Tyr Tyr His Glu Glu Leu Ser Ser Ile Gln Leu Gly Ile His Asp Gly
385 390 395 400
Asn Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr Gly Ile Ile Thr Asn
405 410 415
Tyr Pro Asn Tyr Ser Tyr Phe Tyr Gly Asp Leu Tyr Ser Pro Ser Gly
420 425 430
Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu Asn Ala Gln Thr Gln
435 440 445
Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile Tyr Asn Ser Asp Pro
450 455 460
Ser Ala Tyr Gly Ala Gly Cys His Ile Leu Gly Gly Cys Thr Lys Gly
465 470 475 480
Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly Phe Gly Tyr Ser Cys
485 490 495
Asn Pro Ser Leu Pro Ser Gln Lys Ile Asn Ala Phe Tyr Pro Phe Lys
500 505 510
Met Asn Leu Ser Arg Ser Gly Thr Ser Asp Lys Leu Gly Leu Met Tyr
515 520 525
Ser Leu Val Pro Leu Asp Leu Asn Pro Asn Asn Thr Phe Gly Glu Leu
530 535 540
Asp Ser Asn Thr Gly Asn Val Ile Gly Lys Gly Phe Pro Ala Glu Lys
545 550 555 560
Gly Thr Leu Ala Ser Gly Asn Val Pro Thr Val Val Arg Glu Trp Val
565 570 575
Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Asp Ala Leu Thr Leu Lys
580 585 590
Met Thr Asn Met Thr Gly Gly Lys Tyr Tyr Ile Arg Val Arg Tyr Ala
595 600 605
Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln Ser Val Ser Ala Gly
610 615 620
Asn Gln Asn Ile Gln Asn Gly Pro Trp Gly Leu Pro Ser Thr Ile Thr
625 630 635 640
Ser Val Val Ser Asp Ala Ser Phe Lys Lys Leu Tyr Ile Ala Gly Gln
645 650 655
Thr Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp Gly Val Asp Gly Gln
660 665 670
Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val Thr Val Thr Leu Ser
675 680 685
Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe Ile Asp Arg Val Glu
690 695 700
Leu Val Pro Met
705
<210> 16
<211> 796
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 16
Val Lys Lys Val Asn Ser Tyr Lys Asn Glu Asn Glu Tyr Glu Tyr Glu
1 5 10 15
Ile Leu Asp Thr Ser Gln Asn Asn Ser Asn Met Ser Asn Arg Tyr Pro
20 25 30
Arg Tyr Pro Leu Ala Asn Asn Pro Gln Thr Pro Met Arg Asn Val Asn
35 40 45
Tyr Lys Glu Trp Leu Asn Met Cys Asp Ser Asn Thr Lys Phe Val Gly
50 55 60
Asp Ile Asn Thr Tyr Ser Ser Pro Lys Ala Ala Leu Ser Val Gln Asn
65 70 75 80
Ala Ile Val Thr Gly Ile Ser Ile Ser Gly Thr Val Phe Leu Val Ile
85 90 95
Gly Ser Pro Val Leu Gly Gln Ser Leu Leu Leu Ile Ser Arg Leu Ile
100 105 110
Asn Leu Leu Trp Ser Val Phe Arg Pro Leu Glu Pro Trp Glu Glu Leu
115 120 125
Met Ala Leu Val Glu Glu Leu Ile Asn Gln Arg Ile Glu Val Val Ile
130 135 140
Lys Asn Asn Ala Leu Ala Ser Leu Asp Gly Leu Gln Gln Ile Met Glu
145 150 155 160
Leu Tyr Arg Ile Arg Leu Ser Ala Trp Val Asn Asn Lys Asn Asp Gln
165 170 175
Thr Lys Met Ala Val Arg Glu Gln His Arg Ile Val Asp Asn Phe Leu
180 185 190
Glu Val Asn Met Ala Arg Phe Arg Ile Ser Gly Ser Glu Val Leu Leu
195 200 205
Leu Pro Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Leu Leu Arg
210 215 220
Asp Ile Asp Leu Phe Gly Ala Glu Trp Gly Ile Gly Pro Glu Gln Ile
225 230 235 240
Arg Asp Asp Tyr Ile Arg Leu Gln Arg Tyr Ile Arg Glu Tyr Lys Asp
245 250 255
His Cys Val Ile Phe Tyr Asn Gln Gly Leu Asn Gln Phe Asn Arg Ser
260 265 270
Thr Ala Gln Glu Trp Ile Arg Phe Asn Arg Phe Arg Arg Glu Met Thr
275 280 285
Leu Thr Val Leu Asp Leu Ala Ser Phe Phe Pro Asn Tyr Asp Pro Arg
290 295 300
Ile Tyr Pro Thr Ala Val Lys Thr Glu Leu Thr Arg Glu Ile Tyr Thr
305 310 315 320
Asp Leu Ile Gly Asn Ser Thr Phe Ser Pro Trp Trp Asn Gln Thr Val
325 330 335
Thr Ser Phe Ser Thr Ile Glu Ser Ser Val Ile Arg Arg Pro Ser Phe
340 345 350
Thr Thr Trp Leu Asn Arg Ile Arg Val Phe Thr Gly Ser Leu Asn Pro
355 360 365
Arg Ile Ser Pro Ala Tyr Val Trp Gly Gly His Glu Leu Val Glu Asn
370 375 380
Arg Ser Asn Gly Ser Glu Ile Thr His Val Phe Gly Asn Thr Ala Ser
385 390 395 400
Leu Thr Pro Ile Gln Phe Leu Ser Phe Ala Asn Val Ser Val Phe Arg
405 410 415
Ile Asn Ser Leu Ala Arg Ser Asp Thr Gln Ala His Thr Asn Tyr Gly
420 425 430
Val Ser Arg Ala Ile Phe His Thr Ser Asp Ile Asn Asn Ala Pro Gly
435 440 445
Ser Val Ala Tyr Glu Val Pro Asn Asn Asn Ser Ser Ser Pro Met Val
450 455 460
Ser Glu Leu Pro Arg Glu Asn Glu Gln Arg Pro Asn Glu Arg Glu Phe
465 470 475 480
Ser His Arg Leu Ser Tyr Ile Ser His Val Arg Ala Arg Arg Tyr Asn
485 490 495
Ser Gly Thr Asn Gly Asn Ile Asp Leu Leu Met Tyr Gly Trp Thr His
500 505 510
Thr Ser Met Asn Arg Asn Asn Arg Leu Glu Pro Asp Gly Ile Thr Gln
515 520 525
Ile Asp Ala Val Lys Gly Val Gly Gly Thr Val Ser Gly Val Val Ile
530 535 540
Pro Gly Pro Thr Gly Gly Asn Leu Val Asn Pro Gly Ala Ser Pro Ala
545 550 555 560
Ile Phe Ser Leu Arg Val Gln Ala Pro Gln Ile Pro Thr Ser Tyr Arg
565 570 575
Ile Arg Leu Arg Tyr Ala Cys Leu Ser Arg Ser Pro Ser Arg Ile Trp
580 585 590
Val His His Ser Gly Asn Phe His Phe Val Glu Leu Pro Cys Asn Asn
595 600 605
Val Ser Gly Arg Pro Ser Asn Thr Leu Leu Glu Ser Asp Phe His Tyr
610 615 620
Val Asp Val Pro Gly Val Phe Ser Pro Ser Ile Asn Pro Glu Ile Leu
625 630 635 640
Phe Gln Asp His Ser Phe Gly Thr Val Leu Asp Lys Ile Glu Phe Ile
645 650 655
Pro Ile Pro Leu Leu Pro Glu Gly Phe Arg Ile Val Thr Ala Leu Asn
660 665 670
Asn Asn Phe Val Ile Asp Leu Asn Pro Asn Asn Asn Val Thr Met Trp
675 680 685
Asn Asn Asn Gly Gly Asp Asn Gln Arg Trp Arg Phe Thr Tyr Asp Gln
690 695 700
Gln Arg Asn Ala Tyr Val Ile Arg Asn Leu Arg Asn Pro Asp Leu Ala
705 710 715 720
Leu Thr Trp Asp Thr Thr Ser Ser Ser Val Val Ala Ala Lys Ile Ser
725 730 735
Pro Glu Arg Arg Glu Gln Tyr Trp Val Leu Glu Asn Phe Gln Asn Gly
740 745 750
Tyr Val Leu Leu Asn Leu Arg Asp Thr Gly Arg Val Leu Asp Val Ala
755 760 765
Gly Gly Ser Val Gly Ser Gly Thr Asn Leu Ile Val Tyr Pro Arg His
770 775 780
Asn Gly Asn Gly Gln Ile Phe Phe Ile Arg Lys Pro
785 790 795
<210> 17
<211> 793
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 17
Met Asn Ser Tyr Lys Asn Glu Asn Glu Tyr Glu Tyr Glu Ile Leu Asp
1 5 10 15
Thr Ser Gln Asn Asn Ser Asn Met Ser Asn Arg Tyr Pro Arg Tyr Pro
20 25 30
Leu Ala Asn Asn Pro Gln Thr Pro Met Arg Asn Val Asn Tyr Lys Glu
35 40 45
Trp Leu Asn Met Cys Asp Ser Asn Thr Lys Phe Val Gly Asp Ile Asn
50 55 60
Thr Tyr Ser Ser Pro Lys Ala Ala Leu Ser Val Gln Asn Ala Ile Val
65 70 75 80
Thr Gly Ile Ser Ile Ser Gly Thr Val Phe Leu Val Ile Gly Ser Pro
85 90 95
Val Leu Gly Gln Ser Leu Leu Leu Ile Ser Arg Leu Ile Asn Leu Leu
100 105 110
Trp Ser Val Phe Arg Pro Leu Glu Pro Trp Glu Glu Leu Met Ala Leu
115 120 125
Val Glu Glu Leu Ile Asn Gln Arg Ile Glu Val Val Ile Lys Asn Asn
130 135 140
Ala Leu Ala Ser Leu Asp Gly Leu Gln Gln Ile Met Glu Leu Tyr Arg
145 150 155 160
Ile Arg Leu Ser Ala Trp Val Asn Asn Lys Asn Asp Gln Thr Lys Met
165 170 175
Ala Val Arg Glu Gln His Arg Ile Val Asp Asn Phe Leu Glu Val Asn
180 185 190
Met Ala Arg Phe Arg Ile Ser Gly Ser Glu Val Leu Leu Leu Pro Val
195 200 205
Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Leu Leu Arg Asp Ile Asp
210 215 220
Leu Phe Gly Ala Glu Trp Gly Ile Gly Pro Glu Gln Ile Arg Asp Asp
225 230 235 240
Tyr Ile Arg Leu Gln Arg Tyr Ile Arg Glu Tyr Lys Asp His Cys Val
245 250 255
Ile Phe Tyr Asn Gln Gly Leu Asn Gln Phe Asn Arg Ser Thr Ala Gln
260 265 270
Glu Trp Ile Arg Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val
275 280 285
Leu Asp Leu Ala Ser Phe Phe Pro Asn Tyr Asp Pro Arg Ile Tyr Pro
290 295 300
Thr Ala Val Lys Thr Glu Leu Thr Arg Glu Ile Tyr Thr Asp Leu Ile
305 310 315 320
Gly Asn Ser Thr Phe Ser Pro Trp Trp Asn Gln Thr Val Thr Ser Phe
325 330 335
Ser Thr Ile Glu Ser Ser Val Ile Arg Arg Pro Ser Phe Thr Thr Trp
340 345 350
Leu Asn Arg Ile Arg Val Phe Thr Gly Ser Leu Asn Pro Arg Ile Ser
355 360 365
Pro Ala Tyr Val Trp Gly Gly His Glu Leu Val Glu Asn Arg Ser Asn
370 375 380
Gly Ser Glu Ile Thr His Val Phe Gly Asn Thr Ala Ser Leu Thr Pro
385 390 395 400
Ile Gln Phe Leu Ser Phe Ala Asn Val Ser Val Phe Arg Ile Asn Ser
405 410 415
Leu Ala Arg Ser Asp Thr Gln Ala His Thr Asn Tyr Gly Val Ser Arg
420 425 430
Ala Ile Phe His Thr Ser Asp Ile Asn Asn Ala Pro Gly Ser Val Ala
435 440 445
Tyr Glu Val Pro Asn Asn Asn Ser Ser Ser Pro Met Val Ser Glu Leu
450 455 460
Pro Arg Glu Asn Glu Gln Arg Pro Asn Glu Arg Glu Phe Ser His Arg
465 470 475 480
Leu Ser Tyr Ile Ser His Val Arg Ala Arg Arg Tyr Asn Ser Gly Thr
485 490 495
Asn Gly Asn Ile Asp Leu Leu Met Tyr Gly Trp Thr His Thr Ser Met
500 505 510
Asn Arg Asn Asn Arg Leu Glu Pro Asp Gly Ile Thr Gln Ile Asp Ala
515 520 525
Val Lys Gly Val Gly Gly Thr Val Ser Gly Val Val Ile Pro Gly Pro
530 535 540
Thr Gly Gly Asn Leu Val Asn Pro Gly Ala Ser Pro Ala Ile Phe Ser
545 550 555 560
Leu Arg Val Gln Ala Pro Gln Ile Pro Thr Ser Tyr Arg Ile Arg Leu
565 570 575
Arg Tyr Ala Cys Leu Ser Arg Ser Pro Ser Arg Ile Trp Val His His
580 585 590
Ser Gly Asn Phe His Phe Val Glu Leu Pro Cys Asn Asn Val Ser Gly
595 600 605
Arg Pro Ser Asn Thr Leu Leu Glu Ser Asp Phe His Tyr Val Asp Val
610 615 620
Pro Gly Val Phe Ser Pro Ser Ile Asn Pro Glu Ile Leu Phe Gln Asp
625 630 635 640
His Ser Phe Gly Thr Val Leu Asp Lys Ile Glu Phe Ile Pro Ile Pro
645 650 655
Leu Leu Pro Glu Gly Phe Arg Ile Val Thr Ala Leu Asn Asn Asn Phe
660 665 670
Val Ile Asp Leu Asn Pro Asn Asn Asn Val Thr Met Trp Asn Asn Asn
675 680 685
Gly Gly Asp Asn Gln Arg Trp Arg Phe Thr Tyr Asp Gln Gln Arg Asn
690 695 700
Ala Tyr Val Ile Arg Asn Leu Arg Asn Pro Asp Leu Ala Leu Thr Trp
705 710 715 720
Asp Thr Thr Ser Ser Ser Val Val Ala Ala Lys Ile Ser Pro Glu Arg
725 730 735
Arg Glu Gln Tyr Trp Val Leu Glu Asn Phe Gln Asn Gly Tyr Val Leu
740 745 750
Leu Asn Leu Arg Asp Thr Gly Arg Val Leu Asp Val Ala Gly Gly Ser
755 760 765
Val Gly Ser Gly Thr Asn Leu Ile Val Tyr Pro Arg His Asn Gly Asn
770 775 780
Gly Gln Ile Phe Phe Ile Arg Lys Pro
785 790
<210> 18
<211> 658
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 18
Val Lys Lys Val Asn Ser Tyr Lys Asn Glu Asn Glu Tyr Glu Tyr Glu
1 5 10 15
Ile Leu Asp Thr Ser Gln Asn Asn Ser Asn Met Ser Asn Arg Tyr Pro
20 25 30
Arg Tyr Pro Leu Ala Asn Asn Pro Gln Thr Pro Met Arg Asn Val Asn
35 40 45
Tyr Lys Glu Trp Leu Asn Met Cys Asp Ser Asn Thr Lys Phe Val Gly
50 55 60
Asp Ile Asn Thr Tyr Ser Ser Pro Lys Ala Ala Leu Ser Val Gln Asn
65 70 75 80
Ala Ile Val Thr Gly Ile Ser Ile Ser Gly Thr Val Phe Leu Val Ile
85 90 95
Gly Ser Pro Val Leu Gly Gln Ser Leu Leu Leu Ile Ser Arg Leu Ile
100 105 110
Asn Leu Leu Trp Ser Val Phe Arg Pro Leu Glu Pro Trp Glu Glu Leu
115 120 125
Met Ala Leu Val Glu Glu Leu Ile Asn Gln Arg Ile Glu Val Val Ile
130 135 140
Lys Asn Asn Ala Leu Ala Ser Leu Asp Gly Leu Gln Gln Ile Met Glu
145 150 155 160
Leu Tyr Arg Ile Arg Leu Ser Ala Trp Val Asn Asn Lys Asn Asp Gln
165 170 175
Thr Lys Met Ala Val Arg Glu Gln His Arg Ile Val Asp Asn Phe Leu
180 185 190
Glu Val Asn Met Ala Arg Phe Arg Ile Ser Gly Ser Glu Val Leu Leu
195 200 205
Leu Pro Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Leu Leu Arg
210 215 220
Asp Ile Asp Leu Phe Gly Ala Glu Trp Gly Ile Gly Pro Glu Gln Ile
225 230 235 240
Arg Asp Asp Tyr Ile Arg Leu Gln Arg Tyr Ile Arg Glu Tyr Lys Asp
245 250 255
His Cys Val Ile Phe Tyr Asn Gln Gly Leu Asn Gln Phe Asn Arg Ser
260 265 270
Thr Ala Gln Glu Trp Ile Arg Phe Asn Arg Phe Arg Arg Glu Met Thr
275 280 285
Leu Thr Val Leu Asp Leu Ala Ser Phe Phe Pro Asn Tyr Asp Pro Arg
290 295 300
Ile Tyr Pro Thr Ala Val Lys Thr Glu Leu Thr Arg Glu Ile Tyr Thr
305 310 315 320
Asp Leu Ile Gly Asn Ser Thr Phe Ser Pro Trp Trp Asn Gln Thr Val
325 330 335
Thr Ser Phe Ser Thr Ile Glu Ser Ser Val Ile Arg Arg Pro Ser Phe
340 345 350
Thr Thr Trp Leu Asn Arg Ile Arg Val Phe Thr Gly Ser Leu Asn Pro
355 360 365
Arg Ile Ser Pro Ala Tyr Val Trp Gly Gly His Glu Leu Val Glu Asn
370 375 380
Arg Ser Asn Gly Ser Glu Ile Thr His Val Phe Gly Asn Thr Ala Ser
385 390 395 400
Leu Thr Pro Ile Gln Phe Leu Ser Phe Ala Asn Val Ser Val Phe Arg
405 410 415
Ile Asn Ser Leu Ala Arg Ser Asp Thr Gln Ala His Thr Asn Tyr Gly
420 425 430
Val Ser Arg Ala Ile Phe His Thr Ser Asp Ile Asn Asn Ala Pro Gly
435 440 445
Ser Val Ala Tyr Glu Val Pro Asn Asn Asn Ser Ser Ser Pro Met Val
450 455 460
Ser Glu Leu Pro Arg Glu Asn Glu Gln Arg Pro Asn Glu Arg Glu Phe
465 470 475 480
Ser His Arg Leu Ser Tyr Ile Ser His Val Arg Ala Arg Arg Tyr Asn
485 490 495
Ser Gly Thr Asn Gly Asn Ile Asp Leu Leu Met Tyr Gly Trp Thr His
500 505 510
Thr Ser Met Asn Arg Asn Asn Arg Leu Glu Pro Asp Gly Ile Thr Gln
515 520 525
Ile Asp Ala Val Lys Gly Val Gly Gly Thr Val Ser Gly Val Val Ile
530 535 540
Pro Gly Pro Thr Gly Gly Asn Leu Val Asn Pro Gly Ala Ser Pro Ala
545 550 555 560
Ile Phe Ser Leu Arg Val Gln Ala Pro Gln Ile Pro Thr Ser Tyr Arg
565 570 575
Ile Arg Leu Arg Tyr Ala Cys Leu Ser Arg Ser Pro Ser Arg Ile Trp
580 585 590
Val His His Ser Gly Asn Phe His Phe Val Glu Leu Pro Cys Asn Asn
595 600 605
Val Ser Gly Arg Pro Ser Asn Thr Leu Leu Glu Ser Asp Phe His Tyr
610 615 620
Val Asp Val Pro Gly Val Phe Ser Pro Ser Ile Asn Pro Glu Ile Leu
625 630 635 640
Phe Gln Asp His Ser Phe Gly Thr Val Leu Asp Lys Ile Glu Phe Ile
645 650 655
Pro Ile
<210> 19
<211> 655
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 19
Met Asn Ser Tyr Lys Asn Glu Asn Glu Tyr Glu Tyr Glu Ile Leu Asp
1 5 10 15
Thr Ser Gln Asn Asn Ser Asn Met Ser Asn Arg Tyr Pro Arg Tyr Pro
20 25 30
Leu Ala Asn Asn Pro Gln Thr Pro Met Arg Asn Val Asn Tyr Lys Glu
35 40 45
Trp Leu Asn Met Cys Asp Ser Asn Thr Lys Phe Val Gly Asp Ile Asn
50 55 60
Thr Tyr Ser Ser Pro Lys Ala Ala Leu Ser Val Gln Asn Ala Ile Val
65 70 75 80
Thr Gly Ile Ser Ile Ser Gly Thr Val Phe Leu Val Ile Gly Ser Pro
85 90 95
Val Leu Gly Gln Ser Leu Leu Leu Ile Ser Arg Leu Ile Asn Leu Leu
100 105 110
Trp Ser Val Phe Arg Pro Leu Glu Pro Trp Glu Glu Leu Met Ala Leu
115 120 125
Val Glu Glu Leu Ile Asn Gln Arg Ile Glu Val Val Ile Lys Asn Asn
130 135 140
Ala Leu Ala Ser Leu Asp Gly Leu Gln Gln Ile Met Glu Leu Tyr Arg
145 150 155 160
Ile Arg Leu Ser Ala Trp Val Asn Asn Lys Asn Asp Gln Thr Lys Met
165 170 175
Ala Val Arg Glu Gln His Arg Ile Val Asp Asn Phe Leu Glu Val Asn
180 185 190
Met Ala Arg Phe Arg Ile Ser Gly Ser Glu Val Leu Leu Leu Pro Val
195 200 205
Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Leu Leu Arg Asp Ile Asp
210 215 220
Leu Phe Gly Ala Glu Trp Gly Ile Gly Pro Glu Gln Ile Arg Asp Asp
225 230 235 240
Tyr Ile Arg Leu Gln Arg Tyr Ile Arg Glu Tyr Lys Asp His Cys Val
245 250 255
Ile Phe Tyr Asn Gln Gly Leu Asn Gln Phe Asn Arg Ser Thr Ala Gln
260 265 270
Glu Trp Ile Arg Phe Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val
275 280 285
Leu Asp Leu Ala Ser Phe Phe Pro Asn Tyr Asp Pro Arg Ile Tyr Pro
290 295 300
Thr Ala Val Lys Thr Glu Leu Thr Arg Glu Ile Tyr Thr Asp Leu Ile
305 310 315 320
Gly Asn Ser Thr Phe Ser Pro Trp Trp Asn Gln Thr Val Thr Ser Phe
325 330 335
Ser Thr Ile Glu Ser Ser Val Ile Arg Arg Pro Ser Phe Thr Thr Trp
340 345 350
Leu Asn Arg Ile Arg Val Phe Thr Gly Ser Leu Asn Pro Arg Ile Ser
355 360 365
Pro Ala Tyr Val Trp Gly Gly His Glu Leu Val Glu Asn Arg Ser Asn
370 375 380
Gly Ser Glu Ile Thr His Val Phe Gly Asn Thr Ala Ser Leu Thr Pro
385 390 395 400
Ile Gln Phe Leu Ser Phe Ala Asn Val Ser Val Phe Arg Ile Asn Ser
405 410 415
Leu Ala Arg Ser Asp Thr Gln Ala His Thr Asn Tyr Gly Val Ser Arg
420 425 430
Ala Ile Phe His Thr Ser Asp Ile Asn Asn Ala Pro Gly Ser Val Ala
435 440 445
Tyr Glu Val Pro Asn Asn Asn Ser Ser Ser Pro Met Val Ser Glu Leu
450 455 460
Pro Arg Glu Asn Glu Gln Arg Pro Asn Glu Arg Glu Phe Ser His Arg
465 470 475 480
Leu Ser Tyr Ile Ser His Val Arg Ala Arg Arg Tyr Asn Ser Gly Thr
485 490 495
Asn Gly Asn Ile Asp Leu Leu Met Tyr Gly Trp Thr His Thr Ser Met
500 505 510
Asn Arg Asn Asn Arg Leu Glu Pro Asp Gly Ile Thr Gln Ile Asp Ala
515 520 525
Val Lys Gly Val Gly Gly Thr Val Ser Gly Val Val Ile Pro Gly Pro
530 535 540
Thr Gly Gly Asn Leu Val Asn Pro Gly Ala Ser Pro Ala Ile Phe Ser
545 550 555 560
Leu Arg Val Gln Ala Pro Gln Ile Pro Thr Ser Tyr Arg Ile Arg Leu
565 570 575
Arg Tyr Ala Cys Leu Ser Arg Ser Pro Ser Arg Ile Trp Val His His
580 585 590
Ser Gly Asn Phe His Phe Val Glu Leu Pro Cys Asn Asn Val Ser Gly
595 600 605
Arg Pro Ser Asn Thr Leu Leu Glu Ser Asp Phe His Tyr Val Asp Val
610 615 620
Pro Gly Val Phe Ser Pro Ser Ile Asn Pro Glu Ile Leu Phe Gln Asp
625 630 635 640
His Ser Phe Gly Thr Val Leu Asp Lys Ile Glu Phe Ile Pro Ile
645 650 655
<210> 20
<211> 790
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 20
Val Lys Lys Met Asn Ser Tyr Gln Asn Lys Asn Glu Tyr Glu Ile Leu
1 5 10 15
Asp Ala Ser Glu Asn Thr Val Asn Ala Leu Asn Arg Tyr Pro Phe Ala
20 25 30
Asn Asn Pro Tyr Ser Ser Ile Phe Ser Ser Cys Pro Arg Ser Gly Pro
35 40 45
Gly Asn Trp Ile Asn Ile Leu Gly Asn Ala Val Ser Glu Ala Val Ser
50 55 60
Ile Ser Gln Asp Ile Ile Ser Leu Leu Thr Gln Pro Ser Ile Ser Gly
65 70 75 80
Ile Ile Ser Met Ala Phe Ser Leu Leu Ser Arg Met Ile Gly Ser Asn
85 90 95
Gly Arg Ser Ile Ser Glu Leu Ser Met Cys Asp Leu Leu Ala Ile Ile
100 105 110
Asp Leu Arg Val Asn Gln Ser Val Leu Asp Asp Gly Val Ala Asp Phe
115 120 125
Asn Gly Ser Leu Val Ile Tyr Arg Asn Tyr Leu Glu Ala Leu Gln Arg
130 135 140
Trp Asn Asn Asn Pro Asn Pro Ala Asn Ala Glu Glu Val Arg Thr Arg
145 150 155 160
Phe Arg Glu Ser Asp Thr Ile Phe Asp Leu Ile Leu Thr Gln Gly Ser
165 170 175
Leu Thr Asn Gly Gly Ser Leu Ala Arg Asn Asn Ala Gln Ile Leu Leu
180 185 190
Leu Pro Ser Phe Ala Asn Ala Ala Tyr Phe His Leu Leu Leu Leu Arg
195 200 205
Asp Ala Asn Val Tyr Gly Asn Asn Trp Gly Leu Phe Gly Val Thr Pro
210 215 220
Asn Ile Asn Tyr Glu Ser Lys Leu Leu Asn Leu Ile Arg Leu Tyr Thr
225 230 235 240
Asn Tyr Cys Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Leu Arg Asn
245 250 255
Arg Gly Ser Asn Ala Thr Ala Trp Leu Glu Phe His Arg Phe Arg Arg
260 265 270
Asp Met Thr Leu Met Val Leu Asp Ile Val Ser Ser Phe Ser Ser Leu
275 280 285
Asp Ile Thr Arg Tyr Pro Arg Ala Thr Asp Phe Gln Leu Ser Arg Ile
290 295 300
Ile Tyr Thr Asp Pro Ile Gly Phe Val Asn Arg Ser Asp Pro Ser Ala
305 310 315 320
Pro Arg Thr Trp Phe Ser Phe His Asn Gln Ala Asn Phe Ser Ala Leu
325 330 335
Glu Ser Gly Ile Pro Ser Pro Ser Phe Ser Gln Phe Leu Asp Ser Met
340 345 350
Arg Ile Ser Thr Gly Pro Leu Ser Leu Pro Ala Ser Pro Asn Ile His
355 360 365
Arg Ala Arg Val Trp Tyr Gly Asn Gln Asn Asn Phe Asn Gly Ser Ser
370 375 380
Ser Gln Thr Phe Gly Glu Ile Thr Asn Asp Asn Gln Thr Ile Ser Gly
385 390 395 400
Leu Asn Ile Phe Arg Ile Asp Ser Gln Ala Val Asn Leu Asn Asn Thr
405 410 415
Thr Phe Gly Val Ser Arg Ala Glu Phe Tyr His Asp Ala Ser Gln Gly
420 425 430
Ser Gln Arg Ser Ile Tyr Gln Gly Phe Val Asp Thr Gly Gly Ala Ser
435 440 445
Thr Ala Val Ala Gln Asn Ile Gln Thr Phe Phe Pro Gly Glu Asn Ser
450 455 460
Ser Ile Pro Thr Pro Gln Asp Tyr Thr His Ile Leu Ser Arg Ser Thr
465 470 475 480
Asn Leu Thr Gly Gly Leu Arg Gln Val Ala Ser Gly Arg Arg Ser Ser
485 490 495
Leu Val Leu His Gly Trp Thr His Lys Ser Leu Ser Arg Gln Asn Arg
500 505 510
Val Glu Pro Asn Arg Ile Thr Gln Val Pro Ala Val Lys Ala Ser Ser
515 520 525
Pro Ser Asn Cys Thr Val Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp
530 535 540
Leu Val Arg Met Ser Ser Asn Cys Ser Val Ser Tyr Asn Phe Thr Pro
545 550 555 560
Ala Asp Gln Gln Val Val Ile Arg Leu Arg Tyr Ala Cys Gln Gly Thr
565 570 575
Ala Ser Leu Arg Ile Thr Phe Gly Asn Gly Ser Ser Gln Ile Ile Pro
580 585 590
Leu Val Ser Thr Thr Ser Ser Ile Asn Asn Leu Gln Tyr Glu Asn Phe
595 600 605
Ser Phe Ala Ser Gly Pro Asn Ser Val Asn Phe Leu Ser Ala Gly Thr
610 615 620
Ser Ile Thr Ile Gln Asn Ile Ser Thr Asn Ser Asn Val Val Leu Asp
625 630 635 640
Arg Ile Glu Ile Val Pro Glu Gln Pro Ile Pro Ile Ile Pro Gly Asp
645 650 655
Tyr Gln Ile Val Thr Ala Leu Asn Asn Ser Ser Val Phe Asp Leu Asn
660 665 670
Ser Gly Thr Arg Val Thr Leu Trp Ser Asn Asn Arg Gly Ala His Gln
675 680 685
Ile Trp Asn Phe Met Tyr Asp Gln Gln Arg Asn Ala Tyr Val Ile Arg
690 695 700
Asn Val Ser Asn Pro Ser Leu Val Leu Thr Trp Asp Phe Thr Ser Pro
705 710 715 720
Asn Ser Ile Val Phe Ala Ala Pro Phe Ser Pro Gly Arg Gln Glu Gln
725 730 735
Tyr Trp Ile Ala Glu Ser Phe Gln Asn Ser Tyr Val Phe Glu Asn Leu
740 745 750
Arg Asn Thr Asn Met Val Leu Asp Val Ala Gly Gly Ser Thr Ala Ile
755 760 765
Gly Thr Asn Ile Ile Ala Phe Pro Arg His Asn Gly Asn Ala Gln Arg
770 775 780
Phe Phe Ile Arg Arg Pro
785 790
<210> 21
<211> 647
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 21
Val Lys Lys Met Asn Ser Tyr Gln Asn Lys Asn Glu Tyr Glu Ile Leu
1 5 10 15
Asp Ala Ser Glu Asn Thr Val Asn Ala Leu Asn Arg Tyr Pro Phe Ala
20 25 30
Asn Asn Pro Tyr Ser Ser Ile Phe Ser Ser Cys Pro Arg Ser Gly Pro
35 40 45
Gly Asn Trp Ile Asn Ile Leu Gly Asn Ala Val Ser Glu Ala Val Ser
50 55 60
Ile Ser Gln Asp Ile Ile Ser Leu Leu Thr Gln Pro Ser Ile Ser Gly
65 70 75 80
Ile Ile Ser Met Ala Phe Ser Leu Leu Ser Arg Met Ile Gly Ser Asn
85 90 95
Gly Arg Ser Ile Ser Glu Leu Ser Met Cys Asp Leu Leu Ala Ile Ile
100 105 110
Asp Leu Arg Val Asn Gln Ser Val Leu Asp Asp Gly Val Ala Asp Phe
115 120 125
Asn Gly Ser Leu Val Ile Tyr Arg Asn Tyr Leu Glu Ala Leu Gln Arg
130 135 140
Trp Asn Asn Asn Pro Asn Pro Ala Asn Ala Glu Glu Val Arg Thr Arg
145 150 155 160
Phe Arg Glu Ser Asp Thr Ile Phe Asp Leu Ile Leu Thr Gln Gly Ser
165 170 175
Leu Thr Asn Gly Gly Ser Leu Ala Arg Asn Asn Ala Gln Ile Leu Leu
180 185 190
Leu Pro Ser Phe Ala Asn Ala Ala Tyr Phe His Leu Leu Leu Leu Arg
195 200 205
Asp Ala Asn Val Tyr Gly Asn Asn Trp Gly Leu Phe Gly Val Thr Pro
210 215 220
Asn Ile Asn Tyr Glu Ser Lys Leu Leu Asn Leu Ile Arg Leu Tyr Thr
225 230 235 240
Asn Tyr Cys Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Leu Arg Asn
245 250 255
Arg Gly Ser Asn Ala Thr Ala Trp Leu Glu Phe His Arg Phe Arg Arg
260 265 270
Asp Met Thr Leu Met Val Leu Asp Ile Val Ser Ser Phe Ser Ser Leu
275 280 285
Asp Ile Thr Arg Tyr Pro Arg Ala Thr Asp Phe Gln Leu Ser Arg Ile
290 295 300
Ile Tyr Thr Asp Pro Ile Gly Phe Val Asn Arg Ser Asp Pro Ser Ala
305 310 315 320
Pro Arg Thr Trp Phe Ser Phe His Asn Gln Ala Asn Phe Ser Ala Leu
325 330 335
Glu Ser Gly Ile Pro Ser Pro Ser Phe Ser Gln Phe Leu Asp Ser Met
340 345 350
Arg Ile Ser Thr Gly Pro Leu Ser Leu Pro Ala Ser Pro Asn Ile His
355 360 365
Arg Ala Arg Val Trp Tyr Gly Asn Gln Asn Asn Phe Asn Gly Ser Ser
370 375 380
Ser Gln Thr Phe Gly Glu Ile Thr Asn Asp Asn Gln Thr Ile Ser Gly
385 390 395 400
Leu Asn Ile Phe Arg Ile Asp Ser Gln Ala Val Asn Leu Asn Asn Thr
405 410 415
Thr Phe Gly Val Ser Arg Ala Glu Phe Tyr His Asp Ala Ser Gln Gly
420 425 430
Ser Gln Arg Ser Ile Tyr Gln Gly Phe Val Asp Thr Gly Gly Ala Ser
435 440 445
Thr Ala Val Ala Gln Asn Ile Gln Thr Phe Phe Pro Gly Glu Asn Ser
450 455 460
Ser Ile Pro Thr Pro Gln Asp Tyr Thr His Ile Leu Ser Arg Ser Thr
465 470 475 480
Asn Leu Thr Gly Gly Leu Arg Gln Val Ala Ser Gly Arg Arg Ser Ser
485 490 495
Leu Val Leu His Gly Trp Thr His Lys Ser Leu Ser Arg Gln Asn Arg
500 505 510
Val Glu Pro Asn Arg Ile Thr Gln Val Pro Ala Val Lys Ala Ser Ser
515 520 525
Pro Ser Asn Cys Thr Val Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp
530 535 540
Leu Val Arg Met Ser Ser Asn Cys Ser Val Ser Tyr Asn Phe Thr Pro
545 550 555 560
Ala Asp Gln Gln Val Val Ile Arg Leu Arg Tyr Ala Cys Gln Gly Thr
565 570 575
Ala Ser Leu Arg Ile Thr Phe Gly Asn Gly Ser Ser Gln Ile Ile Pro
580 585 590
Leu Val Ser Thr Thr Ser Ser Ile Asn Asn Leu Gln Tyr Glu Asn Phe
595 600 605
Ser Phe Ala Ser Gly Pro Asn Ser Val Asn Phe Leu Ser Ala Gly Thr
610 615 620
Ser Ile Thr Ile Gln Asn Ile Ser Thr Asn Ser Asn Val Val Leu Asp
625 630 635 640
Arg Ile Glu Ile Val Pro Glu
645
<210> 22
<211> 861
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 22
Met Gly Phe Leu Pro Ile Ile Ile Cys Ser Ser Arg Arg Asn Phe Met
1 5 10 15
Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn Val
20 25 30
Leu Ala Ser Pro Gln Ile Gly Asn Leu Asp Thr Gln Pro Gly Ser Leu
35 40 45
Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys
50 55 60
Thr Gly Ser Asp Lys Leu Leu Gln Thr Thr Leu Gln Asn Gly Phe Ser
65 70 75 80
Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala Gly
85 90 95
Ile Gly Leu Phe Gly Thr Leu Gly Ala Glu Ile Pro Gly Val Ala Val
100 105 110
Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro His Lys
115 120 125
Ser Gly Asp Asp Gly Gln Gln Leu Val Asp Leu Ile Asp Ala Glu Ile
130 135 140
Gln Lys Gln Leu Asn Gln Ala Leu Ser Thr Gln Asp Lys His Asn Trp
145 150 155 160
Val Gly Tyr Leu Asp Glu Ser Phe Leu Asn Ala Ser His Asn Val Ala
165 170 175
Arg Ala Val Thr Asn Ala Gln Phe Val Gly Thr Glu Gly Asn Tyr Thr
180 185 190
Ala Pro Arg Thr Pro Thr Ala Ser Asp Tyr Thr Thr Val Glu Thr Thr
195 200 205
Leu Phe Ser Thr Glu Gly Ser Tyr His Thr Gly Leu Ser Gln Met Leu
210 215 220
Asn Gly Asn Phe Asp Thr Leu Ala Leu Pro Phe Tyr Val Ile Gly Val
225 230 235 240
Thr Met Gln Leu Ala Thr Tyr Gln Thr Phe Ile Gln Phe Gly Glu Lys
245 250 255
Trp Ile Asp Val Val Trp Pro Asp Asn Gln Thr Pro Gly Thr Thr Gly
260 265 270
Tyr Thr His Tyr Gln Thr Leu Gln Asp Lys Lys Thr Asp Met Arg Thr
275 280 285
Phe Ile Gln Gln His Thr Thr Thr Val Tyr Asn Ala Phe Lys Lys Gly
290 295 300
Met Pro Pro Leu Glu Ser Asn Lys Asn Ser Ile Asn Ala Tyr Asn Val
305 310 315 320
Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Val Ala Thr Trp
325 330 335
Pro Ser Met Tyr Pro Asp Asp Tyr Pro Thr Gln Met Ala Leu Glu Gln
340 345 350
Thr Arg Val Ile Phe Ser Asp Met Val Gly Gln Asp Gln Thr Arg Asp
355 360 365
Gly Asn Thr Thr Ile Tyr Asn Ile Phe Asp Ser Thr Ser Asn Phe Gln
370 375 380
His Gly Thr Thr Asp Ile Asn Ser Ile His Tyr Phe Pro Asp Glu Leu
385 390 395 400
Gln Gln Ala Gln Ile Ala Glu Tyr Ser Pro Ser Arg Gly Asn His Cys
405 410 415
Tyr Pro Tyr Gly Ile Ile Leu Asn Tyr Gln His Asn Thr Tyr Arg Tyr
420 425 430
Gly Asp Asn Asp Pro Gly Thr Ser Gly Thr Val Phe Pro Ala Pro Met
435 440 445
Arg Ser Ile Asn Ala Lys Thr Gln Asn Thr Ser Tyr Leu Asn Ser Glu
450 455 460
Asn Leu Phe Thr Asp Leu Gly Ser Ser Thr Asn Cys Tyr Ile Pro Gly
465 470 475 480
Thr Cys Thr Asn Asn Cys Ser Pro Gly Ser Ser Cys Thr Asn Gly Ser
485 490 495
Gly Tyr Gly Glu Ser Cys Asn Val Pro Leu Pro Asn Gln Lys Ile Asn
500 505 510
Ala Leu Tyr Pro Phe Thr Gln Glu His Val Ser Gly Asp Ser Gly Lys
515 520 525
Leu Gly Leu Met Ala Ser Leu Val Pro Tyr Gly Leu Ser Pro Asn Asn
530 535 540
Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Gly Lys Gly
545 550 555 560
Phe Pro Ala Glu Lys Gly Tyr Ile Gly Asn Gly Ser Pro Val Ser Val
565 570 575
Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro Ala Trp
580 585 590
Asp Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Asp Gln Tyr His
595 600 605
Ile Arg Val Arg Tyr Ala Ser Thr Ala Ala Tyr Pro Thr Asn Ala Lys
610 615 620
Tyr Pro Leu Ser Tyr Tyr Val Leu Ala Gly Asp Thr Ile Leu Gln Gln
625 630 635 640
Gly Lys Trp Glu Ala Ile Asp Thr Ser Gln Ala Asp Ile Thr Lys Gln
645 650 655
Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Ile Leu Thr Asp Ile Leu
660 665 670
Gly Glu Asn Pro Ile Thr Leu Pro Lys Asp Glu Ile Thr Val Gln Leu
675 680 685
Leu Gln Gly Asn Pro Ala Gly Gly Met Thr Lys Val Leu Ile Asp Arg
690 695 700
Ile Glu Phe Val Pro Ala Val Ile Asn Val Leu Asn Pro Gln Leu Asn
705 710 715 720
Tyr Gln Ile Ser Ser Gln Leu Pro Ala Trp Asn Asn Asn Gln Ser Met
725 730 735
Asp Pro Phe Pro Thr Ile Trp Arg Ala Asn Pro Gly Ile Val Gly Thr
740 745 750
Ser Ala Asn Ile Thr Val Ala Asp Leu Leu Asp Met Ser Gly Phe Lys
755 760 765
Tyr Thr Asn Ile Thr Ala Leu Asp Leu Gly Gly Gln Val Gly Leu Phe
770 775 780
Ala Lys Ile Gly Ile Gly Thr Asp Ser Asp Pro Asn Asn Trp Lys Trp
785 790 795 800
Met Cys Pro Asp Gly Gln Gly Glu Asn Pro Trp Ile Gly Asp Gly Thr
805 810 815
Tyr Asn Phe Asn Pro Lys Asn Ser Gly Ser Cys Phe Asp Lys Ser Phe
820 825 830
Thr Ala Ile Lys Leu Val Gly Gly Thr Ser Phe Asp Thr Phe Ile Ala
835 840 845
Ala Gly Ile Leu Thr Gly Thr Val Thr Thr Gln Lys Ser
850 855 860
<210> 23
<211> 710
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 23
Met Gly Phe Leu Pro Ile Ile Ile Cys Ser Ser Arg Arg Asn Phe Met
1 5 10 15
Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn Val
20 25 30
Leu Ala Ser Pro Gln Ile Gly Asn Leu Asp Thr Gln Pro Gly Ser Leu
35 40 45
Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys
50 55 60
Thr Gly Ser Asp Lys Leu Leu Gln Thr Thr Leu Gln Asn Gly Phe Ser
65 70 75 80
Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala Gly
85 90 95
Ile Gly Leu Phe Gly Thr Leu Gly Ala Glu Ile Pro Gly Val Ala Val
100 105 110
Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro His Lys
115 120 125
Ser Gly Asp Asp Gly Gln Gln Leu Val Asp Leu Ile Asp Ala Glu Ile
130 135 140
Gln Lys Gln Leu Asn Gln Ala Leu Ser Thr Gln Asp Lys His Asn Trp
145 150 155 160
Val Gly Tyr Leu Asp Glu Ser Phe Leu Asn Ala Ser His Asn Val Ala
165 170 175
Arg Ala Val Thr Asn Ala Gln Phe Val Gly Thr Glu Gly Asn Tyr Thr
180 185 190
Ala Pro Arg Thr Pro Thr Ala Ser Asp Tyr Thr Thr Val Glu Thr Thr
195 200 205
Leu Phe Ser Thr Glu Gly Ser Tyr His Thr Gly Leu Ser Gln Met Leu
210 215 220
Asn Gly Asn Phe Asp Thr Leu Ala Leu Pro Phe Tyr Val Ile Gly Val
225 230 235 240
Thr Met Gln Leu Ala Thr Tyr Gln Thr Phe Ile Gln Phe Gly Glu Lys
245 250 255
Trp Ile Asp Val Val Trp Pro Asp Asn Gln Thr Pro Gly Thr Thr Gly
260 265 270
Tyr Thr His Tyr Gln Thr Leu Gln Asp Lys Lys Thr Asp Met Arg Thr
275 280 285
Phe Ile Gln Gln His Thr Thr Thr Val Tyr Asn Ala Phe Lys Lys Gly
290 295 300
Met Pro Pro Leu Glu Ser Asn Lys Asn Ser Ile Asn Ala Tyr Asn Val
305 310 315 320
Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Val Ala Thr Trp
325 330 335
Pro Ser Met Tyr Pro Asp Asp Tyr Pro Thr Gln Met Ala Leu Glu Gln
340 345 350
Thr Arg Val Ile Phe Ser Asp Met Val Gly Gln Asp Gln Thr Arg Asp
355 360 365
Gly Asn Thr Thr Ile Tyr Asn Ile Phe Asp Ser Thr Ser Asn Phe Gln
370 375 380
His Gly Thr Thr Asp Ile Asn Ser Ile His Tyr Phe Pro Asp Glu Leu
385 390 395 400
Gln Gln Ala Gln Ile Ala Glu Tyr Ser Pro Ser Arg Gly Asn His Cys
405 410 415
Tyr Pro Tyr Gly Ile Ile Leu Asn Tyr Gln His Asn Thr Tyr Arg Tyr
420 425 430
Gly Asp Asn Asp Pro Gly Thr Ser Gly Thr Val Phe Pro Ala Pro Met
435 440 445
Arg Ser Ile Asn Ala Lys Thr Gln Asn Thr Ser Tyr Leu Asn Ser Glu
450 455 460
Asn Leu Phe Thr Asp Leu Gly Ser Ser Thr Asn Cys Tyr Ile Pro Gly
465 470 475 480
Thr Cys Thr Asn Asn Cys Ser Pro Gly Ser Ser Cys Thr Asn Gly Ser
485 490 495
Gly Tyr Gly Glu Ser Cys Asn Val Pro Leu Pro Asn Gln Lys Ile Asn
500 505 510
Ala Leu Tyr Pro Phe Thr Gln Glu His Val Ser Gly Asp Ser Gly Lys
515 520 525
Leu Gly Leu Met Ala Ser Leu Val Pro Tyr Gly Leu Ser Pro Asn Asn
530 535 540
Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Gly Lys Gly
545 550 555 560
Phe Pro Ala Glu Lys Gly Tyr Ile Gly Asn Gly Ser Pro Val Ser Val
565 570 575
Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro Ala Trp
580 585 590
Asp Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Asp Gln Tyr His
595 600 605
Ile Arg Val Arg Tyr Ala Ser Thr Ala Ala Tyr Pro Thr Asn Ala Lys
610 615 620
Tyr Pro Leu Ser Tyr Tyr Val Leu Ala Gly Asp Thr Ile Leu Gln Gln
625 630 635 640
Gly Lys Trp Glu Ala Ile Asp Thr Ser Gln Ala Asp Ile Thr Lys Gln
645 650 655
Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Ile Leu Thr Asp Ile Leu
660 665 670
Gly Glu Asn Pro Ile Thr Leu Pro Lys Asp Glu Ile Thr Val Gln Leu
675 680 685
Leu Gln Gly Asn Pro Ala Gly Gly Met Thr Lys Val Leu Ile Asp Arg
690 695 700
Ile Glu Phe Val Pro Ala
705 710
<210> 24
<211> 209
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 24
Leu Thr Ser Tyr Asn Ile Val Leu Ser Pro Lys Lys Asp Gly Met Arg
1 5 10 15
Leu Phe Trp Gln Lys Ser Asn Asp Phe Glu Arg Arg Met Ile Met Gln
20 25 30
Asp Phe Asp Tyr Phe Glu Gln Pro His Ser Asp Glu Ser Ile Pro Arg
35 40 45
Gln Pro Asp Pro Arg Pro Lys Gln Ala Pro Gln Pro Thr Val Ser Phe
50 55 60
Gln Cys Val Val Glu Ile Pro Arg Gly Tyr Arg Val Val Ser His Gly
65 70 75 80
Leu Thr Gln Phe Leu Tyr Asp Met Ser Gly Leu Ser Ile Ser Gln Glu
85 90 95
Thr Gly Lys Lys Thr Ile Asn Val Glu Glu Tyr Gly Gln Ile Glu Val
100 105 110
Asp Leu His Thr Leu Ser Leu Arg Gly Cys Ile Ser Ile Ser Thr Asn
115 120 125
Ile Asp Ile Gln Pro Glu Gln Gln Thr His Ile Gly Phe Val Gly His
130 135 140
Asn Thr Asn Thr Ile Gln Phe Tyr Phe Thr Asp Ile Ile Ser Ile Ser
145 150 155 160
Thr Val Leu Gln His Ser Val Thr Pro Leu Pro His Tyr Thr Val Asp
165 170 175
His Arg His Ile Lys Val Asp Asn Leu Gln Leu Glu Pro Phe Asp Thr
180 185 190
Ala Asn Pro Arg Leu Trp Lys Leu Thr Gly Leu Phe Ala Phe Leu Met
195 200 205
Lys
<210> 25
<211> 1177
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 25
Met Asp Cys Asn Leu Gln Ser Gln Gln Asn Ile Pro Tyr Asn Val Leu
1 5 10 15
Ala Ile Pro Val Ser Asn Val Asn Ser Leu Thr Asp Thr Ile Gly Asp
20 25 30
Leu Lys Lys Ala Trp Glu Glu Phe Gln Lys Thr Gly Ser Phe Ser Leu
35 40 45
Thr Ala Leu Gln Gln Gly Phe Asn Ala Ala Gln Gly Gly Thr Phe Asn
50 55 60
Tyr Leu Thr Leu Leu Gln Ser Gly Ile Ser Leu Ala Gly Ser Phe Val
65 70 75 80
Pro Gly Gly Thr Phe Val Ala Pro Ile Ile Asn Met Val Ile Gly Trp
85 90 95
Leu Trp Pro His Lys Asn Lys Asn Ala Asp Thr Glu Asn Leu Ile Asn
100 105 110
Leu Ile Asp Ser Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Leu Asp
115 120 125
Gln Asp Arg Gln Asp Trp Ile Ser Tyr Leu Glu Ser Ile Phe Asp Ser
130 135 140
Ser Asn Asn Leu Asn Gly Ala Ile Ile Asp Ala Gln Trp Ser Gly Thr
145 150 155 160
Val Asn Thr Thr Asn Arg Thr Leu Arg Asn Pro Thr Glu Ser Asp Tyr
165 170 175
Leu Asn Val Val Thr Asn Phe Ile Ala Ala Asp Gly Asp Ile Ser Ser
180 185 190
Asn Glu His His Ile Met Asn Gly Asn Phe Asp Val Ala Ala Ala Pro
195 200 205
Tyr Phe Val Ile Gly Ala Thr Ala Arg Phe Ala Thr Met Gln Ser Tyr
210 215 220
Ile Lys Phe Cys Asn Ala Trp Ile Asp Lys Val Gly Leu Ser Asp Ser
225 230 235 240
Gln Leu Thr Thr Gln Lys Ala Asn Leu Asp Arg Thr Lys Gln Thr Ile
245 250 255
Arg Asn Ala Ile Leu Ser Tyr Thr Gln Gln Val Met Lys Val Phe Lys
260 265 270
Asp Pro Lys Asn Met Pro Thr Ile Asp Thr Asn Lys Phe Ser Val Asp
275 280 285
Thr Tyr Asn Val Tyr Val Lys Gly Met Thr Leu Asn Val Leu Asp Ile
290 295 300
Val Ala Ile Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Thr Ser Gln Thr
305 310 315 320
Ser Leu Glu Gln Thr Arg Val Thr Phe Ser Asn Met Val Gly Gln Glu
325 330 335
Glu Gly Thr Asp Gly Thr Val Thr Ile Tyr Asn Thr Phe Asp Ser Asp
340 345 350
Ser Tyr Arg His Lys Ala Ile Pro Asn Asn Asp Val Asn Leu Leu Ser
355 360 365
Tyr Phe Pro Asp Glu Leu Gln Asn Val Gln Leu Ala Leu Tyr Thr Pro
370 375 380
Ser Lys Lys Asp Ser Gly Tyr Thr Tyr Pro Tyr Ala Phe Ile Ser Asn
385 390 395 400
Tyr Gln Tyr Ser Arg Tyr Gln Tyr Gly Asp Asn Ala Pro Glu Gly Pro
405 410 415
Leu Ser Thr Leu Ser Ala Pro Ile Gln Ser Ile Asn Ala Ala Thr Gln
420 425 430
Asn Thr Lys Tyr Leu Asp Gly Glu Thr Ile Asn Gly Ile Gly Ala Asn
435 440 445
Leu Pro Gly Tyr Cys Thr Ser Asp Cys Ser Pro Thr Gln Gln Pro Phe
450 455 460
Ser Cys Thr Ser Thr Val Asn Gln Phe Lys Gln Ser Cys Asn Pro Thr
465 470 475 480
Asp Thr Asn Gln Lys Ile Asn Ala Leu Tyr Ala Phe Thr Gln Thr Asn
485 490 495
Val Lys Gly Asn Gln Gly Lys Leu Gly Leu Leu Ala Ser His Val Pro
500 505 510
Tyr Asp Leu Asn Pro Lys Asn Ile Phe Gly Glu Leu Asp Pro Asp Thr
515 520 525
Asn Asn Val Ile Leu Lys Gly Ile Pro Ala Glu Lys Gly Tyr Phe Ser
530 535 540
Asn Asn Thr Arg Pro Asn Ile Val Lys Glu Trp Ile Asn Gly Ala Asn
545 550 555 560
Ala Val Pro Leu Asp Ser Gly Asn Thr Leu Phe Met Thr Ala Thr Asn
565 570 575
Val Thr Ala Thr Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Asn Pro Asn
580 585 590
Ser Asn Thr Gln Ile Gly Val Gln Ile Ser Gln Asn Gly Ser Ile Ile
595 600 605
Ser Asn Ser Asn Leu Thr Leu Tyr Ser Thr Thr Asp Met Asn Asn Thr
610 615 620
Leu Pro Leu Asn Val Tyr Val Thr Gly Glu Asn Gly Asn Tyr Thr Leu
625 630 635 640
Leu Asp Leu Tyr Ser Thr Thr Asn Val Leu Ser Thr Gly Asp Ile Thr
645 650 655
Leu Lys Leu Thr Gly Gly Asp Gln Lys Ile Phe Ile Asp Arg Ile Glu
660 665 670
Phe Val Pro Thr Met Pro Val Pro Gly Asn Thr Asn Asn Asn Asn Gly
675 680 685
Asp Asn Gly Asn Asn Asn Pro Val His His Val Cys Ala Ile Ala Gly
690 695 700
Thr Gln Gln Ser Cys Ser Gly Pro Pro Lys Phe Glu Gln Val Ser Asn
705 710 715 720
Leu Glu Lys Ile Thr Thr Gln Val Tyr Met Leu Phe Lys Ser Ser Pro
725 730 735
Tyr Glu Glu Leu Ala Pro Glu Val Ser Ser Tyr Gln Ile Ser Gln Val
740 745 750
Ala Leu Lys Val Met Ala Leu Ser Asp Glu Leu Phe Cys Lys Glu Lys
755 760 765
Ser Leu Leu Arg Lys Leu Val Asn Lys Ala Lys Gln Leu Leu Glu Ala
770 775 780
Ser Asn Leu Leu Val Gly Gly Asn Phe Glu Thr Thr Gln Asn Trp Val
785 790 795 800
Leu Gly Thr Asn Ala Tyr Ile Asn Tyr Asp Ser Phe Leu Phe Asn Gly
805 810 815
Asn Tyr Leu Ser Leu Gln Pro Ala Ser Gly Phe Phe Thr Ser Tyr Ala
820 825 830
Tyr Gln Lys Ile Asp Glu Ser Thr Leu Lys Pro Tyr Thr Arg Tyr Lys
835 840 845
Val Ser Gly Phe Ile Gly Gln Ser Asn Gln Val Glu Leu Ile Ile Ser
850 855 860
Arg Tyr Gly Lys Glu Ile Asp Lys Ile Leu Asn Val Pro Tyr Ala Gly
865 870 875 880
Pro Leu Pro Ile Thr Ala Asn Ala Leu Ile Thr Cys Cys Ala Pro Glu
885 890 895
Ile Gly Gln Cys Asp Gly Lys Gln Ser Asp Ser His Phe Phe Asn Tyr
900 905 910
Ser Ile Asp Val Gly Ala Leu His Leu Glu Leu Asn Pro Gly Ile Glu
915 920 925
Ile Gly Leu Lys Ile Val Gln Ser Asn Gly Tyr Ile Thr Ile Ser Asn
930 935 940
Leu Glu Ile Ile Glu Glu Arg Pro Leu Thr Glu Met Glu Ile Gln Ala
945 950 955 960
Val Asn Arg Lys Ser Gln Lys Trp Glu Arg Glu Lys Leu Leu Glu Cys
965 970 975
Ala Ser Ile Asn Glu Leu Leu Gln Pro Val Ile Asn Gln Ile Asp Ser
980 985 990
Leu Phe Lys Asp Gly Asn Trp Tyr Asn Asp Ile Leu Pro His Ile Thr
995 1000 1005
Tyr Gln Asp Leu Lys Asn Ile Ile Ile Pro Glu Leu Pro Lys Leu
1010 1015 1020
Lys His Trp Phe Ile Glu Asp Leu Pro Gly Glu Tyr His Glu Ile
1025 1030 1035
Glu Gln Lys Met Lys Glu Ala Leu Lys His Ala Phe Thr Gln Leu
1040 1045 1050
Asp Glu Arg Asn Leu Ile His Asn Gly His Phe Thr Ile Asn Leu
1055 1060 1065
Ile Asp Trp Gln Val Glu Gly Asp Ala Gln Met Lys Val Leu Glu
1070 1075 1080
Asn Asp Ala Leu Ala Leu Gln Leu Phe Asn Trp Asp Ser Ser Ala
1085 1090 1095
Ser Gln Ser Ile Lys Ile Leu Glu Phe Asp Glu Asp Lys Ala Tyr
1100 1105 1110
Lys Leu Arg Val Tyr Ala Gln Gly Asn Gly Thr Ile Gln Phe Gly
1115 1120 1125
Asn Cys Glu Asp Glu Ala Ile Gln Phe Asn Thr Asn Ser Phe Ile
1130 1135 1140
Tyr Gln Glu Lys Ile Val Tyr Phe Asp Thr Pro Ser Ile Asn Val
1145 1150 1155
Lys Ile Gln Ser Glu Gly Ala Glu Phe Val Val Ser Ser Val Glu
1160 1165 1170
Leu Ile Glu Leu
1175
<210> 26
<211> 676
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 26
Met Asp Cys Asn Leu Gln Ser Gln Gln Asn Ile Pro Tyr Asn Val Leu
1 5 10 15
Ala Ile Pro Val Ser Asn Val Asn Ser Leu Thr Asp Thr Ile Gly Asp
20 25 30
Leu Lys Lys Ala Trp Glu Glu Phe Gln Lys Thr Gly Ser Phe Ser Leu
35 40 45
Thr Ala Leu Gln Gln Gly Phe Asn Ala Ala Gln Gly Gly Thr Phe Asn
50 55 60
Tyr Leu Thr Leu Leu Gln Ser Gly Ile Ser Leu Ala Gly Ser Phe Val
65 70 75 80
Pro Gly Gly Thr Phe Val Ala Pro Ile Ile Asn Met Val Ile Gly Trp
85 90 95
Leu Trp Pro His Lys Asn Lys Asn Ala Asp Thr Glu Asn Leu Ile Asn
100 105 110
Leu Ile Asp Ser Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Leu Asp
115 120 125
Gln Asp Arg Gln Asp Trp Ile Ser Tyr Leu Glu Ser Ile Phe Asp Ser
130 135 140
Ser Asn Asn Leu Asn Gly Ala Ile Ile Asp Ala Gln Trp Ser Gly Thr
145 150 155 160
Val Asn Thr Thr Asn Arg Thr Leu Arg Asn Pro Thr Glu Ser Asp Tyr
165 170 175
Leu Asn Val Val Thr Asn Phe Ile Ala Ala Asp Gly Asp Ile Ser Ser
180 185 190
Asn Glu His His Ile Met Asn Gly Asn Phe Asp Val Ala Ala Ala Pro
195 200 205
Tyr Phe Val Ile Gly Ala Thr Ala Arg Phe Ala Thr Met Gln Ser Tyr
210 215 220
Ile Lys Phe Cys Asn Ala Trp Ile Asp Lys Val Gly Leu Ser Asp Ser
225 230 235 240
Gln Leu Thr Thr Gln Lys Ala Asn Leu Asp Arg Thr Lys Gln Thr Ile
245 250 255
Arg Asn Ala Ile Leu Ser Tyr Thr Gln Gln Val Met Lys Val Phe Lys
260 265 270
Asp Pro Lys Asn Met Pro Thr Ile Asp Thr Asn Lys Phe Ser Val Asp
275 280 285
Thr Tyr Asn Val Tyr Val Lys Gly Met Thr Leu Asn Val Leu Asp Ile
290 295 300
Val Ala Ile Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Thr Ser Gln Thr
305 310 315 320
Ser Leu Glu Gln Thr Arg Val Thr Phe Ser Asn Met Val Gly Gln Glu
325 330 335
Glu Gly Thr Asp Gly Thr Val Thr Ile Tyr Asn Thr Phe Asp Ser Asp
340 345 350
Ser Tyr Arg His Lys Ala Ile Pro Asn Asn Asp Val Asn Leu Leu Ser
355 360 365
Tyr Phe Pro Asp Glu Leu Gln Asn Val Gln Leu Ala Leu Tyr Thr Pro
370 375 380
Ser Lys Lys Asp Ser Gly Tyr Thr Tyr Pro Tyr Ala Phe Ile Ser Asn
385 390 395 400
Tyr Gln Tyr Ser Arg Tyr Gln Tyr Gly Asp Asn Ala Pro Glu Gly Pro
405 410 415
Leu Ser Thr Leu Ser Ala Pro Ile Gln Ser Ile Asn Ala Ala Thr Gln
420 425 430
Asn Thr Lys Tyr Leu Asp Gly Glu Thr Ile Asn Gly Ile Gly Ala Asn
435 440 445
Leu Pro Gly Tyr Cys Thr Ser Asp Cys Ser Pro Thr Gln Gln Pro Phe
450 455 460
Ser Cys Thr Ser Thr Val Asn Gln Phe Lys Gln Ser Cys Asn Pro Thr
465 470 475 480
Asp Thr Asn Gln Lys Ile Asn Ala Leu Tyr Ala Phe Thr Gln Thr Asn
485 490 495
Val Lys Gly Asn Gln Gly Lys Leu Gly Leu Leu Ala Ser His Val Pro
500 505 510
Tyr Asp Leu Asn Pro Lys Asn Ile Phe Gly Glu Leu Asp Pro Asp Thr
515 520 525
Asn Asn Val Ile Leu Lys Gly Ile Pro Ala Glu Lys Gly Tyr Phe Ser
530 535 540
Asn Asn Thr Arg Pro Asn Ile Val Lys Glu Trp Ile Asn Gly Ala Asn
545 550 555 560
Ala Val Pro Leu Asp Ser Gly Asn Thr Leu Phe Met Thr Ala Thr Asn
565 570 575
Val Thr Ala Thr Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Asn Pro Asn
580 585 590
Ser Asn Thr Gln Ile Gly Val Gln Ile Ser Gln Asn Gly Ser Ile Ile
595 600 605
Ser Asn Ser Asn Leu Thr Leu Tyr Ser Thr Thr Asp Met Asn Asn Thr
610 615 620
Leu Pro Leu Asn Val Tyr Val Thr Gly Glu Asn Gly Asn Tyr Thr Leu
625 630 635 640
Leu Asp Leu Tyr Ser Thr Thr Asn Val Leu Ser Thr Gly Asp Ile Thr
645 650 655
Leu Lys Leu Thr Gly Gly Asp Gln Lys Ile Phe Ile Asp Arg Ile Glu
660 665 670
Phe Val Pro Thr
675
<210> 27
<211> 1242
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 27
Met Asn Gly Gly Glu Asn Met Asn Gln Asn Asp Gln Asn Glu Ile Gln
1 5 10 15
Ile Ile Asp Ser Ser Ser Asn Asp Phe Asn Gln Ser Asn Arg Tyr Pro
20 25 30
Arg Tyr Pro Leu Ala Glu Glu Ser Asn Tyr Lys Asp Trp Leu Ala Asn
35 40 45
Cys Lys Glu Ser Asn Leu Asp Arg Leu Ser Thr Pro Ser Ser Val Gln
50 55 60
Asp Ala Val Val Thr Ser Leu Asn Ile Phe Ser Tyr Ile Phe Gly Phe
65 70 75 80
Leu Asp Phe Gly Ala Thr Ser Ala Gly Leu Gly Ile Leu Gly Ala Leu
85 90 95
Phe Gly Val Phe Trp Pro Ser Ser Asn Asn Gly Ile Ser Glu Asp Leu
100 105 110
Leu Arg Ser Ile Glu Ile Leu Ile Asp Arg Arg Ile Asn Glu Val Glu
115 120 125
Arg Asn Arg Ile Thr Val Gln Phe Glu Gly Leu Arg Ala Val Met Ser
130 135 140
Asn Tyr Asn Thr Ala Leu Arg Asn Trp Asn Ala Asn Arg Asn Asn Pro
145 150 155 160
Ala Leu Gln Ser Glu Val Arg Ser Arg Phe Asp Asn Ala Asp Asp Ala
165 170 175
Phe Ala Gly Arg Met Pro Glu Phe Arg Ile Pro Gly Phe Glu Ile Gln
180 185 190
Ser Leu Ser Val Tyr Ala Gln Ala Ala Thr Leu His Leu Leu Leu Leu
195 200 205
Arg Asp Ala Val Val Asn Gly Ser Leu Trp Gly Phe Asp Gly Glu Thr
210 215 220
Ile Glu Ser Tyr Tyr Thr Lys Leu Val Cys Leu Ile Asp Ala Tyr Ala
225 230 235 240
Asp His Cys Ala Ser Phe Tyr Arg Gln Gly Leu Gln Glu Leu Arg Asp
245 250 255
Arg Gly Asn Trp Asn Ala Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile
260 265 270
Thr Val Leu Asp Ile Ile Ser Leu Phe Ser Asn Tyr Asp Pro Arg Arg
275 280 285
Tyr Thr Ile Asn Thr Asn Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu
290 295 300
Pro Val Ala Thr Pro Gly Trp Leu Asn Leu Asn Ser Asn Pro Asp Gln
305 310 315 320
Phe Gln Gln Ile Glu Asn Asp Leu Ile Arg Pro Leu Arg Pro Ser Ser
325 330 335
Thr Leu Ala Ala Leu Ser Ala Glu Thr Ala Phe Ala Phe Phe Gln Ser
340 345 350
Gly Gln Thr Pro Leu Arg Ala Ile Leu Arg Thr Arg Thr Ser His Phe
355 360 365
Asn Ile Ser Gly Gly Phe Gly Thr Asp Pro Trp Gln Gly Ser Pro Asn
370 375 380
Pro Thr Gly Pro Ser Glu Ser Gln Leu Gln Phe Glu Asn Phe Asp Val
385 390 395 400
Tyr His Val Ser Ser Glu Val Thr Arg Glu Val Ser Ser Glu Asn Ser
405 410 415
Gly Leu Phe Gly Pro Gln Arg Ile Asn Phe Arg Met Val Ser Ile Ser
420 425 430
Gly Val Val Ala Ser Arg Glu Ile Ser Ser Pro Pro Phe Gly Arg Tyr
435 440 445
Ile Thr Asn Ile Ser Ser Ser Ile Pro Gly Ile Arg Ser Ala Asn Pro
450 455 460
Thr Ala Ser Asp Tyr Thr His Arg Leu Ser Ser Ile Arg Ser Thr Ser
465 470 475 480
Leu Arg Thr Met Tyr Arg Asp Arg Thr Asn Ile Ile Ala Tyr Gly Trp
485 490 495
Thr His Thr Ser Leu Glu Leu Thr Asn Arg Leu Leu Pro Asn Arg Ile
500 505 510
Thr Gln Ile Pro Ala Val Lys Ser Ser Ile Thr Pro Leu Pro Ile Val
515 520 525
Gly Gly Pro Gly His Thr Gly Gly Gly Leu Val Ser Leu Ala Gly Tyr
530 535 540
Gly Arg Tyr Met Leu Ile Asn Phe Ile Ser Pro Tyr Ser Pro Ser Arg
545 550 555 560
Gln Arg Tyr Arg Leu Arg Ile Arg Phe Val Lys Asp Phe Asp Asp Ala
565 570 575
His Leu Arg Val Ser Ile Ser Ala Ala Asn Phe Tyr Gln Asn Val Val
580 585 590
Leu Pro Gly Ser Arg Thr Ser Val Asn Pro Asn Leu Ser Phe Arg Asp
595 600 605
Phe Arg Tyr Val Asp Val Pro Gly Glu Phe Glu Leu Val Ala Asn Arg
610 615 620
Ser Tyr Ser Ile Asn Leu Gly Ala Pro Thr Ser Ser Ile Pro Gly Thr
625 630 635 640
Cys Leu Ile Asp Lys Ile Glu Phe Ile Pro Val Asn Ser Ala Ala Leu
645 650 655
Glu Tyr Glu Gly Glu Gln Ser Leu Lys Lys Ala Lys Lys Ala Val Asp
660 665 670
Asp Leu Phe Thr Asn Asp Leu Lys Asn Asp Leu Lys Ile Glu Thr Thr
675 680 685
Asp Tyr Asp Val Asp Gln Ala Ala Asn Leu Val Glu Cys Val Pro Glu
690 695 700
Glu Leu Tyr Ala Lys Glu Lys Met Ile Leu Leu Asp Glu Val Lys His
705 710 715 720
Ala Lys Gln Leu Ser Glu Ser Arg Asn Leu Ile Gln Asn Gly Asn Phe
725 730 735
Glu Phe Tyr Thr Asp Lys Trp Thr Thr Ser Asn Asn Ala Ser Ile Gln
740 745 750
Ala Asp Asn Pro Ile Phe Lys Gly Asn Tyr Leu Asn Met Pro Gly Ala
755 760 765
Arg Glu Thr Glu Ala Gly Ala Thr Thr Phe Pro Thr Tyr Val Phe Gln
770 775 780
Lys Ile Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg Tyr Lys Ala Arg
785 790 795 800
Gly Phe Val Gly Gly Ser Lys Glu Val Arg Leu Ile Val Ala Arg Tyr
805 810 815
Ala Glu Glu Val Asp Ala Ile Met Asn Val Arg Asn Asp Leu Thr Pro
820 825 830
Gly Val Pro Val Ala Ser Cys Gly Glu Phe Asp Arg Cys His Pro Gln
835 840 845
Ala Tyr Pro Ile Leu Gln Asp Arg Cys His Asp Asp Arg Ile Glu Gln
850 855 860
His Ala Asp Gly Gly Ser His Ser Cys His Gly Lys Ser Arg Glu Lys
865 870 875 880
His Ala Ile Cys His Asp Ser His Gln Phe Asp Phe His Ile Asp Thr
885 890 895
Gly Glu Val Tyr Leu Thr Glu Asn Pro Gly Ile Trp Val Leu Phe Lys
900 905 910
Ile Ser Ser Pro Glu Gly Tyr Ala Thr Leu Asp Asn Ile Glu Leu Ile
915 920 925
Glu Glu Gly Pro Leu Val Gly Glu Ser Leu Ala Leu Val Lys Lys Arg
930 935 940
Glu Lys Lys Trp Lys Asn Glu Met Glu Thr Lys Trp Ile His Thr Lys
945 950 955 960
Glu Ala Tyr Glu Lys Ala Lys Trp Ala Val Asp Ala Leu Phe Thr Thr
965 970 975
Pro Gln Asp Thr Ala Leu Lys Phe Glu Thr Ile Ile Ser Asn Ile Ile
980 985 990
Ser Ala Glu His Leu Val Gln Ser Ile Pro Tyr Val Tyr Asn Lys Trp
995 1000 1005
Leu Ala Asp Val Pro Gly Met Asn Tyr Asp Ile Tyr Thr Glu Leu
1010 1015 1020
Lys Arg Arg Ile Thr Gln Ala Tyr Ser Leu Tyr Glu Arg Arg Asn
1025 1030 1035
Ile Ile Lys Asn Gly Asp Phe His Gln Gly Leu Ala His Trp His
1040 1045 1050
Ala Thr Pro His Ala Arg Val Gln Gln Ile Asp Gly Thr Ala Val
1055 1060 1065
Leu Val Ile Pro Asn Trp Gly Ser Asn Val Ser Gln Asn Leu Cys
1070 1075 1080
Val Glu His Asn Arg Gly Tyr Val Leu Arg Val Thr Ala Lys Lys
1085 1090 1095
Glu Asp Met Gly Lys Gly Tyr Val Thr Ile Ser Asp Cys Ala Lys
1100 1105 1110
Asn Thr Glu Thr Leu Thr Phe Thr Ser Cys Asp Tyr Val Ser Asn
1115 1120 1125
Thr Ser Thr Ser Asp His Pro Arg Thr Cys Asn Ser Cys Asp Asp
1130 1135 1140
Tyr Val Ser Asn Glu Ser Ile Asn Asp Gln Pro Asp Tyr Leu Leu
1145 1150 1155
Asn Gln Glu Arg Asn Glu Ser Arg Cys Tyr Asn Arg Asn Ala Ser
1160 1165 1170
Thr Ser Glu Glu Ser Asp Tyr Leu Leu Asn Gln Lys Arg Asn Glu
1175 1180 1185
Gln Trp Ser Asn His Ser Asp Ala Thr Val Asn Glu Gln Met Asn
1190 1195 1200
Tyr Val Thr Arg Thr Ile Asp Phe Phe Pro Asp Thr Asp Arg Val
1205 1210 1215
Arg Ile Asp Ile Gly Glu Thr Glu Gly Ile Phe Lys Val Glu Ser
1220 1225 1230
Val Glu Leu Ile Cys Met Glu Gly Gln
1235 1240
<210> 28
<211> 651
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 28
Met Asn Gly Gly Glu Asn Met Asn Gln Asn Asp Gln Asn Glu Ile Gln
1 5 10 15
Ile Ile Asp Ser Ser Ser Asn Asp Phe Asn Gln Ser Asn Arg Tyr Pro
20 25 30
Arg Tyr Pro Leu Ala Glu Glu Ser Asn Tyr Lys Asp Trp Leu Ala Asn
35 40 45
Cys Lys Glu Ser Asn Leu Asp Arg Leu Ser Thr Pro Ser Ser Val Gln
50 55 60
Asp Ala Val Val Thr Ser Leu Asn Ile Phe Ser Tyr Ile Phe Gly Phe
65 70 75 80
Leu Asp Phe Gly Ala Thr Ser Ala Gly Leu Gly Ile Leu Gly Ala Leu
85 90 95
Phe Gly Val Phe Trp Pro Ser Ser Asn Asn Gly Ile Ser Glu Asp Leu
100 105 110
Leu Arg Ser Ile Glu Ile Leu Ile Asp Arg Arg Ile Asn Glu Val Glu
115 120 125
Arg Asn Arg Ile Thr Val Gln Phe Glu Gly Leu Arg Ala Val Met Ser
130 135 140
Asn Tyr Asn Thr Ala Leu Arg Asn Trp Asn Ala Asn Arg Asn Asn Pro
145 150 155 160
Ala Leu Gln Ser Glu Val Arg Ser Arg Phe Asp Asn Ala Asp Asp Ala
165 170 175
Phe Ala Gly Arg Met Pro Glu Phe Arg Ile Pro Gly Phe Glu Ile Gln
180 185 190
Ser Leu Ser Val Tyr Ala Gln Ala Ala Thr Leu His Leu Leu Leu Leu
195 200 205
Arg Asp Ala Val Val Asn Gly Ser Leu Trp Gly Phe Asp Gly Glu Thr
210 215 220
Ile Glu Ser Tyr Tyr Thr Lys Leu Val Cys Leu Ile Asp Ala Tyr Ala
225 230 235 240
Asp His Cys Ala Ser Phe Tyr Arg Gln Gly Leu Gln Glu Leu Arg Asp
245 250 255
Arg Gly Asn Trp Asn Ala Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile
260 265 270
Thr Val Leu Asp Ile Ile Ser Leu Phe Ser Asn Tyr Asp Pro Arg Arg
275 280 285
Tyr Thr Ile Asn Thr Asn Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu
290 295 300
Pro Val Ala Thr Pro Gly Trp Leu Asn Leu Asn Ser Asn Pro Asp Gln
305 310 315 320
Phe Gln Gln Ile Glu Asn Asp Leu Ile Arg Pro Leu Arg Pro Ser Ser
325 330 335
Thr Leu Ala Ala Leu Ser Ala Glu Thr Ala Phe Ala Phe Phe Gln Ser
340 345 350
Gly Gln Thr Pro Leu Arg Ala Ile Leu Arg Thr Arg Thr Ser His Phe
355 360 365
Asn Ile Ser Gly Gly Phe Gly Thr Asp Pro Trp Gln Gly Ser Pro Asn
370 375 380
Pro Thr Gly Pro Ser Glu Ser Gln Leu Gln Phe Glu Asn Phe Asp Val
385 390 395 400
Tyr His Val Ser Ser Glu Val Thr Arg Glu Val Ser Ser Glu Asn Ser
405 410 415
Gly Leu Phe Gly Pro Gln Arg Ile Asn Phe Arg Met Val Ser Ile Ser
420 425 430
Gly Val Val Ala Ser Arg Glu Ile Ser Ser Pro Pro Phe Gly Arg Tyr
435 440 445
Ile Thr Asn Ile Ser Ser Ser Ile Pro Gly Ile Arg Ser Ala Asn Pro
450 455 460
Thr Ala Ser Asp Tyr Thr His Arg Leu Ser Ser Ile Arg Ser Thr Ser
465 470 475 480
Leu Arg Thr Met Tyr Arg Asp Arg Thr Asn Ile Ile Ala Tyr Gly Trp
485 490 495
Thr His Thr Ser Leu Glu Leu Thr Asn Arg Leu Leu Pro Asn Arg Ile
500 505 510
Thr Gln Ile Pro Ala Val Lys Ser Ser Ile Thr Pro Leu Pro Ile Val
515 520 525
Gly Gly Pro Gly His Thr Gly Gly Gly Leu Val Ser Leu Ala Gly Tyr
530 535 540
Gly Arg Tyr Met Leu Ile Asn Phe Ile Ser Pro Tyr Ser Pro Ser Arg
545 550 555 560
Gln Arg Tyr Arg Leu Arg Ile Arg Phe Val Lys Asp Phe Asp Asp Ala
565 570 575
His Leu Arg Val Ser Ile Ser Ala Ala Asn Phe Tyr Gln Asn Val Val
580 585 590
Leu Pro Gly Ser Arg Thr Ser Val Asn Pro Asn Leu Ser Phe Arg Asp
595 600 605
Phe Arg Tyr Val Asp Val Pro Gly Glu Phe Glu Leu Val Ala Asn Arg
610 615 620
Ser Tyr Ser Ile Asn Leu Gly Ala Pro Thr Ser Ser Ile Pro Gly Thr
625 630 635 640
Cys Leu Ile Asp Lys Ile Glu Phe Ile Pro Val
645 650
<210> 29
<211> 661
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 29
Met Asn Lys Asn Asn Asn Ser Ser Glu Leu Asp Ile Asn Asp Thr Gly
1 5 10 15
Ala Ser His Pro Ser Asn Ser Ile Val Glu Val Pro Cys Ser Glu Cys
20 25 30
Gly Gly Ile Asp Tyr Arg Asp Val Met Asn Met Ser Thr Gln Asn Ala
35 40 45
Tyr Val Glu Gln Gln Ser Val Val Gly Ala Ala Thr Thr Thr Ser Phe
50 55 60
Gln Val Ile Arg Glu Leu Tyr Arg Ser Asn Lys Asp Ile Leu Ser Thr
65 70 75 80
Ala Gly Gly Ile Ile Lys Ser Ile Ser Val Phe Leu Trp Lys Lys Glu
85 90 95
His Ala Gln Thr Gln Trp Glu Glu Phe Met Lys Ser Val Glu Leu Met
100 105 110
Ile Asp Glu Lys Ile Asp Thr Ala Val Met Ala Ala Ala Ile Ser Arg
115 120 125
Val Ala Ser Leu Glu Ala Leu Leu Ala Val His Gln Asp Tyr Leu Asp
130 135 140
Ile Leu Ala Ala Asp Pro Ser Asn Pro Asp Leu Gln Glu Leu Val Ser
145 150 155 160
Asp His Val Val Asn Leu Gln Asp Phe Phe Val Tyr Thr Met Pro Gln
165 170 175
Leu Ser Gln Ser Pro Tyr Gln Val Gln Gln Leu Val Ala Tyr Ala Glu
180 185 190
Ala Ala Asn Leu His Leu Ala Phe Leu Arg Asp Val Val Met Phe Gly
195 200 205
Ser Gln Trp Gly Met Gln Gln Ala Thr Ile Asp Thr Tyr Tyr Lys Tyr
210 215 220
Leu Glu Asn Asn Ile Lys Val Tyr Thr Asn Tyr Cys Val Asn Thr Tyr
225 230 235 240
Asn Ile Gly Leu Glu Lys Thr Lys Asn Leu Met Gly Ser Ile Asn Pro
245 250 255
Lys Asn Tyr Asp Lys Tyr Pro Tyr Leu Asn Pro His Ala Asn Asn Ser
260 265 270
Thr Asn Asn Tyr Lys Lys Val Leu Gln Trp Asn Val Trp Asn Asp Phe
275 280 285
Arg Arg Asn Met Thr Ile Met Val Leu Asp Leu Val Val Val Phe Pro
290 295 300
Thr Tyr Asp Pro Lys Val Tyr Thr Asn Leu Glu Gly Ile Glu Ile Glu
305 310 315 320
Leu Ser Arg Glu Val Tyr Ser Thr Ser Tyr Asn Arg Tyr Gly Asp Ser
325 330 335
Thr His Gly Gly Asp Trp Lys Gly Trp Glu Ile Leu Glu Thr Ser Val
340 345 350
Val Arg Pro Pro His Leu Val Ser Phe Leu Ser Asn Leu Ala Phe Tyr
355 360 365
Met Ala Asn Tyr Ala Asn Ala Pro Thr Asn Thr Gly Leu Asp Gln Tyr
370 375 380
Asn Gly Val Ile Asn Thr Leu Arg Asn Ile Asp Ser Thr Ser Thr Trp
385 390 395 400
Glu Asp Gly Phe Ala Pro Ile Thr Lys Lys Glu Tyr Thr Thr Val Lys
405 410 415
Asp Ile Pro Gly Asp Phe Ile Lys Glu Ile Asn Met Asn Ser Gly Val
420 425 430
Asp Pro Cys Phe Leu Asn Phe Ile Thr Asn Ser Ser Gly Thr His Ser
435 440 445
Val Gly Thr Cys Tyr Arg Ser Pro Ile Gln Gly Pro Ile Tyr Asn Leu
450 455 460
Ser Ala Thr Ile Pro Tyr His Arg Leu Ser Tyr Val Asn Gly Met Asp
465 470 475 480
Asp Asn Phe Ser Glu Ser Phe Ser Gly Trp Gly Ile Ser Ala Trp Gly
485 490 495
Phe Gly Trp Leu His Glu Ser Leu Lys Thr Ser Asn Val Ile His Ala
500 505 510
Thr Arg Ser Ser Phe Ile Pro Ala Val Lys Ser Tyr Gly Leu Trp Gly
515 520 525
Gly Ala Thr Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val
530 535 540
Arg Leu Pro Ala Ala Pro Gly Val Ile Gln Val Arg Leu Pro Val Thr
545 550 555 560
Arg Gln Ala Val Lys Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Ala Gln
565 570 575
Glu Asn Gly Glu Leu Phe Val Gly Lys Trp Ala Asn Arg Trp Trp Glu
580 585 590
Thr Ala Asn Leu Ser Leu Ile Lys Thr Thr Ser Ser Asn Thr Pro Ser
595 600 605
Tyr Ser Glu Phe Lys Leu Ile Asp Ala Phe Ile Met Thr Thr Asn Glu
610 615 620
Ser Ala Phe Gln Ile Glu Leu Arg Asn Asp Ser Gly Gly Pro Ile Leu
625 630 635 640
Ile Asp Arg Ile Glu Phe Ile Pro Ile Gln Gly Thr Leu Glu Gly Tyr
645 650 655
Glu Ala Thr Gln Asn
660
<210> 30
<211> 649
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 30
Met Asn Lys Asn Asn Asn Ser Ser Glu Leu Asp Ile Asn Asp Thr Gly
1 5 10 15
Ala Ser His Pro Ser Asn Ser Ile Val Glu Val Pro Cys Ser Glu Cys
20 25 30
Gly Gly Ile Asp Tyr Arg Asp Val Met Asn Met Ser Thr Gln Asn Ala
35 40 45
Tyr Val Glu Gln Gln Ser Val Val Gly Ala Ala Thr Thr Thr Ser Phe
50 55 60
Gln Val Ile Arg Glu Leu Tyr Arg Ser Asn Lys Asp Ile Leu Ser Thr
65 70 75 80
Ala Gly Gly Ile Ile Lys Ser Ile Ser Val Phe Leu Trp Lys Lys Glu
85 90 95
His Ala Gln Thr Gln Trp Glu Glu Phe Met Lys Ser Val Glu Leu Met
100 105 110
Ile Asp Glu Lys Ile Asp Thr Ala Val Met Ala Ala Ala Ile Ser Arg
115 120 125
Val Ala Ser Leu Glu Ala Leu Leu Ala Val His Gln Asp Tyr Leu Asp
130 135 140
Ile Leu Ala Ala Asp Pro Ser Asn Pro Asp Leu Gln Glu Leu Val Ser
145 150 155 160
Asp His Val Val Asn Leu Gln Asp Phe Phe Val Tyr Thr Met Pro Gln
165 170 175
Leu Ser Gln Ser Pro Tyr Gln Val Gln Gln Leu Val Ala Tyr Ala Glu
180 185 190
Ala Ala Asn Leu His Leu Ala Phe Leu Arg Asp Val Val Met Phe Gly
195 200 205
Ser Gln Trp Gly Met Gln Gln Ala Thr Ile Asp Thr Tyr Tyr Lys Tyr
210 215 220
Leu Glu Asn Asn Ile Lys Val Tyr Thr Asn Tyr Cys Val Asn Thr Tyr
225 230 235 240
Asn Ile Gly Leu Glu Lys Thr Lys Asn Leu Met Gly Ser Ile Asn Pro
245 250 255
Lys Asn Tyr Asp Lys Tyr Pro Tyr Leu Asn Pro His Ala Asn Asn Ser
260 265 270
Thr Asn Asn Tyr Lys Lys Val Leu Gln Trp Asn Val Trp Asn Asp Phe
275 280 285
Arg Arg Asn Met Thr Ile Met Val Leu Asp Leu Val Val Val Phe Pro
290 295 300
Thr Tyr Asp Pro Lys Val Tyr Thr Asn Leu Glu Gly Ile Glu Ile Glu
305 310 315 320
Leu Ser Arg Glu Val Tyr Ser Thr Ser Tyr Asn Arg Tyr Gly Asp Ser
325 330 335
Thr His Gly Gly Asp Trp Lys Gly Trp Glu Ile Leu Glu Thr Ser Val
340 345 350
Val Arg Pro Pro His Leu Val Ser Phe Leu Ser Asn Leu Ala Phe Tyr
355 360 365
Met Ala Asn Tyr Ala Asn Ala Pro Thr Asn Thr Gly Leu Asp Gln Tyr
370 375 380
Asn Gly Val Ile Asn Thr Leu Arg Asn Ile Asp Ser Thr Ser Thr Trp
385 390 395 400
Glu Asp Gly Phe Ala Pro Ile Thr Lys Lys Glu Tyr Thr Thr Val Lys
405 410 415
Asp Ile Pro Gly Asp Phe Ile Lys Glu Ile Asn Met Asn Ser Gly Val
420 425 430
Asp Pro Cys Phe Leu Asn Phe Ile Thr Asn Ser Ser Gly Thr His Ser
435 440 445
Val Gly Thr Cys Tyr Arg Ser Pro Ile Gln Gly Pro Ile Tyr Asn Leu
450 455 460
Ser Ala Thr Ile Pro Tyr His Arg Leu Ser Tyr Val Asn Gly Met Asp
465 470 475 480
Asp Asn Phe Ser Glu Ser Phe Ser Gly Trp Gly Ile Ser Ala Trp Gly
485 490 495
Phe Gly Trp Leu His Glu Ser Leu Lys Thr Ser Asn Val Ile His Ala
500 505 510
Thr Arg Ser Ser Phe Ile Pro Ala Val Lys Ser Tyr Gly Leu Trp Gly
515 520 525
Gly Ala Thr Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val
530 535 540
Arg Leu Pro Ala Ala Pro Gly Val Ile Gln Val Arg Leu Pro Val Thr
545 550 555 560
Arg Gln Ala Val Lys Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Ala Gln
565 570 575
Glu Asn Gly Glu Leu Phe Val Gly Lys Trp Ala Asn Arg Trp Trp Glu
580 585 590
Thr Ala Asn Leu Ser Leu Ile Lys Thr Thr Ser Ser Asn Thr Pro Ser
595 600 605
Tyr Ser Glu Phe Lys Leu Ile Asp Ala Phe Ile Met Thr Thr Asn Glu
610 615 620
Ser Ala Phe Gln Ile Glu Leu Arg Asn Asp Ser Gly Gly Pro Ile Leu
625 630 635 640
Ile Asp Arg Ile Glu Phe Ile Pro Ile
645
<210> 31
<211> 404
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 31
Leu Gln Thr Lys Leu Glu Gly Asn Ile Met Lys Ser Ile Ser Lys Lys
1 5 10 15
Val Met Ala Gly Leu Leu Val Gly Ala Thr Ser Leu Ser Ile Trp Ala
20 25 30
Pro Ala Ser Gln Ala Ala Ala Pro Glu Asn Asn Arg Tyr Tyr Ser Val
35 40 45
His Leu Ala Ala Asp Thr Ala Leu Lys Trp Asp Val Tyr Gly Asn Gly
50 55 60
Thr Tyr Asp Asp Ala Gly Ile Leu Leu Tyr Asn Gly Ser Leu Gly Asp
65 70 75 80
Asn Glu Gln Phe Val Phe Phe Pro Leu Asp Gly Gly Leu Tyr Ala Ile
85 90 95
Val Asn Lys Asn Ser Gly Lys Ala Val Thr Ile Gly Gly Gly Trp Val
100 105 110
Gly Asp Asn Ala Phe Ser Tyr Asn Arg Glu Gly Leu Ile Gln Ser Lys
115 120 125
Trp Thr Gly Ala Ala Ala Asn Lys Trp Tyr Leu Arg Asp Lys Gly Asn
130 135 140
Thr Asn Tyr Glu Ile Val Asn Gln Gly Tyr Gly Lys Val Ala Ser Tyr
145 150 155 160
Ala Arg Leu Gly Ile Thr Gly Ala Lys Tyr Val Asp Leu Asp Asp Ser
165 170 175
Asn Pro Ser Asp Asn Asp Arg Leu Phe Arg Ile Asn Lys Ser Pro Leu
180 185 190
Gly Val Asp Ile Pro Gly Phe Asn Phe Asp Tyr Gly Thr Phe Ser Leu
195 200 205
Pro Gln Leu Pro Ala Thr Gly Asn Arg Pro Asp Ala Pro Asn Tyr Thr
210 215 220
Gly Gly Ile Asp Gln Gln Leu Pro Gln Thr Ser Asn Ser Val Ile Val
225 230 235 240
Gly Ala Ser Leu Val Pro Cys Ile Met Val Asn Asp Ser Gln Ala Ser
245 250 255
Asp Tyr Thr Lys Ile His Asn Ser Pro Tyr Tyr Thr Leu Val Lys Glu
260 265 270
Glu Tyr Trp Asp Lys Thr Phe Ser Ala Val Leu Pro Ala Gly Ile Thr
275 280 285
Arg Asn Tyr Ser Phe Lys Thr Gly Met Thr Ser Val Asp Gln Gln Lys
290 295 300
Met Thr Asp Thr Leu Ser Met Lys Ile Gly Ala Asp Phe Gly Leu Lys
305 310 315 320
Phe Gly Asp Ala Ser Ala Ser Leu Lys Thr Glu Ile Ser Arg Thr Ile
325 330 335
Gln Thr Glu Ile Thr Ser Thr Asn Thr Glu Ala Ser Glu Glu Thr Val
340 345 350
Thr Ser Thr Val Ala Ser Glu Ala Gly Lys Thr Thr Gly Phe Thr Glu
355 360 365
Tyr Gln Leu Ala Thr Lys Tyr Thr Leu Lys Arg Ala Asp Gly Ser Ile
370 375 380
Val Ser Asp Pro Trp Val Val Lys Asn Asn Lys Ile Thr Ile Ala Arg
385 390 395 400
Lys Asn Ala Gln
<210> 32
<211> 366
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 32
Met Pro Glu Asn Asn Arg Tyr Tyr Ser Val His Leu Ala Ala Asp Thr
1 5 10 15
Ala Leu Lys Trp Asp Val Tyr Gly Asn Gly Thr Tyr Asp Asp Ala Gly
20 25 30
Ile Leu Leu Tyr Asn Gly Ser Leu Gly Asp Asn Glu Gln Phe Val Phe
35 40 45
Phe Pro Leu Asp Gly Gly Leu Tyr Ala Ile Val Asn Lys Asn Ser Gly
50 55 60
Lys Ala Val Thr Ile Gly Gly Gly Trp Val Gly Asp Asn Ala Phe Ser
65 70 75 80
Tyr Asn Arg Glu Gly Leu Ile Gln Ser Lys Trp Thr Gly Ala Ala Ala
85 90 95
Asn Lys Trp Tyr Leu Arg Asp Lys Gly Asn Thr Asn Tyr Glu Ile Val
100 105 110
Asn Gln Gly Tyr Gly Lys Val Ala Ser Tyr Ala Arg Leu Gly Ile Thr
115 120 125
Gly Ala Lys Tyr Val Asp Leu Asp Asp Ser Asn Pro Ser Asp Asn Asp
130 135 140
Arg Leu Phe Arg Ile Asn Lys Ser Pro Leu Gly Val Asp Ile Pro Gly
145 150 155 160
Phe Asn Phe Asp Tyr Gly Thr Phe Ser Leu Pro Gln Leu Pro Ala Thr
165 170 175
Gly Asn Arg Pro Asp Ala Pro Asn Tyr Thr Gly Gly Ile Asp Gln Gln
180 185 190
Leu Pro Gln Thr Ser Asn Ser Val Ile Val Gly Ala Ser Leu Val Pro
195 200 205
Cys Ile Met Val Asn Asp Ser Gln Ala Ser Asp Tyr Thr Lys Ile His
210 215 220
Asn Ser Pro Tyr Tyr Thr Leu Val Lys Glu Glu Tyr Trp Asp Lys Thr
225 230 235 240
Phe Ser Ala Val Leu Pro Ala Gly Ile Thr Arg Asn Tyr Ser Phe Lys
245 250 255
Thr Gly Met Thr Ser Val Asp Gln Gln Lys Met Thr Asp Thr Leu Ser
260 265 270
Met Lys Ile Gly Ala Asp Phe Gly Leu Lys Phe Gly Asp Ala Ser Ala
275 280 285
Ser Leu Lys Thr Glu Ile Ser Arg Thr Ile Gln Thr Glu Ile Thr Ser
290 295 300
Thr Asn Thr Glu Ala Ser Glu Glu Thr Val Thr Ser Thr Val Ala Ser
305 310 315 320
Glu Ala Gly Lys Thr Thr Gly Phe Thr Glu Tyr Gln Leu Ala Thr Lys
325 330 335
Tyr Thr Leu Lys Arg Ala Asp Gly Ser Ile Val Ser Asp Pro Trp Val
340 345 350
Val Lys Asn Asn Lys Ile Thr Ile Ala Arg Lys Asn Ala Gln
355 360 365
<210> 33
<211> 717
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 33
Val Trp Ser Val Asp Leu Lys Glu Asn Ile Tyr Trp Lys Gly Val Ile
1 5 10 15
Lys Met Ser Ser Tyr Asn Lys Asn Pro Phe Glu Thr Ile Lys Glu Leu
20 25 30
Leu Ser Asn Ala Asn Val Ser Ile Thr Cys Pro Leu His Pro Leu Ala
35 40 45
Lys Glu Pro Thr Ala Val Leu Gln Gly Met Asn Tyr Lys Asp Trp Leu
50 55 60
Val Met Ser Glu Glu Glu Asn Ser Leu Gly Lys Pro Glu Asn Phe Asp
65 70 75 80
Ser Glu Thr Ile Gln Asn Ala Leu Ile Gly Thr Leu Ala Val Ile Gly
85 90 95
Phe Ile Ala Ser Val Leu Ile Thr Pro Ile Thr Gly Gly Ala Ser Val
100 105 110
Ala Ile Gly Ser Ala Leu Leu Leu Glu Thr Ile Pro Tyr Tyr Trp Gly
115 120 125
Leu Ala Glu Ser Glu Asp Pro Thr Ile Gln Glu Lys Ala Met Phe Asp
130 135 140
Asp Ile Met Lys Ala Thr Glu Lys Thr Ile Lys Ser Glu Ile Glu Lys
145 150 155 160
Val Val Lys Thr Lys Ala Glu Ser Glu Leu Thr Ser Leu Arg Asp Val
165 170 175
Phe Lys Tyr Tyr Thr Arg Ala Leu Asn Leu Trp Lys Glu Asn Lys Asp
180 185 190
Asn Gln Asp Ala Arg Lys Glu Val Ser Ser Arg Phe Arg Ala Ala His
195 200 205
Val Ala Phe Ile Gly Ala Met Ser Gln Phe Lys Met Pro Gly Tyr Glu
210 215 220
Leu Val Met Leu Pro Ile Tyr Ala Ala Ala Ala Asn Phe His Leu Met
225 230 235 240
Leu Leu Gln Asp Gly Met Ile Tyr Glu Lys Glu Trp Asn Glu Ser Asn
245 250 255
Asn Leu Thr Glu Asp Phe Tyr Tyr Lys Glu Tyr Gln Asn Trp Thr Ser
260 265 270
Gln Tyr Ile Asp His Cys Thr Ser Trp Tyr Gln Lys Gly Leu Lys Asp
275 280 285
Ile Lys Asp Asn Lys Glu Lys Asn Trp Val Asp Tyr Asn Ser Tyr Arg
290 295 300
Asn Asn Met Thr Ile Leu Val Leu Asp Leu Ile Ala Leu Phe Pro Ser
305 310 315 320
Tyr Asp Val Arg Gln Tyr Asn Asn Met Gly Val Lys Leu Glu Thr His
325 330 335
Thr Arg Thr Leu Tyr Ser Asn Pro Leu Lys Phe Ser Lys Gly Ser Arg
340 345 350
Ile Asp Asp Asp Glu Arg Val Tyr Thr Arg Glu Pro Asn Leu Phe Ser
355 360 365
Ser Leu Ser His Leu Glu Phe His Thr Arg Thr Thr Asn Leu Gly Ser
370 375 380
Asp Gly Ile Glu Phe Leu Ser Gly Ile Glu Lys Lys Tyr Gln Leu Thr
385 390 395 400
Glu Lys Glu Ser Tyr Arg Thr Lys Phe Glu Gly Lys Pro Thr Asp Ile
405 410 415
Gly Lys Lys His Glu Ile Pro Phe Lys Lys Asp Glu Ser Glu Phe Tyr
420 425 430
Ser Leu Tyr Glu Leu Ser Thr Glu Gly Lys Gly Phe Lys Asp Ser Thr
435 440 445
Lys Leu Pro Leu Ile Gln Lys Met Thr Phe Asn Asn Glu Glu Gly Lys
450 455 460
Lys Phe Glu Tyr Ser Val Asp Lys Phe Gly Asn Gly Ala Leu Glu Lys
465 470 475 480
Glu Ser Phe Ser Ile Thr Asn Pro Asp Asp Leu Leu Ala Asp Gln Lys
485 490 495
Ala Arg His Val Leu Ser Asp Ile Ile Phe Val Asp Asp Lys Asn Ile
500 505 510
Ser Pro Glu Gln Ile Gln Thr Arg Asp Tyr Ser Phe Ala Trp Thr Ser
515 520 525
Glu Gln Ile Ser Pro Tyr Asn Arg Ile Thr Asp Gly Thr Lys Asn Lys
530 535 540
Asp Gly Ser Ile Ser Tyr Gln Ile Thr Gln Val Pro Ala Val Lys Ala
545 550 555 560
Tyr Ser Leu Lys Ser Ser Lys Asp Thr Ser His Ile Asp Val Ile Ser
565 570 575
Gly Pro Gly Phe Thr Gly Gly Asp Leu Val Arg Cys Ala Ile Lys Lys
580 585 590
Thr Ala Ser Ser Thr Gly Val Thr Gln Leu Ser Ile Val Phe Pro Ile
595 600 605
Thr Phe Asn His Val Lys Thr Asp Lys Tyr Arg Val Arg Met Asn Tyr
610 615 620
Ala Ala Asp His Thr Lys Leu Ile Thr Leu Val Gly Gly Ser Asn Pro
625 630 635 640
Ile Ser Gly Thr Ile Glu Asn Thr Phe Asp Ala Lys Asp Thr Gly Lys
645 650 655
Ser Leu Lys Tyr Asn Asn Phe Lys Met Met Glu Phe Asn Ile Ala Thr
660 665 670
Asn Phe Lys Asp Glu Glu Lys Gln Lys Ile Ile Thr Leu Lys Leu Thr
675 680 685
Tyr Ala Asp Asp Asp Gln Ala Ile Ala Asp Gly Lys Val Ala Gln Leu
690 695 700
Trp Ile Asp Lys Leu Glu Phe Ile Pro Val Pro Thr Lys
705 710 715
<210> 34
<211> 714
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 34
Val Trp Ser Val Asp Leu Lys Glu Asn Ile Tyr Trp Lys Gly Val Ile
1 5 10 15
Lys Met Ser Ser Tyr Asn Lys Asn Pro Phe Glu Thr Ile Lys Glu Leu
20 25 30
Leu Ser Asn Ala Asn Val Ser Ile Thr Cys Pro Leu His Pro Leu Ala
35 40 45
Lys Glu Pro Thr Ala Val Leu Gln Gly Met Asn Tyr Lys Asp Trp Leu
50 55 60
Val Met Ser Glu Glu Glu Asn Ser Leu Gly Lys Pro Glu Asn Phe Asp
65 70 75 80
Ser Glu Thr Ile Gln Asn Ala Leu Ile Gly Thr Leu Ala Val Ile Gly
85 90 95
Phe Ile Ala Ser Val Leu Ile Thr Pro Ile Thr Gly Gly Ala Ser Val
100 105 110
Ala Ile Gly Ser Ala Leu Leu Leu Glu Thr Ile Pro Tyr Tyr Trp Gly
115 120 125
Leu Ala Glu Ser Glu Asp Pro Thr Ile Gln Glu Lys Ala Met Phe Asp
130 135 140
Asp Ile Met Lys Ala Thr Glu Lys Thr Ile Lys Ser Glu Ile Glu Lys
145 150 155 160
Val Val Lys Thr Lys Ala Glu Ser Glu Leu Thr Ser Leu Arg Asp Val
165 170 175
Phe Lys Tyr Tyr Thr Arg Ala Leu Asn Leu Trp Lys Glu Asn Lys Asp
180 185 190
Asn Gln Asp Ala Arg Lys Glu Val Ser Ser Arg Phe Arg Ala Ala His
195 200 205
Val Ala Phe Ile Gly Ala Met Ser Gln Phe Lys Met Pro Gly Tyr Glu
210 215 220
Leu Val Met Leu Pro Ile Tyr Ala Ala Ala Ala Asn Phe His Leu Met
225 230 235 240
Leu Leu Gln Asp Gly Met Ile Tyr Glu Lys Glu Trp Asn Glu Ser Asn
245 250 255
Asn Leu Thr Glu Asp Phe Tyr Tyr Lys Glu Tyr Gln Asn Trp Thr Ser
260 265 270
Gln Tyr Ile Asp His Cys Thr Ser Trp Tyr Gln Lys Gly Leu Lys Asp
275 280 285
Ile Lys Asp Asn Lys Glu Lys Asn Trp Val Asp Tyr Asn Ser Tyr Arg
290 295 300
Asn Asn Met Thr Ile Leu Val Leu Asp Leu Ile Ala Leu Phe Pro Ser
305 310 315 320
Tyr Asp Val Arg Gln Tyr Asn Asn Met Gly Val Lys Leu Glu Thr His
325 330 335
Thr Arg Thr Leu Tyr Ser Asn Pro Leu Lys Phe Ser Lys Gly Ser Arg
340 345 350
Ile Asp Asp Asp Glu Arg Val Tyr Thr Arg Glu Pro Asn Leu Phe Ser
355 360 365
Ser Leu Ser His Leu Glu Phe His Thr Arg Thr Thr Asn Leu Gly Ser
370 375 380
Asp Gly Ile Glu Phe Leu Ser Gly Ile Glu Lys Lys Tyr Gln Leu Thr
385 390 395 400
Glu Lys Glu Ser Tyr Arg Thr Lys Phe Glu Gly Lys Pro Thr Asp Ile
405 410 415
Gly Lys Lys His Glu Ile Pro Phe Lys Lys Asp Glu Ser Glu Phe Tyr
420 425 430
Ser Leu Tyr Glu Leu Ser Thr Glu Gly Lys Gly Phe Lys Asp Ser Thr
435 440 445
Lys Leu Pro Leu Ile Gln Lys Met Thr Phe Asn Asn Glu Glu Gly Lys
450 455 460
Lys Phe Glu Tyr Ser Val Asp Lys Phe Gly Asn Gly Ala Leu Glu Lys
465 470 475 480
Glu Ser Phe Ser Ile Thr Asn Pro Asp Asp Leu Leu Ala Asp Gln Lys
485 490 495
Ala Arg His Val Leu Ser Asp Ile Ile Phe Val Asp Asp Lys Asn Ile
500 505 510
Ser Pro Glu Gln Ile Gln Thr Arg Asp Tyr Ser Phe Ala Trp Thr Ser
515 520 525
Glu Gln Ile Ser Pro Tyr Asn Arg Ile Thr Asp Gly Thr Lys Asn Lys
530 535 540
Asp Gly Ser Ile Ser Tyr Gln Ile Thr Gln Val Pro Ala Val Lys Ala
545 550 555 560
Tyr Ser Leu Lys Ser Ser Lys Asp Thr Ser His Ile Asp Val Ile Ser
565 570 575
Gly Pro Gly Phe Thr Gly Gly Asp Leu Val Arg Cys Ala Ile Lys Lys
580 585 590
Thr Ala Ser Ser Thr Gly Val Thr Gln Leu Ser Ile Val Phe Pro Ile
595 600 605
Thr Phe Asn His Val Lys Thr Asp Lys Tyr Arg Val Arg Met Asn Tyr
610 615 620
Ala Ala Asp His Thr Lys Leu Ile Thr Leu Val Gly Gly Ser Asn Pro
625 630 635 640
Ile Ser Gly Thr Ile Glu Asn Thr Phe Asp Ala Lys Asp Thr Gly Lys
645 650 655
Ser Leu Lys Tyr Asn Asn Phe Lys Met Met Glu Phe Asn Ile Ala Thr
660 665 670
Asn Phe Lys Asp Glu Glu Lys Gln Lys Ile Ile Thr Leu Lys Leu Thr
675 680 685
Tyr Ala Asp Asp Asp Gln Ala Ile Ala Asp Gly Lys Val Ala Gln Leu
690 695 700
Trp Ile Asp Lys Leu Glu Phe Ile Pro Val
705 710
<210> 35
<211> 1261
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 35
Met Ser Gln Asn Tyr Asn Asp Glu Tyr Glu Val Ile Gly Thr Asp Glu
1 5 10 15
Arg Lys Asn His Pro Lys Tyr Pro Leu Ala Asn Thr Pro Gly Leu Glu
20 25 30
Leu Gln Gln Met Ser Tyr Lys Asp Trp Val Asp Ile Cys Thr Thr Glu
35 40 45
Glu Ser Glu Ser Leu Phe Thr Asp Ile Thr Val Lys Asp Ala Val Ser
50 55 60
Ile Ala Thr Ser Ile Thr Ala Ala Val Leu Gly Ile Ser Phe Pro Val
65 70 75 80
Ala Ser Ala Val Ser Ser Ile Ile Ser Val Leu Val Pro Tyr Trp Trp
85 90 95
Pro Ala Ser Ala Gly Thr Pro Gly Thr Pro Ser Ala Gln Ala Thr Trp
100 105 110
Glu Lys Phe Met Asn Ala Ala Glu Ala Leu Ser Asn Lys Thr Ile Ile
115 120 125
Ala Ser Lys Arg Ala Asp Ala Leu Ala Arg Trp Gln Gly Ile Gln Thr
130 135 140
Leu Gly Arg Asp Tyr Phe Gln Ala Gln Cys Asp Trp Leu Gln Asp Gln
145 150 155 160
Asn Asn Glu Leu Lys Lys Ser Asn Leu Arg Asp Ala Phe Asp Asp Val
165 170 175
Glu Asp Tyr Leu Lys Val Ser Met Pro Phe Phe Arg Ala Gln Gly Phe
180 185 190
Glu Ile Pro Met Leu Ala Met Tyr Ala Gln Ala Ala Asn Met His Leu
195 200 205
Leu Leu Leu Arg Glu Val Val Gln Asn Gly Val Ser Trp Gly Phe Gln
210 215 220
Gln Tyr Glu Ile Asp Arg Tyr Tyr Ser Asn Thr Asp Pro Phe Leu Gly
225 230 235 240
Asn Pro Gly Leu Ile Gln Leu Leu Glu Ser Tyr Thr Asp Tyr Cys Val
245 250 255
Gln Trp Tyr Asn Asn Gly Leu Gln Gln Gln Tyr Glu Asn Asn Ser Tyr
260 265 270
Asn Trp Asp Ala Phe Asn Asp Phe Arg Arg Asp Met Thr Ile Met Val
275 280 285
Leu Asp Val Val Ser Leu Trp Pro Thr Tyr Asp Gln Lys Arg Tyr Pro
290 295 300
Lys Pro Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr Asp Leu Val
305 310 315 320
Gly Phe Ser Gly Asp Phe Asp Tyr Thr Gln Ile Ala Ile Asp Arg Ala
325 330 335
Glu Arg Glu Leu Val Gln Thr Pro Asp Leu Phe Thr Trp Ile Arg Lys
340 345 350
Ile Val Phe Asn Leu Leu Pro Gly Gly Thr Asn Phe Val Arg Gly Arg
355 360 365
Ala Met Thr Phe Thr Tyr Thr Asp Ser Ile Asn Gly Tyr Glu Glu Asn
370 375 380
Ala Gly Asn Pro Gly Glu Ile Gln Lys Ile Val Asn Ile Pro Leu Pro
385 390 395 400
Asp Thr Gly Asp Asp Val Trp Arg Ile Thr Asn Gln Val Ser Pro Thr
405 410 415
Ser Gly Ala Leu Ser Gly Trp Ile Phe Ser Phe Ile Lys Ser Leu Asp
420 425 430
Gln Arg Ile Ser Trp Arg Glu Pro Ile Ser Ser Asp Ile Met Leu Met
435 440 445
Gln Gly Leu Ser Cys Asn Gly Pro Ser Val Asp Ser Cys Asp Leu Cys
450 455 460
Asp Arg Asn Ser Pro Cys Arg Asn Ile Thr Pro Asn Asp Ser Phe Pro
465 470 475 480
Cys Asn Asn Lys Ser Leu Tyr Ser His Arg Phe Ser Tyr Leu Gly Ala
485 490 495
Gly Leu Lys Ser Asp Leu Thr Thr Leu Thr Phe Phe Ser Tyr Gly Trp
500 505 510
Thr His Val Ser Ala Asp Ala Asn Asn Leu Ile Asp Thr Lys Lys Ile
515 520 525
Thr Gln Ile Pro Ala Val Lys Gly Ser His Leu Ile Gly Asn Ala His
530 535 540
Val Ile Lys Gly Thr Gly Ser Thr Gly Gly Asp Leu Val Val Leu Pro
545 550 555 560
Ile Asp Ser Gln Val Phe Val Asn Val Thr Thr Leu Glu Ser Asp Thr
565 570 575
Thr Ile Gly Tyr Glu Ile Arg Ile Arg Tyr Ala Ser Ala Thr Glu Val
580 585 590
Phe Leu Asn Val Arg Ile Glu Gln Pro Asp Ser Pro Pro Ile Tyr Arg
595 600 605
Asp Val Thr Leu Pro Ala Thr Thr Ser His Ser Asn Leu Thr Tyr Ala
610 615 620
Ala Phe Asn Tyr Tyr Thr Ile Ala Lys Ile Gly Thr Val Gly Ala Thr
625 630 635 640
Pro Gly Leu Thr Val Leu Arg Leu Ser Asn Ser Ile Asn Glu Leu Gly
645 650 655
Ala Ile Ile Asp Lys Ile Glu Phe Ile Pro Ser Glu Trp Ser Leu Glu
660 665 670
Glu Tyr Gln Thr Thr Gln Asp Leu Glu Glu Ala Arg Lys Ala Val Asn
675 680 685
Ala Leu Phe Met Asn Asp Ala Lys Asn Ala Leu Gln Leu Asp Val Thr
690 695 700
Asp Tyr Ala Val Asp Gln Ala Ala Asn Leu Val Glu Cys Val Ser Glu
705 710 715 720
Asp Phe His Ala Gln Glu Lys Met Ile Leu Leu Asp Gln Val Lys Phe
725 730 735
Ala Lys Arg Leu Ser Gln Thr Arg Asn Leu Leu Asn Tyr Gly Asp Phe
740 745 750
Glu Ser Ser Asp Trp Ser Gly Glu Asn Gly Trp Arg Thr Ser Asn His
755 760 765
Val His Val Ala Ser Asp Asn Pro Ile Phe Lys Gly Arg Tyr Leu His
770 775 780
Met Pro Gly Ala Met Ser Pro Gln Phe Ser Asn Asn Ile Tyr Pro Thr
785 790 795 800
Tyr Ala Tyr Gln Lys Val Asp Glu Ser Lys Leu Lys Ser Tyr Thr Arg
805 810 815
Tyr Leu Val Arg Gly Phe Val Gly Asn Ser Lys Asp Leu Glu Leu Leu
820 825 830
Val Glu Arg Tyr Gly Lys Glu Val His Val Glu Met Asp Val Pro Asn
835 840 845
Asp Ile Gln Tyr Ser Leu Pro Met Asn Glu Cys Gly Gly Phe Asn Arg
850 855 860
Cys Lys Pro Ala Ser Tyr Gln Thr Arg Pro Pro His Thr Cys Thr Cys
865 870 875 880
Lys Asp Thr Ala Val Ala His Thr Asp Cys Gln Cys Lys Asp Lys Val
885 890 895
Asn Arg Thr Ser Ala Asp Val Tyr Thr Asn Glu Ser Thr Ser Arg Gly
900 905 910
Met Tyr Ala Asp Glu Phe His Ser His Lys Ser Cys Gly Cys Gly Asp
915 920 925
Lys His Met Asp Lys Asn Gly Thr His Pro His Lys Ser Cys Gly Cys
930 935 940
Lys Asn Ser His Val Phe Thr Tyr His Ile Asp Thr Gly Cys Val Asp
945 950 955 960
Met Val Glu Asn Val Gly Leu Phe Phe Ala Leu Lys Ile Ala Ser Glu
965 970 975
Asn Gly Ile Ala Asn Ile Asp Asn Leu Glu Ile Ile Glu Ala Gln Pro
980 985 990
Leu Thr Gly Glu Ala Leu Ala Arg Val Lys Lys Arg Glu Gln Arg Trp
995 1000 1005
Lys Gln Glu Arg Asp Gln Lys Arg Val Glu Thr Glu Lys Ala Val
1010 1015 1020
Gln Ala Ala Gln Thr Ala Val Gln Asn Leu Phe Thr Asn Ala Gln
1025 1030 1035
Gln Asn Arg Leu Lys Phe Glu Thr Leu Phe Pro Gln Ile Val His
1040 1045 1050
Ala Glu Ala Leu Ile Gln Gln Ile Arg Tyr Val Tyr His Pro Phe
1055 1060 1065
Leu Gln Gly Ala Leu Pro Ala Ile Pro Gly Met Asn Phe Asp Leu
1070 1075 1080
Val Gln Glu Leu Ser Met Arg Ile Glu Asn Ala Arg Met Leu Phe
1085 1090 1095
Glu Thr Arg Asn Leu Ile Gln Asn Gly Ser Phe Ser Ala Gly Thr
1100 1105 1110
Gly Ser Trp His Val Thr Asp Gly Val Glu Val Gln Pro Leu Gln
1115 1120 1125
Asn Thr Ser Val Leu Val Leu Ser Glu Trp Ser His Glu Ala Ser
1130 1135 1140
Gln Gln Val Arg Ile Asp Ser Asp Arg Gly Tyr Val Leu Arg Val
1145 1150 1155
Thr Ala Arg Lys Glu Gly Ala Gly Lys Gly Thr Val Thr Leu Ser
1160 1165 1170
Asp Cys Ala Asp Tyr Met Glu Thr Leu Thr Phe Thr Ser Cys Asp
1175 1180 1185
Tyr Asn Lys Val Gly Ser Gln Thr Met Thr Glu Gly Thr Leu Thr
1190 1195 1200
Gly Phe Val Thr Lys Thr Leu Glu Ile Phe Pro Asp Thr Asp Gln
1205 1210 1215
Ile Arg Ile Asp Ile Gly Glu Thr Glu Gly Thr Phe Lys Val Glu
1220 1225 1230
Ser Val Glu Leu Ile Cys Met Glu Gln Met Glu Asp His Leu Tyr
1235 1240 1245
Asp Arg Ala Gly Ser Val Ala Lys Phe Ser Lys Arg Ser
1250 1255 1260
<210> 36
<211> 667
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 36
Met Ser Gln Asn Tyr Asn Asp Glu Tyr Glu Val Ile Gly Thr Asp Glu
1 5 10 15
Arg Lys Asn His Pro Lys Tyr Pro Leu Ala Asn Thr Pro Gly Leu Glu
20 25 30
Leu Gln Gln Met Ser Tyr Lys Asp Trp Val Asp Ile Cys Thr Thr Glu
35 40 45
Glu Ser Glu Ser Leu Phe Thr Asp Ile Thr Val Lys Asp Ala Val Ser
50 55 60
Ile Ala Thr Ser Ile Thr Ala Ala Val Leu Gly Ile Ser Phe Pro Val
65 70 75 80
Ala Ser Ala Val Ser Ser Ile Ile Ser Val Leu Val Pro Tyr Trp Trp
85 90 95
Pro Ala Ser Ala Gly Thr Pro Gly Thr Pro Ser Ala Gln Ala Thr Trp
100 105 110
Glu Lys Phe Met Asn Ala Ala Glu Ala Leu Ser Asn Lys Thr Ile Ile
115 120 125
Ala Ser Lys Arg Ala Asp Ala Leu Ala Arg Trp Gln Gly Ile Gln Thr
130 135 140
Leu Gly Arg Asp Tyr Phe Gln Ala Gln Cys Asp Trp Leu Gln Asp Gln
145 150 155 160
Asn Asn Glu Leu Lys Lys Ser Asn Leu Arg Asp Ala Phe Asp Asp Val
165 170 175
Glu Asp Tyr Leu Lys Val Ser Met Pro Phe Phe Arg Ala Gln Gly Phe
180 185 190
Glu Ile Pro Met Leu Ala Met Tyr Ala Gln Ala Ala Asn Met His Leu
195 200 205
Leu Leu Leu Arg Glu Val Val Gln Asn Gly Val Ser Trp Gly Phe Gln
210 215 220
Gln Tyr Glu Ile Asp Arg Tyr Tyr Ser Asn Thr Asp Pro Phe Leu Gly
225 230 235 240
Asn Pro Gly Leu Ile Gln Leu Leu Glu Ser Tyr Thr Asp Tyr Cys Val
245 250 255
Gln Trp Tyr Asn Asn Gly Leu Gln Gln Gln Tyr Glu Asn Asn Ser Tyr
260 265 270
Asn Trp Asp Ala Phe Asn Asp Phe Arg Arg Asp Met Thr Ile Met Val
275 280 285
Leu Asp Val Val Ser Leu Trp Pro Thr Tyr Asp Gln Lys Arg Tyr Pro
290 295 300
Lys Pro Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr Asp Leu Val
305 310 315 320
Gly Phe Ser Gly Asp Phe Asp Tyr Thr Gln Ile Ala Ile Asp Arg Ala
325 330 335
Glu Arg Glu Leu Val Gln Thr Pro Asp Leu Phe Thr Trp Ile Arg Lys
340 345 350
Ile Val Phe Asn Leu Leu Pro Gly Gly Thr Asn Phe Val Arg Gly Arg
355 360 365
Ala Met Thr Phe Thr Tyr Thr Asp Ser Ile Asn Gly Tyr Glu Glu Asn
370 375 380
Ala Gly Asn Pro Gly Glu Ile Gln Lys Ile Val Asn Ile Pro Leu Pro
385 390 395 400
Asp Thr Gly Asp Asp Val Trp Arg Ile Thr Asn Gln Val Ser Pro Thr
405 410 415
Ser Gly Ala Leu Ser Gly Trp Ile Phe Ser Phe Ile Lys Ser Leu Asp
420 425 430
Gln Arg Ile Ser Trp Arg Glu Pro Ile Ser Ser Asp Ile Met Leu Met
435 440 445
Gln Gly Leu Ser Cys Asn Gly Pro Ser Val Asp Ser Cys Asp Leu Cys
450 455 460
Asp Arg Asn Ser Pro Cys Arg Asn Ile Thr Pro Asn Asp Ser Phe Pro
465 470 475 480
Cys Asn Asn Lys Ser Leu Tyr Ser His Arg Phe Ser Tyr Leu Gly Ala
485 490 495
Gly Leu Lys Ser Asp Leu Thr Thr Leu Thr Phe Phe Ser Tyr Gly Trp
500 505 510
Thr His Val Ser Ala Asp Ala Asn Asn Leu Ile Asp Thr Lys Lys Ile
515 520 525
Thr Gln Ile Pro Ala Val Lys Gly Ser His Leu Ile Gly Asn Ala His
530 535 540
Val Ile Lys Gly Thr Gly Ser Thr Gly Gly Asp Leu Val Val Leu Pro
545 550 555 560
Ile Asp Ser Gln Val Phe Val Asn Val Thr Thr Leu Glu Ser Asp Thr
565 570 575
Thr Ile Gly Tyr Glu Ile Arg Ile Arg Tyr Ala Ser Ala Thr Glu Val
580 585 590
Phe Leu Asn Val Arg Ile Glu Gln Pro Asp Ser Pro Pro Ile Tyr Arg
595 600 605
Asp Val Thr Leu Pro Ala Thr Thr Ser His Ser Asn Leu Thr Tyr Ala
610 615 620
Ala Phe Asn Tyr Tyr Thr Ile Ala Lys Ile Gly Thr Val Gly Ala Thr
625 630 635 640
Pro Gly Leu Thr Val Leu Arg Leu Ser Asn Ser Ile Asn Glu Leu Gly
645 650 655
Ala Ile Ile Asp Lys Ile Glu Phe Ile Pro Ser
660 665
<210> 37
<211> 1164
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 37
Met Ser Glu Tyr Glu Leu Met Asp Val Ser Pro Asn Pro Tyr Asn Val
1 5 10 15
Leu Met Gln Arg Pro Asp Glu Ala Gly Gly Ile Asp Trp Asp Ala Glu
20 25 30
Ile Ser Lys Met Ile Lys Thr Tyr Tyr Gln Lys Gly Thr Phe Ala Ala
35 40 45
Asp Phe Ala Thr Ile Ile Leu Pro Leu Ile Leu Ala His Asn Val Gln
50 55 60
Trp Asn Val Val Leu Lys Ser Val Leu Ser Phe Ala Ser Ala Leu Pro
65 70 75 80
Gly Ile Gly Thr Ala Val Ser Ile Leu Ser Pro Phe Leu Gln Leu Leu
85 90 95
Phe Pro Thr Asp Pro Ala Ala His Asp Lys Gln Leu Met Lys Leu Ile
100 105 110
Leu Asp Gln Thr Lys Ile Leu Ile Asp Gln Ala Val Ser Gln Glu Val
115 120 125
Met Asn Arg Ile Ser Gln Gln Ile Thr Gly Leu Tyr Ala Asn Leu Ala
130 135 140
Arg Phe Asn Arg Thr Ile Ala Ala Tyr Glu Thr Leu Ala Pro Ser Thr
145 150 155 160
Ile Ile Ala Glu Ile Glu Ala Ile Glu Ala Val Phe Asp Asn Gln Val
165 170 175
Pro Gln Leu Val Asp Pro Gly Tyr Leu Ala Leu Thr Leu Pro Leu Tyr
180 185 190
Val Gln Gly Val Asn Leu Asn Leu Met Phe Tyr Lys Ser Val Leu Asp
195 200 205
His Ala Glu Gln Leu Lys Ile Pro Ala Gly Glu Gln Asn Phe Tyr Lys
210 215 220
Lys Ser Leu Ile Asp Lys Ile Lys Thr Ala Ser Ala Gln Val Tyr Thr
225 230 235 240
Tyr Phe Lys Gln Val Pro Trp Arg Pro Asn Glu Ile Thr Pro Ser Ser
245 250 255
Ala Ser Asn Gln Asp Ile Val Thr Arg Gln Thr Met Tyr Thr His Cys
260 265 270
Leu Asp Tyr Val Ala Met Trp Pro Thr Phe Asn Pro Asp Leu Tyr Pro
275 280 285
Leu Ser Ala Asp Ile Glu Gln Thr Arg Val Leu Ser Ser Pro Ile Ser
290 295 300
Gly Leu Leu Gly Lys Pro Gly Ala Tyr Gly Pro Phe Val Asp Phe Leu
305 310 315 320
Gly Phe Asp Asp Thr Leu Glu Leu Thr Ser Ile Thr Tyr Trp Gln Gly
325 330 335
Asp Arg Val Asp Arg Leu Asp Gln Asn Phe Thr His Gly Arg Val Ile
340 345 350
Arg Gly Gly Ser Gly Gly Gly Gly Ser Arg His Asp Ile Pro Ser Ser
355 360 365
Arg Glu Asn Pro Ile Ala Ser Leu Trp Met Asp Tyr Asn Ser Asn Asn
370 375 380
Tyr Gln Phe Val Ser Tyr Gln Met Tyr Asn Thr Leu Ile Asp Tyr Thr
385 390 395 400
Lys Pro Gln Thr Asn Leu Leu Leu Ala Pro Pro Asn His Lys Leu His
405 410 415
Arg Leu Tyr Thr Thr Asp Asp Pro Ser Phe Gly Gly Arg Val Gly Thr
420 425 430
Ile Gly Asn Gln Phe Ile Gln Asn Thr Ile Phe Pro Glu Asn Ile Met
435 440 445
Gly Thr Phe Asp Gln Asp Leu Gly Val Arg Arg Ile Lys Gly Ile Pro
450 455 460
Phe Glu Lys Ser Pro Ser Thr Thr Ala Phe Thr Tyr Ala Lys Glu Pro
465 470 475 480
Leu Asn Gly Ala Glu Ala Val Lys Leu Gly Ile Arg Gln Thr Leu Asp
485 490 495
Leu Pro Ile Thr Asn Val Thr Thr Gly Arg Tyr Glu Val Arg Ile Arg
500 505 510
Tyr Ala Ser Thr Asp Thr Ala Thr Ile Phe Val Asn Leu Asp Val Gly
515 520 525
Gly His Gln Pro Ile Tyr Lys Thr Val Val Leu Pro Ala Thr Thr Thr
530 535 540
Gly Asp Pro Thr Gly Ile Glu Gly Ala Asn Gly Thr Tyr Thr Leu Leu
545 550 555 560
Thr Ile Gln Glu Ile Ser Ile Pro Ala Gly Asn Phe His Val Tyr Val
565 570 575
Thr Asn Asn Thr Thr Asn Gly Ser Gln Leu Leu Leu Asp Arg Ile Glu
580 585 590
Phe Val Pro Ile Leu Val Pro Val Val Phe Ile Asp Glu Asp Phe Asn
595 600 605
Gly Glu Met Lys Thr Leu Trp Lys Ser Asp Thr Glu Gln Gly Ile His
610 615 620
Ile Asn Gly Met Ile Thr Ser Asn Glu Asn Phe Ala Leu Ala Thr Asp
625 630 635 640
Val Ser Leu Pro Thr Thr Thr Asp Thr Asp Lys Gly Gly Pro Ile Asn
645 650 655
Glu Asp Tyr Gln Pro Phe His Glu Leu Tyr Thr Trp Pro Leu Ser Pro
660 665 670
Asn Leu Arg Gly His Ile Thr Ala Ala Ile Ser Leu Val Ser Ser Gln
675 680 685
Asn Lys Phe Thr Thr Glu Glu Glu Leu Ser Ala Val Thr Gln Val Val
690 695 700
Asn Ala Leu Phe Ile Thr Asp Thr Gln Leu Ala Pro Thr Val Thr Asp
705 710 715 720
Tyr Trp Ile Asp Gln Val Tyr Leu Lys Val Lys Ala Leu Ser Asp Asp
725 730 735
Leu Phe Gly Glu Glu Lys Glu Arg Leu Arg Gln Arg Val Ala Arg Ala
740 745 750
Lys Gln Ile Asn Thr Met Lys Asn Lys Leu Ile Gly Ser Ser Phe Gln
755 760 765
Thr Leu Thr Asn Trp Gln Leu Ser Ser Asp Val Ala Leu Leu Ala Asp
770 775 780
Asn Pro Leu Phe Ala Gly Thr Tyr Val Leu Leu Pro Pro Ser Thr Tyr
785 790 795 800
Pro Asp Thr Arg Pro Ser Tyr Ala Tyr Gln Lys Val Asp Glu Ser Lys
805 810 815
Leu Lys Pro Tyr Thr Arg Tyr Ile Val Arg Gly Phe Ile Gly Gln Ala
820 825 830
Gln Asp Leu Ala Leu Leu Val Ser Arg Tyr Gly Lys Glu Val Asp Thr
835 840 845
Ala Leu Thr Val Pro Tyr Gln Glu Ala Leu Pro Leu Ser Pro Asp Ser
850 855 860
Thr Ser Asn Cys Cys Gly Pro Val Ala Cys Leu Pro Cys Glu Gly His
865 870 875 880
Glu Tyr Asp Thr His Leu Phe Ser Tyr Thr Ile Asp Val Gly Ala Leu
885 890 895
Gln Ser Glu Ile Asn Leu Gly Ile Glu Ile Gly Phe Lys Ile Thr Ser
900 905 910
Pro Thr Gly Phe Ala His Ile Ser Asn Leu Glu Ile Val Glu Asp Arg
915 920 925
Pro Leu Thr Glu Ala Glu Thr Ile Lys Val Lys Gln Arg Glu Lys Lys
930 935 940
Trp Met Arg Leu Ser Gln Lys Gln Gln Ser Gln Met Gln Ala Gln Tyr
945 950 955 960
Asp Gln Thr Met Gln Tyr Phe Ala Asn Leu Tyr Thr Thr Pro Asp Gln
965 970 975
Thr Glu Leu Lys Asn Thr Val Gln Tyr Thr Asp Ile Ala Lys Val Gln
980 985 990
Val Val Thr Phe Pro Ser Pro Met Gln Trp Phe Ile Pro Gln Leu Pro
995 1000 1005
Arg Ser Ser Ser Ala Met Val Gln Glu Leu Ile Asn Ala Lys Glu
1010 1015 1020
Lys Ala Leu Gln Leu Tyr Pro Thr Asn Leu Ile Gln Asn Gly Asn
1025 1030 1035
Phe Ala Ala Asp Leu Ser His Trp Asp Val Arg Gln Asp Thr Asn
1040 1045 1050
Val Ser Ile Asp Thr Val Asp Gly Thr Ser Val Leu His Val Pro
1055 1060 1065
Ser Trp Asp Glu Thr Val Ser Gln Thr Ile Thr Leu Pro Pro His
1070 1075 1080
Gln Glu Asn Ile Leu Tyr Gln Leu Arg Val Thr Ala Lys Gly Asn
1085 1090 1095
Gly Ser Val Ile Leu Gln His Asn Gly Glu Gln Glu Arg Leu Tyr
1100 1105 1110
Phe Asn Gln Asn Asn Pro Asn Gly Asn Thr Phe Val Thr Lys Thr
1115 1120 1125
Ile Ser Phe Tyr Pro Thr Ala Ser Asn Leu Ser Ile Gln Ile Gln
1130 1135 1140
Ser Glu Gly Thr Asp Phe Tyr Val Lys Thr Ile Glu Val Phe Val
1145 1150 1155
Gln Pro Val Pro Leu Thr
1160
<210> 38
<211> 596
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 38
Met Ser Glu Tyr Glu Leu Met Asp Val Ser Pro Asn Pro Tyr Asn Val
1 5 10 15
Leu Met Gln Arg Pro Asp Glu Ala Gly Gly Ile Asp Trp Asp Ala Glu
20 25 30
Ile Ser Lys Met Ile Lys Thr Tyr Tyr Gln Lys Gly Thr Phe Ala Ala
35 40 45
Asp Phe Ala Thr Ile Ile Leu Pro Leu Ile Leu Ala His Asn Val Gln
50 55 60
Trp Asn Val Val Leu Lys Ser Val Leu Ser Phe Ala Ser Ala Leu Pro
65 70 75 80
Gly Ile Gly Thr Ala Val Ser Ile Leu Ser Pro Phe Leu Gln Leu Leu
85 90 95
Phe Pro Thr Asp Pro Ala Ala His Asp Lys Gln Leu Met Lys Leu Ile
100 105 110
Leu Asp Gln Thr Lys Ile Leu Ile Asp Gln Ala Val Ser Gln Glu Val
115 120 125
Met Asn Arg Ile Ser Gln Gln Ile Thr Gly Leu Tyr Ala Asn Leu Ala
130 135 140
Arg Phe Asn Arg Thr Ile Ala Ala Tyr Glu Thr Leu Ala Pro Ser Thr
145 150 155 160
Ile Ile Ala Glu Ile Glu Ala Ile Glu Ala Val Phe Asp Asn Gln Val
165 170 175
Pro Gln Leu Val Asp Pro Gly Tyr Leu Ala Leu Thr Leu Pro Leu Tyr
180 185 190
Val Gln Gly Val Asn Leu Asn Leu Met Phe Tyr Lys Ser Val Leu Asp
195 200 205
His Ala Glu Gln Leu Lys Ile Pro Ala Gly Glu Gln Asn Phe Tyr Lys
210 215 220
Lys Ser Leu Ile Asp Lys Ile Lys Thr Ala Ser Ala Gln Val Tyr Thr
225 230 235 240
Tyr Phe Lys Gln Val Pro Trp Arg Pro Asn Glu Ile Thr Pro Ser Ser
245 250 255
Ala Ser Asn Gln Asp Ile Val Thr Arg Gln Thr Met Tyr Thr His Cys
260 265 270
Leu Asp Tyr Val Ala Met Trp Pro Thr Phe Asn Pro Asp Leu Tyr Pro
275 280 285
Leu Ser Ala Asp Ile Glu Gln Thr Arg Val Leu Ser Ser Pro Ile Ser
290 295 300
Gly Leu Leu Gly Lys Pro Gly Ala Tyr Gly Pro Phe Val Asp Phe Leu
305 310 315 320
Gly Phe Asp Asp Thr Leu Glu Leu Thr Ser Ile Thr Tyr Trp Gln Gly
325 330 335
Asp Arg Val Asp Arg Leu Asp Gln Asn Phe Thr His Gly Arg Val Ile
340 345 350
Arg Gly Gly Ser Gly Gly Gly Gly Ser Arg His Asp Ile Pro Ser Ser
355 360 365
Arg Glu Asn Pro Ile Ala Ser Leu Trp Met Asp Tyr Asn Ser Asn Asn
370 375 380
Tyr Gln Phe Val Ser Tyr Gln Met Tyr Asn Thr Leu Ile Asp Tyr Thr
385 390 395 400
Lys Pro Gln Thr Asn Leu Leu Leu Ala Pro Pro Asn His Lys Leu His
405 410 415
Arg Leu Tyr Thr Thr Asp Asp Pro Ser Phe Gly Gly Arg Val Gly Thr
420 425 430
Ile Gly Asn Gln Phe Ile Gln Asn Thr Ile Phe Pro Glu Asn Ile Met
435 440 445
Gly Thr Phe Asp Gln Asp Leu Gly Val Arg Arg Ile Lys Gly Ile Pro
450 455 460
Phe Glu Lys Ser Pro Ser Thr Thr Ala Phe Thr Tyr Ala Lys Glu Pro
465 470 475 480
Leu Asn Gly Ala Glu Ala Val Lys Leu Gly Ile Arg Gln Thr Leu Asp
485 490 495
Leu Pro Ile Thr Asn Val Thr Thr Gly Arg Tyr Glu Val Arg Ile Arg
500 505 510
Tyr Ala Ser Thr Asp Thr Ala Thr Ile Phe Val Asn Leu Asp Val Gly
515 520 525
Gly His Gln Pro Ile Tyr Lys Thr Val Val Leu Pro Ala Thr Thr Thr
530 535 540
Gly Asp Pro Thr Gly Ile Glu Gly Ala Asn Gly Thr Tyr Thr Leu Leu
545 550 555 560
Thr Ile Gln Glu Ile Ser Ile Pro Ala Gly Asn Phe His Val Tyr Val
565 570 575
Thr Asn Asn Thr Thr Asn Gly Ser Gln Leu Leu Leu Asp Arg Ile Glu
580 585 590
Phe Val Pro Ile
595
<210> 39
<211> 456
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 39
Met Thr Gln Arg Leu Pro Gln Phe Glu Ile Asp Gly Tyr Glu Glu Val
1 5 10 15
Ser Leu Pro Leu Phe Ala Glu Ile Cys Thr Leu His Leu Thr Leu Leu
20 25 30
Lys Asp Gly Val Leu Met Gly Pro Arg Trp Gly Leu Thr Thr Asp Asp
35 40 45
Val Lys Tyr Tyr Arg Asp Glu Phe Tyr Arg Leu Thr Ser Asn Tyr Asn
50 55 60
Ser Arg Ala Val Glu Leu Tyr Asn Arg Gly Phe Ser Asn Ala Ser Asn
65 70 75 80
Ser Leu Asp Phe Lys Lys Val Val Val Tyr Lys Thr Thr Met Asn Glu
85 90 95
Tyr Ala Leu Asp Leu Val Tyr Leu Trp Ser Leu Met Arg Tyr Glu Gly
100 105 110
Ile Thr Pro Ser Val Ser Arg Ser Leu Trp Asn Phe Val Gly Asn Arg
115 120 125
Thr Tyr Ile Pro Ile Ser Lys Ser Asp Trp Asn Ala Gln Tyr Lys Leu
130 135 140
Met Asn Gly Val Val Asn Ser Leu Leu Lys Ser Val Thr Phe Arg Ser
145 150 155 160
Ser His Ile Lys Phe Asn Gly Val Trp Pro Lys Tyr Arg Leu Gln Glu
165 170 175
Asn Ile Thr Gly Trp Met Gly Ala Thr Gln Leu Asp Thr Asp Tyr Ile
180 185 190
Tyr Gln Ala Pro Gln Ser Val Pro Lys Asp Ala Val Ile Ala Pro Gly
195 200 205
Gly Gly Val Lys Glu Lys Trp His Ile Lys Ile Gly Asp Thr Val Val
210 215 220
Ile Pro Gln Pro Trp Tyr Leu Pro Ile Lys Pro Ile Ala Tyr Asp Gly
225 230 235 240
Tyr Tyr Ile Arg Thr Ile Ser Ala Tyr Pro Thr Phe Ile Phe Pro Pro
245 250 255
Asn Leu Asp Phe Ser Leu Gly Ser Lys Leu Glu Leu Ser Val Pro Asn
260 265 270
His Thr Met Ala His Gly Gly Asn Ile Ile Leu Gly Phe Ala Pro Asp
275 280 285
Asn Thr Lys Glu Phe Phe Thr Thr Gly Gln His Thr Phe Pro Gln Lys
290 295 300
Gly Ser Tyr Thr Ile Pro Pro Leu Gln Tyr Thr Thr Ile Asn Asn Pro
305 310 315 320
Ser Gln Ala Phe Leu Asn Glu Thr Leu Asp Gln Gly Thr Asp Gly Val
325 330 335
Leu Ser Leu Thr Gly Ala Ala Asn Thr Asn Val Ile Gln Tyr Asn Ile
340 345 350
Asn Met Leu Asn Val Asp Gln Asn Lys Lys Tyr Val Leu Phe Leu Arg
355 360 365
Ile Gly Gly Pro Gly Asn Ile Asn Ala Ile Ile Lys Asn Ala Gly Gln
370 375 380
Gln Ser Asp Val Pro Ser Lys Arg Leu Gly Val Phe Asn Asn Pro Asn
385 390 395 400
Leu Tyr Gly Trp Lys Asp Phe Val Ser Thr Gly Phe Thr Phe Lys Leu
405 410 415
Asp Asp Met Gly Leu Pro Ile Pro Pro Val Gly Leu Val Leu Arg Lys
420 425 430
Thr Asn Asp Ala Gly Met Arg Ile Leu Gln Val Met Ile Ile Pro Glu
435 440 445
Ser Asp Phe Lys Ile Leu Ile Pro
450 455
<210> 40
<211> 505
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 40
Val Lys His Met Asn Ser Glu Gln Arg Lys His Gly Gln Asn Asn Glu
1 5 10 15
Tyr Lys His Gln Asn His Gly Val Ser Cys Asn Cys Gly Cys Gln Gln
20 25 30
Gly Lys Tyr Glu Tyr Glu Gln Lys Gln Asp Arg Glu Glu Tyr Asn Asn
35 40 45
Ser Tyr Met Ser Asn Glu Gln Thr Gln Asn His Glu Gly Met Gly Tyr
50 55 60
Leu Val Ala Lys Glu Gly Thr Arg Asn Asp Met Tyr Asn Ser Val Lys
65 70 75 80
Arg Glu Phe Arg Glu His Asp Met Tyr His Glu Asn Lys Asn Tyr Tyr
85 90 95
Ser Cys Ser Pro Arg Gln Phe Asn Thr Leu Asn Leu Pro Asp Glu Ser
100 105 110
Lys Arg Phe Gln Thr Ile Thr His Val Gln Asn Thr Asn Ser Asn Pro
115 120 125
Ser Leu Asn Ala Thr Ser Thr Thr Arg Gly Gln Asn Ile Asp Gly Ser
130 135 140
Asn Asn Phe Cys Arg Ser Ile Ser Leu Asp Phe Gln Leu Ile Asn Gln
145 150 155 160
Asp Leu Asn Thr Thr Gln Asp Thr Phe Tyr Tyr Leu Phe Tyr Leu Leu
165 170 175
Asp Ser Gly Asp Phe Ile Ile Ala Asn Tyr Gly Thr Gly Arg Val Leu
180 185 190
Glu Asn Arg Thr Val Glu Pro Asp Glu Arg Ala Ile Ile Ser Ser Leu
195 200 205
Tyr Thr Gly Ser Val Ser Gln Phe Phe Arg Lys Arg Ser Ser Val Asn
210 215 220
Asn Gln Phe Ile Leu Ser Pro Gln Gln Glu Leu Asn Arg Ala Val Asn
225 230 235 240
Val Cys Asn Asn Thr Ile Asn Leu Ala Thr Lys Ile Thr Ser Gly Phe
245 250 255
Asn Ile Pro Asp Asn Thr Ser Asn Val Phe Arg His Ile Val Ala Ser
260 265 270
Asn Arg Pro Ile Thr Ile Pro Ser Phe Ser Asn Glu Ser Thr Leu Gly
275 280 285
Pro Ala Pro Val Leu Thr Ser Leu Ser Asp Tyr Gly Pro Ser Pro Ala
290 295 300
Gln Ala Glu Arg Ala Val Ile Gly Ser Ala Leu Ile Pro Cys Ile Phe
305 310 315 320
Val Asn Asp Val Leu Pro Leu Glu Arg Arg Ile Lys Glu Ser Pro Tyr
325 330 335
Tyr Val Leu Glu Tyr Arg Gln Tyr Trp His Arg Leu Trp Thr Asp Glu
340 345 350
Ile Pro Ser Gly Asp Arg Arg Ser Phe Gly Glu Ile Thr Gly Ile Leu
355 360 365
Pro Asn Ala Gln Ala Ser Met Arg Asp Met Ile Asp Met Ser Ile Gly
370 375 380
Ala Asp Trp Gly Leu Arg Phe Gly Asp Ser Ser Thr Pro Phe Lys Arg
385 390 395 400
Gly Ile Leu Asn Ser Leu Asn Thr Gln Gly Ser Tyr Ala Asp Val Asp
405 410 415
Leu Gly Thr Gln Thr Gly Ser Asn Gln Leu Thr Asn Phe Asn Ser Phe
420 425 430
Ser Val Arg Tyr Ala Arg Phe Val Lys Ala His Glu Tyr Arg Leu Thr
435 440 445
Arg Ser Asn Gly Thr Val Val Thr Thr Pro Trp Val Ala Leu Asp Tyr
450 455 460
Arg Val Pro Gln Thr Ala Arg Phe Pro Leu Asn Ala Gln Leu Ser Gln
465 470 475 480
Asn Tyr Lys Ile Ile Ser Ser Tyr Asn Ser Tyr Asp Leu Pro Val Leu
485 490 495
Glu His Thr Asp Asn Lys Lys Lys Trp
500 505
<210> 41
<211> 720
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 41
Leu Leu Ile Glu Arg Gly Glu Ile Phe Met Asn Gln Asn Asn Asn Glu
1 5 10 15
Tyr Glu Ile Ala Asp Val Asn Pro Pro Ser Tyr Pro Ser Asn Arg Ile
20 25 30
Asn Asn Tyr Ser Arg Tyr Pro Phe Ala Asn Asp Pro Asn Gln Leu Leu
35 40 45
Gln Gln Thr Asn Tyr Lys Asp Trp Leu Asn Ile Cys Gln Gly Asn Asn
50 55 60
Gln Asn Ser Gly Asn Leu Ile Thr Asp Ala Gly Ala Ala Val Ser Ala
65 70 75 80
Gly Ile Ile Val Thr Gly Thr Met Ile Ala Ala Phe Ala Ala Leu Ala
85 90 95
Val Ala Pro Phe Pro Ile Val Gly Ala Ile Val Gly Gly Ile Val Ile
100 105 110
Ala Phe Gly Thr Leu Leu Pro Leu Val Trp Pro Gly Gly Lys Glu Asp
115 120 125
Arg Thr Val Trp Thr Gln Phe Phe Ala Leu Gly Glu Thr Leu Leu Gly
130 135 140
Lys Arg Leu Thr Asp Asp Ala Gln Lys Thr Ala Ala Glu Thr Val Asp
145 150 155 160
Gly Phe Gly Arg Val Leu Asp Asn Tyr Lys Val Ala Leu Asn Ser Trp
165 170 175
Met Asp Leu Lys Lys Lys Gln Ser Pro Gly Ser Pro Pro Thr Thr Glu
180 185 190
Leu Lys Glu Ala Ala Asp Asn Val Lys Gln Arg Phe Glu Ile Val His
195 200 205
Asn Glu Phe Asp Ala Asn Met Pro His Leu Gly Asn Pro Lys Ser Asn
210 215 220
Tyr Lys Glu Val Leu Leu Ser Ser Tyr Ala His Ala Ala Asn Leu His
225 230 235 240
Leu Asn Leu Leu Gln Gln Gly Val Arg Phe Ala Asp Gln Trp Asn Gln
245 250 255
Asp Ile Tyr Ser Ser Gln Ile Val Pro Ala Ala Gly Thr Ser Thr Glu
260 265 270
Tyr Leu Lys Phe Leu Gln Ala Arg Ile Gln Glu Tyr Val Asn Tyr Cys
275 280 285
Thr Ala Thr Tyr Arg Ser Gly Leu Asn Ile Leu Lys Asn Ser Asn Asp
290 295 300
Ile Thr Trp Asp Ile Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Thr
305 310 315 320
Val Leu Asp Leu Val Ala Ala Phe Pro Asn Tyr Asp Pro Thr Tyr Tyr
325 330 335
Pro Ile Asp Thr Lys Ile Gln Leu Thr Arg Thr Ile Tyr Ser Asn Leu
340 345 350
Leu Gly Ile Thr Gln Ser Gly Arg Gln Tyr Phe Glu Ala Leu Glu Asn
355 360 365
Ala Leu Thr Met Arg Pro Asn Leu Tyr Arg Ile Leu Arg Gly Phe Asn
370 375 380
Phe Asn Thr Thr Phe Gly Ser Gly Val Asn Tyr Leu Ser Gly Ile Ala
385 390 395 400
Asn Arg Arg Gly Leu Ile Lys Thr Val Pro Ser Glu Glu Asp Glu Gln
405 410 415
Pro Phe Tyr Gly Gln Ser Asn Gly Arg His Asp Pro Met Ile Phe Ala
420 425 430
Asn Gly Leu Asn Ile Phe Arg Val Thr Leu Leu Arg Asn Lys His Tyr
435 440 445
Ile Ala Leu Pro Val Gly Pro Pro Leu Phe Ala Ala Asn Gln Ile Thr
450 455 460
Asp Leu Lys Leu Tyr Tyr Gly Asn Ala Thr Gln Tyr His His Tyr Gln
465 470 475 480
Pro Ser Asn Glu Gly Leu Thr Thr Glu Ser Thr Asp Leu Ser Phe Pro
485 490 495
Pro Lys Asn Glu Glu Tyr Leu Asn Ala Ile Leu Met Thr Ser Pro Arg
500 505 510
Thr Ile Tyr Glu Asn Pro Thr Asn Ile Tyr Ser Phe Ala Trp Thr Lys
515 520 525
Lys Asp Ile Asp Gln Gln Asn Asn Ile Ile Ile Asn Ala Ile Thr Gln
530 535 540
Ile Pro Ala Val Lys Gly Ile Ala Leu Gly Asn Gln Ser Arg Val Ile
545 550 555 560
Gln Gly Pro Gly His Thr Ser Gly Asp Leu Val Tyr Leu Lys Asp Ser
565 570 575
Leu Glu Leu Arg Cys Gln Tyr Thr Gly Pro Gln Gln Ser Tyr Asp Val
580 585 590
Arg Ile Arg Tyr Ala Ser Asn Gly Ile Leu Asn Asp Arg Ala Met Ile
595 600 605
Ala Ile Thr Ile Pro Gly Val Thr Gly Gln Ser Ile Ser Leu Ser Glu
610 615 620
Thr Phe Ser Gly Thr Asp Tyr Asn Val Leu Glu Phe Gln Asn Phe Lys
625 630 635 640
Ser Val Gln Phe Thr Ser Ala Ile Arg Leu Asn Pro Ser Gln Asn Ile
645 650 655
Phe Ile Tyr Leu Asn Arg Leu Asp Gln Asn Ala Asn Thr Thr Leu Leu
660 665 670
Ile Asp Lys Ile Glu Phe Ile Pro His Pro Leu Pro Arg Ile Leu Ser
675 680 685
Glu Gln Lys Leu Lys Lys Val Lys Gln Lys Val Asn Asp Leu Phe Ile
690 695 700
Asp Ala Glu Asn Gly Cys Phe Cys Pro Ser His Glu Lys Asp Leu Arg
705 710 715 720
<210> 42
<211> 681
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 42
Leu Leu Ile Glu Arg Gly Glu Ile Phe Met Asn Gln Asn Asn Asn Glu
1 5 10 15
Tyr Glu Ile Ala Asp Val Asn Pro Pro Ser Tyr Pro Ser Asn Arg Ile
20 25 30
Asn Asn Tyr Ser Arg Tyr Pro Phe Ala Asn Asp Pro Asn Gln Leu Leu
35 40 45
Gln Gln Thr Asn Tyr Lys Asp Trp Leu Asn Ile Cys Gln Gly Asn Asn
50 55 60
Gln Asn Ser Gly Asn Leu Ile Thr Asp Ala Gly Ala Ala Val Ser Ala
65 70 75 80
Gly Ile Ile Val Thr Gly Thr Met Ile Ala Ala Phe Ala Ala Leu Ala
85 90 95
Val Ala Pro Phe Pro Ile Val Gly Ala Ile Val Gly Gly Ile Val Ile
100 105 110
Ala Phe Gly Thr Leu Leu Pro Leu Val Trp Pro Gly Gly Lys Glu Asp
115 120 125
Arg Thr Val Trp Thr Gln Phe Phe Ala Leu Gly Glu Thr Leu Leu Gly
130 135 140
Lys Arg Leu Thr Asp Asp Ala Gln Lys Thr Ala Ala Glu Thr Val Asp
145 150 155 160
Gly Phe Gly Arg Val Leu Asp Asn Tyr Lys Val Ala Leu Asn Ser Trp
165 170 175
Met Asp Leu Lys Lys Lys Gln Ser Pro Gly Ser Pro Pro Thr Thr Glu
180 185 190
Leu Lys Glu Ala Ala Asp Asn Val Lys Gln Arg Phe Glu Ile Val His
195 200 205
Asn Glu Phe Asp Ala Asn Met Pro His Leu Gly Asn Pro Lys Ser Asn
210 215 220
Tyr Lys Glu Val Leu Leu Ser Ser Tyr Ala His Ala Ala Asn Leu His
225 230 235 240
Leu Asn Leu Leu Gln Gln Gly Val Arg Phe Ala Asp Gln Trp Asn Gln
245 250 255
Asp Ile Tyr Ser Ser Gln Ile Val Pro Ala Ala Gly Thr Ser Thr Glu
260 265 270
Tyr Leu Lys Phe Leu Gln Ala Arg Ile Gln Glu Tyr Val Asn Tyr Cys
275 280 285
Thr Ala Thr Tyr Arg Ser Gly Leu Asn Ile Leu Lys Asn Ser Asn Asp
290 295 300
Ile Thr Trp Asp Ile Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Thr
305 310 315 320
Val Leu Asp Leu Val Ala Ala Phe Pro Asn Tyr Asp Pro Thr Tyr Tyr
325 330 335
Pro Ile Asp Thr Lys Ile Gln Leu Thr Arg Thr Ile Tyr Ser Asn Leu
340 345 350
Leu Gly Ile Thr Gln Ser Gly Arg Gln Tyr Phe Glu Ala Leu Glu Asn
355 360 365
Ala Leu Thr Met Arg Pro Asn Leu Tyr Arg Ile Leu Arg Gly Phe Asn
370 375 380
Phe Asn Thr Thr Phe Gly Ser Gly Val Asn Tyr Leu Ser Gly Ile Ala
385 390 395 400
Asn Arg Arg Gly Leu Ile Lys Thr Val Pro Ser Glu Glu Asp Glu Gln
405 410 415
Pro Phe Tyr Gly Gln Ser Asn Gly Arg His Asp Pro Met Ile Phe Ala
420 425 430
Asn Gly Leu Asn Ile Phe Arg Val Thr Leu Leu Arg Asn Lys His Tyr
435 440 445
Ile Ala Leu Pro Val Gly Pro Pro Leu Phe Ala Ala Asn Gln Ile Thr
450 455 460
Asp Leu Lys Leu Tyr Tyr Gly Asn Ala Thr Gln Tyr His His Tyr Gln
465 470 475 480
Pro Ser Asn Glu Gly Leu Thr Thr Glu Ser Thr Asp Leu Ser Phe Pro
485 490 495
Pro Lys Asn Glu Glu Tyr Leu Asn Ala Ile Leu Met Thr Ser Pro Arg
500 505 510
Thr Ile Tyr Glu Asn Pro Thr Asn Ile Tyr Ser Phe Ala Trp Thr Lys
515 520 525
Lys Asp Ile Asp Gln Gln Asn Asn Ile Ile Ile Asn Ala Ile Thr Gln
530 535 540
Ile Pro Ala Val Lys Gly Ile Ala Leu Gly Asn Gln Ser Arg Val Ile
545 550 555 560
Gln Gly Pro Gly His Thr Ser Gly Asp Leu Val Tyr Leu Lys Asp Ser
565 570 575
Leu Glu Leu Arg Cys Gln Tyr Thr Gly Pro Gln Gln Ser Tyr Asp Val
580 585 590
Arg Ile Arg Tyr Ala Ser Asn Gly Ile Leu Asn Asp Arg Ala Met Ile
595 600 605
Ala Ile Thr Ile Pro Gly Val Thr Gly Gln Ser Ile Ser Leu Ser Glu
610 615 620
Thr Phe Ser Gly Thr Asp Tyr Asn Val Leu Glu Phe Gln Asn Phe Lys
625 630 635 640
Ser Val Gln Phe Thr Ser Ala Ile Arg Leu Asn Pro Ser Gln Asn Ile
645 650 655
Phe Ile Tyr Leu Asn Arg Leu Asp Gln Asn Ala Asn Thr Thr Leu Leu
660 665 670
Ile Asp Lys Ile Glu Phe Ile Pro His
675 680
<210> 43
<211> 1229
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 43
Met Asn Gln Asn Asn Gln Lys Glu Ile Gln Ile Ile Gly Ser Ser Ser
1 5 10 15
Asn Asp Val Ser Gln Ser Asn Gly Tyr Pro Arg Tyr Pro Leu Ala Lys
20 25 30
Glu Ser Asn Tyr Lys Asn Trp Leu Thr Ser Cys Asp Glu Ser Asn Leu
35 40 45
Asp Arg Phe Ser Thr Pro Ser Ser Val Gln Asp Ala Ile Val Thr Ser
50 55 60
Leu Ser Val Thr Ser Tyr Ile Ile Ser Phe Leu Glu Gly Pro Ile Gly
65 70 75 80
Ala Gly Leu Gly Ile Leu Ser Val Leu Phe Gly Gln Phe Trp Pro Phe
85 90 95
Asn Thr Asn Thr Val Trp Asp Asn Leu Leu Arg Val Val Glu Glu Leu
100 105 110
Ile Asn Gln Arg Ile Asn Glu Val Glu Arg Glu Arg Ile Ile Arg Gln
115 120 125
Phe Asp Gly Ile Arg Ser Val Leu Ser Asn Tyr Asn Thr Ala Phe Arg
130 135 140
Asn Trp Asn Glu Asn Glu Thr Thr Arg Asn Asn Pro Arg Leu Gln Glu
145 150 155 160
Glu Leu Arg His Arg Phe Asn Asn Thr Asp Asp Ile Ile Ala Asp Arg
165 170 175
Met Pro Glu Phe Arg Ile Lys Gly Phe Glu Thr Gln Ser Leu Ala Val
180 185 190
Tyr Ala Lys Ala Ala Thr Leu His Leu Leu Leu Leu Arg Asp Ala Val
195 200 205
Val Tyr Gly Arg Ala Trp Gly Phe Asp Ser Gln Thr Ile Glu Ser Leu
210 215 220
Tyr Thr Lys Leu Val Cys Leu Ile Asn Ala Tyr Ala Asp His Cys Ile
225 230 235 240
Leu Val Tyr Ser Gln Gly Leu Gln Glu Leu Arg Asn Arg Gly Asn Trp
245 250 255
Arg Asn Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile Thr Val Leu Asp
260 265 270
Ile Ile Ser Leu Phe Ser Asn Tyr Asp Pro Arg Ile Tyr Phe Tyr Asn
275 280 285
Thr Asn Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu Pro Ile Ala Ser
290 295 300
Gly Asn Trp Leu Asn Ser Tyr Ser Asn Pro Asp Gln Phe Gln Gln Ile
305 310 315 320
Glu Asn Asp Leu Asn Pro Pro Pro Ser Leu Phe Ser Thr Leu Ser Thr
325 330 335
Leu Phe Ala Arg Ile Gly Phe Phe Tyr Tyr Asn Asp Ile Tyr Thr Pro
340 345 350
Arg Asp Ala Ile Leu Asn Thr Ser Met Arg Leu Thr Arg Thr Asp Gly
355 360 365
Thr Thr Ile Ile Thr Thr Pro Trp Gln Gly Ala Leu Ser Pro Glu Ile
370 375 380
Ser Gln Glu Arg Gln Leu Ile Phe Ser Gly Tyr Lys Val Tyr Lys Val
385 390 395 400
Asp Ser Ile Leu Ser Asn Thr Ala Val Gln Phe Thr Gly Val Gln Arg
405 410 415
Ala Asp Phe His Met Val Asn Glu Ala Gly Thr Gly Val Thr Ser Arg
420 425 430
Ser Ile Asp Leu Pro Leu Thr Thr Tyr Tyr Phe Ser Lys Ala Ser Asn
435 440 445
Ile Pro Gly Thr Arg Ser Glu Thr Pro Thr Ser Leu Asp Tyr Thr His
450 455 460
Ile Leu Ser Ser Val Arg Ser Thr Ser Ile Gly Thr Thr Tyr Arg Asn
465 470 475 480
Arg Ser Asn Ile Ile Ala Tyr Gly Trp Thr His Thr Ser Ala Glu Arg
485 490 495
Thr Asn Arg Ile Leu Pro Asn Arg Ile Thr Gln Ile Pro Ala Val Lys
500 505 510
Ala Phe Ala Leu Glu Ser Gly Ala Ser Val Arg Pro Gly Pro Gly His
515 520 525
Thr Gly Gly Asp Val Val Ser Leu Ala Leu Ser Gly Arg Leu Lys Ile
530 535 540
Arg Leu Thr Pro Ala Ser Thr Asn Thr Asn Tyr Arg Val Arg Ile Arg
545 550 555 560
Tyr Ala Thr Ser Tyr Gly Ala Ser Leu Met Val Gln Arg Trp Ser Pro
565 570 575
Ser Gly Ser Glu Ser Gly Tyr Phe Ser Ser Ser His Thr Gly Ala Tyr
580 585 590
Pro Lys Phe Gly Tyr Met Asp Thr Leu Val Thr Asn Phe Asn Gln Thr
595 600 605
Gly Val Glu Ile Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile
610 615 620
Asp Arg Ile Glu Leu Ile Pro Val Asn Ser Thr Ala Leu Glu Tyr Glu
625 630 635 640
Gly Lys Gln His Leu Glu Lys Ala Lys Lys Ala Val Gly Asn Leu Phe
645 650 655
Asn Asn Asn Gly Lys Glu Ala Leu Lys Ile Asp Thr Thr Asp Tyr Asp
660 665 670
Val Asp Gln Ala Val Asn Leu Val Glu Cys Val Pro Glu Glu Leu Tyr
675 680 685
Thr Lys Glu Lys Met Ile Leu Leu Asp Glu Val Lys His Ala Lys Gln
690 695 700
Leu Ser Ala Ser Arg Asn Leu Ile Gln Asn Gly Asn Phe Glu Phe Tyr
705 710 715 720
Thr Asp Glu Trp Thr Thr Ser Asn Asn Val Ser Ile Gln Ala Asp Asn
725 730 735
Pro Ile Phe Lys Gly Asn Tyr Leu Lys Met Pro Gly Ala Arg Glu Thr
740 745 750
Glu Gly Gly Thr Thr Arg Phe Pro Thr Tyr Val Leu Gln Lys Ile Asp
755 760 765
Glu Ser Lys Leu Lys Pro Tyr Thr Arg Tyr Lys Val Arg Gly Phe Val
770 775 780
Gly Gly Ser His Asp Val Lys Leu Ile Val Glu Arg Tyr Gly Asn Glu
785 790 795 800
Val Asp Ala Ile Leu Asn Val Arg Asn Asp Leu Ala Leu Asp Thr Ile
805 810 815
Ser Pro Ser Cys Val Glu Val Asn Gln Cys Gln Ser Gln Met Tyr Pro
820 825 830
Leu Met His Asp Gly Tyr Leu Ala Asn Val Ile Asn Thr Asn Ser Tyr
835 840 845
Glu Glu Ser Gln Leu Asp Arg Ala Asn Phe Lys Lys Glu Gln Gly Met
850 855 860
Cys His Gln Ser His Gln Phe Asp Phe His Ile Asp Thr Gly Glu Val
865 870 875 880
His Ile Asn Lys Asn Pro Gly Ile Trp Val Leu Phe Lys Ile Ser Ser
885 890 895
Pro Glu Gly His Ala Thr Leu Asp Asn Ile Glu Leu Ile Glu Glu Gly
900 905 910
Pro Leu Val Gly Glu Ser Leu Ala Leu Val Lys Lys Arg Glu Lys Lys
915 920 925
Trp Lys Asn Lys Met Glu Thr Arg Trp Phe Gln Thr Lys Glu Val Tyr
930 935 940
Glu Lys Ala Lys Gly Ala Ile Asp Ala Leu Phe Thr Asp Ala Gln Asp
945 950 955 960
Gln Ala Leu Lys Phe Asp Ile Asn Ile Ser His Ile Ile Ser Ala Glu
965 970 975
His Phe Val Gln Ser Met Pro Tyr Val Tyr Asn Arg Trp Ile Ser Asp
980 985 990
Val Pro Gly Met Asn Tyr Asp Ile Tyr Thr Glu Leu Asp Leu Arg Ile
995 1000 1005
Thr Gln Ala Tyr Ser Leu Tyr Glu His Arg Asn Ile Ile Lys Asn
1010 1015 1020
Gly Asp Phe Asn His Gly Leu Asn His Trp His Ala Thr Pro His
1025 1030 1035
Ala Lys Val Gln Arg Ile Asp Asp Thr Ala Val Leu Val Ile Pro
1040 1045 1050
Asn Trp Ser Ser Ser Val Ser Gln Asn Leu Cys Val Glu Tyr Asn
1055 1060 1065
Arg Gly Tyr Val Leu Arg Val Thr Ala Lys Lys Glu Asp Ile Gly
1070 1075 1080
Lys Gly Tyr Val Thr Ile Ser Asp Cys Asn Gly Asn Gln Glu Thr
1085 1090 1095
Leu Thr Phe Thr Ser Cys Asp Asn Tyr Val Ser Asn Glu Ile Thr
1100 1105 1110
Asn Asp Gln Ser Glu Tyr Pro Phe Ser Gln Glu Met Asn Glu Gln
1115 1120 1125
Arg Ser Tyr Asn Pro Asn Glu Ala Ile Asn Gly Gln Leu Gly Tyr
1130 1135 1140
Ser Leu Gly Gln Val Arg Asn Glu Gln Ser Cys Tyr Thr Arg Asn
1145 1150 1155
Ala Ile Thr Asn Asp Gln Ser Glu Tyr His Phe Ser Gln Glu Met
1160 1165 1170
Asn Glu Gln Arg Ser Tyr Asn Pro Asn Glu Thr Ile Asn Glu Gln
1175 1180 1185
Arg Asn Tyr Val Thr Arg Thr Ile Asp Phe Phe Pro Asp Thr Asp
1190 1195 1200
Lys Ile Arg Ile Asp Ile Gly Glu Thr Glu Gly Thr Phe Asn Ile
1205 1210 1215
Glu Ser Val Glu Leu Ile Cys Met Lys Ser Gln
1220 1225
<210> 44
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 44
Met Asn Gln Asn Asn Gln Lys Glu Ile Gln Ile Ile Gly Ser Ser Ser
1 5 10 15
Asn Asp Val Ser Gln Ser Asn Gly Tyr Pro Arg Tyr Pro Leu Ala Lys
20 25 30
Glu Ser Asn Tyr Lys Asn Trp Leu Thr Ser Cys Asp Glu Ser Asn Leu
35 40 45
Asp Arg Phe Ser Thr Pro Ser Ser Val Gln Asp Ala Ile Val Thr Ser
50 55 60
Leu Ser Val Thr Ser Tyr Ile Ile Ser Phe Leu Glu Gly Pro Ile Gly
65 70 75 80
Ala Gly Leu Gly Ile Leu Ser Val Leu Phe Gly Gln Phe Trp Pro Phe
85 90 95
Asn Thr Asn Thr Val Trp Asp Asn Leu Leu Arg Val Val Glu Glu Leu
100 105 110
Ile Asn Gln Arg Ile Asn Glu Val Glu Arg Glu Arg Ile Ile Arg Gln
115 120 125
Phe Asp Gly Ile Arg Ser Val Leu Ser Asn Tyr Asn Thr Ala Phe Arg
130 135 140
Asn Trp Asn Glu Asn Glu Thr Thr Arg Asn Asn Pro Arg Leu Gln Glu
145 150 155 160
Glu Leu Arg His Arg Phe Asn Asn Thr Asp Asp Ile Ile Ala Asp Arg
165 170 175
Met Pro Glu Phe Arg Ile Lys Gly Phe Glu Thr Gln Ser Leu Ala Val
180 185 190
Tyr Ala Lys Ala Ala Thr Leu His Leu Leu Leu Leu Arg Asp Ala Val
195 200 205
Val Tyr Gly Arg Ala Trp Gly Phe Asp Ser Gln Thr Ile Glu Ser Leu
210 215 220
Tyr Thr Lys Leu Val Cys Leu Ile Asn Ala Tyr Ala Asp His Cys Ile
225 230 235 240
Leu Val Tyr Ser Gln Gly Leu Gln Glu Leu Arg Asn Arg Gly Asn Trp
245 250 255
Arg Asn Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile Thr Val Leu Asp
260 265 270
Ile Ile Ser Leu Phe Ser Asn Tyr Asp Pro Arg Ile Tyr Phe Tyr Asn
275 280 285
Thr Asn Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu Pro Ile Ala Ser
290 295 300
Gly Asn Trp Leu Asn Ser Tyr Ser Asn Pro Asp Gln Phe Gln Gln Ile
305 310 315 320
Glu Asn Asp Leu Asn Pro Pro Pro Ser Leu Phe Ser Thr Leu Ser Thr
325 330 335
Leu Phe Ala Arg Ile Gly Phe Phe Tyr Tyr Asn Asp Ile Tyr Thr Pro
340 345 350
Arg Asp Ala Ile Leu Asn Thr Ser Met Arg Leu Thr Arg Thr Asp Gly
355 360 365
Thr Thr Ile Ile Thr Thr Pro Trp Gln Gly Ala Leu Ser Pro Glu Ile
370 375 380
Ser Gln Glu Arg Gln Leu Ile Phe Ser Gly Tyr Lys Val Tyr Lys Val
385 390 395 400
Asp Ser Ile Leu Ser Asn Thr Ala Val Gln Phe Thr Gly Val Gln Arg
405 410 415
Ala Asp Phe His Met Val Asn Glu Ala Gly Thr Gly Val Thr Ser Arg
420 425 430
Ser Ile Asp Leu Pro Leu Thr Thr Tyr Tyr Phe Ser Lys Ala Ser Asn
435 440 445
Ile Pro Gly Thr Arg Ser Glu Thr Pro Thr Ser Leu Asp Tyr Thr His
450 455 460
Ile Leu Ser Ser Val Arg Ser Thr Ser Ile Gly Thr Thr Tyr Arg Asn
465 470 475 480
Arg Ser Asn Ile Ile Ala Tyr Gly Trp Thr His Thr Ser Ala Glu Arg
485 490 495
Thr Asn Arg Ile Leu Pro Asn Arg Ile Thr Gln Ile Pro Ala Val Lys
500 505 510
Ala Phe Ala Leu Glu Ser Gly Ala Ser Val Arg Pro Gly Pro Gly His
515 520 525
Thr Gly Gly Asp Val Val Ser Leu Ala Leu Ser Gly Arg Leu Lys Ile
530 535 540
Arg Leu Thr Pro Ala Ser Thr Asn Thr Asn Tyr Arg Val Arg Ile Arg
545 550 555 560
Tyr Ala Thr Ser Tyr Gly Ala Ser Leu Met Val Gln Arg Trp Ser Pro
565 570 575
Ser Gly Ser Glu Ser Gly Tyr Phe Ser Ser Ser His Thr Gly Ala Tyr
580 585 590
Pro Lys Phe Gly Tyr Met Asp Thr Leu Val Thr Asn Phe Asn Gln Thr
595 600 605
Gly Val Glu Ile Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile
610 615 620
Asp Arg Ile Glu Leu Ile Pro Val
625 630
<210> 45
<211> 825
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 45
Leu Phe Ser Ile Phe Gly Ile Gln Met Leu Ser Tyr Ile Lys Ile Gly
1 5 10 15
Gly Glu Asn Met His Gln Thr Asn Thr Ser Lys Asp Lys Asn Arg Met
20 25 30
Ala Ser Phe Glu Lys Thr Asn Met Ser Asn Glu Gly Pro Ile Tyr Pro
35 40 45
Leu Ala Asn Ser Ser Asn Thr Gly Leu Gln Gly Met Asn Tyr Asn Asp
50 55 60
Ser Val Asn Met His Ala Thr Lys Asn Glu Asp Cys Leu Leu Ser Ser
65 70 75 80
Arg Val Ser Val Asp Pro Lys Thr Ala Val Ala Thr Gly Ile Lys Ile
85 90 95
Ser Thr Phe Ile Met Ser Ala Leu Gly Ile Pro Leu Ala Ser Gln Ile
100 105 110
Gly Gln Leu Trp Asn Phe Val Leu Asp Gln Leu Trp Pro Asn Gly Asp
115 120 125
Asn Gln Trp Glu Glu Phe Met Lys His Val Glu Glu Leu Ile Asn Gln
130 135 140
Lys Ile Thr Glu Tyr Ala Arg Asp Lys Ala Leu Ala Glu Leu Glu Gly
145 150 155 160
Leu Gly Asn Val Leu Glu Leu Tyr Gln Glu Ala Ile Glu Asp Trp Lys
165 170 175
Lys Asp Pro Thr Asn Val Ala Ser Gln Glu Arg Val Arg Thr Arg Phe
180 185 190
Arg Asn Ala Asp Ala Phe Phe Glu Asn Gly Met Pro Ser Phe Arg Ile
195 200 205
Lys Asn Tyr Glu Val Pro Leu Leu Ser Val Tyr Ala Ser Ala Ala Asn
210 215 220
Leu His Leu Leu Leu Leu Arg Asp Ala Ser Ile Tyr Gly Lys Asp Trp
225 230 235 240
Gly Leu Asn Glu Thr Asn Val Lys Asp Asn Tyr Asn Arg Gln Leu Lys
245 250 255
Arg Thr Glu Ser Tyr Ser Asn His Cys Val Thr Trp Tyr Gln Lys Gly
260 265 270
Leu Gln Arg Leu Lys Gly Ser Asp Ala Asp Ser Trp Leu Lys Phe Asn
275 280 285
Ala Phe Arg Arg Asp Met Thr Phe Thr Val Leu Asp Ile Val Val Leu
290 295 300
Phe Pro Cys Tyr Asp Ile Gln Lys Tyr Pro Met Gly Val Lys Thr Glu
305 310 315 320
Ile Thr Arg Glu Ile Tyr Thr Asp Pro Ile Gly Asn Leu Pro Arg Arg
325 330 335
Pro Gly Phe Gln Val Leu Asp Trp Val Asn His Ala Pro Ser Phe Glu
340 345 350
Glu Ile Glu Lys Glu Thr Thr Gly Asp Pro Ser Met Val Thr Trp Ile
355 360 365
Asn Tyr Leu Glu Ile Ser Arg Asn Lys Leu Gly Gly Trp Asn Ser Ser
370 375 380
Asn Asp Tyr Trp Glu Gly His Ser Val His Tyr Lys Tyr Ser Asn Asp
385 390 395 400
Gly Ile Val Ile Asp Ser Lys Ala Tyr Ala His Gly Asp Phe Ser Gly
405 410 415
Lys Phe Pro Thr Glu Ala Ile His Leu Glu Asn Cys Asp Val Tyr Ala
420 425 430
Ser Ser Ser Leu Pro Val Ala Ala Ser Asn Pro Tyr Gly Asp Lys Ala
435 440 445
Ser Phe Gly Val Thr Lys Val Tyr Phe Asp Met Leu Asn Leu Asn Asp
450 455 460
Asn Thr Leu Gln Arg Gly Ser Trp Asp His Pro Lys Asn Trp Gly Gly
465 470 475 480
Ser Trp Val Asp Leu Lys Phe Leu Asn Glu Thr Pro Asn Ile Lys Glu
485 490 495
Ile Gln Lys Tyr Ser His Lys Leu Thr Tyr Val Ser Ser Phe Ser Val
500 505 510
Asp Tyr Gly Ser Val Leu Cys Tyr Arg Trp Arg His Ser Ser Val Asp
515 520 525
Arg Asn Asn Thr Leu Asp Pro Gln Lys Ile Thr Gln Ile Ser Ala Val
530 535 540
Lys Gly His Thr Pro Ile Asn Cys Glu Val Val Arg Gly Asn Gly Leu
545 550 555 560
Thr Asp Gly Asp Trp Leu Gln Leu Lys Gly Gly Gly Ser Phe Thr Leu
565 570 575
Asn Val Ser Ser Val Ile Pro Lys Thr Tyr Arg Ile Arg Leu Arg Tyr
580 585 590
Ala Cys Gly Pro Lys Asp Val Asn Leu Thr Val Ser Cys Pro Asp Asn
595 600 605
Gly Ile Asp Lys Thr Ile Thr Ile Pro Ser Thr Ser Thr Ala Ser Leu
610 615 620
Ala Asn Asn Leu Met Tyr Lys Asp Phe Lys Tyr Ile Asp Leu Pro Ile
625 630 635 640
Glu Phe Ile Thr Ser Ala Ala Pro Lys Arg Cys Arg Ile Asn Leu Arg
645 650 655
Leu Glu Gly Val Gln Asn Thr Ser Phe Phe Met Asp Lys Phe Glu Phe
660 665 670
Ile Pro Glu Asn Glu Thr Glu Thr Arg Pro Trp Lys Ile Val Thr Ala
675 680 685
Leu Asn Asn Ser Ser Val Leu Ala Phe Ala Lys Asn Ser Leu Tyr Lys
690 695 700
Glu Ala Ile Leu Ser Asn Asp Thr Gly Glu Glu Ser Gln Lys Trp Arg
705 710 715 720
Phe Ile Tyr Asp Glu Glu Lys Lys Ala Tyr Gln Ile Glu Asp Val Lys
725 730 735
Ser Pro Arg Leu Ile Leu Ala Trp Ile Ile Asp Gly Pro Glu Ser Arg
740 745 750
Lys Val Trp Ala Thr Arg Ser Gln Asn Ile Asp Glu His Tyr Trp Ile
755 760 765
Leu Glu Glu Gln Glu Asp Ser Tyr Phe Ile Ile Lys Asn Lys Lys Asn
770 775 780
Pro Ser Leu Val Leu Asp Val Asp Gly Ala Gln Thr Asn Gln Gly Thr
785 790 795 800
Lys Ile Lys Val Asn Glu Arg His Pro Leu Tyr Ser Ser Arg Thr Asn
805 810 815
Ala Gln Lys Phe Lys Leu Glu Arg Val
820 825
<210> 46
<211> 675
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 46
Leu Phe Ser Ile Phe Gly Ile Gln Met Leu Ser Tyr Ile Lys Ile Gly
1 5 10 15
Gly Glu Asn Met His Gln Thr Asn Thr Ser Lys Asp Lys Asn Arg Met
20 25 30
Ala Ser Phe Glu Lys Thr Asn Met Ser Asn Glu Gly Pro Ile Tyr Pro
35 40 45
Leu Ala Asn Ser Ser Asn Thr Gly Leu Gln Gly Met Asn Tyr Asn Asp
50 55 60
Ser Val Asn Met His Ala Thr Lys Asn Glu Asp Cys Leu Leu Ser Ser
65 70 75 80
Arg Val Ser Val Asp Pro Lys Thr Ala Val Ala Thr Gly Ile Lys Ile
85 90 95
Ser Thr Phe Ile Met Ser Ala Leu Gly Ile Pro Leu Ala Ser Gln Ile
100 105 110
Gly Gln Leu Trp Asn Phe Val Leu Asp Gln Leu Trp Pro Asn Gly Asp
115 120 125
Asn Gln Trp Glu Glu Phe Met Lys His Val Glu Glu Leu Ile Asn Gln
130 135 140
Lys Ile Thr Glu Tyr Ala Arg Asp Lys Ala Leu Ala Glu Leu Glu Gly
145 150 155 160
Leu Gly Asn Val Leu Glu Leu Tyr Gln Glu Ala Ile Glu Asp Trp Lys
165 170 175
Lys Asp Pro Thr Asn Val Ala Ser Gln Glu Arg Val Arg Thr Arg Phe
180 185 190
Arg Asn Ala Asp Ala Phe Phe Glu Asn Gly Met Pro Ser Phe Arg Ile
195 200 205
Lys Asn Tyr Glu Val Pro Leu Leu Ser Val Tyr Ala Ser Ala Ala Asn
210 215 220
Leu His Leu Leu Leu Leu Arg Asp Ala Ser Ile Tyr Gly Lys Asp Trp
225 230 235 240
Gly Leu Asn Glu Thr Asn Val Lys Asp Asn Tyr Asn Arg Gln Leu Lys
245 250 255
Arg Thr Glu Ser Tyr Ser Asn His Cys Val Thr Trp Tyr Gln Lys Gly
260 265 270
Leu Gln Arg Leu Lys Gly Ser Asp Ala Asp Ser Trp Leu Lys Phe Asn
275 280 285
Ala Phe Arg Arg Asp Met Thr Phe Thr Val Leu Asp Ile Val Val Leu
290 295 300
Phe Pro Cys Tyr Asp Ile Gln Lys Tyr Pro Met Gly Val Lys Thr Glu
305 310 315 320
Ile Thr Arg Glu Ile Tyr Thr Asp Pro Ile Gly Asn Leu Pro Arg Arg
325 330 335
Pro Gly Phe Gln Val Leu Asp Trp Val Asn His Ala Pro Ser Phe Glu
340 345 350
Glu Ile Glu Lys Glu Thr Thr Gly Asp Pro Ser Met Val Thr Trp Ile
355 360 365
Asn Tyr Leu Glu Ile Ser Arg Asn Lys Leu Gly Gly Trp Asn Ser Ser
370 375 380
Asn Asp Tyr Trp Glu Gly His Ser Val His Tyr Lys Tyr Ser Asn Asp
385 390 395 400
Gly Ile Val Ile Asp Ser Lys Ala Tyr Ala His Gly Asp Phe Ser Gly
405 410 415
Lys Phe Pro Thr Glu Ala Ile His Leu Glu Asn Cys Asp Val Tyr Ala
420 425 430
Ser Ser Ser Leu Pro Val Ala Ala Ser Asn Pro Tyr Gly Asp Lys Ala
435 440 445
Ser Phe Gly Val Thr Lys Val Tyr Phe Asp Met Leu Asn Leu Asn Asp
450 455 460
Asn Thr Leu Gln Arg Gly Ser Trp Asp His Pro Lys Asn Trp Gly Gly
465 470 475 480
Ser Trp Val Asp Leu Lys Phe Leu Asn Glu Thr Pro Asn Ile Lys Glu
485 490 495
Ile Gln Lys Tyr Ser His Lys Leu Thr Tyr Val Ser Ser Phe Ser Val
500 505 510
Asp Tyr Gly Ser Val Leu Cys Tyr Arg Trp Arg His Ser Ser Val Asp
515 520 525
Arg Asn Asn Thr Leu Asp Pro Gln Lys Ile Thr Gln Ile Ser Ala Val
530 535 540
Lys Gly His Thr Pro Ile Asn Cys Glu Val Val Arg Gly Asn Gly Leu
545 550 555 560
Thr Asp Gly Asp Trp Leu Gln Leu Lys Gly Gly Gly Ser Phe Thr Leu
565 570 575
Asn Val Ser Ser Val Ile Pro Lys Thr Tyr Arg Ile Arg Leu Arg Tyr
580 585 590
Ala Cys Gly Pro Lys Asp Val Asn Leu Thr Val Ser Cys Pro Asp Asn
595 600 605
Gly Ile Asp Lys Thr Ile Thr Ile Pro Ser Thr Ser Thr Ala Ser Leu
610 615 620
Ala Asn Asn Leu Met Tyr Lys Asp Phe Lys Tyr Ile Asp Leu Pro Ile
625 630 635 640
Glu Phe Ile Thr Ser Ala Ala Pro Lys Arg Cys Arg Ile Asn Leu Arg
645 650 655
Leu Glu Gly Val Gln Asn Thr Ser Phe Phe Met Asp Lys Phe Glu Phe
660 665 670
Ile Pro Glu
675
<210> 47
<211> 624
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 47
Leu Tyr Ile Gly Lys Thr Ser Gly Glu Val Lys Met Leu Arg Thr Ile
1 5 10 15
Gln Tyr Lys Asn Ala Ser Glu Ile Val Tyr Asp Gln Pro Leu Pro Tyr
20 25 30
Gly Gly Val Val Lys Ile Ala Gly Lys Ile Gly Leu Gly Val Val Lys
35 40 45
Thr Val Ile Thr Ser Ile Ile Arg Arg Gly Arg Asp Asn Glu Ile Ala
50 55 60
Gly Arg Ile Leu Ser Asp Val Tyr Ser Val Leu Trp Ser Asn Arg Lys
65 70 75 80
Gly Tyr Trp Asn Glu Met Ile Glu Ala Val Glu Thr Leu Ile Gln Gln
85 90 95
Asp Ile Ser Glu Asn Ile Lys Asn Asn Ala Leu Ala Ala Leu Thr Asp
100 105 110
Ile Arg Asn Ala Leu Leu Leu Tyr Gln Gln Ala Val Glu Asp Trp Glu
115 120 125
Ser Asn Arg Thr Asp Ser Gln Leu Gln Glu Arg Val Arg Asn Gln Leu
130 135 140
Ile Ser Thr Ile Thr Phe Ile Glu Phe Ala Met Pro Ser Phe Thr Val
145 150 155 160
Gln His Tyr Glu Val Ile Leu Leu Pro Ile Phe Thr Gln Val Ala Asn
165 170 175
Leu His Leu Leu Leu Leu Arg Asp Ala Glu Ile Phe Gly Leu Lys Trp
180 185 190
Arg Met Ser Lys Ala Glu Ile Asp Asp Tyr Tyr Phe Ala Lys Ser Gly
195 200 205
Leu Thr Arg Leu Thr Gln Lys Tyr Thr Asn His Ser Val Lys Trp Tyr
210 215 220
Arg Glu Gly Leu Cys Ile Ala Thr Asn Ile Asp Leu Asp Gln Cys Pro
225 230 235 240
Glu Phe Tyr Gln Leu Asp Lys Trp Asn Ala Met Asn Asp Phe Arg Arg
245 250 255
Glu Met Thr Phe Met Val Leu Asp Ile Ile Ala Leu Trp Pro Thr Tyr
260 265 270
Asp Pro Ile Arg Tyr Pro Leu Gly Ile Lys Thr Glu Leu Thr Arg Glu
275 280 285
Val Phe Thr Pro Leu Leu Gly Ile Asn Pro Asn Ser Ser Trp Leu Ile
290 295 300
Arg Thr Met Glu Glu Ile Glu Ala Lys Leu Thr Val Leu Ser Pro Phe
305 310 315 320
Leu Ser Trp Ile Ser Phe Glu Gln Leu Val Lys Gln Gly Asp Gly Ile
325 330 335
Ser Thr Phe Thr Asp Trp Gly Asn Phe Thr Leu Ser Asn Thr Met Leu
340 345 350
Pro Leu Ser Tyr Ile Leu Gly Gly Ala Gly Pro Gly Thr Gly Asp Ser
355 360 365
Thr Asn Ile Pro Met Gln Ser Glu Asn Tyr Asp Val Tyr Lys Val Leu
370 375 380
Val Gly Thr Asp Tyr Ser His Pro Ser Asn Val Pro Ile Arg Lys Leu
385 390 395 400
Glu Tyr Tyr Cys Thr Asn Gly Met Met Glu Asn Val Ile Thr Val Gly
405 410 415
Thr Gly Thr Thr Asn Ala Leu Phe Glu Leu Pro Asn Asn Gly Cys Val
420 425 430
Asp Tyr Ser His Arg Val Ser Arg Leu Ser Cys Ser Asn Val Glu Val
435 440 445
Tyr Glu Trp Glu Gly Gly Pro Lys Tyr Ala Leu Lys Asn Ile Ala Tyr
450 455 460
Gly Trp Thr His Ile Ser Val Asp Ser Lys Asn Thr Leu Ser His Asn
465 470 475 480
Ala Ile Thr Gln Ile Pro Ala Arg Lys Gly Tyr Ser Ser Ser Glu Ser
485 490 495
Asn Leu Ser Ile Glu Gly Pro Tyr Phe Thr Gly Gly Asp Leu Ile Ala
500 505 510
Leu Pro Pro Asn Gly Ala Gln Leu Gln Met Arg Val Ser Pro Pro Val
515 520 525
Ser Ser Ser Ser Ile Asn Tyr Cys Val Arg Leu Arg Tyr Ala Ser Ser
530 535 540
Gly Asn Thr Asn Ile Tyr Val Glu Arg Val Leu Ser Ser Gly Asp Thr
545 550 555 560
Tyr Gly Glu Thr His Asp Val Pro Ala Thr Tyr Ser Gly Gly Ser Leu
565 570 575
Ser Tyr Ser Ser Phe Ala Tyr Val Val Asn Leu Thr Ala Ile Phe Glu
580 585 590
Gly Phe Asn Val Glu Ile Lys Ile Lys Asn Ile Gly Ser Ser Gln Ile
595 600 605
Ile Leu Asp Lys Ile Glu Phe Leu Pro Ile Lys Glu Ser Leu Lys Glu
610 615 620
<210> 48
<211> 618
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 48
Leu Tyr Ile Gly Lys Thr Ser Gly Glu Val Lys Met Leu Arg Thr Ile
1 5 10 15
Gln Tyr Lys Asn Ala Ser Glu Ile Val Tyr Asp Gln Pro Leu Pro Tyr
20 25 30
Gly Gly Val Val Lys Ile Ala Gly Lys Ile Gly Leu Gly Val Val Lys
35 40 45
Thr Val Ile Thr Ser Ile Ile Arg Arg Gly Arg Asp Asn Glu Ile Ala
50 55 60
Gly Arg Ile Leu Ser Asp Val Tyr Ser Val Leu Trp Ser Asn Arg Lys
65 70 75 80
Gly Tyr Trp Asn Glu Met Ile Glu Ala Val Glu Thr Leu Ile Gln Gln
85 90 95
Asp Ile Ser Glu Asn Ile Lys Asn Asn Ala Leu Ala Ala Leu Thr Asp
100 105 110
Ile Arg Asn Ala Leu Leu Leu Tyr Gln Gln Ala Val Glu Asp Trp Glu
115 120 125
Ser Asn Arg Thr Asp Ser Gln Leu Gln Glu Arg Val Arg Asn Gln Leu
130 135 140
Ile Ser Thr Ile Thr Phe Ile Glu Phe Ala Met Pro Ser Phe Thr Val
145 150 155 160
Gln His Tyr Glu Val Ile Leu Leu Pro Ile Phe Thr Gln Val Ala Asn
165 170 175
Leu His Leu Leu Leu Leu Arg Asp Ala Glu Ile Phe Gly Leu Lys Trp
180 185 190
Arg Met Ser Lys Ala Glu Ile Asp Asp Tyr Tyr Phe Ala Lys Ser Gly
195 200 205
Leu Thr Arg Leu Thr Gln Lys Tyr Thr Asn His Ser Val Lys Trp Tyr
210 215 220
Arg Glu Gly Leu Cys Ile Ala Thr Asn Ile Asp Leu Asp Gln Cys Pro
225 230 235 240
Glu Phe Tyr Gln Leu Asp Lys Trp Asn Ala Met Asn Asp Phe Arg Arg
245 250 255
Glu Met Thr Phe Met Val Leu Asp Ile Ile Ala Leu Trp Pro Thr Tyr
260 265 270
Asp Pro Ile Arg Tyr Pro Leu Gly Ile Lys Thr Glu Leu Thr Arg Glu
275 280 285
Val Phe Thr Pro Leu Leu Gly Ile Asn Pro Asn Ser Ser Trp Leu Ile
290 295 300
Arg Thr Met Glu Glu Ile Glu Ala Lys Leu Thr Val Leu Ser Pro Phe
305 310 315 320
Leu Ser Trp Ile Ser Phe Glu Gln Leu Val Lys Gln Gly Asp Gly Ile
325 330 335
Ser Thr Phe Thr Asp Trp Gly Asn Phe Thr Leu Ser Asn Thr Met Leu
340 345 350
Pro Leu Ser Tyr Ile Leu Gly Gly Ala Gly Pro Gly Thr Gly Asp Ser
355 360 365
Thr Asn Ile Pro Met Gln Ser Glu Asn Tyr Asp Val Tyr Lys Val Leu
370 375 380
Val Gly Thr Asp Tyr Ser His Pro Ser Asn Val Pro Ile Arg Lys Leu
385 390 395 400
Glu Tyr Tyr Cys Thr Asn Gly Met Met Glu Asn Val Ile Thr Val Gly
405 410 415
Thr Gly Thr Thr Asn Ala Leu Phe Glu Leu Pro Asn Asn Gly Cys Val
420 425 430
Asp Tyr Ser His Arg Val Ser Arg Leu Ser Cys Ser Asn Val Glu Val
435 440 445
Tyr Glu Trp Glu Gly Gly Pro Lys Tyr Ala Leu Lys Asn Ile Ala Tyr
450 455 460
Gly Trp Thr His Ile Ser Val Asp Ser Lys Asn Thr Leu Ser His Asn
465 470 475 480
Ala Ile Thr Gln Ile Pro Ala Arg Lys Gly Tyr Ser Ser Ser Glu Ser
485 490 495
Asn Leu Ser Ile Glu Gly Pro Tyr Phe Thr Gly Gly Asp Leu Ile Ala
500 505 510
Leu Pro Pro Asn Gly Ala Gln Leu Gln Met Arg Val Ser Pro Pro Val
515 520 525
Ser Ser Ser Ser Ile Asn Tyr Cys Val Arg Leu Arg Tyr Ala Ser Ser
530 535 540
Gly Asn Thr Asn Ile Tyr Val Glu Arg Val Leu Ser Ser Gly Asp Thr
545 550 555 560
Tyr Gly Glu Thr His Asp Val Pro Ala Thr Tyr Ser Gly Gly Ser Leu
565 570 575
Ser Tyr Ser Ser Phe Ala Tyr Val Val Asn Leu Thr Ala Ile Phe Glu
580 585 590
Gly Phe Asn Val Glu Ile Lys Ile Lys Asn Ile Gly Ser Ser Gln Ile
595 600 605
Ile Leu Asp Lys Ile Glu Phe Leu Pro Ile
610 615
<210> 49
<211> 376
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 49
Met Met Ser Asn Glu Ser Lys Asn Asn Glu Asn Met Asn Leu Pro Ile
1 5 10 15
Ile Asn Asp Glu Asp Ser Ile Pro Ser Glu Ile Arg Asp Ser Arg Ser
20 25 30
Lys Ile Thr Phe Pro Asp Phe Asn Ile Tyr Lys Ile Gly Thr Phe Cys
35 40 45
Gly Lys Tyr Val Asp Ile Pro Gly Gly Ala Asn Glu Asp Ser Ser Ala
50 55 60
Ile Gln Phe Gln Thr Gln Asp Gly Asp Asn Gln Lys Phe Leu Ile Phe
65 70 75 80
Thr Leu Asp Asp Ser Tyr Ser Ile Ile Ala Ala Arg His Ser Gly Lys
85 90 95
Val Met Asp Ala Tyr Tyr Asn Ser Ser Gln Asn Arg Ile Ile Gln Phe
100 105 110
Asp Phe His Asn Thr Asn Asn Gln Lys Phe Phe Leu Ser Asp Asn Gly
115 120 125
Thr Ile Ala Ser Lys Gln Asp Arg Gln Val Trp Asp Val Gln Gly Glu
130 135 140
Ser Thr Gln Asn Gly Ala Arg Ile Val Val Phe Lys Phe Ile Asp Val
145 150 155 160
Arg Asn Gln Lys Phe Thr Leu Asn Lys Leu Gly Ser Val Pro Val His
165 170 175
Gln Pro Thr Leu Ser Pro Leu Pro Ser Ala Pro Asp Phe Arg Thr Asn
180 185 190
Asp Ile Asn Glu Gln Leu Pro Asp Gln Thr Asn Pro Val Asn Thr His
195 200 205
Phe Thr Tyr Leu Pro Tyr Phe Met Val Lys Asp Pro Leu Tyr Asn Ala
210 215 220
Gln Gln Gln Ile Lys Asn Ser Pro Tyr Tyr Thr Leu Val Arg Arg Gln
225 230 235 240
Tyr Trp Glu Lys Lys Thr Gln Arg Ile Leu Ala Pro Ser Asp Thr Tyr
245 250 255
Glu Tyr Ser Glu Thr Ile Gly Val Ser Arg Thr Asp Gln Ser Ser Met
260 265 270
Thr Asp Ile Thr Glu Ile Ser Ile Gly Val Asp Leu Gly Phe Val Phe
275 280 285
Lys Gly Phe Ser Ala Gly Leu Ser Thr Asn Val Thr Arg Thr Leu Ser
290 295 300
Val Thr Lys Ser Thr Ser Asn Thr Glu Ser Ser Glu Ser Thr Arg Lys
305 310 315 320
Leu Ser Tyr Lys Asn Glu Phe Arg His Thr Ile Ala Tyr Ala Lys Tyr
325 330 335
Met Leu Ile Asn Glu Tyr Tyr Val Thr Arg Ala Asn Gly Glu Lys Ile
340 345 350
Thr Thr Gly Asp Ala Thr Tyr Trp Arg Val Pro Asn Pro Glu Gln Thr
355 360 365
Val Ala Arg Ile Ile Pro Arg Ser
370 375
<210> 50
<211> 665
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 50
Val Glu Ala Thr Arg Arg Lys Tyr Gln Ile Cys Cys His Gln Thr Tyr
1 5 10 15
Thr Ile Tyr Lys Gly Gly Leu Cys Met Asn Lys Asn Asn Asn Glu Ile
20 25 30
Thr Glu Val Pro Cys Ser Glu Cys Gly Asp Ile Asp Tyr Arg Asp Ile
35 40 45
Met Asn Met Ser Thr Gln Gly Glu Tyr Val Ala Gln Gln Ser Val Val
50 55 60
Leu Thr Ala Met Asn Thr Ser Phe Gln Val Ile Arg Thr Leu Tyr Gln
65 70 75 80
Ala Asn Lys Gly Thr Leu Ser Lys Ala Gly Gly Val Ile Lys Ser Ile
85 90 95
Phe Thr Tyr Leu Trp Arg Gln Lys Asn Ser Gln Glu Glu Trp Asp Asn
100 105 110
Phe Met Lys Ala Val Glu Ile Leu Ile Ala Glu Glu Leu Ser Thr Tyr
115 120 125
Ala Arg Asn Asp Ala Ile Ala Ser Val Arg Gly Leu Glu Asn Val Leu
130 135 140
Asn Glu Tyr Gln Glu Ala Leu Asp Asp Leu Ala Leu Asp Pro Thr Asn
145 150 155 160
Ser Ala Leu Lys Asp Glu Val Lys Arg Trp Val Thr Thr Val His Gly
165 170 175
Gln Phe Glu Ala Thr Met Pro Thr Leu Ala Val Gly Gly Tyr Glu Val
180 185 190
Gln Gln Leu Val Ala Tyr Ala Glu Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Val Ala Lys Phe Gly Tyr Gln Trp Asp Leu Asp Lys Thr
210 215 220
Thr Val Ala Asn Tyr Tyr Lys Asp Leu Lys Glu Asn Thr Gln Ile Tyr
225 230 235 240
Thr Asp His Cys Val Arg Thr Tyr Asn Thr Gly Leu Glu Lys Thr Lys
245 250 255
Ser Leu Met Gly Ala Ile Asn Pro Lys Asp Tyr Asn Lys Tyr Pro Tyr
260 265 270
Leu Asn Pro Tyr Ala Asn Gly Ser Thr Asn Phe Tyr Lys Lys Val Leu
275 280 285
Gln Trp Asn Val Trp Asn Asp Phe Arg Arg Asp Met Thr Ile Met Val
290 295 300
Leu Asp Leu Val Ala Leu Trp Pro Thr Phe Asp Pro Glu Val Tyr Thr
305 310 315 320
Asn Pro Glu Gly Val Asp Leu Glu Leu Ser Arg Glu Val Tyr Ser Thr
325 330 335
Ala Tyr Gly Arg Tyr Gly Ser Ser Thr His Gly Gly Asp Trp Lys Ser
340 345 350
Trp Glu Val Met Glu Thr Ala Val Asp Arg Ser Pro His Leu Val Ser
355 360 365
Phe Leu Thr Asn Leu Ala Ile Tyr Gln Lys Thr Tyr Ser Ser Ser Ser
370 375 380
Thr Asn Thr Lys Leu Asp Gln Tyr Asp Gly Val Ile Asn Thr Leu Lys
385 390 395 400
Thr Val Gly Ser Thr Ser Thr Phe Glu Asp Gly Phe Ala Pro Ile Ser
405 410 415
Lys Glu Ala Tyr Arg Ser Val Lys Asn Ile Pro Gly Glu Tyr Ile Asn
420 425 430
Gly Val Asp Leu Arg Leu Gly Met Val Pro Cys Ser Leu Thr Phe Lys
435 440 445
Thr Tyr Ser Ser Gly Thr Phe Phe Ala Gly Gln Cys Tyr Ser His Leu
450 455 460
Ala Gln Ser Thr Phe Tyr Asn Leu Ser Ala Thr Ile Ala Tyr His Arg
465 470 475 480
Leu Ser Tyr Val Asn Ala Ile Asp Asp Asn Phe Ser Gly Tyr Tyr Ser
485 490 495
Gly Trp Gly Ile Gly Thr Trp Gly Met Gly Trp Leu His Glu Asn Leu
500 505 510
Lys Thr Leu Asn Ile Ile His Ala Thr Lys Ser Ser Cys Ile Pro Ala
515 520 525
Val Lys Ala Arg Ile Leu Thr Asn Ser Ala Thr Val Ile Lys Gly Pro
530 535 540
Gly Ser Thr Gly Gly Asp Leu Val Lys Leu Pro Gly Val Pro Ser Thr
545 550 555 560
Ser Asp Ile Gln Val Arg Met Asn Met Thr Arg Glu Val Trp Lys Asn
565 570 575
Tyr Arg Ile Arg Ile Arg Tyr Ala Thr Ser Ser Ser Ala Gln Leu Phe
580 585 590
Val Gly Lys Trp Thr Gly Arg Trp Ala Ala Thr Ala Asn Ile Asn Leu
595 600 605
Ala Ala Thr Tyr Ser Gly Glu Leu Thr Tyr Ser Ala Phe Lys Leu Ala
610 615 620
Asp Val Pro Phe Asn Ile Thr Thr Gly Glu Pro Glu Phe Ser Phe Glu
625 630 635 640
Leu Arg Asn Asn Ser Gly Gly Pro Ile Leu Ile Asp Arg Ile Glu Phe
645 650 655
Ile Pro Leu Glu Gly Thr Phe Ala Glu
660 665
<210> 51
<211> 659
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 51
Val Glu Ala Thr Arg Arg Lys Tyr Gln Ile Cys Cys His Gln Thr Tyr
1 5 10 15
Thr Ile Tyr Lys Gly Gly Leu Cys Met Asn Lys Asn Asn Asn Glu Ile
20 25 30
Thr Glu Val Pro Cys Ser Glu Cys Gly Asp Ile Asp Tyr Arg Asp Ile
35 40 45
Met Asn Met Ser Thr Gln Gly Glu Tyr Val Ala Gln Gln Ser Val Val
50 55 60
Leu Thr Ala Met Asn Thr Ser Phe Gln Val Ile Arg Thr Leu Tyr Gln
65 70 75 80
Ala Asn Lys Gly Thr Leu Ser Lys Ala Gly Gly Val Ile Lys Ser Ile
85 90 95
Phe Thr Tyr Leu Trp Arg Gln Lys Asn Ser Gln Glu Glu Trp Asp Asn
100 105 110
Phe Met Lys Ala Val Glu Ile Leu Ile Ala Glu Glu Leu Ser Thr Tyr
115 120 125
Ala Arg Asn Asp Ala Ile Ala Ser Val Arg Gly Leu Glu Asn Val Leu
130 135 140
Asn Glu Tyr Gln Glu Ala Leu Asp Asp Leu Ala Leu Asp Pro Thr Asn
145 150 155 160
Ser Ala Leu Lys Asp Glu Val Lys Arg Trp Val Thr Thr Val His Gly
165 170 175
Gln Phe Glu Ala Thr Met Pro Thr Leu Ala Val Gly Gly Tyr Glu Val
180 185 190
Gln Gln Leu Val Ala Tyr Ala Glu Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Val Ala Lys Phe Gly Tyr Gln Trp Asp Leu Asp Lys Thr
210 215 220
Thr Val Ala Asn Tyr Tyr Lys Asp Leu Lys Glu Asn Thr Gln Ile Tyr
225 230 235 240
Thr Asp His Cys Val Arg Thr Tyr Asn Thr Gly Leu Glu Lys Thr Lys
245 250 255
Ser Leu Met Gly Ala Ile Asn Pro Lys Asp Tyr Asn Lys Tyr Pro Tyr
260 265 270
Leu Asn Pro Tyr Ala Asn Gly Ser Thr Asn Phe Tyr Lys Lys Val Leu
275 280 285
Gln Trp Asn Val Trp Asn Asp Phe Arg Arg Asp Met Thr Ile Met Val
290 295 300
Leu Asp Leu Val Ala Leu Trp Pro Thr Phe Asp Pro Glu Val Tyr Thr
305 310 315 320
Asn Pro Glu Gly Val Asp Leu Glu Leu Ser Arg Glu Val Tyr Ser Thr
325 330 335
Ala Tyr Gly Arg Tyr Gly Ser Ser Thr His Gly Gly Asp Trp Lys Ser
340 345 350
Trp Glu Val Met Glu Thr Ala Val Asp Arg Ser Pro His Leu Val Ser
355 360 365
Phe Leu Thr Asn Leu Ala Ile Tyr Gln Lys Thr Tyr Ser Ser Ser Ser
370 375 380
Thr Asn Thr Lys Leu Asp Gln Tyr Asp Gly Val Ile Asn Thr Leu Lys
385 390 395 400
Thr Val Gly Ser Thr Ser Thr Phe Glu Asp Gly Phe Ala Pro Ile Ser
405 410 415
Lys Glu Ala Tyr Arg Ser Val Lys Asn Ile Pro Gly Glu Tyr Ile Asn
420 425 430
Gly Val Asp Leu Arg Leu Gly Met Val Pro Cys Ser Leu Thr Phe Lys
435 440 445
Thr Tyr Ser Ser Gly Thr Phe Phe Ala Gly Gln Cys Tyr Ser His Leu
450 455 460
Ala Gln Ser Thr Phe Tyr Asn Leu Ser Ala Thr Ile Ala Tyr His Arg
465 470 475 480
Leu Ser Tyr Val Asn Ala Ile Asp Asp Asn Phe Ser Gly Tyr Tyr Ser
485 490 495
Gly Trp Gly Ile Gly Thr Trp Gly Met Gly Trp Leu His Glu Asn Leu
500 505 510
Lys Thr Leu Asn Ile Ile His Ala Thr Lys Ser Ser Cys Ile Pro Ala
515 520 525
Val Lys Ala Arg Ile Leu Thr Asn Ser Ala Thr Val Ile Lys Gly Pro
530 535 540
Gly Ser Thr Gly Gly Asp Leu Val Lys Leu Pro Gly Val Pro Ser Thr
545 550 555 560
Ser Asp Ile Gln Val Arg Met Asn Met Thr Arg Glu Val Trp Lys Asn
565 570 575
Tyr Arg Ile Arg Ile Arg Tyr Ala Thr Ser Ser Ser Ala Gln Leu Phe
580 585 590
Val Gly Lys Trp Thr Gly Arg Trp Ala Ala Thr Ala Asn Ile Asn Leu
595 600 605
Ala Ala Thr Tyr Ser Gly Glu Leu Thr Tyr Ser Ala Phe Lys Leu Ala
610 615 620
Asp Val Pro Phe Asn Ile Thr Thr Gly Glu Pro Glu Phe Ser Phe Glu
625 630 635 640
Leu Arg Asn Asn Ser Gly Gly Pro Ile Leu Ile Asp Arg Ile Glu Phe
645 650 655
Ile Pro Leu
<210> 52
<211> 383
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 52
Met Met Gln Phe Asn Lys Arg Asn Gly Asn Asn Ala Glu Phe Tyr Lys
1 5 10 15
Gly Glu Lys Val Thr Pro Phe Glu Tyr Asp Leu Pro Glu Glu Gly Lys
20 25 30
Ala Phe Gln Thr Ile Gly Ser Ile Leu Ser Thr Asn Asn Arg Ile Leu
35 40 45
Val Ala Thr Arg Thr Arg Ser Gly Gln Glu Leu Gly Gln Asp Val Gly
50 55 60
Asp Phe Cys Arg Arg Val Thr Leu Asn Asn Gln Val Val Asn Asp Thr
65 70 75 80
Ala Ala Ala Thr Asn Tyr Glu Ala Gln Phe Ile Phe Phe Lys Lys Asp
85 90 95
Phe Ser Gly Tyr Ile Ile Ala Asn Arg Gln His Gly Glu Val Leu Glu
100 105 110
Ser Val Phe Ala Asn Asn Asp Ser Leu Ile Thr Ala Pro Tyr Ala Asn
115 120 125
Lys Thr Gln Gln Val Phe Gly Lys Ser Ser Tyr Ser Asp Lys Ser Phe
130 135 140
Arg Leu Tyr Gly Asp His Gly Ala Ile Thr Thr Cys Gly Asp Lys Trp
145 150 155 160
Asn Ser Trp Thr Arg Ile Thr Met Ser Gly Ser Asn Glu Asn Gln Ala
165 170 175
Arg Tyr Val Phe Lys His Gly Asp Thr Ser Phe Pro Ile Thr Ile Pro
180 185 190
Thr Lys Leu Glu Pro Thr Thr Leu Gly Pro Ile Pro Glu Leu Thr Asn
195 200 205
Leu Asp Asp Val Gly Thr Pro Pro Ala Pro Ala Val Val Gly Thr Ala
210 215 220
Leu Leu Pro Cys Ile Phe Val Asn Asp Arg Thr Pro Ser Gln Arg Ile
225 230 235 240
Gln Asp Ser Pro Tyr Tyr Val Leu Glu Cys Tyr Asp Tyr Trp Lys Leu
245 250 255
Ile Lys Ser Ser Ile Ile Gln Pro Gly Gly Phe Asp Asn Trp Trp Glu
260 265 270
Glu Thr Gly Ile His Ser Asn Val Gln Ala Ala Met Lys Asn Met Ile
275 280 285
Asn Ile Ser Ile Gly Asp Asp Lys Arg Leu Arg Phe Phe Asn Leu Ser
290 295 300
Asp Pro Phe His Gln Ser Ile Val Glu Asp Ile Asn Ile Pro Ala Ser
305 310 315 320
Thr Ser Asn Ser Asp Leu Ser Phe Ile Thr Ala Ile Arg Glu His Thr
325 330 335
Asn Pro Tyr Ser Thr Arg Leu Arg Phe Ala Lys Tyr Phe Val Ala Arg
340 345 350
Lys Phe Glu Leu Lys Arg Leu Asp Gly Ser Thr Pro Ile Tyr Thr Trp
355 360 365
Glu Tyr Ile Asp Asp Thr Lys Tyr Val Met Lys Glu Phe Arg Ser
370 375 380
<210> 53
<211> 862
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 53
Met Val Phe Leu Pro Thr Ile Thr Cys Ile Ser Arg Arg Tyr Phe Met
1 5 10 15
Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn Val
20 25 30
Leu Ala Ser Pro Pro Met Gly Asn Val Asp Asp Leu Pro Thr Leu Ala
35 40 45
Glu Glu Leu Lys Gln Ala Trp Glu Ser Phe Gln Lys Thr Gly Asn Leu
50 55 60
Asn Phe Thr Ala Phe Gln Gln Gly Leu Ala Ala Ala Gly Gly Gly Ser
65 70 75 80
Phe Asn Tyr Leu Ala Leu Leu Gln Ala Ala Ile Gly Leu Ala Gly Thr
85 90 95
Val Gly Ala Glu Ile Pro Gly Val Ala Ile Ala Ala Pro Leu Val Ser
100 105 110
Ile Leu Leu Gly Phe Leu Trp Pro Asn Lys Gln Lys Ser Asp Ile Lys
115 120 125
Gln Leu Ile Thr Val Ile Asp Ala Glu Val Lys Arg Ile Leu Asn Gln
130 135 140
Glu Leu Thr Asp Tyr His Ser Gln Glu Val Gln Gly Tyr Leu Glu Gly
145 150 155 160
Phe Phe Asp Thr Met Thr Asn Pro Ala Thr Ser Pro Asp Ile Lys Ile
165 170 175
Asn Glu Ser Ile Phe Thr Gly Met Pro Gly Asp Pro Asn Arg Thr Pro
180 185 190
Lys Asp Val Asp Ala Thr Ala Tyr Asp Ala Val Val Asp Arg Phe Leu
195 200 205
Thr Ala Asn Val Ser Phe Ile Ile Asp Leu Asp Gln Phe Leu Thr Gly
210 215 220
Lys Phe Gly Thr Asn Asn Asp Val Asn Phe Asp Val Ala Ser Leu Pro
225 230 235 240
Tyr Tyr Val Ile Gly Ala Thr Leu Gln Leu Met Ser Tyr His Ser Val
245 250 255
Ile Asn Phe Met Thr Ala Trp Phe Pro Lys Ile Trp Pro Gln Tyr Thr
260 265 270
Thr Glu Asp Ser Arg Tyr Lys Asp Ile Leu Asn Ala Lys Gln Ala Met
275 280 285
Arg Met Ala Ile Arg Asp Asn Thr Asn Lys Val Leu Thr Ile Val Gln
290 295 300
Ala Asn Leu Pro Ser Leu Gly Ser Thr Arg Asn Ser Leu Asn Asp Tyr
305 310 315 320
Asn Arg Tyr Ala Arg Thr Met Gln Leu Thr Cys Leu Asp Thr Val Ala
325 330 335
Gln Trp Pro Thr Leu Tyr Pro Asp Asp Tyr Pro Val Ser Thr Thr Leu
340 345 350
Glu Gln Thr Arg Arg Ile Phe Leu Asp Thr Ala Gly Ile Asp Glu Arg
355 360 365
Lys Asp Ser Asn Gly Lys Tyr Thr Asn Asp Ser Thr Ile Thr Ala Leu
370 375 380
Tyr Asn Ile Leu Asp Gly Asn Ser Tyr Asn Ser His Asn Asn Val Gly
385 390 395 400
Ile Asn Gly Ile Asn Ser Leu Leu Tyr Ser Gln Asp Glu Leu Lys Ser
405 410 415
Ile Thr Tyr Arg Met Thr Tyr Thr Asn Asn Asn Asp Tyr Gly Gln Tyr
420 425 430
Pro Cys Phe Pro Tyr Gly Val Val Leu Gly Tyr Thr Asn Asn Gln Tyr
435 440 445
Pro Ser Asn Pro Ala Tyr Gly Asn Gly Gly Asp Pro Asn Asn Gly Ser
450 455 460
Tyr Ile Asp Ser Pro Asn Thr Ser Ala Pro Phe His Thr Val Asp Ala
465 470 475 480
Ala Thr Gln Asn Thr Met Tyr Ile Asp Ala Gln Asn Met Phe Thr Asp
485 490 495
Ser Asn Pro Thr Gly Cys Asn Ile Leu Asn Tyr Gly Gly Pro Asn Gly
500 505 510
Gly Lys Ser Asn Asn Pro Thr Leu Ala Thr Gln Lys Ile Asp Thr Leu
515 520 525
Phe Pro Phe Thr Ile Asn Asn Pro Pro Gly Ser Thr Lys Asn Gly Leu
530 535 540
Gly Arg Leu Gly Val Met Ala Ser Leu Val Pro Tyr Asn Leu Ser Pro
545 550 555 560
Asn Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Ala
565 570 575
Lys Gly Phe Pro Ala Glu Lys Gly Tyr Ile Asp Asn Gly Ser Pro Val
580 585 590
Ser Val Val Lys Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro
595 600 605
Ala Trp Asn Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Gly Gln
610 615 620
Tyr Arg Ile Arg Val Arg Tyr Ala Asn Pro Gly Pro Asp Leu Leu Leu
625 630 635 640
His Gln Tyr Val Tyr Ala Gly Asp Asn Glu Val Gln Gly Gly Ala Trp
645 650 655
Thr Phe Lys Ser Thr Thr Asp Ser Asn Val Asn Thr Asn Phe Pro Asn
660 665 670
Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Val Leu Glu Asp Ile
675 680 685
Thr Trp Pro Met Gly Gly Gly Pro Ser Asp Pro Leu Thr Leu Pro Ala
690 695 700
Gly Asp Val Thr Val Gln Leu Phe Leu Gly Asp Phe Ser Gln Gln Gly
705 710 715 720
Gln Met Thr Gln Val Phe Ile Asp Arg Ile Glu Phe Val Pro Val Gly
725 730 735
Ala Thr Phe Pro Ser Thr Ile Pro Ile Ser Val Asn Ala Gln Tyr Asn
740 745 750
Gln Gly Ser Val Leu Leu Trp Gln Ala Pro Ser Ser Ser Tyr Phe Ile
755 760 765
Tyr Asn Phe Asp Ile Thr Gly Thr Leu Tyr Gly Ser Gln Phe Thr Ser
770 775 780
Val Phe Phe Asp Leu Glu Asn Pro Gln Gly Ser Ser Gly Ile Val Pro
785 790 795 800
Gly Ile Asn Gln Ser Ile Thr Leu Asn Pro Asn Gly Thr Asp Ile Ser
805 810 815
Tyr Lys Asn Leu Thr Ala Asn Leu Asn Ser His Gly Gly Phe Asn Leu
820 825 830
Met Arg Met Glu Gly Ala Gly Ser Glu Asp Phe Gly Ser Asn Ile Thr
835 840 845
Gly Asn Leu Thr Ile Thr Ile His Lys Asp Ser Thr Asn Ser
850 855 860
<210> 54
<211> 735
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 54
Met Val Phe Leu Pro Thr Ile Thr Cys Ile Ser Arg Arg Tyr Phe Met
1 5 10 15
Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn Val
20 25 30
Leu Ala Ser Pro Pro Met Gly Asn Val Asp Asp Leu Pro Thr Leu Ala
35 40 45
Glu Glu Leu Lys Gln Ala Trp Glu Ser Phe Gln Lys Thr Gly Asn Leu
50 55 60
Asn Phe Thr Ala Phe Gln Gln Gly Leu Ala Ala Ala Gly Gly Gly Ser
65 70 75 80
Phe Asn Tyr Leu Ala Leu Leu Gln Ala Ala Ile Gly Leu Ala Gly Thr
85 90 95
Val Gly Ala Glu Ile Pro Gly Val Ala Ile Ala Ala Pro Leu Val Ser
100 105 110
Ile Leu Leu Gly Phe Leu Trp Pro Asn Lys Gln Lys Ser Asp Ile Lys
115 120 125
Gln Leu Ile Thr Val Ile Asp Ala Glu Val Lys Arg Ile Leu Asn Gln
130 135 140
Glu Leu Thr Asp Tyr His Ser Gln Glu Val Gln Gly Tyr Leu Glu Gly
145 150 155 160
Phe Phe Asp Thr Met Thr Asn Pro Ala Thr Ser Pro Asp Ile Lys Ile
165 170 175
Asn Glu Ser Ile Phe Thr Gly Met Pro Gly Asp Pro Asn Arg Thr Pro
180 185 190
Lys Asp Val Asp Ala Thr Ala Tyr Asp Ala Val Val Asp Arg Phe Leu
195 200 205
Thr Ala Asn Val Ser Phe Ile Ile Asp Leu Asp Gln Phe Leu Thr Gly
210 215 220
Lys Phe Gly Thr Asn Asn Asp Val Asn Phe Asp Val Ala Ser Leu Pro
225 230 235 240
Tyr Tyr Val Ile Gly Ala Thr Leu Gln Leu Met Ser Tyr His Ser Val
245 250 255
Ile Asn Phe Met Thr Ala Trp Phe Pro Lys Ile Trp Pro Gln Tyr Thr
260 265 270
Thr Glu Asp Ser Arg Tyr Lys Asp Ile Leu Asn Ala Lys Gln Ala Met
275 280 285
Arg Met Ala Ile Arg Asp Asn Thr Asn Lys Val Leu Thr Ile Val Gln
290 295 300
Ala Asn Leu Pro Ser Leu Gly Ser Thr Arg Asn Ser Leu Asn Asp Tyr
305 310 315 320
Asn Arg Tyr Ala Arg Thr Met Gln Leu Thr Cys Leu Asp Thr Val Ala
325 330 335
Gln Trp Pro Thr Leu Tyr Pro Asp Asp Tyr Pro Val Ser Thr Thr Leu
340 345 350
Glu Gln Thr Arg Arg Ile Phe Leu Asp Thr Ala Gly Ile Asp Glu Arg
355 360 365
Lys Asp Ser Asn Gly Lys Tyr Thr Asn Asp Ser Thr Ile Thr Ala Leu
370 375 380
Tyr Asn Ile Leu Asp Gly Asn Ser Tyr Asn Ser His Asn Asn Val Gly
385 390 395 400
Ile Asn Gly Ile Asn Ser Leu Leu Tyr Ser Gln Asp Glu Leu Lys Ser
405 410 415
Ile Thr Tyr Arg Met Thr Tyr Thr Asn Asn Asn Asp Tyr Gly Gln Tyr
420 425 430
Pro Cys Phe Pro Tyr Gly Val Val Leu Gly Tyr Thr Asn Asn Gln Tyr
435 440 445
Pro Ser Asn Pro Ala Tyr Gly Asn Gly Gly Asp Pro Asn Asn Gly Ser
450 455 460
Tyr Ile Asp Ser Pro Asn Thr Ser Ala Pro Phe His Thr Val Asp Ala
465 470 475 480
Ala Thr Gln Asn Thr Met Tyr Ile Asp Ala Gln Asn Met Phe Thr Asp
485 490 495
Ser Asn Pro Thr Gly Cys Asn Ile Leu Asn Tyr Gly Gly Pro Asn Gly
500 505 510
Gly Lys Ser Asn Asn Pro Thr Leu Ala Thr Gln Lys Ile Asp Thr Leu
515 520 525
Phe Pro Phe Thr Ile Asn Asn Pro Pro Gly Ser Thr Lys Asn Gly Leu
530 535 540
Gly Arg Leu Gly Val Met Ala Ser Leu Val Pro Tyr Asn Leu Ser Pro
545 550 555 560
Asn Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Ala
565 570 575
Lys Gly Phe Pro Ala Glu Lys Gly Tyr Ile Asp Asn Gly Ser Pro Val
580 585 590
Ser Val Val Lys Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro
595 600 605
Ala Trp Asn Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Gly Gln
610 615 620
Tyr Arg Ile Arg Val Arg Tyr Ala Asn Pro Gly Pro Asp Leu Leu Leu
625 630 635 640
His Gln Tyr Val Tyr Ala Gly Asp Asn Glu Val Gln Gly Gly Ala Trp
645 650 655
Thr Phe Lys Ser Thr Thr Asp Ser Asn Val Asn Thr Asn Phe Pro Asn
660 665 670
Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Val Leu Glu Asp Ile
675 680 685
Thr Trp Pro Met Gly Gly Gly Pro Ser Asp Pro Leu Thr Leu Pro Ala
690 695 700
Gly Asp Val Thr Val Gln Leu Phe Leu Gly Asp Phe Ser Gln Gln Gly
705 710 715 720
Gln Met Thr Gln Val Phe Ile Asp Arg Ile Glu Phe Val Pro Val
725 730 735
<210> 55
<211> 888
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 55
Met Ala Thr Val Ser Glu Leu Tyr Pro Val Pro Tyr Asn Val Leu Ala
1 5 10 15
Ile Ser Pro Ser Arg Leu Ser Ser Asn Trp Asp Glu Phe Gln Asp Ile
20 25 30
Phe Asn Gly Val Lys Glu Ala Phe Gly Glu Phe Gln Lys Thr Gly Gln
35 40 45
Leu Gln Lys Glu Ala Ile Gln Gln Gly Trp Asn Ala Tyr Arg Glu Gly
50 55 60
Lys Ile Asp Tyr Leu Ala Leu Val Lys Ala Ser Leu Ser Ile Val Gly
65 70 75 80
Leu Ile Val Pro Gly Gly Glu Ala Ala Val Pro Phe Ile Gly Met Phe
85 90 95
Leu Asp Phe Val Trp Pro Lys Leu Phe Gly Gly Ala Ser Asn Asp Glu
100 105 110
Thr Thr Leu Leu Asn Val Ile Asn Asp Val Val Asn Gln Ile Leu Asp
115 120 125
Lys Arg Leu Ser Glu Gln Gln Leu Gln Thr Val Asn Asn Val Leu Asn
130 135 140
Gly Phe Gln Thr Gln Met Leu Asp Leu Asn Arg Lys Ile Leu Ser Ala
145 150 155 160
Thr Phe Asp Ala Thr Thr Gln Lys Pro Arg Thr Pro Ser Ala Ser Asp
165 170 175
Met Ile Asn Val Tyr Asp Glu Phe Asn Leu Glu Gln Gly Leu Ile His
180 185 190
Asn Asp Leu Tyr Glu Phe Ile Leu Lys Asp Tyr Glu Asp Ile Thr Leu
195 200 205
Pro Met Tyr Val Ile Ala Ile Thr Tyr Ile Gly Lys Val Met Ser Tyr
210 215 220
Gln Ala Phe Ile Gln Phe Ala Asn Gln Trp Ile Asp Thr Val Tyr Pro
225 230 235 240
Asp Thr Thr Ser Gly Glu Gly Tyr Thr Phe Lys Thr Asn Leu Asp Ala
245 250 255
Ala Lys Thr Lys Met Gln Asp Asp Ile Asp Thr Ala Ser Gln His Val
260 265 270
Leu Thr Val Phe Gln Asp His Met Val Pro Ala Gly Val Asn Lys Leu
275 280 285
Gln His Asn Leu His Asn Gln Tyr Val Asn Thr Met Val Leu Gln Cys
290 295 300
Phe Asp Phe Val Ala Met Trp Pro Thr Leu Tyr Pro Asp Val Tyr Pro
305 310 315 320
Ala Asn Thr Ser Leu Asp Leu Thr Arg Val Leu Phe Ser Asp Ile Val
325 330 335
Gly Pro Asp Glu Val His Asp Gly Asn Val Thr Leu Tyr Asn Ile Phe
340 345 350
Asp Asn Pro Gln Gly Tyr Thr His Gly Ser Ile Asn Met Glu Asp Ile
355 360 365
Leu Tyr Leu Arg Lys Glu Leu Glu Gln Val Arg Ile Phe Gln His Glu
370 375 380
Ser Gly Thr Asn Cys Trp Ala Tyr Gly Ile Gly Leu Lys Tyr Ala Gly
385 390 395 400
Asp Pro Asn Ile Tyr Met Tyr Ser Pro Ser Ser Phe Asp Pro Ser Leu
405 410 415
Thr Val Pro Ser Thr Val Gln Ala Pro Ile Ile Val Val Asn Ala Ser
420 425 430
Thr Gln Tyr Thr Lys Gly Leu Leu Asp Lys Glu Ser Ile Gly Ile Leu
435 440 445
Ala Asp Asn Gly Asp Asn Ala Gly Thr Tyr Cys Pro Leu Pro Gly His
450 455 460
Asp Gly Val Glu Gly Tyr Gly Arg Ser His Asn Ser Pro Leu Ala Asn
465 470 475 480
Gln Lys Ile Asn Ala Leu Tyr Pro Val Thr Ala Glu His Val Ala Gly
485 490 495
Asp Gln Arg Lys Leu Gly Leu Ile Ala Thr His Val Pro Phe Asp Leu
500 505 510
Val Pro Gln Asn Ile Ile Gly Gln Thr Asp Thr Asp Glu Thr Ile Ser
515 520 525
Val Arg Gly Phe Pro Ala Glu Lys Gly Thr Ile Asp Asn Ile Gly Ser
530 535 540
Leu Gln Leu Val Pro Glu His Ile Asn Gly Ala Asn Ala Val Lys Leu
545 550 555 560
Asn Phe Gln Gln Ile Leu Ala Ile Pro Ile Thr Ser Val Thr Ser Gly
565 570 575
Tyr Tyr Phe Ile Arg Ile Arg Tyr Ala Ser Ser Ser Asp Ile Thr Gly
580 585 590
Tyr Phe His Val Gln Thr Pro Ala Gly Asp Ser Asn Arg Gly Ala Ile
595 600 605
Thr Phe Pro Asn Thr Gln Asn Met Glu Asn Thr Ile Asp Lys Met Tyr
610 615 620
Val Thr Gly Ala Asn Gly Lys Tyr Thr Leu Leu Thr Val Thr Gln Ala
625 630 635 640
Ile Lys Ile Pro Ser Gly Asn Ser Thr Ile Tyr Ile Gln Asn Asn Ser
645 650 655
Asn Ala Asp Phe Phe Leu Asp Arg Ile Glu Val Ile Pro Ile Thr Ala
660 665 670
Ser Gln Leu Pro Ala Pro Ser Ser Ala Asn Gln Phe Thr Leu Val Gly
675 680 685
Pro Tyr Thr Val Leu Pro Glu Pro Val Gly Lys Asp Phe Val Ile Trp
690 695 700
Gln Thr Ser Gln Thr Cys Ala Tyr Thr Pro Asn Ser Thr Leu Ser Phe
705 710 715 720
Thr Ile Leu Gly Ser Gly Glu Phe Gln Ala Leu Asp Lys Asn Asn Asn
725 730 735
Ile Ile Ala Arg Arg Val Glu Asn Gln Asn Asn Gln Thr Thr Glu Thr
740 745 750
Trp Asn Leu Pro Gln Asn Val Glu Lys Ile Arg Phe Val Ala Asn Asp
755 760 765
Tyr Val Val Leu Arg Asn Ile Met Gly Lys Ile Asn Cys Gln Glu Pro
770 775 780
Ile Lys Pro Cys Thr Leu Gly Gly Gln Thr Leu Val Asp Ile Asp Tyr
785 790 795 800
Asn Ala Asn Gly Glu Gln Val Leu Trp Ser Ser Thr Glu Pro Met Gly
805 810 815
Asn Cys Thr Val Asp Ile Val Leu Asp Ser Asp Phe Ile Ser Ser Ile
820 825 830
Ala Thr Leu Gln Phe Gln Asp Asn Thr Gly Thr Val Val Arg Ser Ile
835 840 845
Asp Phe Gly Pro Gly Glu Gln Asn Gln Thr Val Ser Val Ala Ser Thr
850 855 860
Thr Lys Ile Val Leu Lys Pro Thr Thr Phe Gly Pro Ile Ser Gly His
865 870 875 880
Leu Lys Ile Gln Val Val Thr Asn
885
<210> 56
<211> 670
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 56
Met Ala Thr Val Ser Glu Leu Tyr Pro Val Pro Tyr Asn Val Leu Ala
1 5 10 15
Ile Ser Pro Ser Arg Leu Ser Ser Asn Trp Asp Glu Phe Gln Asp Ile
20 25 30
Phe Asn Gly Val Lys Glu Ala Phe Gly Glu Phe Gln Lys Thr Gly Gln
35 40 45
Leu Gln Lys Glu Ala Ile Gln Gln Gly Trp Asn Ala Tyr Arg Glu Gly
50 55 60
Lys Ile Asp Tyr Leu Ala Leu Val Lys Ala Ser Leu Ser Ile Val Gly
65 70 75 80
Leu Ile Val Pro Gly Gly Glu Ala Ala Val Pro Phe Ile Gly Met Phe
85 90 95
Leu Asp Phe Val Trp Pro Lys Leu Phe Gly Gly Ala Ser Asn Asp Glu
100 105 110
Thr Thr Leu Leu Asn Val Ile Asn Asp Val Val Asn Gln Ile Leu Asp
115 120 125
Lys Arg Leu Ser Glu Gln Gln Leu Gln Thr Val Asn Asn Val Leu Asn
130 135 140
Gly Phe Gln Thr Gln Met Leu Asp Leu Asn Arg Lys Ile Leu Ser Ala
145 150 155 160
Thr Phe Asp Ala Thr Thr Gln Lys Pro Arg Thr Pro Ser Ala Ser Asp
165 170 175
Met Ile Asn Val Tyr Asp Glu Phe Asn Leu Glu Gln Gly Leu Ile His
180 185 190
Asn Asp Leu Tyr Glu Phe Ile Leu Lys Asp Tyr Glu Asp Ile Thr Leu
195 200 205
Pro Met Tyr Val Ile Ala Ile Thr Tyr Ile Gly Lys Val Met Ser Tyr
210 215 220
Gln Ala Phe Ile Gln Phe Ala Asn Gln Trp Ile Asp Thr Val Tyr Pro
225 230 235 240
Asp Thr Thr Ser Gly Glu Gly Tyr Thr Phe Lys Thr Asn Leu Asp Ala
245 250 255
Ala Lys Thr Lys Met Gln Asp Asp Ile Asp Thr Ala Ser Gln His Val
260 265 270
Leu Thr Val Phe Gln Asp His Met Val Pro Ala Gly Val Asn Lys Leu
275 280 285
Gln His Asn Leu His Asn Gln Tyr Val Asn Thr Met Val Leu Gln Cys
290 295 300
Phe Asp Phe Val Ala Met Trp Pro Thr Leu Tyr Pro Asp Val Tyr Pro
305 310 315 320
Ala Asn Thr Ser Leu Asp Leu Thr Arg Val Leu Phe Ser Asp Ile Val
325 330 335
Gly Pro Asp Glu Val His Asp Gly Asn Val Thr Leu Tyr Asn Ile Phe
340 345 350
Asp Asn Pro Gln Gly Tyr Thr His Gly Ser Ile Asn Met Glu Asp Ile
355 360 365
Leu Tyr Leu Arg Lys Glu Leu Glu Gln Val Arg Ile Phe Gln His Glu
370 375 380
Ser Gly Thr Asn Cys Trp Ala Tyr Gly Ile Gly Leu Lys Tyr Ala Gly
385 390 395 400
Asp Pro Asn Ile Tyr Met Tyr Ser Pro Ser Ser Phe Asp Pro Ser Leu
405 410 415
Thr Val Pro Ser Thr Val Gln Ala Pro Ile Ile Val Val Asn Ala Ser
420 425 430
Thr Gln Tyr Thr Lys Gly Leu Leu Asp Lys Glu Ser Ile Gly Ile Leu
435 440 445
Ala Asp Asn Gly Asp Asn Ala Gly Thr Tyr Cys Pro Leu Pro Gly His
450 455 460
Asp Gly Val Glu Gly Tyr Gly Arg Ser His Asn Ser Pro Leu Ala Asn
465 470 475 480
Gln Lys Ile Asn Ala Leu Tyr Pro Val Thr Ala Glu His Val Ala Gly
485 490 495
Asp Gln Arg Lys Leu Gly Leu Ile Ala Thr His Val Pro Phe Asp Leu
500 505 510
Val Pro Gln Asn Ile Ile Gly Gln Thr Asp Thr Asp Glu Thr Ile Ser
515 520 525
Val Arg Gly Phe Pro Ala Glu Lys Gly Thr Ile Asp Asn Ile Gly Ser
530 535 540
Leu Gln Leu Val Pro Glu His Ile Asn Gly Ala Asn Ala Val Lys Leu
545 550 555 560
Asn Phe Gln Gln Ile Leu Ala Ile Pro Ile Thr Ser Val Thr Ser Gly
565 570 575
Tyr Tyr Phe Ile Arg Ile Arg Tyr Ala Ser Ser Ser Asp Ile Thr Gly
580 585 590
Tyr Phe His Val Gln Thr Pro Ala Gly Asp Ser Asn Arg Gly Ala Ile
595 600 605
Thr Phe Pro Asn Thr Gln Asn Met Glu Asn Thr Ile Asp Lys Met Tyr
610 615 620
Val Thr Gly Ala Asn Gly Lys Tyr Thr Leu Leu Thr Val Thr Gln Ala
625 630 635 640
Ile Lys Ile Pro Ser Gly Asn Ser Thr Ile Tyr Ile Gln Asn Asn Ser
645 650 655
Asn Ala Asp Phe Phe Leu Asp Arg Ile Glu Val Ile Pro Ile
660 665 670
<210> 57
<211> 830
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 57
Met Asn Ser Leu Gln Thr Leu Gln Ala Gln Pro Ile Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Asn Pro Leu Ser Gly Asn Ala Asp Ala Gln Pro Gly Ser
20 25 30
Leu Gly Asp Leu Ile Ala Ser Leu Lys Glu Ala Trp Val Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Lys Leu Leu Gln Ser Val Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Val Ile Gly Val Leu Ser Thr Val Leu Gly Ala Glu Ile Pro Gly Val
85 90 95
Ala Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Ile Trp Pro
100 105 110
Ser Lys Lys Lys Asp Glu Thr Gln Gln Leu Ile Asn Leu Ile Asp Ala
115 120 125
Glu Ile Gln Lys Gln Leu Asn Gln Ala Leu Ser Asn Gln Asp Lys Asn
130 135 140
Asp Trp Thr Gly Tyr Leu Asn Ala Ile Phe Gln Ile Ser His Asp Ala
145 150 155 160
Ala Arg Ala Val Val Asp Ala Gln Phe Thr Gly Ser Glu Gly Asp Thr
165 170 175
Asn Arg Gln Pro Arg Thr Pro Gly Val Ser Asp Tyr Glu Asn Val Tyr
180 185 190
Asn His Leu Leu Ala Thr Thr Gly Ser Leu Ile Val Ala Leu Ser Gln
195 200 205
Met Ile Asn Gly Asn Phe Asp Thr Leu Ala Ile Pro Phe Phe Val Ile
210 215 220
Gly Ala Thr Val Gln Leu Ser Thr Tyr Gln Thr Phe Ile Gln Phe Ala
225 230 235 240
Asn Lys Trp Leu Pro Ile Val Tyr Pro Asp Tyr Gln Thr Gln Gly Thr
245 250 255
Ala Gly Tyr Thr Gln Gln Gln Asn Leu Ile Asn Ala Lys Ser Asp Met
260 265 270
Arg Ala Ala Ile His Asp His Thr Gln Lys Val Phe Asn Ala Phe Lys
275 280 285
Thr Gly Met Pro Ser Leu Gly Ser Asp Lys Asn Ser Val Asn Ala Tyr
290 295 300
Asn Arg Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Leu Ala
305 310 315 320
Thr Trp Pro Ser Met Tyr Pro Asp Asp Tyr Pro Ser Gln Thr Glu Leu
325 330 335
Glu Gln Thr Arg Val Ile Phe Ser Asp Leu Val Gly Lys Asp Gln Thr
340 345 350
Val Asn Asn Ala Val Thr Leu Ile Asn Met Val Asp Lys Ser Gly Pro
355 360 365
Asp Ala Trp Asn Gln His Ser Ser Ile Asp Ile Asn Ser Ile Ser Tyr
370 375 380
Thr Arg Arg Glu Leu Gln Gln Val Gln Phe Pro Ile His Asp Gly Asn
385 390 395 400
Arg Gly Asn Thr Thr Asn Cys Tyr Pro Tyr Ala Val Ile Leu Asn Tyr
405 410 415
Pro Asp Asn Ser Tyr Phe Tyr Gly Asp Leu Tyr Ala Pro Gln Phe Ser
420 425 430
Gly Asn Pro Thr Phe Gly Gly Asp Asn Asp Ile Leu Thr Ala Leu Asn
435 440 445
Ala Arg Thr Gln His Thr Arg Tyr Val Asp Ser Glu Ser Leu Gln Met
450 455 460
Asn Cys Tyr Pro Tyr Ala Ser Asn Asp Cys His Ile Leu Gly Tyr Cys
465 470 475 480
Thr Gly Gly Thr Cys Leu Ala Pro Ser Gly Cys Ser Asp Gly Tyr Gly
485 490 495
Lys Ser Cys Asn Glu Thr Leu Pro Asp Gln Lys Ile Asn Ala Leu Tyr
500 505 510
Pro Phe Arg Met Asp Leu Gly His Ser Gly Thr Pro Asp Lys Leu Gly
515 520 525
Val Met Ser Ser His Val Pro Phe Asn Leu Ile Pro Asn Asn Val Phe
530 535 540
Gly Ala Ile Asp Ser Asp Thr Asn Asn Ile Ile Gly Lys Gly Phe Pro
545 550 555 560
Ala Glu Lys Gly Tyr Trp Ser Asn Gly Gly Asp Arg Pro Ile Pro Ser
565 570 575
Ser Asn Ile Ala Lys Glu Trp Val Asn Gly Ala Asn Ala Val Lys Val
580 585 590
Glu Tyr Gly Ala Phe Leu Ala Met Lys Ala Thr Asn Met Lys Thr Gly
595 600 605
Lys Tyr Tyr Ile Arg Val Arg Tyr Ala Asn Pro Ser Asn Lys Asp Val
610 615 620
Asn Met Trp Gln Glu Val Trp Ala Gly Asn Gln Lys Leu Gln Gly Gly
625 630 635 640
Gly Trp Thr Phe Lys Ser Thr Ser Asp Asp Asn Val Ala Asp Asn Phe
645 650 655
Pro Lys Gln Val Tyr Val Thr Gly Gln Gln Gly Asp Pro Leu Asn Leu
660 665 670
Pro Ser Gly Glu Ile Met Val Lys Leu Ser Gly Gln Asn Ser Gly Asp
675 680 685
Ser Ile Tyr Val Asp Arg Ile Glu Phe Ile Pro Ile Ser Glu Thr Glu
690 695 700
Asn Ile Asn Tyr Met Ile Thr Ser Thr Val Pro Leu Leu Pro Ser Gly
705 710 715 720
Gln Tyr Val Pro Ser Leu Ser Asn Tyr Pro Thr Ile Trp Thr Ala Asp
725 730 735
Ser Gly Gln Gln Ala Ile Ser Ala Thr Leu Thr Phe Ser Gly Leu Asn
740 745 750
Ser Thr Leu Phe Leu Leu Ala Lys Thr Asn Pro Gly Asp Asp Asn Asp
755 760 765
Glu Asn Asn Trp Asp Trp Met Cys Pro Asp Gly Phe Leu Gly Ser Val
770 775 780
Leu Asn Gly Thr Tyr Thr Phe Pro Ile Asn Asn His Asp Ser Ala Cys
785 790 795 800
Ala Lys Lys Pro Phe Thr Ala Ile Lys Ile Gly Ala Glu Leu Ile Pro
805 810 815
Val Ser Ala Gly Thr Ile Thr Gly Ser Ile Thr His Gln Arg
820 825 830
<210> 58
<211> 700
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 58
Met Asn Ser Leu Gln Thr Leu Gln Ala Gln Pro Ile Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Asn Pro Leu Ser Gly Asn Ala Asp Ala Gln Pro Gly Ser
20 25 30
Leu Gly Asp Leu Ile Ala Ser Leu Lys Glu Ala Trp Val Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Lys Leu Leu Gln Ser Val Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Val Ile Gly Val Leu Ser Thr Val Leu Gly Ala Glu Ile Pro Gly Val
85 90 95
Ala Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Ile Trp Pro
100 105 110
Ser Lys Lys Lys Asp Glu Thr Gln Gln Leu Ile Asn Leu Ile Asp Ala
115 120 125
Glu Ile Gln Lys Gln Leu Asn Gln Ala Leu Ser Asn Gln Asp Lys Asn
130 135 140
Asp Trp Thr Gly Tyr Leu Asn Ala Ile Phe Gln Ile Ser His Asp Ala
145 150 155 160
Ala Arg Ala Val Val Asp Ala Gln Phe Thr Gly Ser Glu Gly Asp Thr
165 170 175
Asn Arg Gln Pro Arg Thr Pro Gly Val Ser Asp Tyr Glu Asn Val Tyr
180 185 190
Asn His Leu Leu Ala Thr Thr Gly Ser Leu Ile Val Ala Leu Ser Gln
195 200 205
Met Ile Asn Gly Asn Phe Asp Thr Leu Ala Ile Pro Phe Phe Val Ile
210 215 220
Gly Ala Thr Val Gln Leu Ser Thr Tyr Gln Thr Phe Ile Gln Phe Ala
225 230 235 240
Asn Lys Trp Leu Pro Ile Val Tyr Pro Asp Tyr Gln Thr Gln Gly Thr
245 250 255
Ala Gly Tyr Thr Gln Gln Gln Asn Leu Ile Asn Ala Lys Ser Asp Met
260 265 270
Arg Ala Ala Ile His Asp His Thr Gln Lys Val Phe Asn Ala Phe Lys
275 280 285
Thr Gly Met Pro Ser Leu Gly Ser Asp Lys Asn Ser Val Asn Ala Tyr
290 295 300
Asn Arg Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Leu Ala
305 310 315 320
Thr Trp Pro Ser Met Tyr Pro Asp Asp Tyr Pro Ser Gln Thr Glu Leu
325 330 335
Glu Gln Thr Arg Val Ile Phe Ser Asp Leu Val Gly Lys Asp Gln Thr
340 345 350
Val Asn Asn Ala Val Thr Leu Ile Asn Met Val Asp Lys Ser Gly Pro
355 360 365
Asp Ala Trp Asn Gln His Ser Ser Ile Asp Ile Asn Ser Ile Ser Tyr
370 375 380
Thr Arg Arg Glu Leu Gln Gln Val Gln Phe Pro Ile His Asp Gly Asn
385 390 395 400
Arg Gly Asn Thr Thr Asn Cys Tyr Pro Tyr Ala Val Ile Leu Asn Tyr
405 410 415
Pro Asp Asn Ser Tyr Phe Tyr Gly Asp Leu Tyr Ala Pro Gln Phe Ser
420 425 430
Gly Asn Pro Thr Phe Gly Gly Asp Asn Asp Ile Leu Thr Ala Leu Asn
435 440 445
Ala Arg Thr Gln His Thr Arg Tyr Val Asp Ser Glu Ser Leu Gln Met
450 455 460
Asn Cys Tyr Pro Tyr Ala Ser Asn Asp Cys His Ile Leu Gly Tyr Cys
465 470 475 480
Thr Gly Gly Thr Cys Leu Ala Pro Ser Gly Cys Ser Asp Gly Tyr Gly
485 490 495
Lys Ser Cys Asn Glu Thr Leu Pro Asp Gln Lys Ile Asn Ala Leu Tyr
500 505 510
Pro Phe Arg Met Asp Leu Gly His Ser Gly Thr Pro Asp Lys Leu Gly
515 520 525
Val Met Ser Ser His Val Pro Phe Asn Leu Ile Pro Asn Asn Val Phe
530 535 540
Gly Ala Ile Asp Ser Asp Thr Asn Asn Ile Ile Gly Lys Gly Phe Pro
545 550 555 560
Ala Glu Lys Gly Tyr Trp Ser Asn Gly Gly Asp Arg Pro Ile Pro Ser
565 570 575
Ser Asn Ile Ala Lys Glu Trp Val Asn Gly Ala Asn Ala Val Lys Val
580 585 590
Glu Tyr Gly Ala Phe Leu Ala Met Lys Ala Thr Asn Met Lys Thr Gly
595 600 605
Lys Tyr Tyr Ile Arg Val Arg Tyr Ala Asn Pro Ser Asn Lys Asp Val
610 615 620
Asn Met Trp Gln Glu Val Trp Ala Gly Asn Gln Lys Leu Gln Gly Gly
625 630 635 640
Gly Trp Thr Phe Lys Ser Thr Ser Asp Asp Asn Val Ala Asp Asn Phe
645 650 655
Pro Lys Gln Val Tyr Val Thr Gly Gln Gln Gly Asp Pro Leu Asn Leu
660 665 670
Pro Ser Gly Glu Ile Met Val Lys Leu Ser Gly Gln Asn Ser Gly Asp
675 680 685
Ser Ile Tyr Val Asp Arg Ile Glu Phe Ile Pro Ile
690 695 700
<210> 59
<211> 962
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 59
Met Ile Met Gln Met Val Thr Lys Ala Met Thr Pro Lys Leu Gly Ala
1 5 10 15
Leu Pro Asp Phe Ile Asp Asp Phe Asn Gly Ile Phe Gly Phe Met Gly
20 25 30
Asn Ile Thr Asp Val Met Ser Thr Ile Phe Gly Val Asp Thr Gly Asp
35 40 45
Ser Ser Ile Asp Asp Val Leu Asn Asn Gln Glu Leu Leu Gln Glu Met
50 55 60
Gln Ala Gln Met Gln Gln Ile Gln Glu Ser Ile Lys Asp Met Thr Asp
65 70 75 80
Lys Glu Ile Ile Thr Gln Ser Ile Glu Glu Gln Ile Leu Ala Leu Thr
85 90 95
Lys Asp Leu Ser Thr Ser Ile Thr Thr Glu Phe Glu Lys Ile Gly Asn
100 105 110
Ile Leu Thr Thr Tyr Leu Gln Ala Met Ser Gly Met Leu Lys Asp Ile
115 120 125
Tyr Ser Gln Thr Ser Val Ile Asp Gln Lys Val Asp Lys Leu Leu Ala
130 135 140
Met Met Thr Phe Ala Leu Lys Glu Leu Asp Tyr Ile Lys Asp Asn Val
145 150 155 160
Val Leu Asn Ser Ser Ile Val Glu Ile Thr Pro His Val Gln Lys Leu
165 170 175
Val Tyr Val Asn Lys Lys Phe Leu Ser Leu Thr Arg Gly Phe Leu Gln
180 185 190
Gly Glu Asp Leu Ser Ile Asp Ser Met Gln Glu Ile Gln Glu Trp Ala
195 200 205
Lys Ser Ile Leu Ala Thr Glu Met Asn Ser Phe Glu Phe Ser Val Asp
210 215 220
Thr Leu His Ser Ile Ile Ile Gly Asp Asn Leu Tyr Lys Arg Ser Ala
225 230 235 240
Leu Lys Thr Phe Ser Asp Val Leu Leu Asp Asp Ala Asp Gln Tyr Gly
245 250 255
Asn Phe Gly Thr Pro Leu Ala Lys Phe Tyr Thr Phe Phe Ser Ser Leu
260 265 270
Ala Thr Leu Gln Ile Asn Ala Tyr Leu Cys Leu Thr Phe Ala Arg Lys
275 280 285
Val Leu Gly Leu Ser Glu Ile Asp Tyr Gln Leu Ile Met Gln Asp Arg
290 295 300
Ile Glu Gln Gln Asn Gln Leu Phe Val Asn Leu Ile Gln Asp Lys Asn
305 310 315 320
Tyr Ser Asn Val Leu Glu Ile Lys Gly Val Tyr Pro Met Pro His Asn
325 330 335
Asn Glu Asp Gly Asn Ser Phe Ser Leu Gln Ala Lys Lys Gly Tyr Ala
340 345 350
Leu Ile Gly Leu Glu Phe Phe Met Asp Asn Gly Ile Tyr Lys Ala Lys
355 360 365
Ala Tyr Gln Gly Lys Ile Gly Lys Asn Phe Ser Val Tyr Ala Asp Thr
370 375 380
Val Glu Glu Ile Ile Ser Asp Asp Leu Ser Lys Val Phe Leu Asn Asn
385 390 395 400
Thr Lys Asp Phe Glu Gln Ser Tyr Lys Tyr Pro Leu Phe Gly Glu Leu
405 410 415
Lys Gly Glu Pro Asn Thr Leu Ile Thr Arg Ile Gly Phe Gly Thr Lys
420 425 430
Tyr Asp Glu Ser Lys Glu Arg Gln Ala Val Ala Tyr Ile Asp Ala Asp
435 440 445
Phe Ser Pro Tyr Glu Pro Lys Ser Gly Ser Ile Ser Thr Glu Gly Thr
450 455 460
Gln Thr Phe Gln Ile Glu Glu Thr Lys Asn Tyr Thr Tyr Cys Tyr Asp
465 470 475 480
Ala Trp Pro Ile Gly Ile Ile Gly Asp Leu Tyr Met Thr Pro Leu Lys
485 490 495
Ser Leu Ser Leu His Val Asp Gly Asp Ser Arg Lys Asn Leu Tyr Met
500 505 510
Ser Gly Glu Ser Tyr Phe Ser Thr Ile Leu Ser Arg Glu Tyr Asn Thr
515 520 525
Asn Phe Ile Leu Phe Pro Tyr Thr Asn Asn Ser Ser Pro Ile Val Glu
530 535 540
Asn Leu Ile Gln Asn Gly Asp Phe Glu Asp Gly Asp Lys Tyr Trp Glu
545 550 555 560
Val Ser Gly Asp Thr Ala Val Ile Ala Lys Gly Glu Gly Ile Tyr Gly
565 570 575
Ser Asn Ala Ile Ser Ile Lys Asn Thr Gly Asp Ala Phe Glu Pro Ala
580 585 590
Leu Lys Gln Glu Leu Asn Leu Lys Pro Tyr Thr Lys Tyr Lys Leu Thr
595 600 605
Ala Tyr Val Lys Ser Lys Thr Asn Asp Ser Gln Ala Gly Leu Ile Ala
610 615 620
Ile Lys Tyr Thr Asn Tyr Met Ile Ala Phe Ile Ser Lys Thr Phe Ser
625 630 635 640
Thr Ser Tyr Lys Gln Ile Glu Leu Lys Phe Arg Thr Gly Ala Asp Thr
645 650 655
Gln Asn Phe Tyr Val Thr Phe Glu Lys Ser Asn Thr Gly Tyr Leu Leu
660 665 670
Leu Asp Asn Ile Lys Leu His Glu Ile Pro Gln Thr Asp Asn Leu Ile
675 680 685
Asp Cys Gly Asp Phe Gly Gly Tyr Asp Tyr Thr Thr Lys Ile Asp Cys
690 695 700
Gln Tyr Tyr Glu Pro Tyr Leu Trp Glu Leu Asp Gly Gly Gly Ser Ile
705 710 715 720
Ser Asn Gly Phe Glu Glu Gly Leu Phe Gly Thr Asp Cys Leu Lys Ile
725 730 735
Ile Lys Ser Gly Gly Ala Asn Gln Lys Val Gln Leu Lys Ala Asn Thr
740 745 750
Asn Tyr Ile Leu Thr Ala Tyr Val Lys Val Asp Asp Pro Thr Thr Thr
755 760 765
Ala His Ile Gly Cys Gly Ser Asn Gln Val Thr Cys Asn Ser Thr Ser
770 775 780
Tyr Thr Leu Val Lys Val Lys Phe Lys Thr Gly Asp Asp Pro Ser Thr
785 790 795 800
Thr Glu Ser Ser Val Tyr Cys Ser Asn Pro Asn Asp Ser Gly Thr Ile
805 810 815
Trp Ala Asp Asn Phe Val Leu Cys Glu Val Pro Asn Leu Ile Val Asn
820 825 830
Gly Asp Phe Glu Gln Met Asp Leu Ser Ser Trp Ser Leu Ser Pro Ser
835 840 845
Glu Arg Gly Thr Ile Tyr Ile Gly Ile Gly Lys Gly Ile Ser Asn Ser
850 855 860
Asn Ala Ile Val Leu Gln Asp His Asn Gly Lys Ile Ser Gln Lys Leu
865 870 875 880
Pro Leu Lys Pro Ile Thr Lys Tyr Arg Leu Thr Ala Tyr Val Ser Val
885 890 895
Thr Gly Gly Gly Thr Ala Gln Ile Gly Tyr Gly Asp Asp Ser Ala Ser
900 905 910
Cys Thr Ser Asn Lys Phe Gln Gln Ile Ser Thr Asp Phe Thr Thr Gly
915 920 925
Ala Asp Pro Leu Gln Ser Asp Asp Val Ile Tyr Leu Ser Thr Asp Gly
930 935 940
Cys Thr Tyr Thr Ile Ala Asp Asn Phe Glu Leu Tyr Glu Leu Asp Gln
945 950 955 960
Ile Glu
<210> 60
<211> 860
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 60
Met Lys Asn Lys Lys Lys Tyr Met Lys Pro Leu Ala Val Gly Leu Leu
1 5 10 15
Ala Thr Asn Ile Ile Gly Phe Gly Thr Gln Thr Val Ala Phe Ala Ala
20 25 30
Thr Asp Lys Ala Pro Thr Lys Glu Gln Ala Gln Lys Gly Ser Phe Asn
35 40 45
Pro Thr Val Leu Ala Ser Met Pro Ser Asp Thr Ser Glu Leu Gln Glu
50 55 60
Ile Leu Glu Asp Ala Ile Thr Asn Pro Asn Thr Gln Ile Ser Pro Lys
65 70 75 80
Leu Val Lys Asp Leu Gln Asp Lys Gly Phe Asp Tyr Leu Ser Ile Val
85 90 95
Lys Ala Ala Thr Gly Gly Leu Val Lys Lys Ile Pro Tyr Ala Gly Ser
100 105 110
Phe Leu Ser Pro Leu Val Glu Phe Ile Phe Pro Gly Lys Lys Tyr Leu
115 120 125
Thr Lys Glu Thr Val Trp Gly Ala Ile Gln Asp Lys Val Ser Asn Leu
130 135 140
Ile Asp Glu Lys Leu Thr Glu Asn Gln Val Asn Glu Leu Ile Ala Ser
145 150 155 160
Leu Thr Gly Ile Gln Asp Asn Leu Lys Lys Tyr Gln Val Ala Val Gly
165 170 175
Leu Val Asn Gly Lys Glu Pro Pro Ile Asp Lys Ser Ile Gln Met Asn
180 185 190
Ala Glu Thr Asp Arg Thr Lys Glu Asp Leu Arg Gly Tyr Ile Asp His
195 200 205
Leu Asp Ser Asp Leu Gly Lys Glu Val Pro Lys Phe Ala Val Lys Asn
210 215 220
Tyr Glu Ala Ala Ser Ile Pro Tyr Tyr Val Gln Ile Ala Asn Ile His
225 230 235 240
Leu Phe Leu Leu Arg Asp Ala Val Thr His Ala Asp Glu Trp Gly Leu
245 250 255
Thr Ala Lys Glu Lys Asn Thr Tyr Leu Ser Arg Leu Gln Ser Lys Ile
260 265 270
Ile Glu Tyr Ser Asn Thr Val Tyr Asp Ser Phe Asn Lys Gly Val Ala
275 280 285
Ala Ala Lys Asn Lys Asp Gly Asn Trp Asp Asn Val Asn Lys Phe Val
290 295 300
Arg Thr Met Thr Val Tyr Gly Leu Asp Phe Val Ala Leu Trp Pro Thr
305 310 315 320
Phe Asp Pro Ser Arg Tyr Asp Glu Pro Thr Lys Leu Gln Gln Thr Arg
325 330 335
Gln Val Tyr Ser Asp Leu Ile Gly Asn Pro Glu Ala Gly Ser Asn Tyr
340 345 350
Asp Thr Leu Leu Pro Gln Ile His Ser Ser Ile Phe Ala Gly Tyr Pro
355 360 365
Gly Glu Leu Lys Gln Phe Asn Val Ser Gln Ala Asp Arg Ile Asp Ser
370 375 380
Ile Lys Glu Val Phe Asp Trp Thr Gly Asp Gly Ser Gln Asp Tyr Thr
385 390 395 400
Gln Gln Leu Gly Asn Pro Asn Pro Ser Gly Tyr Ser Val Arg Asn Val
405 410 415
Thr Arg Asp Ala Pro Val Ile Gly Leu Ser Ala Tyr Lys Ser Trp Asn
420 425 430
Asn Asn Trp Tyr Asn Met Phe Ser Leu Ser Tyr Pro Gly Lys Glu Val
435 440 445
Asp Phe Gln Ser Arg Phe Arg Gln Ala Leu Arg Asp Gly Tyr Val Thr
450 455 460
Lys Thr Ser Ala Pro Ser Gly Gln Lys Val Ser Asn Ile Lys Ala Ala
465 470 475 480
Glu Thr Gly Lys Gly Lys Gly Thr Ile Ser Ser Met Val Ala Ala Tyr
485 490 495
Ile Pro Glu Glu Val His Pro Glu Asn Ile Leu Ala Ala Arg Ser Ile
500 505 510
Thr Gly Ile Pro Ala Glu Lys Tyr Asn Ser Arg Ser Ala Val Asp Asn
515 520 525
Ala Ile Glu Tyr Thr Asn Gly Ser Asn Ala Met Val Phe Thr Gln Asn
530 535 540
Gly Ser Ser Ile Thr Tyr Asn Ile Gln Ser Pro Glu Lys Arg Lys Tyr
545 550 555 560
Lys Ile Arg Leu Arg Val Ala Thr Asn Ser Asp Thr Ser Val Gly Ile
565 570 575
Ser Val Asn Gly Asn Ala Gln Gln Met Asn Ile Glu Asn Thr Glu Ala
580 585 590
Lys Ala Thr Asp Arg Thr Gly Ile Lys Gly Val Asn Gly Thr Tyr Thr
595 600 605
Val Ile Glu Gly Pro Thr Val Glu Leu Gln Glu Gly Lys Asn Thr Ile
610 615 620
Thr Ile Asp His Lys Asp Ala Asn Lys Leu Val Leu Asp Arg Ile Glu
625 630 635 640
Phe Glu His Gln Pro Val Ala Pro Glu His Gly Asn Pro Gly Trp Gln
645 650 655
Thr Ile Asp Gly Lys Lys Tyr Leu Tyr Gly Lys Asp Gly Lys Leu Glu
660 665 670
Met Asn Ser Phe Gln Met Met Asp Gly Lys Gln Val Tyr Leu Asn Asp
675 680 685
Asp Gly Ala Met Val Lys Gly Trp Lys Thr Ile Asn Arg Glu Lys Tyr
690 695 700
Tyr Phe Asn Lys Thr Gly Asp Gly Ser Gly Met Glu Arg Glu Gly Glu
705 710 715 720
Val Ala Ile Gly Trp Lys Thr Ile Asn Gly Val Lys Tyr Tyr Phe Asn
725 730 735
Lys Thr Gly Asp Gly Ser Gly Met Glu Arg Glu Gly Ile Leu Ala Thr
740 745 750
Gly Trp Lys Thr Ile Asn Gly Val Lys Tyr Tyr Phe Asn Lys Thr Gly
755 760 765
Asp Gly Thr Gly Ser Asp His Glu Gly Glu Met Glu Ile Gly Trp Lys
770 775 780
Thr Ile Glu Gly Val Lys Tyr Tyr Leu Asp Pro Gln Asn Gly Lys Leu
785 790 795 800
Val Thr Gly Trp Lys Thr Ile Glu Gly Val Lys Tyr Tyr Phe Asn Arg
805 810 815
Ile Gly Asp Gly Ser Gly Met Thr His Glu Gly Glu Met Ala Ile Gly
820 825 830
Asn Met Thr Ile Asp Gly Val Lys His Tyr Phe Asn Lys Thr Gly Asp
835 840 845
Gly Thr Gly Tyr Asn Tyr Glu Gly Glu Leu Glu Trp
850 855 860
<210> 61
<211> 830
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 61
Met Ala Thr Asp Lys Ala Pro Thr Lys Glu Gln Ala Gln Lys Gly Ser
1 5 10 15
Phe Asn Pro Thr Val Leu Ala Ser Met Pro Ser Asp Thr Ser Glu Leu
20 25 30
Gln Glu Ile Leu Glu Asp Ala Ile Thr Asn Pro Asn Thr Gln Ile Ser
35 40 45
Pro Lys Leu Val Lys Asp Leu Gln Asp Lys Gly Phe Asp Tyr Leu Ser
50 55 60
Ile Val Lys Ala Ala Thr Gly Gly Leu Val Lys Lys Ile Pro Tyr Ala
65 70 75 80
Gly Ser Phe Leu Ser Pro Leu Val Glu Phe Ile Phe Pro Gly Lys Lys
85 90 95
Tyr Leu Thr Lys Glu Thr Val Trp Gly Ala Ile Gln Asp Lys Val Ser
100 105 110
Asn Leu Ile Asp Glu Lys Leu Thr Glu Asn Gln Val Asn Glu Leu Ile
115 120 125
Ala Ser Leu Thr Gly Ile Gln Asp Asn Leu Lys Lys Tyr Gln Val Ala
130 135 140
Val Gly Leu Val Asn Gly Lys Glu Pro Pro Ile Asp Lys Ser Ile Gln
145 150 155 160
Met Asn Ala Glu Thr Asp Arg Thr Lys Glu Asp Leu Arg Gly Tyr Ile
165 170 175
Asp His Leu Asp Ser Asp Leu Gly Lys Glu Val Pro Lys Phe Ala Val
180 185 190
Lys Asn Tyr Glu Ala Ala Ser Ile Pro Tyr Tyr Val Gln Ile Ala Asn
195 200 205
Ile His Leu Phe Leu Leu Arg Asp Ala Val Thr His Ala Asp Glu Trp
210 215 220
Gly Leu Thr Ala Lys Glu Lys Asn Thr Tyr Leu Ser Arg Leu Gln Ser
225 230 235 240
Lys Ile Ile Glu Tyr Ser Asn Thr Val Tyr Asp Ser Phe Asn Lys Gly
245 250 255
Val Ala Ala Ala Lys Asn Lys Asp Gly Asn Trp Asp Asn Val Asn Lys
260 265 270
Phe Val Arg Thr Met Thr Val Tyr Gly Leu Asp Phe Val Ala Leu Trp
275 280 285
Pro Thr Phe Asp Pro Ser Arg Tyr Asp Glu Pro Thr Lys Leu Gln Gln
290 295 300
Thr Arg Gln Val Tyr Ser Asp Leu Ile Gly Asn Pro Glu Ala Gly Ser
305 310 315 320
Asn Tyr Asp Thr Leu Leu Pro Gln Ile His Ser Ser Ile Phe Ala Gly
325 330 335
Tyr Pro Gly Glu Leu Lys Gln Phe Asn Val Ser Gln Ala Asp Arg Ile
340 345 350
Asp Ser Ile Lys Glu Val Phe Asp Trp Thr Gly Asp Gly Ser Gln Asp
355 360 365
Tyr Thr Gln Gln Leu Gly Asn Pro Asn Pro Ser Gly Tyr Ser Val Arg
370 375 380
Asn Val Thr Arg Asp Ala Pro Val Ile Gly Leu Ser Ala Tyr Lys Ser
385 390 395 400
Trp Asn Asn Asn Trp Tyr Asn Met Phe Ser Leu Ser Tyr Pro Gly Lys
405 410 415
Glu Val Asp Phe Gln Ser Arg Phe Arg Gln Ala Leu Arg Asp Gly Tyr
420 425 430
Val Thr Lys Thr Ser Ala Pro Ser Gly Gln Lys Val Ser Asn Ile Lys
435 440 445
Ala Ala Glu Thr Gly Lys Gly Lys Gly Thr Ile Ser Ser Met Val Ala
450 455 460
Ala Tyr Ile Pro Glu Glu Val His Pro Glu Asn Ile Leu Ala Ala Arg
465 470 475 480
Ser Ile Thr Gly Ile Pro Ala Glu Lys Tyr Asn Ser Arg Ser Ala Val
485 490 495
Asp Asn Ala Ile Glu Tyr Thr Asn Gly Ser Asn Ala Met Val Phe Thr
500 505 510
Gln Asn Gly Ser Ser Ile Thr Tyr Asn Ile Gln Ser Pro Glu Lys Arg
515 520 525
Lys Tyr Lys Ile Arg Leu Arg Val Ala Thr Asn Ser Asp Thr Ser Val
530 535 540
Gly Ile Ser Val Asn Gly Asn Ala Gln Gln Met Asn Ile Glu Asn Thr
545 550 555 560
Glu Ala Lys Ala Thr Asp Arg Thr Gly Ile Lys Gly Val Asn Gly Thr
565 570 575
Tyr Thr Val Ile Glu Gly Pro Thr Val Glu Leu Gln Glu Gly Lys Asn
580 585 590
Thr Ile Thr Ile Asp His Lys Asp Ala Asn Lys Leu Val Leu Asp Arg
595 600 605
Ile Glu Phe Glu His Gln Pro Val Ala Pro Glu His Gly Asn Pro Gly
610 615 620
Trp Gln Thr Ile Asp Gly Lys Lys Tyr Leu Tyr Gly Lys Asp Gly Lys
625 630 635 640
Leu Glu Met Asn Ser Phe Gln Met Met Asp Gly Lys Gln Val Tyr Leu
645 650 655
Asn Asp Asp Gly Ala Met Val Lys Gly Trp Lys Thr Ile Asn Arg Glu
660 665 670
Lys Tyr Tyr Phe Asn Lys Thr Gly Asp Gly Ser Gly Met Glu Arg Glu
675 680 685
Gly Glu Val Ala Ile Gly Trp Lys Thr Ile Asn Gly Val Lys Tyr Tyr
690 695 700
Phe Asn Lys Thr Gly Asp Gly Ser Gly Met Glu Arg Glu Gly Ile Leu
705 710 715 720
Ala Thr Gly Trp Lys Thr Ile Asn Gly Val Lys Tyr Tyr Phe Asn Lys
725 730 735
Thr Gly Asp Gly Thr Gly Ser Asp His Glu Gly Glu Met Glu Ile Gly
740 745 750
Trp Lys Thr Ile Glu Gly Val Lys Tyr Tyr Leu Asp Pro Gln Asn Gly
755 760 765
Lys Leu Val Thr Gly Trp Lys Thr Ile Glu Gly Val Lys Tyr Tyr Phe
770 775 780
Asn Arg Ile Gly Asp Gly Ser Gly Met Thr His Glu Gly Glu Met Ala
785 790 795 800
Ile Gly Asn Met Thr Ile Asp Gly Val Lys His Tyr Phe Asn Lys Thr
805 810 815
Gly Asp Gly Thr Gly Tyr Asn Tyr Glu Gly Glu Leu Glu Trp
820 825 830
<210> 62
<211> 644
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 62
Met Lys Asn Lys Lys Lys Tyr Met Lys Pro Leu Ala Val Gly Leu Leu
1 5 10 15
Ala Thr Asn Ile Ile Gly Phe Gly Thr Gln Thr Val Ala Phe Ala Ala
20 25 30
Thr Asp Lys Ala Pro Thr Lys Glu Gln Ala Gln Lys Gly Ser Phe Asn
35 40 45
Pro Thr Val Leu Ala Ser Met Pro Ser Asp Thr Ser Glu Leu Gln Glu
50 55 60
Ile Leu Glu Asp Ala Ile Thr Asn Pro Asn Thr Gln Ile Ser Pro Lys
65 70 75 80
Leu Val Lys Asp Leu Gln Asp Lys Gly Phe Asp Tyr Leu Ser Ile Val
85 90 95
Lys Ala Ala Thr Gly Gly Leu Val Lys Lys Ile Pro Tyr Ala Gly Ser
100 105 110
Phe Leu Ser Pro Leu Val Glu Phe Ile Phe Pro Gly Lys Lys Tyr Leu
115 120 125
Thr Lys Glu Thr Val Trp Gly Ala Ile Gln Asp Lys Val Ser Asn Leu
130 135 140
Ile Asp Glu Lys Leu Thr Glu Asn Gln Val Asn Glu Leu Ile Ala Ser
145 150 155 160
Leu Thr Gly Ile Gln Asp Asn Leu Lys Lys Tyr Gln Val Ala Val Gly
165 170 175
Leu Val Asn Gly Lys Glu Pro Pro Ile Asp Lys Ser Ile Gln Met Asn
180 185 190
Ala Glu Thr Asp Arg Thr Lys Glu Asp Leu Arg Gly Tyr Ile Asp His
195 200 205
Leu Asp Ser Asp Leu Gly Lys Glu Val Pro Lys Phe Ala Val Lys Asn
210 215 220
Tyr Glu Ala Ala Ser Ile Pro Tyr Tyr Val Gln Ile Ala Asn Ile His
225 230 235 240
Leu Phe Leu Leu Arg Asp Ala Val Thr His Ala Asp Glu Trp Gly Leu
245 250 255
Thr Ala Lys Glu Lys Asn Thr Tyr Leu Ser Arg Leu Gln Ser Lys Ile
260 265 270
Ile Glu Tyr Ser Asn Thr Val Tyr Asp Ser Phe Asn Lys Gly Val Ala
275 280 285
Ala Ala Lys Asn Lys Asp Gly Asn Trp Asp Asn Val Asn Lys Phe Val
290 295 300
Arg Thr Met Thr Val Tyr Gly Leu Asp Phe Val Ala Leu Trp Pro Thr
305 310 315 320
Phe Asp Pro Ser Arg Tyr Asp Glu Pro Thr Lys Leu Gln Gln Thr Arg
325 330 335
Gln Val Tyr Ser Asp Leu Ile Gly Asn Pro Glu Ala Gly Ser Asn Tyr
340 345 350
Asp Thr Leu Leu Pro Gln Ile His Ser Ser Ile Phe Ala Gly Tyr Pro
355 360 365
Gly Glu Leu Lys Gln Phe Asn Val Ser Gln Ala Asp Arg Ile Asp Ser
370 375 380
Ile Lys Glu Val Phe Asp Trp Thr Gly Asp Gly Ser Gln Asp Tyr Thr
385 390 395 400
Gln Gln Leu Gly Asn Pro Asn Pro Ser Gly Tyr Ser Val Arg Asn Val
405 410 415
Thr Arg Asp Ala Pro Val Ile Gly Leu Ser Ala Tyr Lys Ser Trp Asn
420 425 430
Asn Asn Trp Tyr Asn Met Phe Ser Leu Ser Tyr Pro Gly Lys Glu Val
435 440 445
Asp Phe Gln Ser Arg Phe Arg Gln Ala Leu Arg Asp Gly Tyr Val Thr
450 455 460
Lys Thr Ser Ala Pro Ser Gly Gln Lys Val Ser Asn Ile Lys Ala Ala
465 470 475 480
Glu Thr Gly Lys Gly Lys Gly Thr Ile Ser Ser Met Val Ala Ala Tyr
485 490 495
Ile Pro Glu Glu Val His Pro Glu Asn Ile Leu Ala Ala Arg Ser Ile
500 505 510
Thr Gly Ile Pro Ala Glu Lys Tyr Asn Ser Arg Ser Ala Val Asp Asn
515 520 525
Ala Ile Glu Tyr Thr Asn Gly Ser Asn Ala Met Val Phe Thr Gln Asn
530 535 540
Gly Ser Ser Ile Thr Tyr Asn Ile Gln Ser Pro Glu Lys Arg Lys Tyr
545 550 555 560
Lys Ile Arg Leu Arg Val Ala Thr Asn Ser Asp Thr Ser Val Gly Ile
565 570 575
Ser Val Asn Gly Asn Ala Gln Gln Met Asn Ile Glu Asn Thr Glu Ala
580 585 590
Lys Ala Thr Asp Arg Thr Gly Ile Lys Gly Val Asn Gly Thr Tyr Thr
595 600 605
Val Ile Glu Gly Pro Thr Val Glu Leu Gln Glu Gly Lys Asn Thr Ile
610 615 620
Thr Ile Asp His Lys Asp Ala Asn Lys Leu Val Leu Asp Arg Ile Glu
625 630 635 640
Phe Glu His Gln
<210> 63
<211> 614
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 63
Met Ala Thr Asp Lys Ala Pro Thr Lys Glu Gln Ala Gln Lys Gly Ser
1 5 10 15
Phe Asn Pro Thr Val Leu Ala Ser Met Pro Ser Asp Thr Ser Glu Leu
20 25 30
Gln Glu Ile Leu Glu Asp Ala Ile Thr Asn Pro Asn Thr Gln Ile Ser
35 40 45
Pro Lys Leu Val Lys Asp Leu Gln Asp Lys Gly Phe Asp Tyr Leu Ser
50 55 60
Ile Val Lys Ala Ala Thr Gly Gly Leu Val Lys Lys Ile Pro Tyr Ala
65 70 75 80
Gly Ser Phe Leu Ser Pro Leu Val Glu Phe Ile Phe Pro Gly Lys Lys
85 90 95
Tyr Leu Thr Lys Glu Thr Val Trp Gly Ala Ile Gln Asp Lys Val Ser
100 105 110
Asn Leu Ile Asp Glu Lys Leu Thr Glu Asn Gln Val Asn Glu Leu Ile
115 120 125
Ala Ser Leu Thr Gly Ile Gln Asp Asn Leu Lys Lys Tyr Gln Val Ala
130 135 140
Val Gly Leu Val Asn Gly Lys Glu Pro Pro Ile Asp Lys Ser Ile Gln
145 150 155 160
Met Asn Ala Glu Thr Asp Arg Thr Lys Glu Asp Leu Arg Gly Tyr Ile
165 170 175
Asp His Leu Asp Ser Asp Leu Gly Lys Glu Val Pro Lys Phe Ala Val
180 185 190
Lys Asn Tyr Glu Ala Ala Ser Ile Pro Tyr Tyr Val Gln Ile Ala Asn
195 200 205
Ile His Leu Phe Leu Leu Arg Asp Ala Val Thr His Ala Asp Glu Trp
210 215 220
Gly Leu Thr Ala Lys Glu Lys Asn Thr Tyr Leu Ser Arg Leu Gln Ser
225 230 235 240
Lys Ile Ile Glu Tyr Ser Asn Thr Val Tyr Asp Ser Phe Asn Lys Gly
245 250 255
Val Ala Ala Ala Lys Asn Lys Asp Gly Asn Trp Asp Asn Val Asn Lys
260 265 270
Phe Val Arg Thr Met Thr Val Tyr Gly Leu Asp Phe Val Ala Leu Trp
275 280 285
Pro Thr Phe Asp Pro Ser Arg Tyr Asp Glu Pro Thr Lys Leu Gln Gln
290 295 300
Thr Arg Gln Val Tyr Ser Asp Leu Ile Gly Asn Pro Glu Ala Gly Ser
305 310 315 320
Asn Tyr Asp Thr Leu Leu Pro Gln Ile His Ser Ser Ile Phe Ala Gly
325 330 335
Tyr Pro Gly Glu Leu Lys Gln Phe Asn Val Ser Gln Ala Asp Arg Ile
340 345 350
Asp Ser Ile Lys Glu Val Phe Asp Trp Thr Gly Asp Gly Ser Gln Asp
355 360 365
Tyr Thr Gln Gln Leu Gly Asn Pro Asn Pro Ser Gly Tyr Ser Val Arg
370 375 380
Asn Val Thr Arg Asp Ala Pro Val Ile Gly Leu Ser Ala Tyr Lys Ser
385 390 395 400
Trp Asn Asn Asn Trp Tyr Asn Met Phe Ser Leu Ser Tyr Pro Gly Lys
405 410 415
Glu Val Asp Phe Gln Ser Arg Phe Arg Gln Ala Leu Arg Asp Gly Tyr
420 425 430
Val Thr Lys Thr Ser Ala Pro Ser Gly Gln Lys Val Ser Asn Ile Lys
435 440 445
Ala Ala Glu Thr Gly Lys Gly Lys Gly Thr Ile Ser Ser Met Val Ala
450 455 460
Ala Tyr Ile Pro Glu Glu Val His Pro Glu Asn Ile Leu Ala Ala Arg
465 470 475 480
Ser Ile Thr Gly Ile Pro Ala Glu Lys Tyr Asn Ser Arg Ser Ala Val
485 490 495
Asp Asn Ala Ile Glu Tyr Thr Asn Gly Ser Asn Ala Met Val Phe Thr
500 505 510
Gln Asn Gly Ser Ser Ile Thr Tyr Asn Ile Gln Ser Pro Glu Lys Arg
515 520 525
Lys Tyr Lys Ile Arg Leu Arg Val Ala Thr Asn Ser Asp Thr Ser Val
530 535 540
Gly Ile Ser Val Asn Gly Asn Ala Gln Gln Met Asn Ile Glu Asn Thr
545 550 555 560
Glu Ala Lys Ala Thr Asp Arg Thr Gly Ile Lys Gly Val Asn Gly Thr
565 570 575
Tyr Thr Val Ile Glu Gly Pro Thr Val Glu Leu Gln Glu Gly Lys Asn
580 585 590
Thr Ile Thr Ile Asp His Lys Asp Ala Asn Lys Leu Val Leu Asp Arg
595 600 605
Ile Glu Phe Glu His Gln
610
<210> 64
<211> 962
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 64
Val Ile Lys Met Asn Asn Thr Ile Val Pro Tyr Asn Val Leu Arg Asp
1 5 10 15
Asn Asp Met Ser Asn Ile Ser Gly Thr Ser Trp Asp Lys Asp Gln Ile
20 25 30
Ser Lys Gly Met Glu Asn Ser Ser Ile Leu Leu Glu Leu Ile Glu Lys
35 40 45
Gly Leu Asn Asp Gly Asn Asp Ile Leu Ser Leu Leu Gly Phe Ile Gly
50 55 60
Met Lys Ala Ile Glu Ala Ile Pro Val Val Gly Gly Thr Ile Ser Ser
65 70 75 80
Leu Ile Ser Leu Ile Leu Phe Pro Leu Lys Ser Lys Val Asp Tyr Gln
85 90 95
Glu Ile Trp Asn Gln Leu Lys Lys Ser Ile Glu Lys Met Val Asp Gln
100 105 110
Lys Ile Glu Glu Ala Met Met Ser Gln Leu Met Gln Glu Ile Ala Gly
115 120 125
Leu Ala Ala Ile Leu Glu Gln Tyr Arg Ala Ala Tyr Asp Leu Tyr Asn
130 135 140
Gly Lys Lys Val Phe Asn Ile Pro Asp Gly Met Thr Pro Gly Asp His
145 150 155 160
Leu Asn Thr Val Phe Val Ser Ala Asn Met Gln Phe Ile Gln Arg Met
165 170 175
Lys Ile Phe Gln Asn Pro Lys Tyr Lys Val Asp Leu Leu Pro Phe Phe
180 185 190
Val His Ala Ala Glu Met His Ile Leu Leu Val Arg Asp Ala Ala Ile
195 200 205
Tyr Gly Leu Glu Trp Gly Met Asp Glu Ala Val Asn Gln Tyr Tyr Lys
210 215 220
Met Glu Leu Lys Arg Leu Ile Asn Glu Tyr Ser Lys Tyr Val Leu Asp
225 230 235 240
Thr Tyr Gln Lys Gly Leu Lys Glu Val Thr Glu Arg Glu Leu Glu Asp
245 250 255
Asn Asp Phe Pro Thr Thr Trp Gln Lys Asp Asn Tyr Lys Asn Thr Val
260 265 270
Arg Trp Asn Val Ile Asn Lys Tyr Lys Arg Gly Met Ser Leu Thr Val
275 280 285
Phe Asp Phe Ala Tyr Lys Trp Lys Tyr Tyr His Glu Ser Tyr Gln Lys
290 295 300
Asn Leu Met Leu Asn Pro Thr Arg Thr Ile Tyr Ser Asp Ile Glu Gly
305 310 315 320
Ser Val Tyr Pro Tyr Thr Lys Glu Thr Ser Glu Ile Asp Asn Asp Ile
325 330 335
Asn Phe Thr Asn Leu Glu Tyr Arg Gly Ile Leu Lys Glu Val Gln Val
340 345 350
Asn Asn Ala Asp Arg Ile Asp Ser Leu Gln Ser Ile Phe Ile Arg Glu
355 360 365
Asn Gln Lys Phe Glu Asn Arg Lys Thr Gly Gly Ser Gly Gly Arg Gly
370 375 380
Thr Phe Val Asn Ile Glu Ser Pro Lys Asn Asn Pro Leu Thr Arg Val
385 390 395 400
Glu Thr Trp Ser Glu Leu Val Pro Phe Ala Leu Lys Phe Gly Phe Tyr
405 410 415
Asn Gly Glu Glu Thr Arg Leu Met Gly Gln Lys Thr Tyr Gly Lys Lys
420 425 430
Tyr Thr Ile Tyr Gln Tyr Ser Gly Asn Lys Val Val Ser Ile Met Gly
435 440 445
Phe Gly Leu Asn Ala Thr Ser Gly Phe Asp Ser Leu Asp Ala Met Val
450 455 460
Val Gly Phe Lys Gln Asp Asp Tyr Ile Pro Glu Asn Lys Ile Val Pro
465 470 475 480
Ile Asn Thr Asn Gly Glu Pro Leu Ile Gln Val Val Glu Val Gly Asn
485 490 495
Phe Tyr Glu Gly Lys Phe Glu Ser Asn Leu Lys Met Ile Glu Glu Pro
500 505 510
Met Phe Gly Asp Gly Val Leu Gln Phe Glu Asn His Ser Asn Asn Leu
515 520 525
Asn Gln Asp Asn Tyr Val Thr Tyr Gln Ile His Ala Glu Ile Glu Gly
530 535 540
Thr Tyr Glu Leu Asn Ala Ile Ile Gly Ala Asn Lys Gln Lys Asp Arg
545 550 555 560
Ile Ala Phe Lys Met Ser Leu Asn Gly Glu Gln Pro Glu Asn Phe Ile
565 570 575
Thr Glu Ser Phe Asp Asp Gly Asp Ile Trp Glu Gly Met Ser Leu Asn
580 585 590
Glu Glu Leu Val Tyr Lys Lys Asn Ser Leu Gly Asn Phe Gln Leu Asn
595 600 605
Lys Gly Val Asn Asn Ile Thr Ile Tyr Asn Gly Ala Leu Gln Ala Ser
610 615 620
Ala Asn Ile Lys Ile Trp Asn Leu Ala Thr Ile Glu Leu Thr Ile Asn
625 630 635 640
Ala Asp Ser Leu Lys Asp Pro Asp Ile Thr Thr Leu Tyr Asp Gln Asp
645 650 655
Asn Tyr Thr Gly Thr Lys Lys Leu Ile Phe Glu Asp Thr Asn Arg Leu
660 665 670
Glu Asp Phe Asn Asp Lys Ala Ser Ser Ile Lys Ala Glu Ser His Ile
675 680 685
Ala Gly Ile Arg Phe Tyr Gln Asn Tyr Asp Tyr Thr Gly Asp Phe Ile
690 695 700
Asp Leu Val Gly Gly Glu Lys Ile Ser Leu Lys Asn His Leu Phe Asn
705 710 715 720
Asn Arg Ala Ser Ser Val Lys Phe Ala Asn Ile Val Leu Tyr Asn Lys
725 730 735
Asp Asn Tyr Glu Gly Ser Arg Lys Phe Val Phe Glu Asp Ile Pro Glu
740 745 750
Leu Thr Asp Phe Asn Asp Gln Thr Ser Ser Ile Val Ile Ser Ser Asn
755 760 765
Val Ser Gly Ala Arg Leu Tyr Glu His Ala Asn Tyr Glu Gly Asn Tyr
770 775 780
Val Asp Val Val Gly Gly Gln Arg Leu Ser Leu Lys Asn His Val Leu
785 790 795 800
Asn Lys Glu Ile Thr Ser Ile Lys Phe Phe Lys Gly Asp Asp Glu Ile
805 810 815
Leu Asn Gly Ile Tyr Gln Ile Val Pro Ala Ile Asn Asn Thr Ser Val
820 825 830
Ile Val Asn Leu Glu Ser Ser Ala Phe Val Trp Leu Trp Gly Lys Asp
835 840 845
Ser Glu Phe Ala His Gln Trp Arg Val Glu Tyr Asp Glu Ser Glu Leu
850 855 860
Ala Tyr Gln Ile Lys Asn Met Ser Asn Glu Asn Ile Val Leu Ala Ala
865 870 875 880
Met Ile Val Thr Glu Pro Ser Tyr Ala His Gly Ile Pro Asn Met Lys
885 890 895
Asn Ser Leu Gln Tyr Trp Ile Phe Glu Tyr Val Gly Asn Gly Tyr Tyr
900 905 910
Ile Ile Lys Leu Lys Ala Asn Pro Lys Leu Val Leu Asp Val Asn Gly
915 920 925
Gly Asn Pro Asn Asp Gly Thr Ile Ile Lys Val Asn Tyr Ser His Asp
930 935 940
Leu Asn Ser Pro Asn Ile Asn Ala Gln Lys Phe Arg Phe Asn Asn Met
945 950 955 960
Asn Asn
<210> 65
<211> 1396
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 65
Met Gly Gly Leu Leu Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu
1 5 10 15
Ile Leu Asp Asn Gly Gly Arg Gly Tyr Gln Pro Arg Tyr Pro Leu Ala
20 25 30
Gln Ala Pro Gly Ser Glu Phe Gln Gln Met Asn Tyr Lys Asp Trp Met
35 40 45
Asp Arg Tyr Thr Gly Glu Glu Gln Ala Gly Gly Leu Ile Arg Ala Glu
50 55 60
Ile Asp Ala Asn Thr Ala Leu Thr Ala Thr Phe Gly Val Ala Trp Ala
65 70 75 80
Val Leu Gly Val Ser Asn Pro Val Gly Ser Ala Ile Ala Gly Ile Leu
85 90 95
Asn Val Val Thr Pro Ile Ile Phe Asn Gln Val Asp Pro Asn Ser Pro
100 105 110
Leu Lys Val Trp Glu Ser Met Ile Gly Tyr Ala Gln Ala Leu Ile Lys
115 120 125
Lys Glu Leu Thr Glu Thr Val Arg Asn Leu Ala Leu Asp His Ile Asp
130 135 140
Arg Ile Asn Asp Tyr Gln Ile Asp Tyr Lys Asn Lys Ala Arg Ile Trp
145 150 155 160
Glu Gly Asn Pro Thr Pro Gly Asn Ala Ser Gln Leu Leu Asp Ala Phe
165 170 175
Lys Asn Leu Lys Lys Ser Cys Gln Asp Ala Met Pro His Leu Asp Thr
180 185 190
Arg Gly Tyr Glu Thr Ile Leu Leu Pro Leu Tyr Ala Gln Ala Ala Asn
195 200 205
Ile His Leu Ile Val Leu Arg Asp Gly Leu Met His Gly Arg Ser Trp
210 215 220
Gly Met Thr Asn Glu Glu Tyr Glu Asp Leu Tyr Ser Gly Phe Phe Gly
225 230 235 240
Phe Lys Asn Arg Ile Ala Thr Tyr Thr Asn Tyr Cys Thr Asn Thr Phe
245 250 255
Asn Gln Gly Ala Thr Thr Asn Tyr Thr Ala Asp Met Glu Asp Cys Ala
260 265 270
Lys Tyr Pro Trp Val Arg Tyr Asn Gln Asp Glu Phe Tyr Ser Gly Phe
275 280 285
Phe Gly Pro Glu Asn Ser Cys Arg Gln Ser Gln Thr Thr Asn Ala Tyr
290 295 300
Pro Gln Pro Lys Ser Gly Phe Asn Gln Pro Leu Asn Leu Arg Asn Ser
305 310 315 320
Arg Gln Ser Pro Gly Glu Tyr Arg Asp Val Glu Ala Trp Asn Leu Arg
325 330 335
Asn Glu Phe Ile Ser Ser Met Thr Ile Gln Val Leu Asp Thr Val Ala
340 345 350
Leu Trp Pro Thr Tyr Asp Pro Arg Val Tyr Thr Ser Ala Val Lys Thr
355 360 365
Glu Phe Thr Arg Glu Ile Tyr Thr Ser Ile Arg Gly Thr Thr Tyr Arg
370 375 380
Ser Asp Pro Ala Gln Asn Thr Met Ser Ala Ile Glu Ala Arg Met Ile
385 390 395 400
Arg Pro Pro His Leu Phe Glu Trp Pro Gln Thr Met Thr Phe Phe Phe
405 410 415
Glu Asp Val Asn Val Arg Tyr Thr Trp Val Gly Asn Ser Trp Thr Lys
420 425 430
Gly Gln Ala Leu Ile Gly Leu Lys Thr Glu Ser Ala Arg Thr Leu Asp
435 440 445
Thr Asn Met Ile Thr Ala Leu Gln Gly Leu Thr Asp Glu Tyr Glu Tyr
450 455 460
Tyr His Ile Pro Leu Asp Val Ser Thr Arg Gln Lys Asp Ile Thr Asn
465 470 475 480
Ile Arg Thr Lys Gln Trp Phe Glu Pro Arg Glu Phe Ile Phe Tyr Arg
485 490 495
Ser Gly Gly Gln Gln Val Leu Ala Leu Gly Thr Ile Glu Glu Asp Ile
500 505 510
Pro Gly Tyr Ser Val Tyr Leu Val Asn Asp Tyr Ile Tyr Thr Pro Ser
515 520 525
Ile Arg Gln Asn Thr Phe Pro Pro Ile Glu Val Pro Glu Gly Thr Pro
530 535 540
Pro Pro Ser His Arg Leu Ser Trp Val Lys Phe Glu Pro Val Arg Asp
545 550 555 560
Asn Ala Thr Thr Phe Val Asp Pro Lys Gln Ile Gly Ala Thr Ile Phe
565 570 575
Gly Trp Thr His Thr Ser Val Asp Pro Asn Asn Thr Ile Asp Ala Thr
580 585 590
Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly Thr Asp Tyr
595 600 605
Gln Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val Ser Leu
610 615 620
Pro Thr Tyr Ala Lys Ile Glu Ile Pro Leu Thr Ser Ser Ala Asp Gln
625 630 635 640
Leu Tyr Glu Val Arg Ile Arg Tyr Ala Ala Ala Glu Gln Lys Arg Ile
645 650 655
Arg Ile Gly Lys Lys Ser Arg Asp Phe Gln Trp Arg Tyr Ile Asp Tyr
660 665 670
Asp Val Pro Ser Thr Pro Tyr Ser Gly Gly Asn Leu Thr Tyr Asn Ala
675 680 685
Phe Ser Tyr Lys Glu Leu Gly Thr Val Gln Gly Val Leu Glu Leu Ser
690 695 700
Ile Glu Arg Ile Asp Pro Phe Asp Glu Pro Ile Ile Ile Asp Lys Ile
705 710 715 720
Glu Phe Ile Pro Ile Gln Gly Ser Val Glu Thr Tyr Glu Ala Asp Gln
725 730 735
Asp Leu Glu Lys Ala Arg Lys Ala Val Asn Ala Leu Phe Thr Ser Asp
740 745 750
Ala Lys Asn Ala Leu Gln Leu Glu Val Thr Asp Tyr Leu Val Asp Gln
755 760 765
Ala Ala Lys Leu Val Glu Cys Met Ser Asp Glu Ile Tyr Pro Gln Glu
770 775 780
Lys Met Cys Leu Leu Asp Gln Val Lys Val Ala Lys Arg Leu Ser Gln
785 790 795 800
Ala Arg Asn Leu Leu His His Gly Asp Phe Glu Ser Pro Asp Trp Phe
805 810 815
Gly Glu Asn Gly Trp Lys Thr Ser Asn His Val Ser Val Thr Ser Gly
820 825 830
Asn Pro Ile Phe Lys Gly Arg Tyr Leu Asn Met Pro Gly Ala Asp Asn
835 840 845
Pro Gln Leu Ser Asp Thr Val Phe Pro Thr Tyr Ala Tyr Gln Lys Ile
850 855 860
Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg Tyr Met Ile Arg Gly Phe
865 870 875 880
Val Gly Asn Ser Lys Asp Leu Glu Val Phe Ile Thr Arg Tyr Asp Lys
885 890 895
Glu Val His Lys Asn Met Asn Val Pro His Asp Ile Leu Pro Thr Asp
900 905 910
Pro Cys Thr Gly Thr Tyr Pro Leu Gly Glu Gly Pro Met Leu Thr Asn
915 920 925
His Thr Ile Pro Gln Asp Met Ser Cys Asp Pro Cys Asp Ala Gly Thr
930 935 940
Val Met Lys Val Gln Gln Thr Leu Val Lys Cys Glu Ala Pro His Ala
945 950 955 960
Phe Thr Phe His Ile Asp Thr Gly Glu Leu Asn Arg Asp Gln Asn Leu
965 970 975
Gly Ile Trp Val Gly Phe Lys Ile Gly Thr Thr Asp Gly Arg Ala Thr
980 985 990
Phe Asp Asn Leu Glu Val Ile Glu Ala Asn Pro Leu Thr Gly Glu Ala
995 1000 1005
Leu Ala Arg Val Lys Lys Arg Glu His Lys Trp Lys Gln Lys Trp
1010 1015 1020
Met Glu Lys Arg Thr Lys Ile Asp Lys Ala Val Gln Ile Ala Gln
1025 1030 1035
Gly Ala Ile Gln Ala Leu Phe Thr Asp Ser Asn Gln Asn Arg Leu
1040 1045 1050
Lys Pro Asp Ile Thr Leu Asn His Ile Leu Gln Val Glu Thr Glu
1055 1060 1065
Val Gln Lys Ile Pro Tyr Val Tyr Asn Ala Phe Leu Gln Gly Ala
1070 1075 1080
Leu Pro Pro Val Pro Gly Glu Thr Tyr Asp Ile Phe Gln Gln Leu
1085 1090 1095
Ser Ser Ala Val Ala Ile Ala Arg Ser Leu Tyr Glu Gln Arg Asn
1100 1105 1110
Val Leu Arg Asn Gly Asp Phe Met Ala Gly Leu Ser Asn Trp Arg
1115 1120 1125
Gly Ala Lys Gly Ala Ile Val Gln Lys Ile Gly Asn Ala Ser Val
1130 1135 1140
Leu Val Ile Ser Asp Trp Ser Ala Asn Leu Ser Gln Asp Val Pro
1145 1150 1155
Val Asn Pro Gly His Gly Tyr Ile Leu Arg Val Thr Ala Lys Lys
1160 1165 1170
Glu Gly Ser Gly Glu Gly Tyr Val Thr Ile Ser Asp Gly Thr Glu
1175 1180 1185
Asp Asn Thr Glu Thr Leu Lys Phe Thr Val Gly Glu Val Ala Thr
1190 1195 1200
Arg Leu Ala Gln Ser Asp Met His Ser Pro Met Gln Gly Arg Tyr
1205 1210 1215
Gln Glu Arg Asn Met Thr Asn Ala Pro Ser Glu Ala Tyr Gly Thr
1220 1225 1230
Asn Gly Tyr Thr Ser His Thr Met Thr Asn Tyr Pro Ser Asp Asn
1235 1240 1245
Asp Gly Thr Asn Ala Tyr Pro Gly Asn His Asn Arg Asn Tyr Gln
1250 1255 1260
Ser Glu Ser Phe Gly Ile Thr Pro Tyr Gly Asp Glu Thr Ser Met
1265 1270 1275
Met Asn Gly Pro Ser Asn Lys Tyr Glu Met Asn Ala Tyr Ala Gly
1280 1285 1290
Ser Thr Tyr Met Thr Asn Asn Gly Val Gly Cys Ser Cys Gly Cys
1295 1300 1305
Ser Thr Asn Ala Tyr Ala Gly Glu Asn Met Met Arg Asn Asp Ser
1310 1315 1320
Ser Asn Glu Asn Gly Val Gly Tyr Gly Cys Gly Cys Arg Thr His
1325 1330 1335
Pro Asn Asn Arg Thr Asp Ser Tyr Pro Met Thr Ala His Gly Ser
1340 1345 1350
Ser Leu Ser Gly Tyr Val Thr Lys Thr Phe Glu Ile Phe Pro Glu
1355 1360 1365
Thr Asn Arg Val Cys Ile Glu Ile Gly Glu Thr Ser Gly Thr Phe
1370 1375 1380
Lys Val Glu Ser Ile Glu Leu Ile Arg Met Asp Cys Glu
1385 1390 1395
<210> 66
<211> 725
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 66
Met Gly Gly Leu Leu Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu
1 5 10 15
Ile Leu Asp Asn Gly Gly Arg Gly Tyr Gln Pro Arg Tyr Pro Leu Ala
20 25 30
Gln Ala Pro Gly Ser Glu Phe Gln Gln Met Asn Tyr Lys Asp Trp Met
35 40 45
Asp Arg Tyr Thr Gly Glu Glu Gln Ala Gly Gly Leu Ile Arg Ala Glu
50 55 60
Ile Asp Ala Asn Thr Ala Leu Thr Ala Thr Phe Gly Val Ala Trp Ala
65 70 75 80
Val Leu Gly Val Ser Asn Pro Val Gly Ser Ala Ile Ala Gly Ile Leu
85 90 95
Asn Val Val Thr Pro Ile Ile Phe Asn Gln Val Asp Pro Asn Ser Pro
100 105 110
Leu Lys Val Trp Glu Ser Met Ile Gly Tyr Ala Gln Ala Leu Ile Lys
115 120 125
Lys Glu Leu Thr Glu Thr Val Arg Asn Leu Ala Leu Asp His Ile Asp
130 135 140
Arg Ile Asn Asp Tyr Gln Ile Asp Tyr Lys Asn Lys Ala Arg Ile Trp
145 150 155 160
Glu Gly Asn Pro Thr Pro Gly Asn Ala Ser Gln Leu Leu Asp Ala Phe
165 170 175
Lys Asn Leu Lys Lys Ser Cys Gln Asp Ala Met Pro His Leu Asp Thr
180 185 190
Arg Gly Tyr Glu Thr Ile Leu Leu Pro Leu Tyr Ala Gln Ala Ala Asn
195 200 205
Ile His Leu Ile Val Leu Arg Asp Gly Leu Met His Gly Arg Ser Trp
210 215 220
Gly Met Thr Asn Glu Glu Tyr Glu Asp Leu Tyr Ser Gly Phe Phe Gly
225 230 235 240
Phe Lys Asn Arg Ile Ala Thr Tyr Thr Asn Tyr Cys Thr Asn Thr Phe
245 250 255
Asn Gln Gly Ala Thr Thr Asn Tyr Thr Ala Asp Met Glu Asp Cys Ala
260 265 270
Lys Tyr Pro Trp Val Arg Tyr Asn Gln Asp Glu Phe Tyr Ser Gly Phe
275 280 285
Phe Gly Pro Glu Asn Ser Cys Arg Gln Ser Gln Thr Thr Asn Ala Tyr
290 295 300
Pro Gln Pro Lys Ser Gly Phe Asn Gln Pro Leu Asn Leu Arg Asn Ser
305 310 315 320
Arg Gln Ser Pro Gly Glu Tyr Arg Asp Val Glu Ala Trp Asn Leu Arg
325 330 335
Asn Glu Phe Ile Ser Ser Met Thr Ile Gln Val Leu Asp Thr Val Ala
340 345 350
Leu Trp Pro Thr Tyr Asp Pro Arg Val Tyr Thr Ser Ala Val Lys Thr
355 360 365
Glu Phe Thr Arg Glu Ile Tyr Thr Ser Ile Arg Gly Thr Thr Tyr Arg
370 375 380
Ser Asp Pro Ala Gln Asn Thr Met Ser Ala Ile Glu Ala Arg Met Ile
385 390 395 400
Arg Pro Pro His Leu Phe Glu Trp Pro Gln Thr Met Thr Phe Phe Phe
405 410 415
Glu Asp Val Asn Val Arg Tyr Thr Trp Val Gly Asn Ser Trp Thr Lys
420 425 430
Gly Gln Ala Leu Ile Gly Leu Lys Thr Glu Ser Ala Arg Thr Leu Asp
435 440 445
Thr Asn Met Ile Thr Ala Leu Gln Gly Leu Thr Asp Glu Tyr Glu Tyr
450 455 460
Tyr His Ile Pro Leu Asp Val Ser Thr Arg Gln Lys Asp Ile Thr Asn
465 470 475 480
Ile Arg Thr Lys Gln Trp Phe Glu Pro Arg Glu Phe Ile Phe Tyr Arg
485 490 495
Ser Gly Gly Gln Gln Val Leu Ala Leu Gly Thr Ile Glu Glu Asp Ile
500 505 510
Pro Gly Tyr Ser Val Tyr Leu Val Asn Asp Tyr Ile Tyr Thr Pro Ser
515 520 525
Ile Arg Gln Asn Thr Phe Pro Pro Ile Glu Val Pro Glu Gly Thr Pro
530 535 540
Pro Pro Ser His Arg Leu Ser Trp Val Lys Phe Glu Pro Val Arg Asp
545 550 555 560
Asn Ala Thr Thr Phe Val Asp Pro Lys Gln Ile Gly Ala Thr Ile Phe
565 570 575
Gly Trp Thr His Thr Ser Val Asp Pro Asn Asn Thr Ile Asp Ala Thr
580 585 590
Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly Thr Asp Tyr
595 600 605
Gln Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val Ser Leu
610 615 620
Pro Thr Tyr Ala Lys Ile Glu Ile Pro Leu Thr Ser Ser Ala Asp Gln
625 630 635 640
Leu Tyr Glu Val Arg Ile Arg Tyr Ala Ala Ala Glu Gln Lys Arg Ile
645 650 655
Arg Ile Gly Lys Lys Ser Arg Asp Phe Gln Trp Arg Tyr Ile Asp Tyr
660 665 670
Asp Val Pro Ser Thr Pro Tyr Ser Gly Gly Asn Leu Thr Tyr Asn Ala
675 680 685
Phe Ser Tyr Lys Glu Leu Gly Thr Val Gln Gly Val Leu Glu Leu Ser
690 695 700
Ile Glu Arg Ile Asp Pro Phe Asp Glu Pro Ile Ile Ile Asp Lys Ile
705 710 715 720
Glu Phe Ile Pro Ile
725
<210> 67
<211> 131
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 67
Met Ile Pro Thr Arg Ile His Asn Ser Met Leu Ser Ala Asn Asp Asp
1 5 10 15
Leu Ser Phe His Ser Phe Gln Thr Cys Phe Leu Val Arg Gly Ala Gln
20 25 30
Ala Glu Glu Val Lys Gly Leu Lys Pro Gly Gln Tyr Gly Lys Tyr Gly
35 40 45
Arg Glu Tyr Glu Met Val Gln Thr Gly Arg His His Pro Asp Leu Phe
50 55 60
Glu Gly Ala Met Gln Ser Ala Ala Thr Ile Asp Asp Pro Glu Gln Val
65 70 75 80
Gln Ser Ser Val Ser Phe Cys Ser Met Val His Val Pro Gln Gly Phe
85 90 95
Val Tyr Val Pro Lys Gly Thr Arg Lys Phe Ala Tyr Ser Leu Pro Asn
100 105 110
Leu Ser Val Val Lys Asp Thr Cys Gln Lys Thr Ile Ala Ala Gln Ser
115 120 125
Met Ser Arg
130
<210> 68
<211> 875
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 68
Met Ser Leu Leu Ile Lys Lys Glu Arg Asn Val Met Pro Asn Glu Met
1 5 10 15
Asn Ser Lys Asn Asn Met Asn Thr Thr Arg Ile Asp Asp Lys Asp Leu
20 25 30
Leu Asp Ser Lys Gly Arg Glu Cys Gln Pro Arg Tyr Pro Ser Ala Glu
35 40 45
Ala Pro Asp Val Met Asn Pro Ile Glu Lys Ile Tyr Thr Arg Gly Gln
50 55 60
Ser Ala Asn Ile Ser Asn Ala Thr Ser Thr Thr Ile Asp Val Thr Ser
65 70 75 80
Gln Ile Leu Arg Gly Ser Lys Lys Leu Ala Gly Leu Ile Ile Lys Val
85 90 95
Leu Ile Lys Ile Leu Trp Pro Lys Gly Thr Gln Gln Glu Trp Glu Asn
100 105 110
Phe Met Asp Ala Val Glu Gln Leu Ile His Glu Lys Leu Asp Thr Tyr
115 120 125
Ala Arg Asn Lys Ala Thr Ala Asp Leu Glu Gly Leu Gln Arg Val Leu
130 135 140
Ser Glu Tyr Ile Asp Lys Leu Thr Ile Tyr Thr Asp Asp Pro Asn Ala
145 150 155 160
Ala His Ala Gln Ser Leu Arg Thr Gln Val Ile Thr Thr Asp Asn Leu
165 170 175
Phe Glu Tyr Ser Met Pro Ser Phe Ala Ile Arg Asp Tyr Glu Met Gln
180 185 190
Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe Leu
195 200 205
Glu Asp Ile Val Arg Phe Gly Asp Glu Trp Gly Phe Thr Pro Ala Glu
210 215 220
Ile Ala Asp Phe His Lys Ser Met Lys Glu Arg Thr Asp Glu Tyr Thr
225 230 235 240
Asn His Cys Val Ser Thr Tyr Asn Lys Gly Leu Glu Lys Ala Lys Leu
245 250 255
Leu Thr Ala Asp Leu Cys Asn His Ser Thr Tyr Pro Trp Thr Arg Tyr
260 265 270
Asn Gln Gly Trp Arg Lys Glu Glu Thr Pro Asn Cys Ser Leu Gln Gln
275 280 285
Ile Asp Ser Cys Glu Gln Pro Leu Asn Glu Ser Asp Glu Gln Gly Thr
290 295 300
Pro Tyr Met Gly Pro Glu Arg Arg Ser Ile Asp Glu Tyr Gln Lys Leu
305 310 315 320
Glu Asn Trp Asn Leu Tyr Asn Ala Tyr Arg Arg Asp Met Thr Leu Met
325 330 335
Val Leu Asp Ile Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys Leu Tyr
340 345 350
Pro Thr Ser Tyr Gly Val Thr Ser Glu Leu Thr Arg Glu Leu Tyr Thr
355 360 365
Asp Ile Arg Gly Thr Thr Tyr Arg Ser Asp Glu Ser Gln Asn Ser Leu
370 375 380
Asp Ala Ile Glu Gly Arg Met Ile Pro Gln Pro Ser Leu Phe Arg Trp
385 390 395 400
Leu Phe Gly Ile Ser Ile Lys Met Asn Gln Phe Ile Gly Thr Gly Gly
405 410 415
Ser Asp Asp Trp Thr Ser Gly Asn Leu Met Leu Gly Met Arg Ile Met
420 425 430
Trp Arg Lys Thr Leu Thr Glu Gln Trp Glu Gly Leu Tyr Phe Gly Gly
435 440 445
Gly Gly Asn Leu Tyr Ile Asn Leu Asn Pro Leu Asp Tyr Ala Glu Gly
450 455 460
Ile Thr His Val Asn Thr Arg Gln Trp Phe Glu Pro Arg Trp Phe Glu
465 470 475 480
Phe Tyr Ser Gly Gly Tyr Asp Thr Glu Leu Lys Phe Gln Asn Lys Cys
485 490 495
Gly Thr Ile Glu Glu Lys Ser Pro Phe Trp Ala Ala Gly Pro Tyr Asn
500 505 510
Tyr Ile Tyr Gln Pro Gly Val Arg Thr Ser Gln Ile Pro Glu His Arg
515 520 525
Leu Ser Trp Met Thr Tyr Glu Pro Val Arg Glu Asn Ala Pro Phe Val
530 535 540
Tyr Pro Gln Tyr Lys Gln Leu Gly Ala Val Ala Leu Gly Trp Thr Ser
545 550 555 560
Asn Ser Val Asp Gln Lys Asn Ala Ile Asp Leu Asp Lys Ile Thr Ser
565 570 575
Leu Pro Ala Val Lys Gly Asn Glu Ile Arg Gly Pro Gly Ser Val Val
580 585 590
Arg Gly Pro Gly Ser Thr Gly Gly Asn Leu Val Leu Leu Asn Pro Thr
595 600 605
Gly Glu Val Ser Ile Thr Val Ser Gln Pro Ile Leu Asn Lys Pro Gly
610 615 620
Ala Gly Tyr Tyr His Val Arg Ile Arg Tyr Ala Ala Ile Ala Asn Gly
625 630 635 640
Lys Leu Asn Val Lys Arg Ser Val Asn Ser Ser Thr Gln Glu Ser Val
645 650 655
Thr Tyr Asp Tyr Lys Gln Thr Thr Ala Arg Asp Leu Thr Tyr Ser Ser
660 665 670
Phe Gln Tyr Leu Glu Val Tyr Asp Phe Asn Pro Val Thr Thr Ala Ser
675 680 685
Gln Phe Glu Val Leu Leu Thr Asn Glu Ser Gly Gly Pro Ile Tyr Ile
690 695 700
Asp Lys Ile Glu Phe Ile Pro Phe Gln Gly Thr Thr Gly Pro Glu Ala
705 710 715 720
Pro Gly Ile Pro Ala Asp Ile Tyr Tyr Thr Ile Ser Pro Ala Ile Asp
725 730 735
Glu Asn Tyr Tyr Thr Ile Gly Gly Ser Thr Ile Trp Leu Ser Leu Lys
740 745 750
Glu Ser Phe Ser Asn Pro Pro Arg Asp Gly Gln Trp Arg Phe Val Tyr
755 760 765
Asp Thr Asn Lys Lys Ala Tyr Gln Ile Gly Thr Gln Leu Asn Glu Asp
770 775 780
Pro Ala Gly Gly Leu Phe Ala Val Leu Thr Ser Thr Glu Pro Asn Asn
785 790 795 800
Asn Leu Thr Met Tyr Pro Asn Leu Ser Leu Asp Gln Gln Tyr Trp Phe
805 810 815
Val Lys Asp Ala Gly Asp Gly Tyr Val Thr Leu Glu Asn Tyr Gly Tyr
820 825 830
Pro Gly Tyr Val Val Asn Val Pro Arg Val Tyr Gln Gly Ala Ile Leu
835 840 845
Thr Leu Ser Pro Phe Ile Gly Ser Met Asn His Lys Phe Lys Leu Thr
850 855 860
Lys Leu Ser Thr Asn Asp Glu Val Ile Leu Gly
865 870 875
<210> 69
<211> 712
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 69
Met Ser Leu Leu Ile Lys Lys Glu Arg Asn Val Met Pro Asn Glu Met
1 5 10 15
Asn Ser Lys Asn Asn Met Asn Thr Thr Arg Ile Asp Asp Lys Asp Leu
20 25 30
Leu Asp Ser Lys Gly Arg Glu Cys Gln Pro Arg Tyr Pro Ser Ala Glu
35 40 45
Ala Pro Asp Val Met Asn Pro Ile Glu Lys Ile Tyr Thr Arg Gly Gln
50 55 60
Ser Ala Asn Ile Ser Asn Ala Thr Ser Thr Thr Ile Asp Val Thr Ser
65 70 75 80
Gln Ile Leu Arg Gly Ser Lys Lys Leu Ala Gly Leu Ile Ile Lys Val
85 90 95
Leu Ile Lys Ile Leu Trp Pro Lys Gly Thr Gln Gln Glu Trp Glu Asn
100 105 110
Phe Met Asp Ala Val Glu Gln Leu Ile His Glu Lys Leu Asp Thr Tyr
115 120 125
Ala Arg Asn Lys Ala Thr Ala Asp Leu Glu Gly Leu Gln Arg Val Leu
130 135 140
Ser Glu Tyr Ile Asp Lys Leu Thr Ile Tyr Thr Asp Asp Pro Asn Ala
145 150 155 160
Ala His Ala Gln Ser Leu Arg Thr Gln Val Ile Thr Thr Asp Asn Leu
165 170 175
Phe Glu Tyr Ser Met Pro Ser Phe Ala Ile Arg Asp Tyr Glu Met Gln
180 185 190
Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe Leu
195 200 205
Glu Asp Ile Val Arg Phe Gly Asp Glu Trp Gly Phe Thr Pro Ala Glu
210 215 220
Ile Ala Asp Phe His Lys Ser Met Lys Glu Arg Thr Asp Glu Tyr Thr
225 230 235 240
Asn His Cys Val Ser Thr Tyr Asn Lys Gly Leu Glu Lys Ala Lys Leu
245 250 255
Leu Thr Ala Asp Leu Cys Asn His Ser Thr Tyr Pro Trp Thr Arg Tyr
260 265 270
Asn Gln Gly Trp Arg Lys Glu Glu Thr Pro Asn Cys Ser Leu Gln Gln
275 280 285
Ile Asp Ser Cys Glu Gln Pro Leu Asn Glu Ser Asp Glu Gln Gly Thr
290 295 300
Pro Tyr Met Gly Pro Glu Arg Arg Ser Ile Asp Glu Tyr Gln Lys Leu
305 310 315 320
Glu Asn Trp Asn Leu Tyr Asn Ala Tyr Arg Arg Asp Met Thr Leu Met
325 330 335
Val Leu Asp Ile Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys Leu Tyr
340 345 350
Pro Thr Ser Tyr Gly Val Thr Ser Glu Leu Thr Arg Glu Leu Tyr Thr
355 360 365
Asp Ile Arg Gly Thr Thr Tyr Arg Ser Asp Glu Ser Gln Asn Ser Leu
370 375 380
Asp Ala Ile Glu Gly Arg Met Ile Pro Gln Pro Ser Leu Phe Arg Trp
385 390 395 400
Leu Phe Gly Ile Ser Ile Lys Met Asn Gln Phe Ile Gly Thr Gly Gly
405 410 415
Ser Asp Asp Trp Thr Ser Gly Asn Leu Met Leu Gly Met Arg Ile Met
420 425 430
Trp Arg Lys Thr Leu Thr Glu Gln Trp Glu Gly Leu Tyr Phe Gly Gly
435 440 445
Gly Gly Asn Leu Tyr Ile Asn Leu Asn Pro Leu Asp Tyr Ala Glu Gly
450 455 460
Ile Thr His Val Asn Thr Arg Gln Trp Phe Glu Pro Arg Trp Phe Glu
465 470 475 480
Phe Tyr Ser Gly Gly Tyr Asp Thr Glu Leu Lys Phe Gln Asn Lys Cys
485 490 495
Gly Thr Ile Glu Glu Lys Ser Pro Phe Trp Ala Ala Gly Pro Tyr Asn
500 505 510
Tyr Ile Tyr Gln Pro Gly Val Arg Thr Ser Gln Ile Pro Glu His Arg
515 520 525
Leu Ser Trp Met Thr Tyr Glu Pro Val Arg Glu Asn Ala Pro Phe Val
530 535 540
Tyr Pro Gln Tyr Lys Gln Leu Gly Ala Val Ala Leu Gly Trp Thr Ser
545 550 555 560
Asn Ser Val Asp Gln Lys Asn Ala Ile Asp Leu Asp Lys Ile Thr Ser
565 570 575
Leu Pro Ala Val Lys Gly Asn Glu Ile Arg Gly Pro Gly Ser Val Val
580 585 590
Arg Gly Pro Gly Ser Thr Gly Gly Asn Leu Val Leu Leu Asn Pro Thr
595 600 605
Gly Glu Val Ser Ile Thr Val Ser Gln Pro Ile Leu Asn Lys Pro Gly
610 615 620
Ala Gly Tyr Tyr His Val Arg Ile Arg Tyr Ala Ala Ile Ala Asn Gly
625 630 635 640
Lys Leu Asn Val Lys Arg Ser Val Asn Ser Ser Thr Gln Glu Ser Val
645 650 655
Thr Tyr Asp Tyr Lys Gln Thr Thr Ala Arg Asp Leu Thr Tyr Ser Ser
660 665 670
Phe Gln Tyr Leu Glu Val Tyr Asp Phe Asn Pro Val Thr Thr Ala Ser
675 680 685
Gln Phe Glu Val Leu Leu Thr Asn Glu Ser Gly Gly Pro Ile Tyr Ile
690 695 700
Asp Lys Ile Glu Phe Ile Pro Phe
705 710
<210> 70
<211> 1018
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 70
Met Asn Gln Asn Tyr Asn Asp Glu Tyr Glu Ile Met Asn Thr Gly Val
1 5 10 15
Met Gly Tyr Gln Ser Arg Tyr Pro Leu Ala Lys Thr Pro Gly Ser Glu
20 25 30
Ile Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Met Tyr Thr His Gly
35 40 45
Glu Ser Gly Glu Thr Phe Ile Ser Ala Arg Asp Gly Val Ile Ile Gly
50 55 60
Thr Gly Ile Gly Trp Ala Ile Leu Gly Phe Val Pro Val Leu Gly Pro
65 70 75 80
Gly Leu Ser Ala Ile Ser Gly Ile Leu Asn Val Leu Leu Pro Ile Leu
85 90 95
Trp Pro Glu Asp Ala Val Thr Ala Glu Ser Gln Val Ser Trp Asp Lys
100 105 110
Leu Met Arg Thr Ala Glu Glu Ile Ala Asn Lys Ala Ile Asp Ala Gln
115 120 125
Val Arg Ala Glu Ala Ile Thr Glu Leu Gln Gly Ile Gln Glu Ala Ile
130 135 140
Gln Leu Tyr His Gly Ala Val Cys Asp Trp Lys Lys Ala Pro Thr Asn
145 150 155 160
Val Arg Leu Gln Glu Thr Val Arg Thr Gln Phe Ile Ala Thr Asn Thr
165 170 175
Val Ile Phe Asn Arg Met Pro Ser Phe Arg Val Gln Gly Phe Glu Thr
180 185 190
Ser Leu Leu Ser Thr Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Gly Val Gln Phe Gly Thr Glu Trp Gly Leu Asp Pro Thr
210 215 220
Thr Val Asp Arg Tyr Tyr Gly Tyr Leu Thr Ala Asn Thr Ala Ser Tyr
225 230 235 240
Thr Asn Tyr Cys Val Arg Trp Tyr Asp Lys Gly Leu Ser Asp Leu Arg
245 250 255
Thr Thr Asn Asn Trp Asn Lys Tyr Asn Asn Phe Arg Arg Asp Met Thr
260 265 270
Ile Met Ala Met Asp Leu Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys
275 280 285
Arg Tyr Pro Thr Phe Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr
290 295 300
Pro Trp Ile Gly His Ile Lys Asn Arg Trp Met Gly His Pro Gln Glu
305 310 315 320
Val Pro Phe Ser Glu Ile Glu Ser Asn Leu Ile Asp Ile Pro Arg Leu
325 330 335
Phe Ala Trp Leu Arg Glu Leu Ile Val Tyr Ser Val Asp Pro Gln Arg
340 345 350
Arg Gly Tyr Ser Thr Leu Val Cys Gly Arg Lys Gln Arg Phe Gln Asn
355 360 365
Thr Leu Ser Asn Asn Leu Trp Asp Arg Gly Met Lys Gly Ser His Gly
370 375 380
Glu Glu Pro Pro Gln Thr Leu Thr Ile Pro Ser Pro Gln Ser Lys Asp
385 390 395 400
Asp Ile Trp Lys Ile Asn Thr Thr Ser Thr Leu Met Ala Gln Gly Pro
405 410 415
Glu Asn Thr Gln Ile Lys Gly Trp Asp Phe Ser Phe Thr Asn Ser Gln
420 425 430
Thr Gln Thr Ile Ser Thr Ile Tyr Gln Gly Gln Glu Ser Leu Val Tyr
435 440 445
Ala Val Ser Gly Leu Pro Cys Arg Ser Ser Tyr Asn Leu Cys Asp Pro
450 455 460
Cys Asp Pro Asp Cys Lys Val Asp Ile Leu Thr Pro Gly Asp Pro Cys
465 470 475 480
Asn Asp Lys Leu Leu Tyr Ser His Arg Phe Ala Tyr Leu Gly Ala Gly
485 490 495
Ile Asp Pro Tyr Met Gly Tyr Pro Gly Arg Ala Gly Gly Tyr Gly Tyr
500 505 510
Tyr Ser Tyr Gly Trp Thr His Ile Ser Ala Asp Ala Asn Asn Leu Ile
515 520 525
Asp Thr Glu Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly
530 535 540
Thr Asp Tyr Gln Val Ile Arg Gly Pro Gly Ser Thr Gly Gly Asp Leu
545 550 555 560
Val Ser Leu Ala Thr Asp Ala Arg Ile Glu Leu Lys Leu Thr Ala Ser
565 570 575
Ile Ala Lys Tyr Tyr Gln Met Arg Ile Arg Tyr Ala Ser Pro Val Gln
580 585 590
Thr Arg Leu Arg Leu Glu Thr Tyr Gln Asp Asp Ile Ser Gly Gly Pro
595 600 605
Val Asn Tyr Asp Ile Pro Ser Thr Pro Tyr Ser Gly Asn Gln Asn Glu
610 615 620
Leu Ser Tyr Asn Ala Phe Gly Tyr Lys His Val Met Asn Val Tyr Ile
625 630 635 640
Ser Leu Thr Gly Thr Lys Ile Ser Ile Gln Arg Val Asp Pro Phe Asp
645 650 655
Tyr Pro Phe Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile Glu Gly Ser
660 665 670
Leu Glu Ala Tyr Gln Ala Asp Gln Asp Leu Glu Lys Ala Arg Lys Lys
675 680 685
Val Asn Ala Leu Phe Met Ser Asp Ala Lys Asp Val Leu Lys Leu Asn
690 695 700
Val Thr Asp Tyr Ala Leu Asp Gln Ala Ala Asn Leu Val Glu Cys Leu
705 710 715 720
Ser Asp Glu Phe Cys Ala Gln Glu Lys Met Ile Leu Leu Asp Gln Val
725 730 735
Lys Phe Ala Lys Arg Leu Ser Gln Ser Arg Asn Leu Leu Asn Tyr Gly
740 745 750
Asp Phe Glu Ser Pro Tyr Trp Ser Gly Glu Asn Gly Trp Lys Thr Ser
755 760 765
Pro His Ile His Val Ala Ser Gly Asn Pro Ile Phe Lys Gly His Tyr
770 775 780
Leu His Met Pro Gly Ala Asn Gln Pro Gln Thr Ser Gly Thr Val Tyr
785 790 795 800
Pro Thr Tyr Ile Tyr Gln Lys Val Asp Glu Ser Lys Leu Lys Pro Tyr
805 810 815
Thr Arg Tyr His Val Arg Gly Phe Val Gly Asn Ser Lys Asp Leu Glu
820 825 830
Leu Leu Val Glu Arg Tyr Gly Lys Glu Ile His Val Glu Met Asp Val
835 840 845
Pro Asn Asp Ile Arg Tyr Thr Leu Pro Met Asn Glu Cys Gly Gly Phe
850 855 860
Glu Arg Cys Ser His Glu Pro His His Thr Cys Thr Cys Lys Asp Thr
865 870 875 880
Ala Arg Val Asp Thr Gly Cys Gln Cys Gln Asp Lys Met Asn Arg Met
885 890 895
Thr Thr Gly Val Tyr Thr Asn Ile Pro Thr Ala Ser Glu Met Tyr Pro
900 905 910
Asn Gly Tyr Leu Val His Lys Ser Cys Gly Cys Lys Asp Pro His Val
915 920 925
Phe Ser Phe His Ile Asp Thr Gly Cys Val Asp Leu Lys Glu Asn Leu
930 935 940
Gly Leu Leu Phe Ala Phe Lys Ile Ser Ser Thr Asp Gly Val Ala Asn
945 950 955 960
Val Asp Asn Leu Glu Ile Ile Glu Gly Gln Pro Leu Thr Gly Glu Ala
965 970 975
Leu Ala Arg Val Lys Lys Arg Glu His Lys Trp Lys Glu Thr Met Lys
980 985 990
Gln Lys Arg Cys Lys Thr Asp Glu Ala Val Gln Ala Val Gln Thr Ala
995 1000 1005
Ile Asn Ala Leu Phe Thr Asn Ala Gln Tyr
1010 1015
<210> 71
<211> 669
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 71
Met Asn Gln Asn Tyr Asn Asp Glu Tyr Glu Ile Met Asn Thr Gly Val
1 5 10 15
Met Gly Tyr Gln Ser Arg Tyr Pro Leu Ala Lys Thr Pro Gly Ser Glu
20 25 30
Ile Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Met Tyr Thr His Gly
35 40 45
Glu Ser Gly Glu Thr Phe Ile Ser Ala Arg Asp Gly Val Ile Ile Gly
50 55 60
Thr Gly Ile Gly Trp Ala Ile Leu Gly Phe Val Pro Val Leu Gly Pro
65 70 75 80
Gly Leu Ser Ala Ile Ser Gly Ile Leu Asn Val Leu Leu Pro Ile Leu
85 90 95
Trp Pro Glu Asp Ala Val Thr Ala Glu Ser Gln Val Ser Trp Asp Lys
100 105 110
Leu Met Arg Thr Ala Glu Glu Ile Ala Asn Lys Ala Ile Asp Ala Gln
115 120 125
Val Arg Ala Glu Ala Ile Thr Glu Leu Gln Gly Ile Gln Glu Ala Ile
130 135 140
Gln Leu Tyr His Gly Ala Val Cys Asp Trp Lys Lys Ala Pro Thr Asn
145 150 155 160
Val Arg Leu Gln Glu Thr Val Arg Thr Gln Phe Ile Ala Thr Asn Thr
165 170 175
Val Ile Phe Asn Arg Met Pro Ser Phe Arg Val Gln Gly Phe Glu Thr
180 185 190
Ser Leu Leu Ser Thr Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Gly Val Gln Phe Gly Thr Glu Trp Gly Leu Asp Pro Thr
210 215 220
Thr Val Asp Arg Tyr Tyr Gly Tyr Leu Thr Ala Asn Thr Ala Ser Tyr
225 230 235 240
Thr Asn Tyr Cys Val Arg Trp Tyr Asp Lys Gly Leu Ser Asp Leu Arg
245 250 255
Thr Thr Asn Asn Trp Asn Lys Tyr Asn Asn Phe Arg Arg Asp Met Thr
260 265 270
Ile Met Ala Met Asp Leu Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys
275 280 285
Arg Tyr Pro Thr Phe Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr
290 295 300
Pro Trp Ile Gly His Ile Lys Asn Arg Trp Met Gly His Pro Gln Glu
305 310 315 320
Val Pro Phe Ser Glu Ile Glu Ser Asn Leu Ile Asp Ile Pro Arg Leu
325 330 335
Phe Ala Trp Leu Arg Glu Leu Ile Val Tyr Ser Val Asp Pro Gln Arg
340 345 350
Arg Gly Tyr Ser Thr Leu Val Cys Gly Arg Lys Gln Arg Phe Gln Asn
355 360 365
Thr Leu Ser Asn Asn Leu Trp Asp Arg Gly Met Lys Gly Ser His Gly
370 375 380
Glu Glu Pro Pro Gln Thr Leu Thr Ile Pro Ser Pro Gln Ser Lys Asp
385 390 395 400
Asp Ile Trp Lys Ile Asn Thr Thr Ser Thr Leu Met Ala Gln Gly Pro
405 410 415
Glu Asn Thr Gln Ile Lys Gly Trp Asp Phe Ser Phe Thr Asn Ser Gln
420 425 430
Thr Gln Thr Ile Ser Thr Ile Tyr Gln Gly Gln Glu Ser Leu Val Tyr
435 440 445
Ala Val Ser Gly Leu Pro Cys Arg Ser Ser Tyr Asn Leu Cys Asp Pro
450 455 460
Cys Asp Pro Asp Cys Lys Val Asp Ile Leu Thr Pro Gly Asp Pro Cys
465 470 475 480
Asn Asp Lys Leu Leu Tyr Ser His Arg Phe Ala Tyr Leu Gly Ala Gly
485 490 495
Ile Asp Pro Tyr Met Gly Tyr Pro Gly Arg Ala Gly Gly Tyr Gly Tyr
500 505 510
Tyr Ser Tyr Gly Trp Thr His Ile Ser Ala Asp Ala Asn Asn Leu Ile
515 520 525
Asp Thr Glu Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly
530 535 540
Thr Asp Tyr Gln Val Ile Arg Gly Pro Gly Ser Thr Gly Gly Asp Leu
545 550 555 560
Val Ser Leu Ala Thr Asp Ala Arg Ile Glu Leu Lys Leu Thr Ala Ser
565 570 575
Ile Ala Lys Tyr Tyr Gln Met Arg Ile Arg Tyr Ala Ser Pro Val Gln
580 585 590
Thr Arg Leu Arg Leu Glu Thr Tyr Gln Asp Asp Ile Ser Gly Gly Pro
595 600 605
Val Asn Tyr Asp Ile Pro Ser Thr Pro Tyr Ser Gly Asn Gln Asn Glu
610 615 620
Leu Ser Tyr Asn Ala Phe Gly Tyr Lys His Val Met Asn Val Tyr Ile
625 630 635 640
Ser Leu Thr Gly Thr Lys Ile Ser Ile Gln Arg Val Asp Pro Phe Asp
645 650 655
Tyr Pro Phe Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile
660 665
<210> 72
<211> 662
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 72
Met Gly Gly Leu Leu Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu
1 5 10 15
Ile Leu Asp Asn Gly Val Gln Cys Cys Ser Pro Arg Tyr Pro Leu Ala
20 25 30
Gln Ala Pro Gly Ser Glu Trp Gln Asn Met Asn Tyr Gln Asp Val Met
35 40 45
Asn Thr Ser Val Gly Arg Glu Tyr Gln Ala Val Gln Gln Val Asn Val
50 55 60
Gly Glu Ala Val Gly Ala Ala Leu Gly Ile Leu Thr Thr Ile Leu Lys
65 70 75 80
Ala Val Asn Pro Thr Leu Gly Thr Ala Ala Gly Val Ile Ser Ser Ile
85 90 95
Phe Gly Phe Leu Trp Lys Arg Phe Gly Thr Asp Pro Gln Ala Gln Trp
100 105 110
Lys Gln Phe Met Glu Ala Val Glu Tyr Leu Val Ser Gln Lys Ile Thr
115 120 125
Asp Ala Val Arg Ser Lys Ala Val Ser Glu Leu Glu Gly Val Gln Arg
130 135 140
Ala Val Glu Leu Tyr Gln Glu Ala Ala Asn Ala Trp Asn Met Asn Pro
145 150 155 160
Asp Asn Ala Ala Ala Lys Glu Arg Ile Arg Arg Gln Tyr Thr Ser Thr
165 170 175
Asn Thr Val Ile Glu Tyr Ala Met Pro Ser Phe Arg Val Gly Ser Phe
180 185 190
Glu Val Pro Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu
195 200 205
Leu Leu Leu Arg Asp Ala Val Lys Phe Gly Ala Gly Trp Gly Phe Ser
210 215 220
Pro Thr Glu Val Glu Asp Asn Tyr Thr Arg Leu Gln Ala Arg Thr Ala
225 230 235 240
Glu Tyr Thr Asn His Cys Thr Asn Thr Tyr Asp Lys Gly Leu Lys Gln
245 250 255
Ala Tyr Asp Leu Ala Pro Asn Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr
260 265 270
Leu Asn Pro Tyr Ser Lys Asp Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala
275 280 285
Pro Val Asp Trp Asn Leu Phe Asn Asp Phe Arg Arg Asp Met Thr Leu
290 295 300
Met Val Leu Asp Ile Val Ala Val Trp Pro Thr Tyr Asn Pro Arg Leu
305 310 315 320
Tyr Asn Asn Pro Asn Gly Val Gln Ile Gln Leu Ser Arg Glu Val Tyr
325 330 335
Ser Thr Val Tyr Gly Arg Gly Trp Ser Asn Asn Ser Thr Val Asp Ala
340 345 350
Ile Glu Ser Thr Leu Val Arg Pro Pro His Leu Val Thr Glu Leu Thr
355 360 365
Lys Leu Thr Phe Asp Glu Arg Asn Leu Tyr Glu Ala Glu Thr Val Pro
370 375 380
Val Ala Phe Arg Lys Val Thr Asn Thr Leu His Tyr Val Gly Ser Ser
385 390 395 400
Thr Thr Trp Glu Gln Ser Phe Ser Ala Thr Pro Val Gly Ser Leu Lys
405 410 415
Ala Val His Asn Val Leu Ala Thr Asp Ile Gly Asn Leu Ala Leu Ser
420 425 430
Leu Gly Ala Val Pro Leu Gly Phe Ser Phe Tyr Asn Lys Asn Asp Gln
435 440 445
His Ile Thr Thr Val Gly Tyr Ser Gly Gly Trp Trp Asn Gly Ile Pro
450 455 460
Lys Asp Glu Gly Ser Asn Gln Asn Ser His His Leu Ser Tyr Val Ala
465 470 475 480
Ala Leu Glu Thr Gln Thr Thr Ala Gly Trp Phe Pro Thr Tyr Pro Val
485 490 495
Pro Leu Leu Gly Glu Trp Gly Phe Gly Trp Gln His Asn Ser Leu Lys
500 505 510
Tyr Thr Asn Arg Leu Val Thr Asn Gln Ile Asn Gln Leu Pro Ala Val
515 520 525
Lys Ala Ser Ala Leu Val Asn His Ala Arg Val Ser Lys Gly Thr Thr
530 535 540
Gly Thr Gly Gly Asn Ile Ile Glu Leu Val Asn Pro Asn Cys Gly Phe
545 550 555 560
Ile Phe Asn Phe Tyr Ala Glu Pro Glu Asn Pro Gln Gly Gln Ala Phe
565 570 575
Asn Ile Arg Ile Arg Tyr Ser Ala Thr Val Pro Gly Lys Leu Asn Ile
580 585 590
Arg His Leu Glu Ser Gly Lys Glu Phe Met Asn Thr Pro Arg Asp Phe
595 600 605
Lys Gly Thr Thr Thr Gly Thr Asn Trp Ile Pro Lys Phe Gln Glu Tyr
610 615 620
Gln Tyr Leu Asp Leu Gly Pro Ile Val Leu Arg Thr Gly Met Asn Tyr
625 630 635 640
Gly Phe Lys Ile Glu Leu Ala Ser Gly Gln Leu Val Ser Ile Asp Lys
645 650 655
Ile Glu Phe Leu Pro Phe
660
<210> 73
<211> 197
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 73
Val Lys Arg Leu Lys Pro Gly Gln Tyr Gly Lys Tyr Gly Arg Glu Tyr
1 5 10 15
Glu Met Val Gln Ala Gly Ser Gln His Pro Pro Leu Phe Glu Gly Ala
20 25 30
Met Gln Ser Ala Ala Thr Ile Asp Asp Arg Glu Gln Val Gln Ser Ser
35 40 45
Val Ser Phe Cys Ser Met Val His Val Pro Gln Gly Phe Val Tyr Val
50 55 60
Pro Lys Gly Thr Arg Lys Phe Ala Tyr Ser Leu Ser Gly Ile Ser Ile
65 70 75 80
Val Lys Glu Ala Ser Gln Lys Thr Ile Thr Val Asp Asn Cys Gly Pro
85 90 95
Val Asp Val Thr Leu Asn Leu Leu Lys Val Val Gly Ser Ile Pro Tyr
100 105 110
Leu Val Asn Ala Gln Val Gln Gly Glu Cys Gly Glu Lys Tyr Gly Ala
115 120 125
Ala Gln Gly Arg Asp Asn Gln Ile Glu Leu Ser His Thr Gly His Ile
130 135 140
Pro Val Asn Thr Val Leu Lys Phe Ser Val Ala Ser Leu Pro Asp Tyr
145 150 155 160
Gln Ile Glu Glu Lys Asn Ile Val Ile Ser Ala Phe Asp Val Thr Pro
165 170 175
Val Gln Glu Gln Asp Asn Gln Phe Leu Gln Phe Ile Gly Thr Leu Thr
180 185 190
Phe Gln Asn Ile Pro
195
<210> 74
<211> 731
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 74
Met Glu Ala Gly Tyr Ser Gln Glu Gly Arg Arg Phe Ala Phe His Asp
1 5 10 15
Ile Ser Leu Ile Gln Trp Lys His Ile Cys Tyr Lys Asn Gln Lys Pro
20 25 30
Lys Ser Pro Arg Arg Asn Glu Ile Leu Ala Ile Tyr Cys Val Cys Phe
35 40 45
Met Asp Val Thr Leu Met Gln Lys Gly Gly Phe Phe Met Asn Gln Tyr
50 55 60
Val Thr Arg Ala Arg Glu Ala Val Asn Ala Leu Phe Leu Asn Asn Thr
65 70 75 80
Leu Gln Ala Lys Val Thr Asp Tyr Gln Ile Asp Gln Ala Ala Lys Leu
85 90 95
Val Glu Cys Ile Pro Asp Glu Ile Tyr Pro Lys Glu Lys Met Cys Leu
100 105 110
Leu Asp Gln Val Lys Val Ala Lys Arg Leu Ser Gln Ala Arg Asn Leu
115 120 125
Leu Asn Tyr Gly Asp Phe Glu Ser Ser Asp Trp Ser Gly Val Asp Gly
130 135 140
Trp Asn Ile Ser Ala His Val Tyr Ala Val Ala Asp Asn Pro Thr Phe
145 150 155 160
Lys Gly Gln Tyr Leu Asn Leu Pro Ser Ala Asn Asn Pro Gln Phe Ser
165 170 175
Asp Lys Ile Phe Pro Ser Tyr Ala Tyr Gln Lys Ile Asp Glu Ser Lys
180 185 190
Leu Lys Pro Tyr Thr Arg Tyr Met Val Arg Gly Phe Val Gly Thr Ser
195 200 205
Lys Asp Leu Glu Leu Leu Val Ala Arg Tyr Asp Lys Glu Val His Lys
210 215 220
Lys Met Asn Val Pro Asn Asp Ile Ile Pro Thr Asp Pro Cys Thr Gly
225 230 235 240
Gly Tyr Leu Leu Glu Gln Gly Ser Tyr Pro Val Leu Thr Asn Gln Met
245 250 255
Met Pro Gln Asn Met Ser Cys Asp Pro Cys Asp Thr Gly Tyr Arg Asn
260 265 270
Glu Val Gly Met Met Val Pro Asn Thr Asn Met Met Cys Gln Asp Pro
275 280 285
His Ala Phe Gln Phe His Ile Asp Ile Gly Glu Leu Asp Met Asp Arg
290 295 300
Asn Leu Gly Ile Trp Val Gly Phe Lys Ile Gly Thr Thr Asp Gly Met
305 310 315 320
Ala Thr Leu Asp Asn Val Glu Val Val Glu Val Gly Pro Leu Thr Gly
325 330 335
Glu Ala Leu Val Arg Met Gln Lys Arg Glu Pro Lys Trp Lys Gln Lys
340 345 350
Trp Ala Glu Lys Arg Met Lys Ile Glu Lys Ala Val Gln Ala Ala Arg
355 360 365
Asp Ala Ile Gln Ala Leu Phe Thr Ser Pro Asp Gln Asn Arg Leu Gln
370 375 380
Ser Ala Ile Thr Leu His Thr Ile Leu His Ala Glu Lys Leu Val Gln
385 390 395 400
Lys Ile Pro Tyr Ile Tyr Asn Gln Phe Leu Leu Gly Ala Leu Pro Ser
405 410 415
Val Pro Gly Glu Gln Tyr Asp Ile Phe Arg Lys Leu Ser Asp Ala Val
420 425 430
Ala Ile Ala Gln Ser Leu Tyr Gly Gln Arg Asn Val Leu Asn Asn Gly
435 440 445
Asp Phe Ser Ala Gly Leu Ser Gly Trp Asn Ala Thr Asp Gly Ala Asp
450 455 460
Val Gln Gln Ile Gly Asn Thr Ser Val Leu Val Ile Ser Asp Trp Ser
465 470 475 480
Ala Ser Leu Ser Gln His Val Cys Val Lys Pro Glu His Ser Tyr Leu
485 490 495
Leu Arg Val Thr Ala Lys Lys Glu Gly Ser Gly Glu Gly Tyr Val Thr
500 505 510
Ile Ser Asp Gly Thr Glu Glu Asn Thr Glu Thr Leu Arg Phe Thr Val
515 520 525
Gly Glu Glu Ala Thr Ser Ser Pro Leu Ser Asp Ile Arg Ser Thr Leu
530 535 540
Arg Glu Arg Tyr Asn Glu Gln Asn Val Pro Asn Ala Ser Glu Glu Ala
545 550 555 560
Tyr Gly Ile Asn Gly Tyr Ala Ser Asn Asn Met Thr Asn Tyr Pro Ser
565 570 575
Asp Asn Tyr Gly Val Asn Ala Tyr Pro Gly Val Asn Asn Asn Met Thr
580 585 590
Asn Tyr Gln Ser Glu Ser Phe Gly Ile Thr Pro Tyr Gly Asp Glu Asn
595 600 605
Ser Met Met Asn Tyr Pro Ser Asn Asn Tyr Glu Met Asn Ala Tyr Pro
610 615 620
Gly Ser Thr Asn Met Thr Asn Pro Asn Gly Gly Cys Ser Cys Gly Cys
625 630 635 640
Ser Thr Asn Ala Tyr Ala Gly Glu Asn Met Arg Arg Asn Asp Ala Ser
645 650 655
Asn Glu Asn Gly Phe Gly Tyr Gly Cys Gly Cys Arg Thr His Thr Asn
660 665 670
Asn Arg Thr Asp Ser Tyr Ala Pro Pro Met Ile Ala Gln Asn Ser Ser
675 680 685
Ser Leu Ser Gly Tyr Val Thr Lys Thr Val Glu Ile Phe Pro Glu Thr
690 695 700
Asn Arg Val Cys Val Glu Ile Gly Glu Thr Ser Gly Thr Phe Met Val
705 710 715 720
Glu Ser Ile Glu Leu Ile Arg Met Asp Cys Glu
725 730
<210> 75
<211> 1060
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 75
Met Pro Ile Ser Lys Glu Pro Ile Leu Tyr Pro Leu Val Ser Ser Thr
1 5 10 15
Ser Asn Ile Pro Thr Ser Glu Ser Phe Thr Phe Asp Trp Asn Trp Gly
20 25 30
Leu His Gln Ala Leu Ser Ile Gly Thr Gly Leu Met Lys Phe Ile Pro
35 40 45
Gly Leu Gly Val Val Ala Gly Pro Tyr Gly Ala Leu Leu Asn Tyr Val
50 55 60
Tyr Gly Gly Val Ile Asp Glu Leu Phe Pro Lys Lys Asp Ala Gly Gly
65 70 75 80
Gln Asp Thr Val Asp Pro Thr Val Leu Phe Gln Thr Phe Met Ala Ala
85 90 95
Ala Glu Gln Met Ile Asp Glu Lys Leu Asn Asp Thr Val Lys Ser Thr
100 105 110
Ala Val Thr Lys Leu Ala Gly Leu Lys Asn Asn Leu Thr Asp Leu Lys
115 120 125
Thr Lys Leu Asp Thr Leu Lys Asp Asn Ser Asp Pro Lys Asp Thr Asp
130 135 140
Glu Leu Lys Lys Gln Val Arg Asp Lys Phe Glu Ala Val His Gly Leu
145 150 155 160
Phe Arg Ser Asp Met Pro Gln Phe Arg Val Ala Ser Tyr Glu Ser Ile
165 170 175
Leu Leu Pro Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Phe Leu
180 185 190
Arg Glu Ala Ile Ile Lys Gly Val Ser Glu Trp Gly Phe Pro Gln Ala
195 200 205
Thr Ile Asp Lys Phe Tyr Asp Asn Pro Thr Asp Lys Asn Gly Phe Lys
210 215 220
Gln Leu Met Glu Glu Tyr Thr Thr Asn Cys Met Ser Ala Tyr Gln Gln
225 230 235 240
Gly Leu Asp Lys Leu Ala Thr Gln Ser Ile Asp Leu Cys Thr Ser Asp
245 250 255
Gln Ser Tyr Trp Asn Pro Ala Tyr Tyr Asn Gln Tyr Leu Phe Leu Asp
260 265 270
Asp Leu Phe Ser Tyr Ala Thr Cys Gln His Ser Leu Gly Asp Met Ser
275 280 285
Tyr Gly Cys Gly Gly Gly Glu Leu Lys Pro Ile Cys Lys Asn Tyr Asn
290 295 300
Ser Arg Ile Tyr Ser Lys Tyr Met Gln Ser Ile Thr Asn Trp Asn Gln
305 310 315 320
Tyr Asn Asp Tyr Val Met Ser Met Gln Val Thr Ala Leu Asp Leu Val
325 330 335
Ala Met Trp Pro Tyr Trp Asp Pro Lys Val Tyr Pro Asn Gly Ala Lys
340 345 350
Lys Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ile Leu Gly Ser Pro Asp
355 360 365
Gly Gln Thr Lys Gly Val Thr Thr Ile Ser Gln Glu Leu Leu Pro Gln
370 375 380
Pro Arg Leu Phe Thr Gln Leu Thr Gln Leu Asp Leu Asn Thr Glu Pro
385 390 395 400
His Thr Ile Gln Gln Val Thr Arg Asn Pro Val Ser Gly Lys Val Asn
405 410 415
Ala Thr Tyr Lys Tyr Leu Asp Gly Asp Ile Ile Val Gly Ile Thr Gln
420 425 430
His Leu Gln Thr Thr Leu Gly Glu Ser Lys Leu His Ala Gln Gly Ser
435 440 445
Gly Gly Ser Lys Thr Gln Ser Tyr Asp Asn Ser Tyr Thr Asn Asp Phe
450 455 460
Thr Met Pro Leu Lys Val Ala Ser Trp Phe Glu Pro Arg Thr Ile Gly
465 470 475 480
Phe Ala Lys Ser Ser Gly Glu Ile Tyr Pro Val Gly Thr Ile Gly Asp
485 490 495
Glu Thr Pro Phe Val Glu Ile Ser Lys Ser Ile Gly His Asn Ser Gly
500 505 510
Thr Ile Thr Ala Ile Ala His His Ala Val Lys Ile Ser Glu Ser Thr
515 520 525
Gln Ser Gly Tyr Tyr Glu Glu Tyr Gln Gly Asp Thr Ala Tyr Ser Ile
530 535 540
Asp Glu His Arg Leu Ser Trp Met Asn Tyr Thr Pro Thr Ser Ala Ser
545 550 555 560
Ser Phe Val Val Thr Gly Lys Gln Ile Gly Ser Leu Ser Leu Gly Trp
565 570 575
Thr Ser Ser Thr Val Asp Pro Thr Asn Lys Val Gln Lys Asp Thr Ile
580 585 590
Thr Ser Ile Pro Ala Val Lys Ser His Ser Ile Thr Asp Trp Gly Thr
595 600 605
Gly Thr Val Val Lys Gly Pro Gly His Thr Gly Gly Asp Leu Val Lys
610 615 620
Leu Pro Ser Asn Thr Arg Ala Lys Met Ile Val Thr Ile Glu Asp Thr
625 630 635 640
Ser Thr Ser Tyr Asp Val Arg Ile Arg Tyr Ala Ala Pro Ser Gly Gly
645 650 655
His Ile Gln Phe Ser Tyr Trp Asn Gly Gly Ser Asp Val Lys Val Ala
660 665 670
Asp Thr Gln Leu Gln Asn Thr Gly Gly Asn Thr Asn Phe Glu His Phe
675 680 685
Gln Tyr Ala Met Leu Thr Thr Asp Asn Ala Lys Phe Lys Ala Pro Phe
690 695 700
Ser Pro Val Glu Ile Leu Ile Glu Asn Ile Gly Asp Ser Asp Val Tyr
705 710 715 720
Leu Asp Lys Val Glu Phe Leu Val Leu Ser Gln Pro Leu Ser Glu Gln
725 730 735
Leu Pro Pro Gly Ala Thr Pro Leu Thr Ser Gln Tyr Met Asn Pro Tyr
740 745 750
Gly Tyr Asn Ile Leu Trp Gln Ser Lys Asn Gly Glu Ala Ala Asn Gln
755 760 765
Gly Met Val Thr Phe Ser Gln Asn Asp Ser Ala Ile Lys Thr Phe Tyr
770 775 780
Leu Tyr Asn Thr Glu Val Val His Gln Thr Thr Gly Ser Pro Ser Ser
785 790 795 800
Trp Asn Gly Lys Phe Asp Thr Leu Tyr Val Gln Ser Pro Thr Ser Ile
805 810 815
Asn Leu Thr Gln Gly Phe Ile Val Ile Asp Thr Thr Lys Asn Asn Pro
820 825 830
Glu Pro Pro Ile Ala Leu Pro Asn Gln Asp Ile Leu Arg Pro Phe Thr
835 840 845
Asn Glu His Glu Ile Trp Asn Gly Arg Tyr Ser Thr Lys Ala Leu Asn
850 855 860
Leu Ser Leu Ser Thr Ser Ser Gly Ala Glu Gly Lys Ile Lys Phe Tyr
865 870 875 880
Asn Asn Thr Asn Leu Val His Glu Ser Pro Ala Leu Ser Gly Ser Pro
885 890 895
Ser Glu Pro Tyr Thr Trp Glu Gly Ser Phe Asn Arg Ile Thr Val Tyr
900 905 910
Gln Ser Asn Ser Ser Asp Ser Asn Tyr Phe Asn Leu Phe Gly Gly Phe
915 920 925
Leu Lys Leu Asp Pro Asn Ser Asn Asp Asn Gly Asn Asn Gly Gly Pro
930 935 940
His Asn Asp Cys Glu Asn Ile Ala His Pro Asp Gln Pro Leu Tyr Tyr
945 950 955 960
Glu Ser Asn Asn Leu Ile Trp Thr Ala Ala Pro Asn Lys Leu Ala Tyr
965 970 975
Ser Thr Thr Glu Met Gln Phe Val Leu Ser Gly Gly Leu Gly Phe Thr
980 985 990
Pro Gly Ala Ile Gly Ala Lys Ile Lys Leu Asn Phe Trp Lys Asn Asp
995 1000 1005
Ser Ile Gln Tyr Val Ala Tyr Gly Ala Val Ala Asn Leu Asp Phe
1010 1015 1020
Ile Ser Ala Pro Ala Gln Ser Ile Pro Gly Gly Phe Asp Lys Ile
1025 1030 1035
Thr Met Asp Asn Asp Glu Tyr Ser Phe Pro Thr Ser Leu Arg Met
1040 1045 1050
Thr Leu Gly Ser Val Cys Tyr
1055 1060
<210> 76
<211> 729
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 76
Met Pro Ile Ser Lys Glu Pro Ile Leu Tyr Pro Leu Val Ser Ser Thr
1 5 10 15
Ser Asn Ile Pro Thr Ser Glu Ser Phe Thr Phe Asp Trp Asn Trp Gly
20 25 30
Leu His Gln Ala Leu Ser Ile Gly Thr Gly Leu Met Lys Phe Ile Pro
35 40 45
Gly Leu Gly Val Val Ala Gly Pro Tyr Gly Ala Leu Leu Asn Tyr Val
50 55 60
Tyr Gly Gly Val Ile Asp Glu Leu Phe Pro Lys Lys Asp Ala Gly Gly
65 70 75 80
Gln Asp Thr Val Asp Pro Thr Val Leu Phe Gln Thr Phe Met Ala Ala
85 90 95
Ala Glu Gln Met Ile Asp Glu Lys Leu Asn Asp Thr Val Lys Ser Thr
100 105 110
Ala Val Thr Lys Leu Ala Gly Leu Lys Asn Asn Leu Thr Asp Leu Lys
115 120 125
Thr Lys Leu Asp Thr Leu Lys Asp Asn Ser Asp Pro Lys Asp Thr Asp
130 135 140
Glu Leu Lys Lys Gln Val Arg Asp Lys Phe Glu Ala Val His Gly Leu
145 150 155 160
Phe Arg Ser Asp Met Pro Gln Phe Arg Val Ala Ser Tyr Glu Ser Ile
165 170 175
Leu Leu Pro Val Tyr Ala Gln Ala Ala Asn Leu His Leu Ile Phe Leu
180 185 190
Arg Glu Ala Ile Ile Lys Gly Val Ser Glu Trp Gly Phe Pro Gln Ala
195 200 205
Thr Ile Asp Lys Phe Tyr Asp Asn Pro Thr Asp Lys Asn Gly Phe Lys
210 215 220
Gln Leu Met Glu Glu Tyr Thr Thr Asn Cys Met Ser Ala Tyr Gln Gln
225 230 235 240
Gly Leu Asp Lys Leu Ala Thr Gln Ser Ile Asp Leu Cys Thr Ser Asp
245 250 255
Gln Ser Tyr Trp Asn Pro Ala Tyr Tyr Asn Gln Tyr Leu Phe Leu Asp
260 265 270
Asp Leu Phe Ser Tyr Ala Thr Cys Gln His Ser Leu Gly Asp Met Ser
275 280 285
Tyr Gly Cys Gly Gly Gly Glu Leu Lys Pro Ile Cys Lys Asn Tyr Asn
290 295 300
Ser Arg Ile Tyr Ser Lys Tyr Met Gln Ser Ile Thr Asn Trp Asn Gln
305 310 315 320
Tyr Asn Asp Tyr Val Met Ser Met Gln Val Thr Ala Leu Asp Leu Val
325 330 335
Ala Met Trp Pro Tyr Trp Asp Pro Lys Val Tyr Pro Asn Gly Ala Lys
340 345 350
Lys Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ile Leu Gly Ser Pro Asp
355 360 365
Gly Gln Thr Lys Gly Val Thr Thr Ile Ser Gln Glu Leu Leu Pro Gln
370 375 380
Pro Arg Leu Phe Thr Gln Leu Thr Gln Leu Asp Leu Asn Thr Glu Pro
385 390 395 400
His Thr Ile Gln Gln Val Thr Arg Asn Pro Val Ser Gly Lys Val Asn
405 410 415
Ala Thr Tyr Lys Tyr Leu Asp Gly Asp Ile Ile Val Gly Ile Thr Gln
420 425 430
His Leu Gln Thr Thr Leu Gly Glu Ser Lys Leu His Ala Gln Gly Ser
435 440 445
Gly Gly Ser Lys Thr Gln Ser Tyr Asp Asn Ser Tyr Thr Asn Asp Phe
450 455 460
Thr Met Pro Leu Lys Val Ala Ser Trp Phe Glu Pro Arg Thr Ile Gly
465 470 475 480
Phe Ala Lys Ser Ser Gly Glu Ile Tyr Pro Val Gly Thr Ile Gly Asp
485 490 495
Glu Thr Pro Phe Val Glu Ile Ser Lys Ser Ile Gly His Asn Ser Gly
500 505 510
Thr Ile Thr Ala Ile Ala His His Ala Val Lys Ile Ser Glu Ser Thr
515 520 525
Gln Ser Gly Tyr Tyr Glu Glu Tyr Gln Gly Asp Thr Ala Tyr Ser Ile
530 535 540
Asp Glu His Arg Leu Ser Trp Met Asn Tyr Thr Pro Thr Ser Ala Ser
545 550 555 560
Ser Phe Val Val Thr Gly Lys Gln Ile Gly Ser Leu Ser Leu Gly Trp
565 570 575
Thr Ser Ser Thr Val Asp Pro Thr Asn Lys Val Gln Lys Asp Thr Ile
580 585 590
Thr Ser Ile Pro Ala Val Lys Ser His Ser Ile Thr Asp Trp Gly Thr
595 600 605
Gly Thr Val Val Lys Gly Pro Gly His Thr Gly Gly Asp Leu Val Lys
610 615 620
Leu Pro Ser Asn Thr Arg Ala Lys Met Ile Val Thr Ile Glu Asp Thr
625 630 635 640
Ser Thr Ser Tyr Asp Val Arg Ile Arg Tyr Ala Ala Pro Ser Gly Gly
645 650 655
His Ile Gln Phe Ser Tyr Trp Asn Gly Gly Ser Asp Val Lys Val Ala
660 665 670
Asp Thr Gln Leu Gln Asn Thr Gly Gly Asn Thr Asn Phe Glu His Phe
675 680 685
Gln Tyr Ala Met Leu Thr Thr Asp Asn Ala Lys Phe Lys Ala Pro Phe
690 695 700
Ser Pro Val Glu Ile Leu Ile Glu Asn Ile Gly Asp Ser Asp Val Tyr
705 710 715 720
Leu Asp Lys Val Glu Phe Leu Val Leu
725
<210> 77
<211> 522
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 77
Met Leu Leu Thr Gln Lys Gly Gly His Val Met Ala Ile Ser Arg Ile
1 5 10 15
Ile Asn Pro Ile Phe Thr Lys Gln Glu His Leu Asp Gln Ile Thr Asn
20 25 30
Leu Val Asn Asn Leu Phe Ser Ser Gly Ser Ser Ser Leu Ser Gln Ser
35 40 45
Val Ser Asp Tyr Trp Ile Asp Gln Val Leu Leu Lys Val Asn Ala Leu
50 55 60
Ser Asp Thr Val Phe Pro Thr Gln Lys Glu Gln Leu Arg Gln Arg Leu
65 70 75 80
Ala Gln Ala Lys Gln Ile Ser Lys Ala Arg Asn Leu Leu Val Gly Gly
85 90 95
Asn Phe Glu Thr Leu Asn Lys Trp Lys Leu Ser Arg Asn Ala Val Leu
100 105 110
Val Ala Gly His Asp Leu Phe Gln Gly Tyr His Leu Glu Leu Pro Pro
115 120 125
Ala Ile Asp Ser Val Lys Tyr Pro Ser Tyr Ala Tyr Gln Lys Ile Asp
130 135 140
Glu Ser Lys Leu Lys Ala Asn Thr Arg Tyr Tyr Val Ser Ala Phe Ile
145 150 155 160
Ala Gln Gly Ser Gln Leu Glu Ile Met Val Ser Arg Tyr Gly Gln Glu
165 170 175
Tyr Ser Gln Ile Leu Tyr Val Pro Ala Glu Met Ala Lys Pro Ile Ser
180 185 190
Pro Asp Gly Gly Pro Asn Cys Cys Phe Pro His Pro Cys Asn Cys Thr
195 200 205
Ala Cys Asn Glu Glu Glu Val Asp Ser His Phe Phe Gln Val Pro Ile
210 215 220
Asp Val Gly Thr Leu Gln Ser Ser Gln Asn Leu Gly Ile Glu Ile Gly
225 230 235 240
Phe Lys Val Ala Ser Thr Asp Gly Phe Ala Lys Leu Ser Asn Ile Glu
245 250 255
Val Phe Glu Gly Arg Pro Leu Thr Ala Ala Glu Gln Arg Lys Val Ser
260 265 270
Arg Leu Glu Asn Glu Trp Lys Glu Glu Gln Gln Thr Lys Glu Thr Glu
275 280 285
Arg Thr Gln Leu Leu Gln Gln Ile Gln Gln Arg Phe Asn Met Leu Tyr
290 295 300
Thr Thr Pro Glu His His Thr Leu Arg Ala Glu Thr Gly Tyr Gln His
305 310 315 320
Leu Leu Glu Thr Met Leu Pro Ser Leu His His Val Tyr His Trp Phe
325 330 335
Met Pro Asp Val Pro Asp Ser Asp Tyr Ala Leu Tyr Tyr Glu Leu Gln
340 345 350
Gln Lys Leu Glu Arg Gly Trp Asp Gln Tyr Phe Ser Arg Asn Leu Leu
355 360 365
Glu Asn Gly Asp Phe Leu Glu Pro Leu Asp Asp Ser Trp Tyr Thr Gln
370 375 380
Gly Asn Val Ser Leu His Ile Ile Asn Asn Asn Thr Met Leu Arg Leu
385 390 395 400
His His Trp Asp Ser Leu Ile Arg Lys Asn Ala Ser Leu Pro Val Val
405 410 415
Asn Glu Asp Ala Glu Tyr Val Ile Arg Val Ile Gly Lys Gly Thr Gly
420 425 430
Ser Val Leu Ile Gln Asn Gly Thr Thr Thr Asn Lys Leu Val Phe Thr
435 440 445
Asn Ser Arg Gln Met Glu Thr Lys Glu Phe His Leu Gln Pro Glu Lys
450 455 460
Glu Gln Leu Ser Leu Thr Ile Arg Ser Asp Ala Asn Glu Phe Leu Val
465 470 475 480
Asp Ala Ile Glu Val Ile Leu Met Asn Asp Gly Ala Glu Glu Glu Glu
485 490 495
Gln Leu Pro Gly Met Phe Pro Pro Ile Asn Ser Asn Met Gly Ser Thr
500 505 510
Pro Asn Ser Asn Met Met Asn Asn Asn Gln
515 520
<210> 78
<211> 804
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 78
Met Asn Gln Asn Tyr Asn Asn Asn Glu Tyr Glu Ile Ile Asp Asn Gly
1 5 10 15
Asn Met Gly Tyr Gln Pro Arg Tyr Pro Leu Ala Arg Ala Pro Gly Ser
20 25 30
Glu Cys Pro Lys Pro Asn Ala Lys Asp Gly Leu Tyr Thr Pro Leu Gly
35 40 45
Arg Val Asp Thr Val Leu Gln Asn Leu Asp Ile Gly Leu Ser Ile Arg
50 55 60
Thr Ala Leu Gly Ile Leu Gln Met Ile Leu Ala Ala Arg Ser Pro Ala
65 70 75 80
Leu Gly Arg Ala Ala Gly Leu Ile Asn Leu Ile Phe Gly Phe Leu Trp
85 90 95
Gly Thr Leu Gly Gly Gln Lys Thr Trp Glu Ser Phe Met Lys Ala Val
100 105 110
Glu Ala Leu Val Asp Gln Lys Ile Thr Asp Ala Val Arg Ala Lys Ala
115 120 125
Leu Ser Glu Leu Glu Gly Val Gln Asn Ala Val Ile Leu Tyr Gln Glu
130 135 140
Ala Ala Asp Asp Trp Asn Glu Asn Pro Asn Asp Glu Ser Asn Lys Glu
145 150 155 160
Arg Val Arg Arg Gln Phe Thr Ser Thr Asn Thr Val Met Glu Phe Ala
165 170 175
Ile Pro Ser Phe Arg Ala Leu Gly Phe Glu Val Pro Leu Leu Thr Val
180 185 190
Tyr Ala Gln Ala Thr Asn Leu His Leu Gln Leu Leu Arg Asp Ala Val
195 200 205
Lys Phe Gly Thr Glu Trp Gly Met Pro Ser Ala Glu Val Glu Asp Leu
210 215 220
Tyr Thr Arg Leu Thr Arg Arg Thr Ala Glu Tyr Thr Asp His Cys Val
225 230 235 240
Thr Thr Tyr Asp Lys Gly Leu Lys Gln Ala Tyr Asp Leu Ala Pro Asn
245 250 255
Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ser Lys Asp
260 265 270
Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala Pro Val Asp Trp Asn Leu Phe
275 280 285
Asn Asp Tyr Arg Arg Asn Met Thr Leu Met Val Leu Asp Ile Val Ala
290 295 300
Val Trp Pro Thr Tyr Asn Pro Arg Ile Tyr Thr Asn Pro Asn Gly Thr
305 310 315 320
Gln Ile Glu Leu Ser Arg Glu Val Tyr Ser Thr Val Tyr Gly Arg Gly
325 330 335
Gly Ser Asn Asn Ser Ser Val Asp Ala Ile Glu Ser Gln Ile Val Arg
340 345 350
Pro Pro His Leu Val Thr Glu Leu Thr Thr Leu Lys Ile Glu Gln Gly
355 360 365
Ser Thr Leu Asp Met Glu Gln Ile Gln Tyr Pro Lys Tyr Met Lys Val
370 375 380
Thr Asn Thr Leu His Tyr Ile Gly Ser Ser Ser Thr Trp Glu Gln Ser
385 390 395 400
Ser Ser Ala Pro Pro Ile Arg Pro Ile Thr Lys Ile His Thr Ile Pro
405 410 415
Ala Asn Asn Ile Gly Asn Leu Ser Leu Ser Gln Leu Asp Val Pro Tyr
420 425 430
Arg Phe Ser Phe Tyr Asn Lys Asp Asp Thr Leu Ile Ala Glu Val Gly
435 440 445
Ala Glu Phe Pro Pro Asn Thr Val Thr Trp Asn Gly Ile Glu Lys Ala
450 455 460
Glu Asp Ser Asn Gln Asn Ser His Arg Leu Ser Tyr Val Gly Ala Leu
465 470 475 480
Gly Thr Gln Ser Ser Ala Gly Phe Pro Trp Thr Tyr Pro Thr Glu Leu
485 490 495
Leu Gly Glu Trp Gly Phe Gly Trp Leu His Asn Ser Leu Thr Pro Glu
500 505 510
Asn Arg Val Gly Val Thr Thr Ile Thr Gln Ile Pro Ala Val Lys Ala
515 520 525
Tyr Trp Met Glu Gln Gly Ala Gln Val Val Lys Gly Pro Gly Ser Thr
530 535 540
Gly Gly Asp Val Ile Gln Leu Pro Val Asn Gly Ile Ile Arg Ile Asn
545 550 555 560
Leu Thr Ser Ile Asn Pro Gln Arg Asn Tyr Ser Val Arg Val Arg Tyr
565 570 575
Ala Ala Thr Ala Asn Gly Arg Leu Asn Phe Ser Arg Ile Asp Ser Phe
580 585 590
Gly Thr Ile Gln Ser Thr Ser Thr Glu Tyr Val Thr Thr Gly Ala Gly
595 600 605
Ala Asn Ala Ser Ser Phe Thr Tyr Asn Gln Phe Gln Tyr Val Asp Ile
610 615 620
Asp Pro Ser Asp Ser Thr Gln Val Tyr Thr Gly Ile Ile Leu Val Asn
625 630 635 640
Glu Met Gly Gly Pro Ile Tyr Ile Asp Lys Ile Glu Phe Ile Pro Leu
645 650 655
Ser Gln Ile Ile Glu Pro Pro Gln Leu Ile Pro Asp Gly Ser Tyr Gln
660 665 670
Ile Ile Thr Ala Leu Asn Asn Arg Ser Ala Val Gln Ala Ile Phe Lys
675 680 685
Asn Val Tyr Pro Arg Leu Asn Asn Leu Glu Leu Trp Glu Gly Ser Ser
690 695 700
Phe Lys Trp Asn Phe Val Tyr Asn Ala Gly Arg Asp Ala Tyr Gln Ile
705 710 715 720
Phe Thr Ala Asp Gly Gly Ala Leu Thr Arg Phe Asn Glu Ile Ser Thr
725 730 735
Asp Arg Asp Pro Val Phe Val Arg Gln Asn Leu Ser Thr Asp Asn Gln
740 745 750
Phe Trp Lys Ile Tyr Tyr Gln Asn Asp Gly Tyr Phe Val Ile Arg Asn
755 760 765
Leu Asp Ser Ala Leu Asp Val Trp Gly Tyr Ser Thr Ala Asn Gly Thr
770 775 780
Thr Val Asp Ala Tyr Arg Tyr His Gly Gly Ile Asn Gln Lys Phe Arg
785 790 795 800
Leu Leu Lys Val
<210> 79
<211> 656
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 79
Met Asn Gln Asn Tyr Asn Asn Asn Glu Tyr Glu Ile Ile Asp Asn Gly
1 5 10 15
Asn Met Gly Tyr Gln Pro Arg Tyr Pro Leu Ala Arg Ala Pro Gly Ser
20 25 30
Glu Cys Pro Lys Pro Asn Ala Lys Asp Gly Leu Tyr Thr Pro Leu Gly
35 40 45
Arg Val Asp Thr Val Leu Gln Asn Leu Asp Ile Gly Leu Ser Ile Arg
50 55 60
Thr Ala Leu Gly Ile Leu Gln Met Ile Leu Ala Ala Arg Ser Pro Ala
65 70 75 80
Leu Gly Arg Ala Ala Gly Leu Ile Asn Leu Ile Phe Gly Phe Leu Trp
85 90 95
Gly Thr Leu Gly Gly Gln Lys Thr Trp Glu Ser Phe Met Lys Ala Val
100 105 110
Glu Ala Leu Val Asp Gln Lys Ile Thr Asp Ala Val Arg Ala Lys Ala
115 120 125
Leu Ser Glu Leu Glu Gly Val Gln Asn Ala Val Ile Leu Tyr Gln Glu
130 135 140
Ala Ala Asp Asp Trp Asn Glu Asn Pro Asn Asp Glu Ser Asn Lys Glu
145 150 155 160
Arg Val Arg Arg Gln Phe Thr Ser Thr Asn Thr Val Met Glu Phe Ala
165 170 175
Ile Pro Ser Phe Arg Ala Leu Gly Phe Glu Val Pro Leu Leu Thr Val
180 185 190
Tyr Ala Gln Ala Thr Asn Leu His Leu Gln Leu Leu Arg Asp Ala Val
195 200 205
Lys Phe Gly Thr Glu Trp Gly Met Pro Ser Ala Glu Val Glu Asp Leu
210 215 220
Tyr Thr Arg Leu Thr Arg Arg Thr Ala Glu Tyr Thr Asp His Cys Val
225 230 235 240
Thr Thr Tyr Asp Lys Gly Leu Lys Gln Ala Tyr Asp Leu Ala Pro Asn
245 250 255
Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ser Lys Asp
260 265 270
Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala Pro Val Asp Trp Asn Leu Phe
275 280 285
Asn Asp Tyr Arg Arg Asn Met Thr Leu Met Val Leu Asp Ile Val Ala
290 295 300
Val Trp Pro Thr Tyr Asn Pro Arg Ile Tyr Thr Asn Pro Asn Gly Thr
305 310 315 320
Gln Ile Glu Leu Ser Arg Glu Val Tyr Ser Thr Val Tyr Gly Arg Gly
325 330 335
Gly Ser Asn Asn Ser Ser Val Asp Ala Ile Glu Ser Gln Ile Val Arg
340 345 350
Pro Pro His Leu Val Thr Glu Leu Thr Thr Leu Lys Ile Glu Gln Gly
355 360 365
Ser Thr Leu Asp Met Glu Gln Ile Gln Tyr Pro Lys Tyr Met Lys Val
370 375 380
Thr Asn Thr Leu His Tyr Ile Gly Ser Ser Ser Thr Trp Glu Gln Ser
385 390 395 400
Ser Ser Ala Pro Pro Ile Arg Pro Ile Thr Lys Ile His Thr Ile Pro
405 410 415
Ala Asn Asn Ile Gly Asn Leu Ser Leu Ser Gln Leu Asp Val Pro Tyr
420 425 430
Arg Phe Ser Phe Tyr Asn Lys Asp Asp Thr Leu Ile Ala Glu Val Gly
435 440 445
Ala Glu Phe Pro Pro Asn Thr Val Thr Trp Asn Gly Ile Glu Lys Ala
450 455 460
Glu Asp Ser Asn Gln Asn Ser His Arg Leu Ser Tyr Val Gly Ala Leu
465 470 475 480
Gly Thr Gln Ser Ser Ala Gly Phe Pro Trp Thr Tyr Pro Thr Glu Leu
485 490 495
Leu Gly Glu Trp Gly Phe Gly Trp Leu His Asn Ser Leu Thr Pro Glu
500 505 510
Asn Arg Val Gly Val Thr Thr Ile Thr Gln Ile Pro Ala Val Lys Ala
515 520 525
Tyr Trp Met Glu Gln Gly Ala Gln Val Val Lys Gly Pro Gly Ser Thr
530 535 540
Gly Gly Asp Val Ile Gln Leu Pro Val Asn Gly Ile Ile Arg Ile Asn
545 550 555 560
Leu Thr Ser Ile Asn Pro Gln Arg Asn Tyr Ser Val Arg Val Arg Tyr
565 570 575
Ala Ala Thr Ala Asn Gly Arg Leu Asn Phe Ser Arg Ile Asp Ser Phe
580 585 590
Gly Thr Ile Gln Ser Thr Ser Thr Glu Tyr Val Thr Thr Gly Ala Gly
595 600 605
Ala Asn Ala Ser Ser Phe Thr Tyr Asn Gln Phe Gln Tyr Val Asp Ile
610 615 620
Asp Pro Ser Asp Ser Thr Gln Val Tyr Thr Gly Ile Ile Leu Val Asn
625 630 635 640
Glu Met Gly Gly Pro Ile Tyr Ile Asp Lys Ile Glu Phe Ile Pro Leu
645 650 655
<210> 80
<211> 614
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 80
Met Asn Lys Asp Ile Thr Arg Ala Arg Glu Thr Val Gln Ser Leu Phe
1 5 10 15
Ala Asn Asp Thr Arg Asn Ala Leu Lys Leu Ser Ile Thr Asp Tyr Ala
20 25 30
Val Asp Gln Ala Ala Asn Leu Val Glu Cys Val Ser Glu Asp Phe His
35 40 45
Ala Gln Glu Lys Met Ile Leu Leu Asp Gln Val Lys Phe Ala Lys Arg
50 55 60
Leu Ser Gln Ala Arg Asn Leu Leu Asn Tyr Gly Asp Phe Glu Ser Ser
65 70 75 80
Asp Trp Ser Gly Glu Asn Gly Trp Arg Thr Ser Pro His Val His Val
85 90 95
Lys Ser Asn Asn Pro Ile Phe Lys Gly His Tyr Leu His Ile Pro Gly
100 105 110
Ala Met Ser Pro Gln Phe Ser Asn Asn Thr Tyr Ser Thr Tyr Val Tyr
115 120 125
Gln Lys Val Asp Glu Ser Lys Leu Lys Ser Tyr Thr Arg Tyr Leu Val
130 135 140
Arg Gly Phe Val Gly Asn Ser Lys Asp Leu Gln Leu Leu Val Glu Arg
145 150 155 160
Tyr Gly Lys Glu Val His Val Glu Met Asp Val Pro Asp Asp Met Gln
165 170 175
Tyr Thr Leu Pro Met Asn Glu Cys Gly Gly Phe Asp Arg Cys Lys Pro
180 185 190
Thr Ser Tyr Gln Thr Arg Asn Pro His Thr Cys Thr Cys Lys Asp Thr
195 200 205
Ala Tyr Thr Asp Thr Asp Cys Gln Cys Lys Asp Lys Met Asn Arg Leu
210 215 220
Thr Thr Gly Val Tyr Thr Asn Met Pro Val Asp Ser Ser Met Tyr Ala
225 230 235 240
Asp Gly Tyr His Thr His Thr Ser Cys Gly Cys Gly Asn Lys Asn Met
245 250 255
Tyr Gln Asn Gly Lys His Thr His Thr Ser Cys Gly Cys Lys Asn Pro
260 265 270
His Val Phe Ser Phe His Ile Asp Thr Gly Tyr Val Asp Leu Glu Glu
275 280 285
Asn Val Gly Leu Leu Phe Ala Leu Lys Ile Val Ser Thr Asn Gly Val
290 295 300
Ala Asn Ile Asp Asn Leu Glu Ile Ile Glu Gly Lys Pro Leu Thr Gly
305 310 315 320
Glu Ala Leu Ala Arg Val Lys Lys Arg Glu His Lys Trp Lys Glu Lys
325 330 335
Met Lys Gln Lys Arg Cys Lys Thr Glu Glu Ala Val Gln Ala Ala Gln
340 345 350
Thr Ala Ile Gln Ala Leu Phe Thr Asn Ala Glu Tyr Asn Arg Leu Lys
355 360 365
Phe Glu Thr Leu Phe Pro His Ile Leu Tyr Ala Asp Glu Leu Val Gln
370 375 380
Gly Ile Pro Tyr Leu Tyr His Pro Phe Phe Lys Gly Ala Leu Pro Pro
385 390 395 400
Val Pro Gly Met Asn Asp Lys Leu Phe Gln Gln Leu Ser Asp Leu Val
405 410 415
Gly Arg Ala Arg Gly Phe Tyr Glu Met Arg Asn Leu Val Gln Asn Gly
420 425 430
Thr Phe Ser Ser Gly Thr Gly Asn Trp His Val Thr Asp Gly Val Thr
435 440 445
Thr Gln Pro Glu Gly Asn Thr Ser Val Leu Val Leu Arg Glu Trp Ser
450 455 460
Asp Gln Val Ile Gln His Leu Arg Ile Asp Ala Asp Arg Gly Tyr Val
465 470 475 480
Leu Arg Val Thr Ala Arg Lys Glu Gly Gly Gly Lys Gly Thr Val Thr
485 490 495
Met Ser Asp Cys Ala Ala Tyr Thr Glu Thr Leu Thr Phe Thr Ser Cys
500 505 510
Asp Tyr Asn Thr Leu Gly Ser Gln Thr Met Thr Ser Gly Thr Leu Ser
515 520 525
Gly Phe Val Thr Lys Thr Leu Glu Ile Phe Pro Asp Thr Asp Arg Ile
530 535 540
Arg Ile Asp Ile Gly Ala Thr Glu Gly Val Phe Lys Val Glu Ser Val
545 550 555 560
Glu Leu Ile Cys Met Glu Gln Met Glu Asp Arg Met Tyr Asp Met Ala
565 570 575
Gly Asn Val Glu Glu Glu Met Gln Tyr Leu Gln Glu Ser Gly Ser Val
580 585 590
Arg Asn Thr Ser Gly Pro Asp Cys Asn Thr Ile Pro Asp Phe Leu Gly
595 600 605
Gly Pro Cys Pro Ser Pro
610
<210> 81
<211> 382
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 81
Met Val Asn Ile Asp Lys Asp Lys Phe Tyr Lys Ile Glu Asn Ile Lys
1 5 10 15
Tyr Asn Lys Val Leu Ala Ile Ser Ala Gly Asn Thr Asp Asn Ala Phe
20 25 30
Ile Tyr Thr Asp Leu Gly Ser Asp Asp Gln Arg Trp Ala Phe Phe Lys
35 40 45
Leu Asp Asp Glu Ser Tyr Arg Ile Val Asn Lys Ser Asn Gly Lys Thr
50 55 60
Leu Ala Met Thr Asn Asp Asp Val Phe Val Tyr Pro His Leu Val Thr
65 70 75 80
Ala Asp Gln His Trp Thr Lys Tyr Asp Ile Ala Glu Asn Met Lys Phe
85 90 95
Arg Asn Leu Asn Asn Tyr Lys Val Leu Ala Ala Gly Pro Ser Ser Asp
100 105 110
Val Phe Val Trp Asp Asp Leu Pro Ser Asp Val Leu Asp Gln Gln Trp
115 120 125
Lys Ile Ser Pro Val Glu Asn Leu Val Met Pro Asn Leu Pro Thr Asn
130 135 140
Phe Lys Lys Leu Gly Pro Ser Pro Thr Tyr Ser Ser Pro Asn Asp Gln
145 150 155 160
Phe Gln Glu Asp Met Pro Arg Val Leu Val Gly Ala Val Leu Val Pro
165 170 175
Tyr Phe Ala Val Ala Asp Asn Asn Ile Asp Pro Gly Thr Gln Val Lys
180 185 190
Arg Asn Pro Tyr Tyr Val Met Glu His Tyr Gln Asn Trp His Leu Leu
195 200 205
Ser Glu Asn Ser Leu Pro Ala Gly Gly Glu Glu Thr Leu Thr Leu His
210 215 220
Thr Gly Ile Arg Thr Ile Asp Gln Gln Thr Ile Thr Gln Lys Ser Gly
225 230 235 240
Ile Ser Ile Ala Ser Asp Ala Gly Phe Thr Phe Pro Ile Val Pro Ala
245 250 255
Gly Arg Ala Asp Gly Pro Phe Pro Thr Val Thr Gly Asp Leu Lys Thr
260 265 270
Thr Ile Thr Glu Glu Leu Asp Val Ser Ile Ser His Thr Thr Glu Leu
275 280 285
Met Lys Glu Val Thr Gln Thr Gln Thr Val Arg Asn Pro Phe Asn Gln
290 295 300
Gln Leu Gln Tyr Ala Lys Phe Ala Val Lys Asp Thr Tyr Ile Leu Arg
305 310 315 320
Arg Ala Asp Asn Ser Val Val Tyr Glu Val Ser Ala Val Asp Thr Thr
325 330 335
Ser Ile Lys Gly Ala Ser Tyr Pro Glu Asp Lys Gln Pro Ile Thr Asp
340 345 350
Ala Ala Lys Ile Val Ala Ser Arg Tyr Asp Val Thr Asn Pro Asn Gln
355 360 365
Thr Asn Cys Gln Cys Gln Val Thr Cys Asn Cys Pro Ser Lys
370 375 380
<210> 82
<211> 1209
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 82
Met Tyr Asn Ile Glu Ile Gln Ile Arg Thr Gly Gly Thr Phe Met Asn
1 5 10 15
His Asn Asn Asn Asn Asn Glu Tyr Glu Ile Ile Asp Ser His Thr Ser
20 25 30
Ser Tyr Pro Ser Asn Arg Asn Ser Asn His Ser Arg Tyr Pro Tyr Thr
35 40 45
Asn Asn Pro Asn Gln Pro Leu Ser Asn Thr Asn Tyr Lys Asp Trp Leu
50 55 60
Thr Met Cys Gln Gly Asn Gln Pro Tyr Arg Asp Lys Pro Gln Thr Leu
65 70 75 80
Val Asp Thr Gly Val Ala Ile Ser Thr Gly Leu Ile Ile Ile Gly Thr
85 90 95
Ile Met Thr Gly Phe Pro Ala Leu Thr Pro Trp Gly Ile Gly Ile Val
100 105 110
Ser Phe Gly Thr Leu Leu Pro Val Leu Trp Pro Ser Gly Glu Thr Asp
115 120 125
Lys Lys Val Trp Ile Asp Phe Met Asn Tyr Gly Gly Thr Ile Phe Lys
130 135 140
Pro Leu Leu Asn Glu Gln Lys Asp Gln Ala Gln Arg Ile Leu Gln Gly
145 150 155 160
Phe His Leu Pro Ile Asn Ser Tyr Gln Asn Gly Ile Thr Thr Trp Gln
165 170 175
Asn Leu Arg Lys Ser His Ile Pro Gly Thr Pro Pro Ser Ser Ala Leu
180 185 190
Arg Glu Ala Ala Arg Val Ala Arg Thr Arg Ile Asp Ile Ala Asn Ser
195 200 205
Ala Leu Ser Arg Ile Ser Glu Ile Gln Ser Ala Gln Phe Gly Ala Ala
210 215 220
Ile Leu Pro Phe Tyr Ala Gln Ala Ala Thr Leu His Leu Asn Leu Leu
225 230 235 240
Ser Gln Ala Val Gln Phe Ala Asp Gln Leu Lys Ala Asp Ala Glu Asn
245 250 255
Gln Pro Thr Gln Asp Ala Gly Thr Ser Asp Glu Tyr His Arg Leu Leu
260 265 270
Lys Gln Tyr Thr Gln Gln Tyr Thr Asp Tyr Cys Val Lys Thr Tyr Gln
275 280 285
Glu Gly Leu Thr Thr Leu Lys Asn Ala Pro Asp Met Thr Trp Asn Ile
290 295 300
Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Leu Val Leu Asp Tyr Ile
305 310 315 320
Ala Leu Phe Pro Asn Phe Asp Ile Arg Gln Tyr Pro Thr Gly Thr His
325 330 335
Ser Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ala Leu Ile Lys Glu Gln
340 345 350
Pro Asn Pro Pro Ile Pro Ile Pro Val Ala Gln Arg Glu Thr Glu Leu
355 360 365
Thr Arg Pro Pro His Leu Phe Ser Arg Leu Asn Arg Leu Leu Leu Phe
370 375 380
Ser Asn Thr Asp Pro Ser Pro Pro Ala Leu Ser Ala Val Gln Asn Ser
385 390 395 400
Phe Thr Tyr Thr Asn Asn Asn Ile Val Gln Tyr Ser Gln Ile Tyr Arg
405 410 415
Leu Ile Asn Ser Pro Glu Pro Thr Gly Ser Thr Gln Gln Ile Ile Ile
420 425 430
Ser Asn Thr Asp Ile Tyr Arg Ile Glu Leu Thr Leu Arg Gln Asn Tyr
435 440 445
Phe Ile Thr His Asn Ile Asn Phe Tyr Thr Leu Asp Gly Ile Thr Arg
450 455 460
Arg Tyr Asn Ser Gly Ser Thr Pro Glu Thr Pro Ile Ile Ile Asn Phe
465 470 475 480
Gln Leu Pro Glu Phe Pro Ala Asp Lys Ile Pro Pro Tyr Gly Leu Gly
485 490 495
Thr Lys Tyr Ser His Ile Leu Ser Tyr Met Lys Ile Ser Pro Asn Arg
500 505 510
His Asn Ile Val Leu Pro Asn Glu Arg His Leu Ser Phe Ala Trp Thr
515 520 525
His Thr Ser Val Asn Phe Asp Asn Glu Ile Ser Asn Gln Ile Val Thr
530 535 540
Gln Ile Pro Ala Val Lys Ala Arg Ser Leu Asp Thr Ser Ser Ser Val
545 550 555 560
Ile Ser Gly Pro Gly His Thr Gly Gly Asp Leu Val Leu Phe Arg Ser
565 570 575
Arg Met Glu Phe Gln Cys Phe Val Pro Thr Gln Thr Lys Tyr Lys Ile
580 585 590
Arg Met Arg Tyr Val Ala Tyr Ser Pro Asn Asn Ser Ile Ser Leu Thr
595 600 605
Leu Asn Ile Met Gly Gln Val Asn Tyr Gly Leu Arg Ala Leu Asn Ile
610 615 620
Lys Ser Thr Val Ala Ser Glu Lys Asp Thr Val Asn Pro Lys Tyr Asn
625 630 635 640
Asp Phe Gln Tyr Leu Tyr Phe Tyr Thr Gln Asp Pro Asn Glu Ile Ala
645 650 655
Ile Val Asn Leu Tyr Ser Leu Thr Asn Met Thr Leu Gln Ser Asn Asn
660 665 670
Phe Ala Gly Asn Thr Leu Val Ile Asp Lys Ile Glu Phe Leu Pro Val
675 680 685
Thr Gln Thr Leu Glu Glu Asn Leu Glu Thr Gln Lys Leu Glu Glu Ile
690 695 700
Gln Gln Thr Val Asn Thr Phe Phe Thr Asn His Thr Lys Asn Ile Leu
705 710 715 720
Asn Ile Glu Thr Thr Asp Tyr Glu Ile Asp Gln Thr Ala Ile Leu Val
725 730 735
Glu Ser Leu Ser Glu Glu Gln Tyr Pro Gln Glu Lys Met Ile Leu Leu
740 745 750
Asp Thr Met Lys Gln Ala Lys Gln Phe Ser Gln Ser Arg Asn Leu Leu
755 760 765
Gln Asn Gly Asp Phe Ser Ser Phe Ile Gly Trp Thr Ile Asn Asn Asp
770 775 780
Leu Thr Leu Gln Thr Asn Asn Pro Ile Phe Lys Gly Gln Tyr Leu His
785 790 795 800
Ile Leu Gly Ala Arg Thr Thr Glu Ile Asp Asn Thr Ile Phe Pro Thr
805 810 815
Tyr Val Tyr Gln Lys Ile Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg
820 825 830
Tyr Val Val Arg Gly Cys Ile Ser Asn Ser Lys Asp Leu Glu Leu Phe
835 840 845
Val Thr Arg Tyr Asn Lys Glu Ile His Lys Ile Leu Asn Ile Pro Asn
850 855 860
Asp Ser Ser His Met Asn Ala Glu Ile Gln Thr Asn Ser Tyr Glu Ala
865 870 875 880
Gln Ser Tyr Ser Phe Arg Asn Glu Asn Asn Asn Leu Thr Asn Gln Gln
885 890 895
Ser Pro Tyr Pro Thr Thr Ser Asn His Ser Gly Tyr Ser Glu Ser Thr
900 905 910
Leu Ile Ser Ser Asn Phe Glu Asn Glu Asn Ala Phe Ser Phe Ser Ile
915 920 925
Asp Thr Gly Asn Leu Asp Phe Asn Glu Asn Leu Gly Ile Gly Ile Leu
930 935 940
Phe Lys Ile Ser Asn Pro Asp Gly Tyr Ala Ala Leu Gly Asn Leu Glu
945 950 955 960
Val Ile Glu Glu Gln Pro Leu Thr Glu Glu Glu Val Asn Arg Val Thr
965 970 975
Glu Lys Glu Asn Arg Trp Lys Gln Val Arg Asn Asn Gln Gln Lys Glu
980 985 990
Thr Glu Gln Ala Tyr Tyr Gln Ala Gln Gln Ala Ile Asp Ala Leu Phe
995 1000 1005
Ile Asn Ser Gln His Gln Met Leu Lys Ile Glu Thr Thr Ile Gln
1010 1015 1020
Asp Ile Val Val Ala Glu Thr Phe Ile Asp Lys Ile Pro Tyr Ile
1025 1030 1035
Tyr Asn Glu Trp Leu Pro Asn Glu Pro Gly Met Asn Tyr Thr Leu
1040 1045 1050
Tyr Gln Gly Leu Lys Gln Gln Ile Ser Gln Ala Tyr Met Leu Tyr
1055 1060 1065
Arg Asn Arg Asn Leu Ile Gln Asn Gly Asn Phe Ile Asn Ser Thr
1070 1075 1080
Thr Asn Trp Phe Thr Ser Pro Asp Ala Lys Ile Gln Gln Ile Asp
1085 1090 1095
Asn Thr Ser Ile Leu Val Ile Pro Asn Trp Ser Thr Gln Val Ser
1100 1105 1110
Gln Asn Ile Asn Leu Pro Gln Gln Asn His Gln Tyr Leu Leu Arg
1115 1120 1125
Val Ile Ala Lys Lys Glu Gly Ile Gly Asn Gly Tyr Val Lys Ile
1130 1135 1140
Ser Asp Cys Thr Lys His Val Glu Thr Leu Ile Phe Thr Ser Cys
1145 1150 1155
Asp Tyr Asn Asn Asn Ala Met Trp Asn Glu Pro Met Gly Tyr Val
1160 1165 1170
Thr Lys Thr Met His Ile Thr Pro His Thr Asn Gln Val Arg Ile
1175 1180 1185
Asp Ile Gly Glu Thr Glu Gly Thr Phe Lys Ile Asn Ser Val Glu
1190 1195 1200
Leu Ile Gly Ile Lys Asn
1205
<210> 83
<211> 688
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 83
Met Tyr Asn Ile Glu Ile Gln Ile Arg Thr Gly Gly Thr Phe Met Asn
1 5 10 15
His Asn Asn Asn Asn Asn Glu Tyr Glu Ile Ile Asp Ser His Thr Ser
20 25 30
Ser Tyr Pro Ser Asn Arg Asn Ser Asn His Ser Arg Tyr Pro Tyr Thr
35 40 45
Asn Asn Pro Asn Gln Pro Leu Ser Asn Thr Asn Tyr Lys Asp Trp Leu
50 55 60
Thr Met Cys Gln Gly Asn Gln Pro Tyr Arg Asp Lys Pro Gln Thr Leu
65 70 75 80
Val Asp Thr Gly Val Ala Ile Ser Thr Gly Leu Ile Ile Ile Gly Thr
85 90 95
Ile Met Thr Gly Phe Pro Ala Leu Thr Pro Trp Gly Ile Gly Ile Val
100 105 110
Ser Phe Gly Thr Leu Leu Pro Val Leu Trp Pro Ser Gly Glu Thr Asp
115 120 125
Lys Lys Val Trp Ile Asp Phe Met Asn Tyr Gly Gly Thr Ile Phe Lys
130 135 140
Pro Leu Leu Asn Glu Gln Lys Asp Gln Ala Gln Arg Ile Leu Gln Gly
145 150 155 160
Phe His Leu Pro Ile Asn Ser Tyr Gln Asn Gly Ile Thr Thr Trp Gln
165 170 175
Asn Leu Arg Lys Ser His Ile Pro Gly Thr Pro Pro Ser Ser Ala Leu
180 185 190
Arg Glu Ala Ala Arg Val Ala Arg Thr Arg Ile Asp Ile Ala Asn Ser
195 200 205
Ala Leu Ser Arg Ile Ser Glu Ile Gln Ser Ala Gln Phe Gly Ala Ala
210 215 220
Ile Leu Pro Phe Tyr Ala Gln Ala Ala Thr Leu His Leu Asn Leu Leu
225 230 235 240
Ser Gln Ala Val Gln Phe Ala Asp Gln Leu Lys Ala Asp Ala Glu Asn
245 250 255
Gln Pro Thr Gln Asp Ala Gly Thr Ser Asp Glu Tyr His Arg Leu Leu
260 265 270
Lys Gln Tyr Thr Gln Gln Tyr Thr Asp Tyr Cys Val Lys Thr Tyr Gln
275 280 285
Glu Gly Leu Thr Thr Leu Lys Asn Ala Pro Asp Met Thr Trp Asn Ile
290 295 300
Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Leu Val Leu Asp Tyr Ile
305 310 315 320
Ala Leu Phe Pro Asn Phe Asp Ile Arg Gln Tyr Pro Thr Gly Thr His
325 330 335
Ser Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ala Leu Ile Lys Glu Gln
340 345 350
Pro Asn Pro Pro Ile Pro Ile Pro Val Ala Gln Arg Glu Thr Glu Leu
355 360 365
Thr Arg Pro Pro His Leu Phe Ser Arg Leu Asn Arg Leu Leu Leu Phe
370 375 380
Ser Asn Thr Asp Pro Ser Pro Pro Ala Leu Ser Ala Val Gln Asn Ser
385 390 395 400
Phe Thr Tyr Thr Asn Asn Asn Ile Val Gln Tyr Ser Gln Ile Tyr Arg
405 410 415
Leu Ile Asn Ser Pro Glu Pro Thr Gly Ser Thr Gln Gln Ile Ile Ile
420 425 430
Ser Asn Thr Asp Ile Tyr Arg Ile Glu Leu Thr Leu Arg Gln Asn Tyr
435 440 445
Phe Ile Thr His Asn Ile Asn Phe Tyr Thr Leu Asp Gly Ile Thr Arg
450 455 460
Arg Tyr Asn Ser Gly Ser Thr Pro Glu Thr Pro Ile Ile Ile Asn Phe
465 470 475 480
Gln Leu Pro Glu Phe Pro Ala Asp Lys Ile Pro Pro Tyr Gly Leu Gly
485 490 495
Thr Lys Tyr Ser His Ile Leu Ser Tyr Met Lys Ile Ser Pro Asn Arg
500 505 510
His Asn Ile Val Leu Pro Asn Glu Arg His Leu Ser Phe Ala Trp Thr
515 520 525
His Thr Ser Val Asn Phe Asp Asn Glu Ile Ser Asn Gln Ile Val Thr
530 535 540
Gln Ile Pro Ala Val Lys Ala Arg Ser Leu Asp Thr Ser Ser Ser Val
545 550 555 560
Ile Ser Gly Pro Gly His Thr Gly Gly Asp Leu Val Leu Phe Arg Ser
565 570 575
Arg Met Glu Phe Gln Cys Phe Val Pro Thr Gln Thr Lys Tyr Lys Ile
580 585 590
Arg Met Arg Tyr Val Ala Tyr Ser Pro Asn Asn Ser Ile Ser Leu Thr
595 600 605
Leu Asn Ile Met Gly Gln Val Asn Tyr Gly Leu Arg Ala Leu Asn Ile
610 615 620
Lys Ser Thr Val Ala Ser Glu Lys Asp Thr Val Asn Pro Lys Tyr Asn
625 630 635 640
Asp Phe Gln Tyr Leu Tyr Phe Tyr Thr Gln Asp Pro Asn Glu Ile Ala
645 650 655
Ile Val Asn Leu Tyr Ser Leu Thr Asn Met Thr Leu Gln Ser Asn Asn
660 665 670
Phe Ala Gly Asn Thr Leu Val Ile Asp Lys Ile Glu Phe Leu Pro Val
675 680 685
<210> 84
<211> 387
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 84
Met Ala Asn Leu Lys Leu Asp Lys Phe Tyr Gln Ile Ser Pro Val Ile
1 5 10 15
Ser Pro Gly Lys Ile Val Gln Val Gly Trp Gly Ala Asp Ser Asp Gly
20 25 30
Asp Val Leu Glu Gln Phe Pro Pro Asp Arg Tyr Gln Asp Ala Trp His
35 40 45
Gln Phe Leu Phe Val Pro Leu Asp Tyr Gly Asn Tyr Ala Ile Val Cys
50 55 60
Lys Asn Ser Gly Lys Val Val Ser Val Ser His Arg Asp Asn Gly Asp
65 70 75 80
Gly Arg Trp Ile Glu Gln Trp Asp Trp Leu Ser Asp Val Asn Asp Gln
85 90 95
Glu Ala Gln His Ile Lys Leu Asn Asp Thr Pro Asn Ser Gly Thr Tyr
100 105 110
Tyr Phe Gln Phe Gln His Ser Gly Lys Tyr Met Ala Val Ser Trp Arg
115 120 125
Asn Ser Asn Asp Gly Asn Tyr Met Glu Gln Trp Gln Phe Glu Gly Lys
130 135 140
Asp Trp Gln Gln Phe Lys Val Glu Glu Leu Ser Asn Ser Ile Thr Leu
145 150 155 160
Pro Thr Ala Thr Ile Lys Pro Arg Pro Val Ala Pro Asp Tyr Thr Gly
165 170 175
Thr Asn Ile Asn Glu Thr Leu Pro Glu Ser Ser Glu Lys Ala Ile Thr
180 185 190
Ala Ala Ser Leu Val Pro Phe Phe Met Val Asp Asp Pro Phe Tyr Asn
195 200 205
Asp Ser Ala Thr Gln Phe Asn Ala Ser Pro Tyr Tyr Leu Leu Glu Lys
210 215 220
Thr Glu Gln Trp Ile Arg Ile Ser Thr Asn Ile Val Ser Pro Gly Asp
225 230 235 240
Lys Ile Val Tyr Thr Arg Lys Val Gly Ile Ser Glu Ala His Gln Thr
245 250 255
Thr Met Gln Ser Thr Val Asp Met Ser Leu Gly Val Asp Ile Gly Leu
260 265 270
Gln Phe Arg Ala Leu Ser Gly Ala Leu Ser Ala Asn Ile Ser Thr Gln
275 280 285
Leu Ser Met Thr Glu Ser Thr Thr Thr Thr Gln Met Lys Glu Glu Thr
290 295 300
Ile Thr Arg Glu Ile Leu Asn Asn Ala Asp Glu Ser Asp Ser Phe Thr
305 310 315 320
Ile Tyr Gln Leu Glu Thr Gln Phe His Leu Lys Arg Ser Asp Arg Ser
325 330 335
Tyr Val Ile Asp Pro Trp Ala Ala Arg Phe Glu Arg Asp Thr Val Thr
340 345 350
Arg Ser Tyr Leu Gln Ser Ile Gln Thr Thr Asp Asn Thr Leu Arg Ala
355 360 365
Glu Thr Ser Ala Glu Leu Gln Lys Arg Leu Ile Asn Lys Lys Ser Tyr
370 375 380
Leu Arg Ser
385
<210> 85
<211> 777
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 85
Met Gln Arg Asn Asn Lys Gln Phe Thr Asn Trp Lys Lys Gln Leu Met
1 5 10 15
Lys Ala Val Pro Val Cys Val Leu Ser Thr Gly Phe Leu Cys Gly Val
20 25 30
Ala Asn Thr Ala Leu Ala Ala Glu Lys Thr Asp Val Lys Thr Ala Val
35 40 45
Thr Ala Lys Gln Val His Thr Val Gln Asn Gly Asn Ile Lys Glu Ala
50 55 60
Thr Lys Ala Pro Ala Val Gly Ser Phe Ala Ser Asn Pro Glu Asp Asp
65 70 75 80
Leu Ser Val Leu Gly Lys Ala Ile Lys Asp Ala Ala Gln Ala Gly Ser
85 90 95
Ala Gly Ile Asp Pro Asn Ile Leu Gln Glu Ala Ala Asn Asn Asp Tyr
100 105 110
Thr Asp Ile Ala Lys Ser Ile Leu Lys Ile Thr Ala Met Ala Gly Ser
115 120 125
Ala Gly Phe Ala Pro Leu Gly Val Leu Val Pro Leu Ile Asp Ala Phe
130 135 140
Phe Pro Ser Gly Asp Lys Pro Thr Leu Ile Gln Gln Ile Thr Pro Ala
145 150 155 160
Val Gln Lys Met Ile Asp Ala Asn Asp Val Lys Asn Arg Ala Asn Thr
165 170 175
Leu Asp Gly Tyr Ile Asn Gly Leu Lys Ala Pro Leu Ala Asp Tyr Gly
180 185 190
Asn Tyr Val Arg Ala Val Gln Asn Gly Lys Phe Leu Asp Gly Arg Thr
195 200 205
Pro Thr Gln Lys Asp Arg Gln Glu Ala Met Glu Arg Met Ser Lys Asn
210 215 220
Ile Glu Asn Ile Lys Gln Thr Ile Asn Lys Asp Gln Lys Leu Lys Ser
225 230 235 240
Val Leu Lys Ser Glu Thr Leu Lys Gln Leu Asn Asp Asn Gln Asn Gln
245 250 255
Lys Asn Gly Ala Asn Ser Asp Val Asp Ile Leu Ala Gly Glu Ile Asn
260 265 270
Thr Val Asn Thr Ala Phe Glu Thr Ala Leu Ala Gln Phe Glu Gln Gly
275 280 285
Val Ser Asp Ser Ser Thr Arg Ile Ala Glu Thr Thr Val Pro Tyr Tyr
290 295 300
Ala Gln Phe Val Thr Met Tyr Thr Thr Phe Leu His Asp Val Val Thr
305 310 315 320
Asn Gly Ser Lys Trp Gly Ile Ser Asn Ala Val Ile Ser Thr Met His
325 330 335
Asp Asp Leu Gln Lys Leu Ile Pro His Ala Thr Gln Gln Ile His Asp
340 345 350
Met Tyr Leu Thr Ala Gln Lys Tyr Ala Glu Gln Ile Glu His Ser Glu
355 360 365
Pro Gly Ile Thr Thr His Lys Arg Glu Ala Asp Glu Ala Leu Ile Pro
370 375 380
Ser Leu Asn Leu Ala Ala Met Trp Thr Thr Met Ser Asp Asp Tyr Lys
385 390 395 400
Gly His Ala Val His Leu Asp Gln Ser Lys Thr Ile Thr Thr His Glu
405 410 415
Ile Gly Gln Ile Gln Asp Ser Trp Ser Pro Ser Phe Arg Asp Thr Ala
420 425 430
Thr Glu Phe Tyr Asp Gly Thr Leu Leu Met His Leu Asp Asp Glu Tyr
435 440 445
Phe Gln Thr Thr Thr Asn Ile Gly Glu Ser Phe Arg Asp Ser Thr Gly
450 455 460
Ala Arg His Gly Tyr Val Tyr Gly Glu Ala Thr Pro Ser Gly Tyr Ala
465 470 475 480
Val Lys Thr Glu Gly Gly Glu Ile Asp Pro Asn Asn Pro Ile Thr Ser
485 490 495
Val Trp Met Thr Gly Val Asn Asp Ala Thr Ser Arg Val His Phe Gly
500 505 510
Pro Ala Gly Val Gly Ser Asp Pro Gln Met Ala Gly Ser Val Asn Ile
515 520 525
Pro Gly Tyr Lys Leu Asn Trp Leu Asp Ala Asn Tyr Thr Thr Gly Val
530 535 540
Ser Gly Ser Glu Gly Arg Ser Val Ile Thr Asn Ala Phe Ala Arg Phe
545 550 555 560
Ile Pro Lys Asp Leu Val Pro Glu Asn Thr Val Arg Ser Glu Asn Val
565 570 575
Ile Thr Gly Ile Pro Phe Glu Lys Tyr Asp Arg Ser Lys Ala Phe Asn
580 585 590
Tyr Tyr Gly Arg Ala His Glu Ser Val Asn Gly Ala Asp Ala Val Thr
595 600 605
Thr Arg Pro Lys Gln Pro Gly Glu Asn Gly Pro Trp Met Ser Val Lys
610 615 620
Phe His Val Asp Lys Asp Gly Asp Tyr Asn Leu Arg Tyr Arg Ala Ala
625 630 635 640
Thr Thr Arg Asp Gly Asn Leu Phe Ile Ala Gln Asp Gly Ala Met Asn
645 650 655
Ala Asn Ser Val Lys Ile Gln Asn Thr Ser Ala Asp Ala Asn Ala Val
660 665 670
Gln Gly Asp Leu Gly Lys Tyr Gly Leu Ser Glu Asn Leu Pro Val His
675 680 685
Leu Thr Ala Gly Asp His Ser Val Glu Ile Met Val Pro Asp Glu Met
690 695 700
Leu Gly Asn Met Thr Leu Asp Arg Leu Glu Phe Val Pro Val Asp Thr
705 710 715 720
Ser Ala Gln Lys Ser Phe Lys Pro Ala Thr Pro Thr Val Lys Ile Ser
725 730 735
Lys Glu Leu Ala Ala Asn Phe Thr Lys Lys Val Asn Ser Glu Asp Lys
740 745 750
Thr Lys Lys Asp Asp Val Lys Ala Ser Ala Asp Thr Ser Lys Asn Pro
755 760 765
Glu Asp Ser Lys Asn Arg Glu Lys Gln
770 775
<210> 86
<211> 740
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 86
Met Ala Glu Lys Thr Asp Val Lys Thr Ala Val Thr Ala Lys Gln Val
1 5 10 15
His Thr Val Gln Asn Gly Asn Ile Lys Glu Ala Thr Lys Ala Pro Ala
20 25 30
Val Gly Ser Phe Ala Ser Asn Pro Glu Asp Asp Leu Ser Val Leu Gly
35 40 45
Lys Ala Ile Lys Asp Ala Ala Gln Ala Gly Ser Ala Gly Ile Asp Pro
50 55 60
Asn Ile Leu Gln Glu Ala Ala Asn Asn Asp Tyr Thr Asp Ile Ala Lys
65 70 75 80
Ser Ile Leu Lys Ile Thr Ala Met Ala Gly Ser Ala Gly Phe Ala Pro
85 90 95
Leu Gly Val Leu Val Pro Leu Ile Asp Ala Phe Phe Pro Ser Gly Asp
100 105 110
Lys Pro Thr Leu Ile Gln Gln Ile Thr Pro Ala Val Gln Lys Met Ile
115 120 125
Asp Ala Asn Asp Val Lys Asn Arg Ala Asn Thr Leu Asp Gly Tyr Ile
130 135 140
Asn Gly Leu Lys Ala Pro Leu Ala Asp Tyr Gly Asn Tyr Val Arg Ala
145 150 155 160
Val Gln Asn Gly Lys Phe Leu Asp Gly Arg Thr Pro Thr Gln Lys Asp
165 170 175
Arg Gln Glu Ala Met Glu Arg Met Ser Lys Asn Ile Glu Asn Ile Lys
180 185 190
Gln Thr Ile Asn Lys Asp Gln Lys Leu Lys Ser Val Leu Lys Ser Glu
195 200 205
Thr Leu Lys Gln Leu Asn Asp Asn Gln Asn Gln Lys Asn Gly Ala Asn
210 215 220
Ser Asp Val Asp Ile Leu Ala Gly Glu Ile Asn Thr Val Asn Thr Ala
225 230 235 240
Phe Glu Thr Ala Leu Ala Gln Phe Glu Gln Gly Val Ser Asp Ser Ser
245 250 255
Thr Arg Ile Ala Glu Thr Thr Val Pro Tyr Tyr Ala Gln Phe Val Thr
260 265 270
Met Tyr Thr Thr Phe Leu His Asp Val Val Thr Asn Gly Ser Lys Trp
275 280 285
Gly Ile Ser Asn Ala Val Ile Ser Thr Met His Asp Asp Leu Gln Lys
290 295 300
Leu Ile Pro His Ala Thr Gln Gln Ile His Asp Met Tyr Leu Thr Ala
305 310 315 320
Gln Lys Tyr Ala Glu Gln Ile Glu His Ser Glu Pro Gly Ile Thr Thr
325 330 335
His Lys Arg Glu Ala Asp Glu Ala Leu Ile Pro Ser Leu Asn Leu Ala
340 345 350
Ala Met Trp Thr Thr Met Ser Asp Asp Tyr Lys Gly His Ala Val His
355 360 365
Leu Asp Gln Ser Lys Thr Ile Thr Thr His Glu Ile Gly Gln Ile Gln
370 375 380
Asp Ser Trp Ser Pro Ser Phe Arg Asp Thr Ala Thr Glu Phe Tyr Asp
385 390 395 400
Gly Thr Leu Leu Met His Leu Asp Asp Glu Tyr Phe Gln Thr Thr Thr
405 410 415
Asn Ile Gly Glu Ser Phe Arg Asp Ser Thr Gly Ala Arg His Gly Tyr
420 425 430
Val Tyr Gly Glu Ala Thr Pro Ser Gly Tyr Ala Val Lys Thr Glu Gly
435 440 445
Gly Glu Ile Asp Pro Asn Asn Pro Ile Thr Ser Val Trp Met Thr Gly
450 455 460
Val Asn Asp Ala Thr Ser Arg Val His Phe Gly Pro Ala Gly Val Gly
465 470 475 480
Ser Asp Pro Gln Met Ala Gly Ser Val Asn Ile Pro Gly Tyr Lys Leu
485 490 495
Asn Trp Leu Asp Ala Asn Tyr Thr Thr Gly Val Ser Gly Ser Glu Gly
500 505 510
Arg Ser Val Ile Thr Asn Ala Phe Ala Arg Phe Ile Pro Lys Asp Leu
515 520 525
Val Pro Glu Asn Thr Val Arg Ser Glu Asn Val Ile Thr Gly Ile Pro
530 535 540
Phe Glu Lys Tyr Asp Arg Ser Lys Ala Phe Asn Tyr Tyr Gly Arg Ala
545 550 555 560
His Glu Ser Val Asn Gly Ala Asp Ala Val Thr Thr Arg Pro Lys Gln
565 570 575
Pro Gly Glu Asn Gly Pro Trp Met Ser Val Lys Phe His Val Asp Lys
580 585 590
Asp Gly Asp Tyr Asn Leu Arg Tyr Arg Ala Ala Thr Thr Arg Asp Gly
595 600 605
Asn Leu Phe Ile Ala Gln Asp Gly Ala Met Asn Ala Asn Ser Val Lys
610 615 620
Ile Gln Asn Thr Ser Ala Asp Ala Asn Ala Val Gln Gly Asp Leu Gly
625 630 635 640
Lys Tyr Gly Leu Ser Glu Asn Leu Pro Val His Leu Thr Ala Gly Asp
645 650 655
His Ser Val Glu Ile Met Val Pro Asp Glu Met Leu Gly Asn Met Thr
660 665 670
Leu Asp Arg Leu Glu Phe Val Pro Val Asp Thr Ser Ala Gln Lys Ser
675 680 685
Phe Lys Pro Ala Thr Pro Thr Val Lys Ile Ser Lys Glu Leu Ala Ala
690 695 700
Asn Phe Thr Lys Lys Val Asn Ser Glu Asp Lys Thr Lys Lys Asp Asp
705 710 715 720
Val Lys Ala Ser Ala Asp Thr Ser Lys Asn Pro Glu Asp Ser Lys Asn
725 730 735
Arg Glu Lys Gln
740
<210> 87
<211> 718
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 87
Met Gln Arg Asn Asn Lys Gln Phe Thr Asn Trp Lys Lys Gln Leu Met
1 5 10 15
Lys Ala Val Pro Val Cys Val Leu Ser Thr Gly Phe Leu Cys Gly Val
20 25 30
Ala Asn Thr Ala Leu Ala Ala Glu Lys Thr Asp Val Lys Thr Ala Val
35 40 45
Thr Ala Lys Gln Val His Thr Val Gln Asn Gly Asn Ile Lys Glu Ala
50 55 60
Thr Lys Ala Pro Ala Val Gly Ser Phe Ala Ser Asn Pro Glu Asp Asp
65 70 75 80
Leu Ser Val Leu Gly Lys Ala Ile Lys Asp Ala Ala Gln Ala Gly Ser
85 90 95
Ala Gly Ile Asp Pro Asn Ile Leu Gln Glu Ala Ala Asn Asn Asp Tyr
100 105 110
Thr Asp Ile Ala Lys Ser Ile Leu Lys Ile Thr Ala Met Ala Gly Ser
115 120 125
Ala Gly Phe Ala Pro Leu Gly Val Leu Val Pro Leu Ile Asp Ala Phe
130 135 140
Phe Pro Ser Gly Asp Lys Pro Thr Leu Ile Gln Gln Ile Thr Pro Ala
145 150 155 160
Val Gln Lys Met Ile Asp Ala Asn Asp Val Lys Asn Arg Ala Asn Thr
165 170 175
Leu Asp Gly Tyr Ile Asn Gly Leu Lys Ala Pro Leu Ala Asp Tyr Gly
180 185 190
Asn Tyr Val Arg Ala Val Gln Asn Gly Lys Phe Leu Asp Gly Arg Thr
195 200 205
Pro Thr Gln Lys Asp Arg Gln Glu Ala Met Glu Arg Met Ser Lys Asn
210 215 220
Ile Glu Asn Ile Lys Gln Thr Ile Asn Lys Asp Gln Lys Leu Lys Ser
225 230 235 240
Val Leu Lys Ser Glu Thr Leu Lys Gln Leu Asn Asp Asn Gln Asn Gln
245 250 255
Lys Asn Gly Ala Asn Ser Asp Val Asp Ile Leu Ala Gly Glu Ile Asn
260 265 270
Thr Val Asn Thr Ala Phe Glu Thr Ala Leu Ala Gln Phe Glu Gln Gly
275 280 285
Val Ser Asp Ser Ser Thr Arg Ile Ala Glu Thr Thr Val Pro Tyr Tyr
290 295 300
Ala Gln Phe Val Thr Met Tyr Thr Thr Phe Leu His Asp Val Val Thr
305 310 315 320
Asn Gly Ser Lys Trp Gly Ile Ser Asn Ala Val Ile Ser Thr Met His
325 330 335
Asp Asp Leu Gln Lys Leu Ile Pro His Ala Thr Gln Gln Ile His Asp
340 345 350
Met Tyr Leu Thr Ala Gln Lys Tyr Ala Glu Gln Ile Glu His Ser Glu
355 360 365
Pro Gly Ile Thr Thr His Lys Arg Glu Ala Asp Glu Ala Leu Ile Pro
370 375 380
Ser Leu Asn Leu Ala Ala Met Trp Thr Thr Met Ser Asp Asp Tyr Lys
385 390 395 400
Gly His Ala Val His Leu Asp Gln Ser Lys Thr Ile Thr Thr His Glu
405 410 415
Ile Gly Gln Ile Gln Asp Ser Trp Ser Pro Ser Phe Arg Asp Thr Ala
420 425 430
Thr Glu Phe Tyr Asp Gly Thr Leu Leu Met His Leu Asp Asp Glu Tyr
435 440 445
Phe Gln Thr Thr Thr Asn Ile Gly Glu Ser Phe Arg Asp Ser Thr Gly
450 455 460
Ala Arg His Gly Tyr Val Tyr Gly Glu Ala Thr Pro Ser Gly Tyr Ala
465 470 475 480
Val Lys Thr Glu Gly Gly Glu Ile Asp Pro Asn Asn Pro Ile Thr Ser
485 490 495
Val Trp Met Thr Gly Val Asn Asp Ala Thr Ser Arg Val His Phe Gly
500 505 510
Pro Ala Gly Val Gly Ser Asp Pro Gln Met Ala Gly Ser Val Asn Ile
515 520 525
Pro Gly Tyr Lys Leu Asn Trp Leu Asp Ala Asn Tyr Thr Thr Gly Val
530 535 540
Ser Gly Ser Glu Gly Arg Ser Val Ile Thr Asn Ala Phe Ala Arg Phe
545 550 555 560
Ile Pro Lys Asp Leu Val Pro Glu Asn Thr Val Arg Ser Glu Asn Val
565 570 575
Ile Thr Gly Ile Pro Phe Glu Lys Tyr Asp Arg Ser Lys Ala Phe Asn
580 585 590
Tyr Tyr Gly Arg Ala His Glu Ser Val Asn Gly Ala Asp Ala Val Thr
595 600 605
Thr Arg Pro Lys Gln Pro Gly Glu Asn Gly Pro Trp Met Ser Val Lys
610 615 620
Phe His Val Asp Lys Asp Gly Asp Tyr Asn Leu Arg Tyr Arg Ala Ala
625 630 635 640
Thr Thr Arg Asp Gly Asn Leu Phe Ile Ala Gln Asp Gly Ala Met Asn
645 650 655
Ala Asn Ser Val Lys Ile Gln Asn Thr Ser Ala Asp Ala Asn Ala Val
660 665 670
Gln Gly Asp Leu Gly Lys Tyr Gly Leu Ser Glu Asn Leu Pro Val His
675 680 685
Leu Thr Ala Gly Asp His Ser Val Glu Ile Met Val Pro Asp Glu Met
690 695 700
Leu Gly Asn Met Thr Leu Asp Arg Leu Glu Phe Val Pro Val
705 710 715
<210> 88
<211> 681
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 88
Met Ala Glu Lys Thr Asp Val Lys Thr Ala Val Thr Ala Lys Gln Val
1 5 10 15
His Thr Val Gln Asn Gly Asn Ile Lys Glu Ala Thr Lys Ala Pro Ala
20 25 30
Val Gly Ser Phe Ala Ser Asn Pro Glu Asp Asp Leu Ser Val Leu Gly
35 40 45
Lys Ala Ile Lys Asp Ala Ala Gln Ala Gly Ser Ala Gly Ile Asp Pro
50 55 60
Asn Ile Leu Gln Glu Ala Ala Asn Asn Asp Tyr Thr Asp Ile Ala Lys
65 70 75 80
Ser Ile Leu Lys Ile Thr Ala Met Ala Gly Ser Ala Gly Phe Ala Pro
85 90 95
Leu Gly Val Leu Val Pro Leu Ile Asp Ala Phe Phe Pro Ser Gly Asp
100 105 110
Lys Pro Thr Leu Ile Gln Gln Ile Thr Pro Ala Val Gln Lys Met Ile
115 120 125
Asp Ala Asn Asp Val Lys Asn Arg Ala Asn Thr Leu Asp Gly Tyr Ile
130 135 140
Asn Gly Leu Lys Ala Pro Leu Ala Asp Tyr Gly Asn Tyr Val Arg Ala
145 150 155 160
Val Gln Asn Gly Lys Phe Leu Asp Gly Arg Thr Pro Thr Gln Lys Asp
165 170 175
Arg Gln Glu Ala Met Glu Arg Met Ser Lys Asn Ile Glu Asn Ile Lys
180 185 190
Gln Thr Ile Asn Lys Asp Gln Lys Leu Lys Ser Val Leu Lys Ser Glu
195 200 205
Thr Leu Lys Gln Leu Asn Asp Asn Gln Asn Gln Lys Asn Gly Ala Asn
210 215 220
Ser Asp Val Asp Ile Leu Ala Gly Glu Ile Asn Thr Val Asn Thr Ala
225 230 235 240
Phe Glu Thr Ala Leu Ala Gln Phe Glu Gln Gly Val Ser Asp Ser Ser
245 250 255
Thr Arg Ile Ala Glu Thr Thr Val Pro Tyr Tyr Ala Gln Phe Val Thr
260 265 270
Met Tyr Thr Thr Phe Leu His Asp Val Val Thr Asn Gly Ser Lys Trp
275 280 285
Gly Ile Ser Asn Ala Val Ile Ser Thr Met His Asp Asp Leu Gln Lys
290 295 300
Leu Ile Pro His Ala Thr Gln Gln Ile His Asp Met Tyr Leu Thr Ala
305 310 315 320
Gln Lys Tyr Ala Glu Gln Ile Glu His Ser Glu Pro Gly Ile Thr Thr
325 330 335
His Lys Arg Glu Ala Asp Glu Ala Leu Ile Pro Ser Leu Asn Leu Ala
340 345 350
Ala Met Trp Thr Thr Met Ser Asp Asp Tyr Lys Gly His Ala Val His
355 360 365
Leu Asp Gln Ser Lys Thr Ile Thr Thr His Glu Ile Gly Gln Ile Gln
370 375 380
Asp Ser Trp Ser Pro Ser Phe Arg Asp Thr Ala Thr Glu Phe Tyr Asp
385 390 395 400
Gly Thr Leu Leu Met His Leu Asp Asp Glu Tyr Phe Gln Thr Thr Thr
405 410 415
Asn Ile Gly Glu Ser Phe Arg Asp Ser Thr Gly Ala Arg His Gly Tyr
420 425 430
Val Tyr Gly Glu Ala Thr Pro Ser Gly Tyr Ala Val Lys Thr Glu Gly
435 440 445
Gly Glu Ile Asp Pro Asn Asn Pro Ile Thr Ser Val Trp Met Thr Gly
450 455 460
Val Asn Asp Ala Thr Ser Arg Val His Phe Gly Pro Ala Gly Val Gly
465 470 475 480
Ser Asp Pro Gln Met Ala Gly Ser Val Asn Ile Pro Gly Tyr Lys Leu
485 490 495
Asn Trp Leu Asp Ala Asn Tyr Thr Thr Gly Val Ser Gly Ser Glu Gly
500 505 510
Arg Ser Val Ile Thr Asn Ala Phe Ala Arg Phe Ile Pro Lys Asp Leu
515 520 525
Val Pro Glu Asn Thr Val Arg Ser Glu Asn Val Ile Thr Gly Ile Pro
530 535 540
Phe Glu Lys Tyr Asp Arg Ser Lys Ala Phe Asn Tyr Tyr Gly Arg Ala
545 550 555 560
His Glu Ser Val Asn Gly Ala Asp Ala Val Thr Thr Arg Pro Lys Gln
565 570 575
Pro Gly Glu Asn Gly Pro Trp Met Ser Val Lys Phe His Val Asp Lys
580 585 590
Asp Gly Asp Tyr Asn Leu Arg Tyr Arg Ala Ala Thr Thr Arg Asp Gly
595 600 605
Asn Leu Phe Ile Ala Gln Asp Gly Ala Met Asn Ala Asn Ser Val Lys
610 615 620
Ile Gln Asn Thr Ser Ala Asp Ala Asn Ala Val Gln Gly Asp Leu Gly
625 630 635 640
Lys Tyr Gly Leu Ser Glu Asn Leu Pro Val His Leu Thr Ala Gly Asp
645 650 655
His Ser Val Glu Ile Met Val Pro Asp Glu Met Leu Gly Asn Met Thr
660 665 670
Leu Asp Arg Leu Glu Phe Val Pro Val
675 680
<210> 89
<211> 657
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 89
Val Lys Asn Met Asn Ser Tyr Gln Asn Thr Asn Glu Tyr Glu Ile Leu
1 5 10 15
Asp Gly Ser Pro Asn Asn Thr Asn Met Ser Asn Arg Tyr Pro Phe Ala
20 25 30
Lys Asp Pro Asn Ile Phe Pro Ile Asn Leu Asp Ala Cys Gln Gly Arg
35 40 45
Pro Trp Gln Asp Thr Trp Glu Ser Val Ser Asp Ile Val Thr Ile Gly
50 55 60
Thr Tyr Leu Ile Gln Phe Leu Leu Glu Pro Gly Ile Gly Gly Ile Pro
65 70 75 80
Val Ile Leu Ser Ile Ile Asn Lys Leu Ile Pro Ser Ser Gly Gln Ser
85 90 95
Val Ala Ala Leu Ser Ile Cys Asp Leu Leu Ser Ile Ile Arg Lys Glu
100 105 110
Val Asp Glu Ser Val Leu Ser Asp Gly Val Ala Asp Phe Lys Gly Glu
115 120 125
Met Thr Ala Tyr Arg Asp Tyr Tyr Leu Pro Tyr Leu Glu Asp Trp Leu
130 135 140
Thr Asp Lys Ser Asn Pro Glu Lys Leu Ala Asp Val Val Lys Gln Phe
145 150 155 160
Gln Ala Arg Glu Glu Asp Phe Thr Lys Leu Leu Ala Gly Ser Leu Ser
165 170 175
Arg Gln Lys Ala Glu Ile Leu Leu Leu Pro Thr Tyr Val Gln Ala Ala
180 185 190
Asn Val His Leu Leu Leu Leu Arg Asp Ala Val Lys Tyr Lys Lys Glu
195 200 205
Trp Gly Leu Val Cys Pro Pro Leu Tyr Pro Gly Ser Gly Arg Thr Asp
210 215 220
Cys Asn Glu Arg Leu Lys Ala Lys Ile Lys Glu Tyr Thr Asn Tyr Cys
225 230 235 240
Val Glu Trp Tyr Asn Lys Gly Leu Asp Gln Ile Lys Gln Ala Gly Thr
245 250 255
Ser Thr Glu Thr Trp Leu Lys Phe Asn Lys Phe Arg Arg Glu Met Thr
260 265 270
Leu Ala Val Leu Asp Val Ile Ala Ile Phe Pro Thr Tyr Asp Phe Glu
275 280 285
Lys Tyr Pro Leu Ala Thr Asn Val Glu Leu Thr Arg Glu Val Tyr Thr
290 295 300
Asp Pro Val Gly Tyr Ser Gly Ser Ser Ile Arg Trp Glu Arg Thr Phe
305 310 315 320
Pro Asn Ala Phe Asn Thr Leu Glu Ala Asn Gly Thr Arg Gly Pro Gly
325 330 335
Leu Val Thr Trp Leu Glu Val Leu Gly Ile Tyr Asn His Asp Phe Gln
340 345 350
Asp Ile Thr Val Tyr Phe Ser Gly Trp Val Gly Ser Arg His Ser Glu
355 360 365
Asp Tyr Thr Arg Gly Asn Gly Thr Phe Ser Arg Ile Ser Gly Thr Thr
370 375 380
Ser Asn Asp Ile Arg Asn Gln Thr Phe Tyr Ser Arg Asp Val Phe Lys
385 390 395 400
Ile Asp Ser Leu Gly Ile Tyr Glu Thr Arg Ala Glu Leu Gly Tyr Thr
405 410 415
Arg Gln Arg Phe Cys Val Ser Arg Ala Val Phe Ser Ser Thr Leu Tyr
420 425 430
Asn Tyr Val Tyr Asp Ala Arg Asn Asn Gly Gln Gly Arg Met Thr Ile
435 440 445
Glu Ser Met Leu Pro Gly Ile Lys Asn Pro Ile Pro Ser Ser Gln Asp
450 455 460
Tyr Ser His Arg Leu Ser Asn Ala Ala Cys Val Gln Phe Asn Asp Ser
465 470 475 480
Arg Ile Asn Val Tyr Gly Trp Thr His Thr Ser Met Thr Lys Asn Asn
485 490 495
Pro Ile Tyr Thr Asp Lys Ile Ser Gln Ile Pro Ala Val Lys Ala Phe
500 505 510
Ala Leu Glu Ser Gly Ala Tyr Val Ser Arg Gly Pro Gly His Thr Gly
515 520 525
Gly Asp Val Val Thr Leu Pro Asn Arg Gly Arg Leu Lys Ile Arg Leu
530 535 540
Thr Ser Ala Pro Thr Asn Lys Asn Tyr Arg Val Arg Val Arg Tyr Ala
545 550 555 560
Thr Ser Tyr Gly Ala Asp Leu Met Val Gln Arg Trp Ser Pro Ser Gly
565 570 575
Asn His Ala Ser Gly Tyr Phe Ser Gly Ser Pro Thr Gly Ser Tyr Pro
580 585 590
Thr Phe Gly Tyr Met Asn Thr Leu Val Thr Thr Phe Asn Gln Ser Gly
595 600 605
Val Glu Ile Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile Asp
610 615 620
Lys Ile Glu Phe Ile Pro Asp Asp Ile Thr Thr Leu Glu Tyr Glu Gly
625 630 635 640
Glu Arg Asp Leu Glu Lys Thr Lys Asn Ala Val Asn Asp Leu Phe Thr
645 650 655
Asn
<210> 90
<211> 631
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 90
Val Lys Asn Met Asn Ser Tyr Gln Asn Thr Asn Glu Tyr Glu Ile Leu
1 5 10 15
Asp Gly Ser Pro Asn Asn Thr Asn Met Ser Asn Arg Tyr Pro Phe Ala
20 25 30
Lys Asp Pro Asn Ile Phe Pro Ile Asn Leu Asp Ala Cys Gln Gly Arg
35 40 45
Pro Trp Gln Asp Thr Trp Glu Ser Val Ser Asp Ile Val Thr Ile Gly
50 55 60
Thr Tyr Leu Ile Gln Phe Leu Leu Glu Pro Gly Ile Gly Gly Ile Pro
65 70 75 80
Val Ile Leu Ser Ile Ile Asn Lys Leu Ile Pro Ser Ser Gly Gln Ser
85 90 95
Val Ala Ala Leu Ser Ile Cys Asp Leu Leu Ser Ile Ile Arg Lys Glu
100 105 110
Val Asp Glu Ser Val Leu Ser Asp Gly Val Ala Asp Phe Lys Gly Glu
115 120 125
Met Thr Ala Tyr Arg Asp Tyr Tyr Leu Pro Tyr Leu Glu Asp Trp Leu
130 135 140
Thr Asp Lys Ser Asn Pro Glu Lys Leu Ala Asp Val Val Lys Gln Phe
145 150 155 160
Gln Ala Arg Glu Glu Asp Phe Thr Lys Leu Leu Ala Gly Ser Leu Ser
165 170 175
Arg Gln Lys Ala Glu Ile Leu Leu Leu Pro Thr Tyr Val Gln Ala Ala
180 185 190
Asn Val His Leu Leu Leu Leu Arg Asp Ala Val Lys Tyr Lys Lys Glu
195 200 205
Trp Gly Leu Val Cys Pro Pro Leu Tyr Pro Gly Ser Gly Arg Thr Asp
210 215 220
Cys Asn Glu Arg Leu Lys Ala Lys Ile Lys Glu Tyr Thr Asn Tyr Cys
225 230 235 240
Val Glu Trp Tyr Asn Lys Gly Leu Asp Gln Ile Lys Gln Ala Gly Thr
245 250 255
Ser Thr Glu Thr Trp Leu Lys Phe Asn Lys Phe Arg Arg Glu Met Thr
260 265 270
Leu Ala Val Leu Asp Val Ile Ala Ile Phe Pro Thr Tyr Asp Phe Glu
275 280 285
Lys Tyr Pro Leu Ala Thr Asn Val Glu Leu Thr Arg Glu Val Tyr Thr
290 295 300
Asp Pro Val Gly Tyr Ser Gly Ser Ser Ile Arg Trp Glu Arg Thr Phe
305 310 315 320
Pro Asn Ala Phe Asn Thr Leu Glu Ala Asn Gly Thr Arg Gly Pro Gly
325 330 335
Leu Val Thr Trp Leu Glu Val Leu Gly Ile Tyr Asn His Asp Phe Gln
340 345 350
Asp Ile Thr Val Tyr Phe Ser Gly Trp Val Gly Ser Arg His Ser Glu
355 360 365
Asp Tyr Thr Arg Gly Asn Gly Thr Phe Ser Arg Ile Ser Gly Thr Thr
370 375 380
Ser Asn Asp Ile Arg Asn Gln Thr Phe Tyr Ser Arg Asp Val Phe Lys
385 390 395 400
Ile Asp Ser Leu Gly Ile Tyr Glu Thr Arg Ala Glu Leu Gly Tyr Thr
405 410 415
Arg Gln Arg Phe Cys Val Ser Arg Ala Val Phe Ser Ser Thr Leu Tyr
420 425 430
Asn Tyr Val Tyr Asp Ala Arg Asn Asn Gly Gln Gly Arg Met Thr Ile
435 440 445
Glu Ser Met Leu Pro Gly Ile Lys Asn Pro Ile Pro Ser Ser Gln Asp
450 455 460
Tyr Ser His Arg Leu Ser Asn Ala Ala Cys Val Gln Phe Asn Asp Ser
465 470 475 480
Arg Ile Asn Val Tyr Gly Trp Thr His Thr Ser Met Thr Lys Asn Asn
485 490 495
Pro Ile Tyr Thr Asp Lys Ile Ser Gln Ile Pro Ala Val Lys Ala Phe
500 505 510
Ala Leu Glu Ser Gly Ala Tyr Val Ser Arg Gly Pro Gly His Thr Gly
515 520 525
Gly Asp Val Val Thr Leu Pro Asn Arg Gly Arg Leu Lys Ile Arg Leu
530 535 540
Thr Ser Ala Pro Thr Asn Lys Asn Tyr Arg Val Arg Val Arg Tyr Ala
545 550 555 560
Thr Ser Tyr Gly Ala Asp Leu Met Val Gln Arg Trp Ser Pro Ser Gly
565 570 575
Asn His Ala Ser Gly Tyr Phe Ser Gly Ser Pro Thr Gly Ser Tyr Pro
580 585 590
Thr Phe Gly Tyr Met Asn Thr Leu Val Thr Thr Phe Asn Gln Ser Gly
595 600 605
Val Glu Ile Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile Asp
610 615 620
Lys Ile Glu Phe Ile Pro Asp
625 630
<210> 91
<211> 568
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 91
Leu Leu Phe Asn Lys Tyr Ala Lys Lys Val Val Ser Pro Met Phe Thr
1 5 10 15
Ser Gly Thr Lys Asn Thr Leu Lys Ile Glu Thr Thr Asp Tyr Glu Ile
20 25 30
Asp Gln Ala Ala Ile Ser Ile Glu Cys Met Ser Asp Glu His Ser Pro
35 40 45
Lys Glu Lys Met Met Leu Trp Asp Glu Val Lys Arg Ala Lys Gln Phe
50 55 60
Ser Gln Ser Arg Asn Leu Leu Gln Asn Gly Asp Phe Gly Asp Phe Pro
65 70 75 80
Arg Asn Asp Trp Thr Phe Gly Asn Asp Ile Ile Ile Gly Ser Asn Asn
85 90 95
Ser Ile Phe Lys Gly Asn Phe Leu Gln Met Ser Gly Ala Arg Asp Ile
100 105 110
Tyr Gly Thr Ile Phe Pro Thr Tyr Ile Tyr Gln Lys Ile Asp Glu Ser
115 120 125
Lys Leu Lys Thr Tyr Thr Arg Tyr Arg Val Arg Gly Val Val Gly Ser
130 135 140
Ser Lys Asp Leu Arg Leu Met Val Thr Arg Tyr Gly Lys Glu Ile Asp
145 150 155 160
Ala Ile Met Asn Val Pro Asn Asp Leu Val Tyr Thr Gln Pro Asn Pro
165 170 175
Ser Cys Gly Asp Tyr Arg Cys Glu Ser Ser Ser Gln Tyr Val Asn Gln
180 185 190
Gly Tyr Pro Thr Pro Tyr Thr Asp Gly Tyr Gly Val Gly Tyr Pro Ser
195 200 205
Asn Gln Gly Glu Lys Arg Val Lys Cys His Asp Arg His Ser Phe Asp
210 215 220
Phe His Ile Asp Thr Gly Glu Leu Asp Ile Asn Thr Asn Leu Gly Ile
225 230 235 240
Leu Val Leu Phe Lys Ile Ser Asn Pro Asp Gly Tyr Ala Thr Leu Gly
245 250 255
Asn Leu Glu Val Ile Glu Glu Gly Pro Leu Thr Asp Glu Ala Leu Ala
260 265 270
His Val Lys Gln Lys Glu Lys Lys Trp Asn Gln His Leu Glu Lys Ala
275 280 285
Arg Met Glu Thr Gln Gln Ala Asn Glu Pro Ala Lys Gln Ala Val Asp
290 295 300
Thr Leu Phe Thr Asn Glu Gln Glu Leu His Tyr His Ile Thr Leu Asp
305 310 315 320
His Ile Gln Asn Ala Asp Arg Leu Val Gln Leu Ile Pro Tyr Val His
325 330 335
His Ala Trp Leu Pro Asp Ala Pro Gly Met Asn Tyr Asp Val Tyr Gln
340 345 350
Gly Leu Asn Ala Arg Ile Met Gln Ala Tyr Asn Leu Tyr Asp Ala Arg
355 360 365
Asn Val Ile Thr Asn Gly Asp Phe Thr Gln Gly Leu Gln Gly Trp His
370 375 380
Ala Ala Gly Lys Ala Thr Val Gln Gln Met Asp Gly Ser Ser Val Leu
385 390 395 400
Val Leu Ser Asn Trp Ser Ala Gly Val Ser Gln Asn Leu His Ala Gln
405 410 415
Asp His His Gly Tyr Met Leu Arg Val Ile Ala Lys Lys Glu Gly Pro
420 425 430
Gly Lys Gly Tyr Val Thr Met Met Asp Cys Asn Gly Asn Gln Glu Thr
435 440 445
Leu Lys Phe Thr Ser Cys Glu Glu Gly Tyr Met Thr Lys Thr Val Glu
450 455 460
Val Phe Pro Asp Ser Asp Arg Val Arg Ile Glu Ile Gly Glu Thr Glu
465 470 475 480
Gly Thr Phe Tyr Ile Asp Ser Ile Glu Leu Ile Cys Met Gln Gly Tyr
485 490 495
Ala Ser Asn Asn Asn Pro His Thr Gly Asn Met Tyr Glu Gln Ser Tyr
500 505 510
Thr Arg Asn Phe Asn Gln Asn Thr Ser Asp Val Tyr His Gln Gly Tyr
515 520 525
Thr Asn Asn Tyr Asn Gln Asp Ser Ser Asn Met Tyr Asn Gln Asn Tyr
530 535 540
Thr Asn Asn Asp Asn Leu His Ser Gly Cys Thr Cys Asn Gln Gly His
545 550 555 560
Asn Ser Gly Cys Thr Cys Asn Arg
565
<210> 92
<211> 711
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 92
Met Asn Gln Asn Asn Asn Glu Tyr Glu Ile Ala Asp Val Asn Pro Pro
1 5 10 15
Ser Tyr Pro Ser Asn Arg Ile Asn Asn Tyr Ser Arg Tyr Pro Phe Ala
20 25 30
Asn Asp Pro Asn Gln Leu Leu Gln Gln Thr Asn Tyr Lys Asp Trp Leu
35 40 45
Asn Ile Cys Gln Gly Asn Asn Gln Asn Ser Gly Asn Leu Ile Thr Asp
50 55 60
Ala Gly Ala Ala Val Ser Ala Gly Ile Ile Val Thr Gly Thr Met Ile
65 70 75 80
Ala Ala Phe Ala Ala Leu Ala Val Ala Pro Phe Pro Ile Val Gly Ala
85 90 95
Ile Val Gly Gly Ile Val Ile Ala Phe Gly Thr Leu Leu Pro Leu Val
100 105 110
Trp Pro Gly Gly Lys Glu Asp Arg Thr Val Trp Thr Gln Phe Phe Ala
115 120 125
Leu Gly Glu Thr Leu Leu Gly Lys Arg Leu Thr Asp Asp Ala Gln Lys
130 135 140
Thr Ala Ala Glu Thr Val Asp Gly Phe Gly Arg Val Leu Asp Asn Tyr
145 150 155 160
Lys Val Ala Leu Asn Ser Trp Met Asp Leu Lys Lys Lys Gln Ser Pro
165 170 175
Gly Ser Pro Pro Thr Thr Glu Leu Lys Glu Ala Ala Asp Asn Val Lys
180 185 190
Gln Arg Phe Glu Ile Val His Asn Glu Phe Asp Ala Asn Met Pro His
195 200 205
Leu Gly Asn Pro Lys Ser Asn Tyr Lys Glu Val Leu Leu Ser Ser Tyr
210 215 220
Ala His Ala Ala Asn Leu His Leu Asn Leu Leu Gln Gln Gly Val Arg
225 230 235 240
Phe Ala Asp Gln Trp Asn Gln Asp Ile Tyr Ser Ser Gln Ile Val Pro
245 250 255
Ala Ala Gly Thr Ser Thr Glu Tyr Leu Lys Phe Leu Gln Ala Arg Ile
260 265 270
Gln Glu Tyr Val Asn Tyr Cys Thr Ala Thr Tyr Arg Ser Gly Leu Asn
275 280 285
Ile Leu Lys Asn Ser Asn Asp Ile Thr Trp Asp Ile Tyr Asn Thr Tyr
290 295 300
Arg Arg Asp Met Thr Leu Thr Val Leu Asp Leu Val Ala Ala Phe Pro
305 310 315 320
Asn Tyr Asp Pro Thr Tyr Tyr Pro Ile Asp Thr Lys Ile Gln Leu Thr
325 330 335
Arg Thr Ile Tyr Ser Asn Leu Leu Gly Ile Thr Gln Ser Gly Arg Gln
340 345 350
Tyr Phe Glu Ala Leu Glu Asn Ala Leu Thr Met Arg Pro Asn Leu Tyr
355 360 365
Arg Ile Leu Arg Gly Phe Asn Phe Asn Thr Thr Phe Gly Ser Gly Val
370 375 380
Asn Tyr Leu Ser Gly Ile Ala Asn Arg Arg Gly Leu Ile Lys Thr Val
385 390 395 400
Pro Ser Glu Glu Asp Glu Gln Pro Phe Tyr Gly Gln Ser Asn Gly Arg
405 410 415
His Asp Pro Met Ile Phe Ala Asn Gly Leu Asn Ile Phe Arg Val Thr
420 425 430
Leu Leu Arg Asn Lys His Tyr Ile Ala Leu Pro Val Gly Pro Pro Leu
435 440 445
Phe Ala Ala Asn Gln Ile Thr Asp Leu Lys Leu Tyr Tyr Gly Asn Ala
450 455 460
Thr Gln Tyr His His Tyr Gln Pro Ser Asn Glu Gly Leu Thr Thr Glu
465 470 475 480
Ser Thr Asp Leu Ser Phe Pro Pro Lys Asn Glu Glu Tyr Leu Asn Ala
485 490 495
Ile Leu Met Thr Ser Pro Arg Thr Ile Tyr Glu Asn Pro Thr Asn Ile
500 505 510
Tyr Ser Phe Ala Trp Thr Lys Lys Asp Ile Asp Gln Gln Asn Asn Ile
515 520 525
Ile Ile Asn Ala Ile Thr Gln Ile Pro Ala Val Lys Gly Ile Ala Leu
530 535 540
Gly Asn Gln Ser Arg Val Ile Gln Gly Pro Gly His Thr Ser Gly Asp
545 550 555 560
Leu Val Tyr Leu Lys Asp Ser Leu Glu Leu Arg Cys Gln Tyr Thr Gly
565 570 575
Pro Gln Gln Ser Tyr Asp Val Arg Ile Arg Tyr Ala Ser Asn Gly Ile
580 585 590
Leu Asn Asp Arg Ala Met Ile Ala Ile Thr Ile Pro Gly Val Thr Gly
595 600 605
Gln Ser Ile Ser Leu Ser Glu Thr Phe Ser Gly Thr Asp Tyr Asn Val
610 615 620
Leu Glu Phe Gln Asn Phe Lys Ser Val Gln Phe Thr Ser Ala Ile Arg
625 630 635 640
Leu Asn Pro Ser Gln Asn Ile Phe Ile Tyr Leu Asn Arg Leu Asp Gln
645 650 655
Asn Ala Asn Thr Thr Leu Leu Ile Asp Lys Ile Glu Phe Ile Pro His
660 665 670
Pro Leu Pro Arg Ile Leu Ser Glu Gln Lys Leu Lys Lys Val Lys Gln
675 680 685
Lys Val Asn Asp Leu Phe Ile Asp Ala Glu Asn Gly Cys Phe Cys Pro
690 695 700
Ser His Glu Lys Asp Leu Arg
705 710
<210> 93
<211> 672
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 93
Met Asn Gln Asn Asn Asn Glu Tyr Glu Ile Ala Asp Val Asn Pro Pro
1 5 10 15
Ser Tyr Pro Ser Asn Arg Ile Asn Asn Tyr Ser Arg Tyr Pro Phe Ala
20 25 30
Asn Asp Pro Asn Gln Leu Leu Gln Gln Thr Asn Tyr Lys Asp Trp Leu
35 40 45
Asn Ile Cys Gln Gly Asn Asn Gln Asn Ser Gly Asn Leu Ile Thr Asp
50 55 60
Ala Gly Ala Ala Val Ser Ala Gly Ile Ile Val Thr Gly Thr Met Ile
65 70 75 80
Ala Ala Phe Ala Ala Leu Ala Val Ala Pro Phe Pro Ile Val Gly Ala
85 90 95
Ile Val Gly Gly Ile Val Ile Ala Phe Gly Thr Leu Leu Pro Leu Val
100 105 110
Trp Pro Gly Gly Lys Glu Asp Arg Thr Val Trp Thr Gln Phe Phe Ala
115 120 125
Leu Gly Glu Thr Leu Leu Gly Lys Arg Leu Thr Asp Asp Ala Gln Lys
130 135 140
Thr Ala Ala Glu Thr Val Asp Gly Phe Gly Arg Val Leu Asp Asn Tyr
145 150 155 160
Lys Val Ala Leu Asn Ser Trp Met Asp Leu Lys Lys Lys Gln Ser Pro
165 170 175
Gly Ser Pro Pro Thr Thr Glu Leu Lys Glu Ala Ala Asp Asn Val Lys
180 185 190
Gln Arg Phe Glu Ile Val His Asn Glu Phe Asp Ala Asn Met Pro His
195 200 205
Leu Gly Asn Pro Lys Ser Asn Tyr Lys Glu Val Leu Leu Ser Ser Tyr
210 215 220
Ala His Ala Ala Asn Leu His Leu Asn Leu Leu Gln Gln Gly Val Arg
225 230 235 240
Phe Ala Asp Gln Trp Asn Gln Asp Ile Tyr Ser Ser Gln Ile Val Pro
245 250 255
Ala Ala Gly Thr Ser Thr Glu Tyr Leu Lys Phe Leu Gln Ala Arg Ile
260 265 270
Gln Glu Tyr Val Asn Tyr Cys Thr Ala Thr Tyr Arg Ser Gly Leu Asn
275 280 285
Ile Leu Lys Asn Ser Asn Asp Ile Thr Trp Asp Ile Tyr Asn Thr Tyr
290 295 300
Arg Arg Asp Met Thr Leu Thr Val Leu Asp Leu Val Ala Ala Phe Pro
305 310 315 320
Asn Tyr Asp Pro Thr Tyr Tyr Pro Ile Asp Thr Lys Ile Gln Leu Thr
325 330 335
Arg Thr Ile Tyr Ser Asn Leu Leu Gly Ile Thr Gln Ser Gly Arg Gln
340 345 350
Tyr Phe Glu Ala Leu Glu Asn Ala Leu Thr Met Arg Pro Asn Leu Tyr
355 360 365
Arg Ile Leu Arg Gly Phe Asn Phe Asn Thr Thr Phe Gly Ser Gly Val
370 375 380
Asn Tyr Leu Ser Gly Ile Ala Asn Arg Arg Gly Leu Ile Lys Thr Val
385 390 395 400
Pro Ser Glu Glu Asp Glu Gln Pro Phe Tyr Gly Gln Ser Asn Gly Arg
405 410 415
His Asp Pro Met Ile Phe Ala Asn Gly Leu Asn Ile Phe Arg Val Thr
420 425 430
Leu Leu Arg Asn Lys His Tyr Ile Ala Leu Pro Val Gly Pro Pro Leu
435 440 445
Phe Ala Ala Asn Gln Ile Thr Asp Leu Lys Leu Tyr Tyr Gly Asn Ala
450 455 460
Thr Gln Tyr His His Tyr Gln Pro Ser Asn Glu Gly Leu Thr Thr Glu
465 470 475 480
Ser Thr Asp Leu Ser Phe Pro Pro Lys Asn Glu Glu Tyr Leu Asn Ala
485 490 495
Ile Leu Met Thr Ser Pro Arg Thr Ile Tyr Glu Asn Pro Thr Asn Ile
500 505 510
Tyr Ser Phe Ala Trp Thr Lys Lys Asp Ile Asp Gln Gln Asn Asn Ile
515 520 525
Ile Ile Asn Ala Ile Thr Gln Ile Pro Ala Val Lys Gly Ile Ala Leu
530 535 540
Gly Asn Gln Ser Arg Val Ile Gln Gly Pro Gly His Thr Ser Gly Asp
545 550 555 560
Leu Val Tyr Leu Lys Asp Ser Leu Glu Leu Arg Cys Gln Tyr Thr Gly
565 570 575
Pro Gln Gln Ser Tyr Asp Val Arg Ile Arg Tyr Ala Ser Asn Gly Ile
580 585 590
Leu Asn Asp Arg Ala Met Ile Ala Ile Thr Ile Pro Gly Val Thr Gly
595 600 605
Gln Ser Ile Ser Leu Ser Glu Thr Phe Ser Gly Thr Asp Tyr Asn Val
610 615 620
Leu Glu Phe Gln Asn Phe Lys Ser Val Gln Phe Thr Ser Ala Ile Arg
625 630 635 640
Leu Asn Pro Ser Gln Asn Ile Phe Ile Tyr Leu Asn Arg Leu Asp Gln
645 650 655
Asn Ala Asn Thr Thr Leu Leu Ile Asp Lys Ile Glu Phe Ile Pro His
660 665 670
<210> 94
<211> 657
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 94
Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu Ile Leu Asp Asn Gly
1 5 10 15
Val Gln Cys Cys Ser Pro Arg Tyr Pro Leu Ala Gln Ala Pro Gly Ser
20 25 30
Glu Trp Gln Asn Met Asn Tyr Gln Asp Val Met Asn Thr Ser Val Gly
35 40 45
Arg Glu Tyr Gln Ala Val Gln Gln Val Asn Val Gly Glu Ala Val Gly
50 55 60
Ala Ala Leu Gly Ile Leu Thr Thr Ile Leu Lys Ala Val Asn Pro Thr
65 70 75 80
Leu Gly Thr Ala Ala Gly Val Ile Ser Ser Ile Phe Gly Phe Leu Trp
85 90 95
Lys Arg Phe Gly Thr Asp Pro Gln Ala Gln Trp Lys Gln Phe Met Glu
100 105 110
Ala Val Glu Tyr Leu Val Ser Gln Lys Ile Thr Asp Ala Val Arg Ser
115 120 125
Lys Ala Val Ser Glu Leu Glu Gly Val Gln Arg Ala Val Glu Leu Tyr
130 135 140
Gln Glu Ala Ala Asn Ala Trp Asn Met Asn Pro Asp Asn Ala Ala Ala
145 150 155 160
Lys Glu Arg Ile Arg Arg Gln Tyr Thr Ser Thr Asn Thr Val Ile Glu
165 170 175
Tyr Ala Met Pro Ser Phe Arg Val Gly Ser Phe Glu Val Pro Leu Leu
180 185 190
Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu Leu Leu Leu Arg Asp
195 200 205
Ala Val Lys Phe Gly Ala Gly Trp Gly Phe Ser Pro Thr Glu Val Glu
210 215 220
Asp Asn Tyr Thr Arg Leu Gln Ala Arg Thr Ala Glu Tyr Thr Asn His
225 230 235 240
Cys Thr Asn Thr Tyr Asp Lys Gly Leu Lys Gln Ala Tyr Asp Leu Ala
245 250 255
Pro Asn Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ser
260 265 270
Lys Asp Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala Pro Val Asp Trp Asn
275 280 285
Leu Phe Asn Asp Phe Arg Arg Asp Met Thr Leu Met Val Leu Asp Ile
290 295 300
Val Ala Val Trp Pro Thr Tyr Asn Pro Arg Leu Tyr Asn Asn Pro Asn
305 310 315 320
Gly Val Gln Ile Gln Leu Ser Arg Glu Val Tyr Ser Thr Val Tyr Gly
325 330 335
Arg Gly Trp Ser Asn Asn Ser Thr Val Asp Ala Ile Glu Ser Thr Leu
340 345 350
Val Arg Pro Pro His Leu Val Thr Glu Leu Thr Lys Leu Thr Phe Asp
355 360 365
Glu Arg Asn Leu Tyr Glu Ala Glu Thr Val Pro Val Ala Phe Arg Lys
370 375 380
Val Thr Asn Thr Leu His Tyr Val Gly Ser Ser Thr Thr Trp Glu Gln
385 390 395 400
Ser Phe Ser Ala Thr Pro Val Gly Ser Leu Lys Ala Val His Asn Val
405 410 415
Leu Ala Thr Asp Ile Gly Asn Leu Ala Leu Ser Leu Gly Ala Val Pro
420 425 430
Leu Gly Phe Ser Phe Tyr Asn Lys Asn Asp Gln His Ile Thr Thr Val
435 440 445
Gly Tyr Ser Gly Gly Trp Trp Asn Gly Ile Pro Lys Asp Glu Gly Ser
450 455 460
Asn Gln Asn Ser His His Leu Ser Tyr Val Ala Ala Leu Glu Thr Gln
465 470 475 480
Thr Thr Ala Gly Trp Phe Pro Thr Tyr Pro Val Pro Leu Leu Gly Glu
485 490 495
Trp Gly Phe Gly Trp Gln His Asn Ser Leu Lys Tyr Thr Asn Arg Leu
500 505 510
Val Thr Asn Gln Ile Asn Gln Leu Pro Ala Val Lys Ala Ser Ala Leu
515 520 525
Val Asn His Ala Arg Val Ser Lys Gly Thr Thr Gly Thr Gly Gly Asn
530 535 540
Ile Ile Glu Leu Val Asn Pro Asn Cys Gly Phe Ile Phe Asn Phe Tyr
545 550 555 560
Ala Glu Pro Glu Asn Pro Gln Gly Gln Ala Phe Asn Ile Arg Ile Arg
565 570 575
Tyr Ser Ala Thr Val Pro Gly Lys Leu Asn Ile Arg His Leu Glu Ser
580 585 590
Gly Lys Glu Phe Met Asn Thr Pro Arg Asp Phe Lys Gly Thr Thr Thr
595 600 605
Gly Thr Asn Trp Ile Pro Lys Phe Gln Glu Tyr Gln Tyr Leu Asp Leu
610 615 620
Gly Pro Ile Val Leu Arg Thr Gly Met Asn Tyr Gly Phe Lys Ile Glu
625 630 635 640
Leu Ala Ser Gly Gln Leu Val Ser Ile Asp Lys Ile Glu Phe Leu Pro
645 650 655
Phe
<210> 95
<211> 820
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 95
Met Ala Gln Leu Pro Glu Leu Gln Ala Gln Ser Phe Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Asn Pro Val Thr Gly Asn Ser Ala Gly Thr Asp Trp Gly
20 25 30
Glu Trp Leu Ser Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys Thr
35 40 45
Gly Ser Asp Lys Phe Leu Gln Ser Val Leu Ser Thr Gly Tyr Asp Ala
50 55 60
Ala Gln Gly Gly Ser Phe Asn Tyr Ile Ala Val Leu Gln Ala Gly Val
65 70 75 80
Ala Ile Leu Gly Thr Val Leu Gly Ala Glu Phe Pro Gly Ile Thr Ala
85 90 95
Ile Ala Val Pro Ala Leu Thr Phe Leu Ile Lys Trp Val Trp Pro Lys
100 105 110
Leu Thr Gly Gly Ser Ala Gly Glu Ser Asp Thr Gln Lys Met Leu Asp
115 120 125
Leu Ile Asp Lys Glu Ile Gln Gln Lys Leu Asp Gln Ala Leu Ser Gln
130 135 140
Gln Asp Ile Asn Asn Trp Arg Ala Tyr Leu Thr Ser Ile Phe Glu Ala
145 150 155 160
Ala Thr Asn Leu Ser Gln Ser Ile Ile Asp Ala Gln Phe Thr Gly Thr
165 170 175
Glu Arg Ser Ser Asp Arg Gln Pro Arg Val Pro Thr Ser Ser Asp Tyr
180 185 190
Glu Asp Val Tyr Thr Lys Leu Asn Ile Ala Asp Ala Val Leu Ser Pro
195 200 205
Ser Thr Asn Asn Ile Leu Thr Gly Asn Phe Asp Val Leu Ala Ile Pro
210 215 220
Tyr Tyr Thr Ile Gly Ala Thr Val Arg Ile Thr Ala Tyr Gln Ser Phe
225 230 235 240
Val Lys Phe Ala Asn Gln Trp Ile Asp Val Val Tyr Pro Asp Gln Thr
245 250 255
Ser Asn Thr Trp Leu Lys Ala His Gln Asn Leu Asp Asn Ala Lys Asp
260 265 270
Lys Met Ile Ala Met Ile Gln Gln Tyr Thr Ala Asn Val Leu Arg Val
275 280 285
Phe Thr Asp Pro Lys Asn Met Pro Pro Leu Gly Lys Asp Lys Lys Ser
290 295 300
Ile Asn Ala Tyr Asn Lys Phe Val Arg Gly Met Val Ile Ser Gly Leu
305 310 315 320
Asp Leu Val Ala Val Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Gln Val
325 330 335
Gln Thr Asp Leu Glu Lys Thr Arg Val Ile Phe Ser Asp Leu Ile Gly
340 345 350
Lys Asp Gln Thr Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala Asp
355 360 365
Gly Lys Asp Gly Ser Gly Ala Phe Asn Gln His Ala Ala Ile Asp Leu
370 375 380
Asn Ser Ile Phe Tyr Tyr His Glu Glu Leu Ser Ser Met Gln Leu Gly
385 390 395 400
Ile His Asp Gly Ser Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr Gly
405 410 415
Ile Ile Ala Asn Tyr Pro Asn His Ser Tyr Phe Tyr Gly Asp Leu Tyr
420 425 430
Ser Pro Ser Gly Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu Asn
435 440 445
Ala Gln Thr Gln Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile Tyr
450 455 460
Asn Ser Asp Pro Ser Ala Tyr Gly Leu Gly Cys His Ile Leu Gly Gly
465 470 475 480
Cys Thr Lys Gly Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly Phe
485 490 495
Gly Tyr Ser Cys Asn Thr Ser Leu Pro Ser Gln Lys Ile Asn Ala Phe
500 505 510
Tyr Pro Phe Lys Met Asn Leu Asn Arg Ser Gly Thr Ser Asp Lys Leu
515 520 525
Gly Leu Met Tyr Ser Leu Val Pro Leu Asp Leu Ser Pro Asn Asn Val
530 535 540
Phe Gly Glu Leu Asp Ser Asp Thr Arg Asn Val Ile Gly Lys Gly Phe
545 550 555 560
Pro Ala Glu Lys Gly Thr Leu Ser Tyr Gly Arg Gln Pro Met Leu Val
565 570 575
Arg Glu Trp Val Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Val Pro
580 585 590
Leu Thr Leu Gln Met Thr Asn Met Thr Ser Gly Lys Tyr Tyr Ile Arg
595 600 605
Val Arg Tyr Ala Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln Gln
610 615 620
Val Tyr Ala Gly Asn Gln Asn Val Gln Asn Gly Gly Trp Ser Leu Pro
625 630 635 640
Ser Thr Thr Ala Ser Asn Val Ala Thr Asn Phe Pro Asn Gln Leu Tyr
645 650 655
Ile Thr Gly Ile Asn Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp Gly
660 665 670
Val Asp Gly Gln Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val Thr
675 680 685
Val Thr Leu Ser Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe Ile
690 695 700
Asp Arg Val Glu Leu Val Pro Ala Ala Pro Gln Ser Asp Tyr Ile Asn
705 710 715 720
Tyr Ser Ile Thr Ser Gln Ile Pro Asn Cys Asn Asn Phe Asp Thr Ile
725 730 735
Ile Trp Asp Gly Asn Gly Ala Gln Arg Ser Thr Ala Thr Val Thr Val
740 745 750
Ser Gly Leu Pro Asp Asn Asn Gly Ile Leu Ile Ser Gly Leu Ser Ser
755 760 765
Tyr Gly Trp Gln Pro Leu Gln Asp Val Asn Gly His Gly Asn Asn Trp
770 775 780
Val Thr Asn Gly Gly Pro Phe Thr Leu Ser Asn Asp His Thr Phe Thr
785 790 795 800
Ala Ile Ser Ile Cys Gly Phe Gln Asn Val Ser Ser Gly Lys Ile Thr
805 810 815
Gly Lys Leu Ser
820
<210> 96
<211> 712
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 96
Met Ala Gln Leu Pro Glu Leu Gln Ala Gln Ser Phe Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Asn Pro Val Thr Gly Asn Ser Ala Gly Thr Asp Trp Gly
20 25 30
Glu Trp Leu Ser Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln Lys Thr
35 40 45
Gly Ser Asp Lys Phe Leu Gln Ser Val Leu Ser Thr Gly Tyr Asp Ala
50 55 60
Ala Gln Gly Gly Ser Phe Asn Tyr Ile Ala Val Leu Gln Ala Gly Val
65 70 75 80
Ala Ile Leu Gly Thr Val Leu Gly Ala Glu Phe Pro Gly Ile Thr Ala
85 90 95
Ile Ala Val Pro Ala Leu Thr Phe Leu Ile Lys Trp Val Trp Pro Lys
100 105 110
Leu Thr Gly Gly Ser Ala Gly Glu Ser Asp Thr Gln Lys Met Leu Asp
115 120 125
Leu Ile Asp Lys Glu Ile Gln Gln Lys Leu Asp Gln Ala Leu Ser Gln
130 135 140
Gln Asp Ile Asn Asn Trp Arg Ala Tyr Leu Thr Ser Ile Phe Glu Ala
145 150 155 160
Ala Thr Asn Leu Ser Gln Ser Ile Ile Asp Ala Gln Phe Thr Gly Thr
165 170 175
Glu Arg Ser Ser Asp Arg Gln Pro Arg Val Pro Thr Ser Ser Asp Tyr
180 185 190
Glu Asp Val Tyr Thr Lys Leu Asn Ile Ala Asp Ala Val Leu Ser Pro
195 200 205
Ser Thr Asn Asn Ile Leu Thr Gly Asn Phe Asp Val Leu Ala Ile Pro
210 215 220
Tyr Tyr Thr Ile Gly Ala Thr Val Arg Ile Thr Ala Tyr Gln Ser Phe
225 230 235 240
Val Lys Phe Ala Asn Gln Trp Ile Asp Val Val Tyr Pro Asp Gln Thr
245 250 255
Ser Asn Thr Trp Leu Lys Ala His Gln Asn Leu Asp Asn Ala Lys Asp
260 265 270
Lys Met Ile Ala Met Ile Gln Gln Tyr Thr Ala Asn Val Leu Arg Val
275 280 285
Phe Thr Asp Pro Lys Asn Met Pro Pro Leu Gly Lys Asp Lys Lys Ser
290 295 300
Ile Asn Ala Tyr Asn Lys Phe Val Arg Gly Met Val Ile Ser Gly Leu
305 310 315 320
Asp Leu Val Ala Val Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Gln Val
325 330 335
Gln Thr Asp Leu Glu Lys Thr Arg Val Ile Phe Ser Asp Leu Ile Gly
340 345 350
Lys Asp Gln Thr Ile Ser Asn Asp Val Thr Leu Ile Asn Met Ala Asp
355 360 365
Gly Lys Asp Gly Ser Gly Ala Phe Asn Gln His Ala Ala Ile Asp Leu
370 375 380
Asn Ser Ile Phe Tyr Tyr His Glu Glu Leu Ser Ser Met Gln Leu Gly
385 390 395 400
Ile His Asp Gly Ser Arg Gly Asn Thr Thr Asn Cys Tyr Asp Tyr Gly
405 410 415
Ile Ile Ala Asn Tyr Pro Asn His Ser Tyr Phe Tyr Gly Asp Leu Tyr
420 425 430
Ser Pro Ser Gly Asp Ala Thr Phe Ser Ala Pro Leu Asn Val Leu Asn
435 440 445
Ala Gln Thr Gln Tyr Ser Ser Gly Tyr Val Asp Ser Glu Phe Ile Tyr
450 455 460
Asn Ser Asp Pro Ser Ala Tyr Gly Leu Gly Cys His Ile Leu Gly Gly
465 470 475 480
Cys Thr Lys Gly Asp Cys Tyr Asn Gln Ser Ser Cys Ser Glu Gly Phe
485 490 495
Gly Tyr Ser Cys Asn Thr Ser Leu Pro Ser Gln Lys Ile Asn Ala Phe
500 505 510
Tyr Pro Phe Lys Met Asn Leu Asn Arg Ser Gly Thr Ser Asp Lys Leu
515 520 525
Gly Leu Met Tyr Ser Leu Val Pro Leu Asp Leu Ser Pro Asn Asn Val
530 535 540
Phe Gly Glu Leu Asp Ser Asp Thr Arg Asn Val Ile Gly Lys Gly Phe
545 550 555 560
Pro Ala Glu Lys Gly Thr Leu Ser Tyr Gly Arg Gln Pro Met Leu Val
565 570 575
Arg Glu Trp Val Asn Gly Ala Asn Ala Val Lys Leu Ser Asp Val Pro
580 585 590
Leu Thr Leu Gln Met Thr Asn Met Thr Ser Gly Lys Tyr Tyr Ile Arg
595 600 605
Val Arg Tyr Ala Asn Thr Ser Asn Ala Glu Ile Ser Ile Asn Gln Gln
610 615 620
Val Tyr Ala Gly Asn Gln Asn Val Gln Asn Gly Gly Trp Ser Leu Pro
625 630 635 640
Ser Thr Thr Ala Ser Asn Val Ala Thr Asn Phe Pro Asn Gln Leu Tyr
645 650 655
Ile Thr Gly Ile Asn Gly Asn Tyr Val Leu Glu Asp Ile Thr Trp Gly
660 665 670
Val Asp Gly Gln Gly Lys Pro Ile Thr Ile Pro Ser Gly Asp Val Thr
675 680 685
Val Thr Leu Ser Arg Thr Asn Met Ser Gln Ser Gly Asp Leu Phe Ile
690 695 700
Asp Arg Val Glu Leu Val Pro Ala
705 710
<210> 97
<211> 787
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 97
Met Asn Ser Tyr Gln Asn Lys Asn Glu Tyr Glu Ile Leu Asp Ala Ser
1 5 10 15
Glu Asn Thr Val Asn Ala Leu Asn Arg Tyr Pro Phe Ala Asn Asn Pro
20 25 30
Tyr Ser Ser Ile Phe Ser Ser Cys Pro Arg Ser Gly Pro Gly Asn Trp
35 40 45
Ile Asn Ile Leu Gly Asn Ala Val Ser Glu Ala Val Ser Ile Ser Gln
50 55 60
Asp Ile Ile Ser Leu Leu Thr Gln Pro Ser Ile Ser Gly Ile Ile Ser
65 70 75 80
Met Ala Phe Ser Leu Leu Ser Arg Met Ile Gly Ser Asn Gly Arg Ser
85 90 95
Ile Ser Glu Leu Ser Met Cys Asp Leu Leu Ala Ile Ile Asp Leu Arg
100 105 110
Val Asn Gln Ser Val Leu Asp Asp Gly Val Ala Asp Phe Asn Gly Ser
115 120 125
Leu Val Ile Tyr Arg Asn Tyr Leu Glu Ala Leu Gln Arg Trp Asn Asn
130 135 140
Asn Pro Asn Pro Ala Asn Ala Glu Glu Val Arg Thr Arg Phe Arg Glu
145 150 155 160
Ser Asp Thr Ile Phe Asp Leu Ile Leu Thr Gln Gly Ser Leu Thr Asn
165 170 175
Gly Gly Ser Leu Ala Arg Asn Asn Ala Gln Ile Leu Leu Leu Pro Ser
180 185 190
Phe Ala Asn Ala Ala Tyr Phe His Leu Leu Leu Leu Arg Asp Ala Asn
195 200 205
Val Tyr Gly Asn Asn Trp Gly Leu Phe Gly Val Thr Pro Asn Ile Asn
210 215 220
Tyr Glu Ser Lys Leu Leu Asn Leu Ile Arg Leu Tyr Thr Asn Tyr Cys
225 230 235 240
Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Leu Arg Asn Arg Gly Ser
245 250 255
Asn Ala Thr Ala Trp Leu Glu Phe His Arg Phe Arg Arg Asp Met Thr
260 265 270
Leu Met Val Leu Asp Ile Val Ser Ser Phe Ser Ser Leu Asp Ile Thr
275 280 285
Arg Tyr Pro Arg Ala Thr Asp Phe Gln Leu Ser Arg Ile Ile Tyr Thr
290 295 300
Asp Pro Ile Gly Phe Val Asn Arg Ser Asp Pro Ser Ala Pro Arg Thr
305 310 315 320
Trp Phe Ser Phe His Asn Gln Ala Asn Phe Ser Ala Leu Glu Ser Gly
325 330 335
Ile Pro Ser Pro Ser Phe Ser Gln Phe Leu Asp Ser Met Arg Ile Ser
340 345 350
Thr Gly Pro Leu Ser Leu Pro Ala Ser Pro Asn Ile His Arg Ala Arg
355 360 365
Val Trp Tyr Gly Asn Gln Asn Asn Phe Asn Gly Ser Ser Ser Gln Thr
370 375 380
Phe Gly Glu Ile Thr Asn Asp Asn Gln Thr Ile Ser Gly Leu Asn Ile
385 390 395 400
Phe Arg Ile Asp Ser Gln Ala Val Asn Leu Asn Asn Thr Thr Phe Gly
405 410 415
Val Ser Arg Ala Glu Phe Tyr His Asp Ala Ser Gln Gly Ser Gln Arg
420 425 430
Ser Ile Tyr Gln Gly Phe Val Asp Thr Gly Gly Ala Ser Thr Ala Val
435 440 445
Ala Gln Asn Ile Gln Thr Phe Phe Pro Gly Glu Asn Ser Ser Ile Pro
450 455 460
Thr Pro Gln Asp Tyr Thr His Ile Leu Ser Arg Ser Thr Asn Leu Thr
465 470 475 480
Gly Gly Leu Arg Gln Val Ala Ser Gly Arg Arg Ser Ser Leu Val Leu
485 490 495
His Gly Trp Thr His Lys Ser Leu Ser Arg Gln Asn Arg Val Glu Pro
500 505 510
Asn Arg Ile Thr Gln Val Pro Ala Val Lys Ala Ser Ser Pro Ser Asn
515 520 525
Cys Thr Val Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp Leu Val Arg
530 535 540
Met Ser Ser Asn Cys Ser Val Ser Tyr Asn Phe Thr Pro Ala Asp Gln
545 550 555 560
Gln Val Val Ile Arg Leu Arg Tyr Ala Cys Gln Gly Thr Ala Ser Leu
565 570 575
Arg Ile Thr Phe Gly Asn Gly Ser Ser Gln Ile Ile Pro Leu Val Ser
580 585 590
Thr Thr Ser Ser Ile Asn Asn Leu Gln Tyr Glu Asn Phe Ser Phe Ala
595 600 605
Ser Gly Pro Asn Ser Val Asn Phe Leu Ser Ala Gly Thr Ser Ile Thr
610 615 620
Ile Gln Asn Ile Ser Thr Asn Ser Asn Val Val Leu Asp Arg Ile Glu
625 630 635 640
Ile Val Pro Glu Gln Pro Ile Pro Ile Ile Pro Gly Asp Tyr Gln Ile
645 650 655
Val Thr Ala Leu Asn Asn Ser Ser Val Phe Asp Leu Asn Ser Gly Thr
660 665 670
Arg Val Thr Leu Trp Ser Asn Asn Arg Gly Ala His Gln Ile Trp Asn
675 680 685
Phe Met Tyr Asp Gln Gln Arg Asn Ala Tyr Val Ile Arg Asn Val Ser
690 695 700
Asn Pro Ser Leu Val Leu Thr Trp Asp Phe Thr Ser Pro Asn Ser Ile
705 710 715 720
Val Phe Ala Ala Pro Phe Ser Pro Gly Arg Gln Glu Gln Tyr Trp Ile
725 730 735
Ala Glu Ser Phe Gln Asn Ser Tyr Val Phe Glu Asn Leu Arg Asn Thr
740 745 750
Asn Met Val Leu Asp Val Ala Gly Gly Ser Thr Ala Ile Gly Thr Asn
755 760 765
Ile Ile Ala Phe Pro Arg His Asn Gly Asn Ala Gln Arg Phe Phe Ile
770 775 780
Arg Arg Pro
785
<210> 98
<211> 644
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 98
Met Asn Ser Tyr Gln Asn Lys Asn Glu Tyr Glu Ile Leu Asp Ala Ser
1 5 10 15
Glu Asn Thr Val Asn Ala Leu Asn Arg Tyr Pro Phe Ala Asn Asn Pro
20 25 30
Tyr Ser Ser Ile Phe Ser Ser Cys Pro Arg Ser Gly Pro Gly Asn Trp
35 40 45
Ile Asn Ile Leu Gly Asn Ala Val Ser Glu Ala Val Ser Ile Ser Gln
50 55 60
Asp Ile Ile Ser Leu Leu Thr Gln Pro Ser Ile Ser Gly Ile Ile Ser
65 70 75 80
Met Ala Phe Ser Leu Leu Ser Arg Met Ile Gly Ser Asn Gly Arg Ser
85 90 95
Ile Ser Glu Leu Ser Met Cys Asp Leu Leu Ala Ile Ile Asp Leu Arg
100 105 110
Val Asn Gln Ser Val Leu Asp Asp Gly Val Ala Asp Phe Asn Gly Ser
115 120 125
Leu Val Ile Tyr Arg Asn Tyr Leu Glu Ala Leu Gln Arg Trp Asn Asn
130 135 140
Asn Pro Asn Pro Ala Asn Ala Glu Glu Val Arg Thr Arg Phe Arg Glu
145 150 155 160
Ser Asp Thr Ile Phe Asp Leu Ile Leu Thr Gln Gly Ser Leu Thr Asn
165 170 175
Gly Gly Ser Leu Ala Arg Asn Asn Ala Gln Ile Leu Leu Leu Pro Ser
180 185 190
Phe Ala Asn Ala Ala Tyr Phe His Leu Leu Leu Leu Arg Asp Ala Asn
195 200 205
Val Tyr Gly Asn Asn Trp Gly Leu Phe Gly Val Thr Pro Asn Ile Asn
210 215 220
Tyr Glu Ser Lys Leu Leu Asn Leu Ile Arg Leu Tyr Thr Asn Tyr Cys
225 230 235 240
Thr His Trp Tyr Asn Gln Gly Leu Asn Glu Leu Arg Asn Arg Gly Ser
245 250 255
Asn Ala Thr Ala Trp Leu Glu Phe His Arg Phe Arg Arg Asp Met Thr
260 265 270
Leu Met Val Leu Asp Ile Val Ser Ser Phe Ser Ser Leu Asp Ile Thr
275 280 285
Arg Tyr Pro Arg Ala Thr Asp Phe Gln Leu Ser Arg Ile Ile Tyr Thr
290 295 300
Asp Pro Ile Gly Phe Val Asn Arg Ser Asp Pro Ser Ala Pro Arg Thr
305 310 315 320
Trp Phe Ser Phe His Asn Gln Ala Asn Phe Ser Ala Leu Glu Ser Gly
325 330 335
Ile Pro Ser Pro Ser Phe Ser Gln Phe Leu Asp Ser Met Arg Ile Ser
340 345 350
Thr Gly Pro Leu Ser Leu Pro Ala Ser Pro Asn Ile His Arg Ala Arg
355 360 365
Val Trp Tyr Gly Asn Gln Asn Asn Phe Asn Gly Ser Ser Ser Gln Thr
370 375 380
Phe Gly Glu Ile Thr Asn Asp Asn Gln Thr Ile Ser Gly Leu Asn Ile
385 390 395 400
Phe Arg Ile Asp Ser Gln Ala Val Asn Leu Asn Asn Thr Thr Phe Gly
405 410 415
Val Ser Arg Ala Glu Phe Tyr His Asp Ala Ser Gln Gly Ser Gln Arg
420 425 430
Ser Ile Tyr Gln Gly Phe Val Asp Thr Gly Gly Ala Ser Thr Ala Val
435 440 445
Ala Gln Asn Ile Gln Thr Phe Phe Pro Gly Glu Asn Ser Ser Ile Pro
450 455 460
Thr Pro Gln Asp Tyr Thr His Ile Leu Ser Arg Ser Thr Asn Leu Thr
465 470 475 480
Gly Gly Leu Arg Gln Val Ala Ser Gly Arg Arg Ser Ser Leu Val Leu
485 490 495
His Gly Trp Thr His Lys Ser Leu Ser Arg Gln Asn Arg Val Glu Pro
500 505 510
Asn Arg Ile Thr Gln Val Pro Ala Val Lys Ala Ser Ser Pro Ser Asn
515 520 525
Cys Thr Val Ile Ala Gly Pro Gly Phe Thr Gly Gly Asp Leu Val Arg
530 535 540
Met Ser Ser Asn Cys Ser Val Ser Tyr Asn Phe Thr Pro Ala Asp Gln
545 550 555 560
Gln Val Val Ile Arg Leu Arg Tyr Ala Cys Gln Gly Thr Ala Ser Leu
565 570 575
Arg Ile Thr Phe Gly Asn Gly Ser Ser Gln Ile Ile Pro Leu Val Ser
580 585 590
Thr Thr Ser Ser Ile Asn Asn Leu Gln Tyr Glu Asn Phe Ser Phe Ala
595 600 605
Ser Gly Pro Asn Ser Val Asn Phe Leu Ser Ala Gly Thr Ser Ile Thr
610 615 620
Ile Gln Asn Ile Ser Thr Asn Ser Asn Val Val Leu Asp Arg Ile Glu
625 630 635 640
Ile Val Pro Glu
<210> 99
<211> 846
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 99
Met Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Ser Pro Gln Ile Gly Asn Leu Asp Thr Gln Pro Gly Ser
20 25 30
Leu Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Lys Leu Leu Gln Thr Thr Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Gly Ile Gly Leu Phe Gly Thr Leu Gly Ala Glu Ile Pro Gly Val Ala
85 90 95
Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro His
100 105 110
Lys Ser Gly Asp Asp Gly Gln Gln Leu Val Asp Leu Ile Asp Ala Glu
115 120 125
Ile Gln Lys Gln Leu Asn Gln Ala Leu Ser Thr Gln Asp Lys His Asn
130 135 140
Trp Val Gly Tyr Leu Asp Glu Ser Phe Leu Asn Ala Ser His Asn Val
145 150 155 160
Ala Arg Ala Val Thr Asn Ala Gln Phe Val Gly Thr Glu Gly Asn Tyr
165 170 175
Thr Ala Pro Arg Thr Pro Thr Ala Ser Asp Tyr Thr Thr Val Glu Thr
180 185 190
Thr Leu Phe Ser Thr Glu Gly Ser Tyr His Thr Gly Leu Ser Gln Met
195 200 205
Leu Asn Gly Asn Phe Asp Thr Leu Ala Leu Pro Phe Tyr Val Ile Gly
210 215 220
Val Thr Met Gln Leu Ala Thr Tyr Gln Thr Phe Ile Gln Phe Gly Glu
225 230 235 240
Lys Trp Ile Asp Val Val Trp Pro Asp Asn Gln Thr Pro Gly Thr Thr
245 250 255
Gly Tyr Thr His Tyr Gln Thr Leu Gln Asp Lys Lys Thr Asp Met Arg
260 265 270
Thr Phe Ile Gln Gln His Thr Thr Thr Val Tyr Asn Ala Phe Lys Lys
275 280 285
Gly Met Pro Pro Leu Glu Ser Asn Lys Asn Ser Ile Asn Ala Tyr Asn
290 295 300
Val Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Val Ala Thr
305 310 315 320
Trp Pro Ser Met Tyr Pro Asp Asp Tyr Pro Thr Gln Met Ala Leu Glu
325 330 335
Gln Thr Arg Val Ile Phe Ser Asp Met Val Gly Gln Asp Gln Thr Arg
340 345 350
Asp Gly Asn Thr Thr Ile Tyr Asn Ile Phe Asp Ser Thr Ser Asn Phe
355 360 365
Gln His Gly Thr Thr Asp Ile Asn Ser Ile His Tyr Phe Pro Asp Glu
370 375 380
Leu Gln Gln Ala Gln Ile Ala Glu Tyr Ser Pro Ser Arg Gly Asn His
385 390 395 400
Cys Tyr Pro Tyr Gly Ile Ile Leu Asn Tyr Gln His Asn Thr Tyr Arg
405 410 415
Tyr Gly Asp Asn Asp Pro Gly Thr Ser Gly Thr Val Phe Pro Ala Pro
420 425 430
Met Arg Ser Ile Asn Ala Lys Thr Gln Asn Thr Ser Tyr Leu Asn Ser
435 440 445
Glu Asn Leu Phe Thr Asp Leu Gly Ser Ser Thr Asn Cys Tyr Ile Pro
450 455 460
Gly Thr Cys Thr Asn Asn Cys Ser Pro Gly Ser Ser Cys Thr Asn Gly
465 470 475 480
Ser Gly Tyr Gly Glu Ser Cys Asn Val Pro Leu Pro Asn Gln Lys Ile
485 490 495
Asn Ala Leu Tyr Pro Phe Thr Gln Glu His Val Ser Gly Asp Ser Gly
500 505 510
Lys Leu Gly Leu Met Ala Ser Leu Val Pro Tyr Gly Leu Ser Pro Asn
515 520 525
Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Gly Lys
530 535 540
Gly Phe Pro Ala Glu Lys Gly Tyr Ile Gly Asn Gly Ser Pro Val Ser
545 550 555 560
Val Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro Ala
565 570 575
Trp Asp Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Asp Gln Tyr
580 585 590
His Ile Arg Val Arg Tyr Ala Ser Thr Ala Ala Tyr Pro Thr Asn Ala
595 600 605
Lys Tyr Pro Leu Ser Tyr Tyr Val Leu Ala Gly Asp Thr Ile Leu Gln
610 615 620
Gln Gly Lys Trp Glu Ala Ile Asp Thr Ser Gln Ala Asp Ile Thr Lys
625 630 635 640
Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Ile Leu Thr Asp Ile
645 650 655
Leu Gly Glu Asn Pro Ile Thr Leu Pro Lys Asp Glu Ile Thr Val Gln
660 665 670
Leu Leu Gln Gly Asn Pro Ala Gly Gly Met Thr Lys Val Leu Ile Asp
675 680 685
Arg Ile Glu Phe Val Pro Ala Val Ile Asn Val Leu Asn Pro Gln Leu
690 695 700
Asn Tyr Gln Ile Ser Ser Gln Leu Pro Ala Trp Asn Asn Asn Gln Ser
705 710 715 720
Met Asp Pro Phe Pro Thr Ile Trp Arg Ala Asn Pro Gly Ile Val Gly
725 730 735
Thr Ser Ala Asn Ile Thr Val Ala Asp Leu Leu Asp Met Ser Gly Phe
740 745 750
Lys Tyr Thr Asn Ile Thr Ala Leu Asp Leu Gly Gly Gln Val Gly Leu
755 760 765
Phe Ala Lys Ile Gly Ile Gly Thr Asp Ser Asp Pro Asn Asn Trp Lys
770 775 780
Trp Met Cys Pro Asp Gly Gln Gly Glu Asn Pro Trp Ile Gly Asp Gly
785 790 795 800
Thr Tyr Asn Phe Asn Pro Lys Asn Ser Gly Ser Cys Phe Asp Lys Ser
805 810 815
Phe Thr Ala Ile Lys Leu Val Gly Gly Thr Ser Phe Asp Thr Phe Ile
820 825 830
Ala Ala Gly Ile Leu Thr Gly Thr Val Thr Thr Gln Lys Ser
835 840 845
<210> 100
<211> 696
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 100
Met Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Ser Pro Gln Ile Gly Asn Leu Asp Thr Gln Pro Gly Ser
20 25 30
Leu Asp Asp Leu Ile Glu Ser Leu Lys Glu Ala Trp Thr Glu Phe Gln
35 40 45
Lys Thr Gly Ser Asp Lys Leu Leu Gln Thr Thr Leu Gln Asn Gly Phe
50 55 60
Ser Ala Ala Ser Gly Gly Ser Phe Asn Tyr Leu Ala Val Leu Gln Ala
65 70 75 80
Gly Ile Gly Leu Phe Gly Thr Leu Gly Ala Glu Ile Pro Gly Val Ala
85 90 95
Val Ala Ala Pro Ile Leu Ser Met Val Ile Gly Trp Leu Trp Pro His
100 105 110
Lys Ser Gly Asp Asp Gly Gln Gln Leu Val Asp Leu Ile Asp Ala Glu
115 120 125
Ile Gln Lys Gln Leu Asn Gln Ala Leu Ser Thr Gln Asp Lys His Asn
130 135 140
Trp Val Gly Tyr Leu Asp Glu Ser Phe Leu Asn Ala Ser His Asn Val
145 150 155 160
Ala Arg Ala Val Thr Asn Ala Gln Phe Val Gly Thr Glu Gly Asn Tyr
165 170 175
Thr Ala Pro Arg Thr Pro Thr Ala Ser Asp Tyr Thr Thr Val Glu Thr
180 185 190
Thr Leu Phe Ser Thr Glu Gly Ser Tyr His Thr Gly Leu Ser Gln Met
195 200 205
Leu Asn Gly Asn Phe Asp Thr Leu Ala Leu Pro Phe Tyr Val Ile Gly
210 215 220
Val Thr Met Gln Leu Ala Thr Tyr Gln Thr Phe Ile Gln Phe Gly Glu
225 230 235 240
Lys Trp Ile Asp Val Val Trp Pro Asp Asn Gln Thr Pro Gly Thr Thr
245 250 255
Gly Tyr Thr His Tyr Gln Thr Leu Gln Asp Lys Lys Thr Asp Met Arg
260 265 270
Thr Phe Ile Gln Gln His Thr Thr Thr Val Tyr Asn Ala Phe Lys Lys
275 280 285
Gly Met Pro Pro Leu Glu Ser Asn Lys Asn Ser Ile Asn Ala Tyr Asn
290 295 300
Val Tyr Val Arg Gly Met Val Leu Asn Gly Leu Asp Met Val Ala Thr
305 310 315 320
Trp Pro Ser Met Tyr Pro Asp Asp Tyr Pro Thr Gln Met Ala Leu Glu
325 330 335
Gln Thr Arg Val Ile Phe Ser Asp Met Val Gly Gln Asp Gln Thr Arg
340 345 350
Asp Gly Asn Thr Thr Ile Tyr Asn Ile Phe Asp Ser Thr Ser Asn Phe
355 360 365
Gln His Gly Thr Thr Asp Ile Asn Ser Ile His Tyr Phe Pro Asp Glu
370 375 380
Leu Gln Gln Ala Gln Ile Ala Glu Tyr Ser Pro Ser Arg Gly Asn His
385 390 395 400
Cys Tyr Pro Tyr Gly Ile Ile Leu Asn Tyr Gln His Asn Thr Tyr Arg
405 410 415
Tyr Gly Asp Asn Asp Pro Gly Thr Ser Gly Thr Val Phe Pro Ala Pro
420 425 430
Met Arg Ser Ile Asn Ala Lys Thr Gln Asn Thr Ser Tyr Leu Asn Ser
435 440 445
Glu Asn Leu Phe Thr Asp Leu Gly Ser Ser Thr Asn Cys Tyr Ile Pro
450 455 460
Gly Thr Cys Thr Asn Asn Cys Ser Pro Gly Ser Ser Cys Thr Asn Gly
465 470 475 480
Ser Gly Tyr Gly Glu Ser Cys Asn Val Pro Leu Pro Asn Gln Lys Ile
485 490 495
Asn Ala Leu Tyr Pro Phe Thr Gln Glu His Val Ser Gly Asp Ser Gly
500 505 510
Lys Leu Gly Leu Met Ala Ser Leu Val Pro Tyr Gly Leu Ser Pro Asn
515 520 525
Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile Gly Lys
530 535 540
Gly Phe Pro Ala Glu Lys Gly Tyr Ile Gly Asn Gly Ser Pro Val Ser
545 550 555 560
Val Val Arg Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu Pro Ala
565 570 575
Trp Asp Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Asp Gln Tyr
580 585 590
His Ile Arg Val Arg Tyr Ala Ser Thr Ala Ala Tyr Pro Thr Asn Ala
595 600 605
Lys Tyr Pro Leu Ser Tyr Tyr Val Leu Ala Gly Asp Thr Ile Leu Gln
610 615 620
Gln Gly Lys Trp Glu Ala Ile Asp Thr Ser Gln Ala Asp Ile Thr Lys
625 630 635 640
Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Ile Leu Thr Asp Ile
645 650 655
Leu Gly Glu Asn Pro Ile Thr Leu Pro Lys Asp Glu Ile Thr Val Gln
660 665 670
Leu Leu Gln Gly Asn Pro Ala Gly Gly Met Thr Lys Val Leu Ile Asp
675 680 685
Arg Ile Glu Phe Val Pro Ala Val
690 695
<210> 101
<211> 1236
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 101
Met Asn Gln Asn Asp Gln Asn Glu Ile Gln Ile Ile Asp Ser Ser Ser
1 5 10 15
Asn Asp Phe Asn Gln Ser Asn Arg Tyr Pro Arg Tyr Pro Leu Ala Glu
20 25 30
Glu Ser Asn Tyr Lys Asp Trp Leu Ala Asn Cys Lys Glu Ser Asn Leu
35 40 45
Asp Arg Leu Ser Thr Pro Ser Ser Val Gln Asp Ala Val Val Thr Ser
50 55 60
Leu Asn Ile Phe Ser Tyr Ile Phe Gly Phe Leu Asp Phe Gly Ala Thr
65 70 75 80
Ser Ala Gly Leu Gly Ile Leu Gly Ala Leu Phe Gly Val Phe Trp Pro
85 90 95
Ser Ser Asn Asn Gly Ile Ser Glu Asp Leu Leu Arg Ser Ile Glu Ile
100 105 110
Leu Ile Asp Arg Arg Ile Asn Glu Val Glu Arg Asn Arg Ile Thr Val
115 120 125
Gln Phe Glu Gly Leu Arg Ala Val Met Ser Asn Tyr Asn Thr Ala Leu
130 135 140
Arg Asn Trp Asn Ala Asn Arg Asn Asn Pro Ala Leu Gln Ser Glu Val
145 150 155 160
Arg Ser Arg Phe Asp Asn Ala Asp Asp Ala Phe Ala Gly Arg Met Pro
165 170 175
Glu Phe Arg Ile Pro Gly Phe Glu Ile Gln Ser Leu Ser Val Tyr Ala
180 185 190
Gln Ala Ala Thr Leu His Leu Leu Leu Leu Arg Asp Ala Val Val Asn
195 200 205
Gly Ser Leu Trp Gly Phe Asp Gly Glu Thr Ile Glu Ser Tyr Tyr Thr
210 215 220
Lys Leu Val Cys Leu Ile Asp Ala Tyr Ala Asp His Cys Ala Ser Phe
225 230 235 240
Tyr Arg Gln Gly Leu Gln Glu Leu Arg Asp Arg Gly Asn Trp Asn Ala
245 250 255
Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile Thr Val Leu Asp Ile Ile
260 265 270
Ser Leu Phe Ser Asn Tyr Asp Pro Arg Arg Tyr Thr Ile Asn Thr Asn
275 280 285
Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu Pro Val Ala Thr Pro Gly
290 295 300
Trp Leu Asn Leu Asn Ser Asn Pro Asp Gln Phe Gln Gln Ile Glu Asn
305 310 315 320
Asp Leu Ile Arg Pro Leu Arg Pro Ser Ser Thr Leu Ala Ala Leu Ser
325 330 335
Ala Glu Thr Ala Phe Ala Phe Phe Gln Ser Gly Gln Thr Pro Leu Arg
340 345 350
Ala Ile Leu Arg Thr Arg Thr Ser His Phe Asn Ile Ser Gly Gly Phe
355 360 365
Gly Thr Asp Pro Trp Gln Gly Ser Pro Asn Pro Thr Gly Pro Ser Glu
370 375 380
Ser Gln Leu Gln Phe Glu Asn Phe Asp Val Tyr His Val Ser Ser Glu
385 390 395 400
Val Thr Arg Glu Val Ser Ser Glu Asn Ser Gly Leu Phe Gly Pro Gln
405 410 415
Arg Ile Asn Phe Arg Met Val Ser Ile Ser Gly Val Val Ala Ser Arg
420 425 430
Glu Ile Ser Ser Pro Pro Phe Gly Arg Tyr Ile Thr Asn Ile Ser Ser
435 440 445
Ser Ile Pro Gly Ile Arg Ser Ala Asn Pro Thr Ala Ser Asp Tyr Thr
450 455 460
His Arg Leu Ser Ser Ile Arg Ser Thr Ser Leu Arg Thr Met Tyr Arg
465 470 475 480
Asp Arg Thr Asn Ile Ile Ala Tyr Gly Trp Thr His Thr Ser Leu Glu
485 490 495
Leu Thr Asn Arg Leu Leu Pro Asn Arg Ile Thr Gln Ile Pro Ala Val
500 505 510
Lys Ser Ser Ile Thr Pro Leu Pro Ile Val Gly Gly Pro Gly His Thr
515 520 525
Gly Gly Gly Leu Val Ser Leu Ala Gly Tyr Gly Arg Tyr Met Leu Ile
530 535 540
Asn Phe Ile Ser Pro Tyr Ser Pro Ser Arg Gln Arg Tyr Arg Leu Arg
545 550 555 560
Ile Arg Phe Val Lys Asp Phe Asp Asp Ala His Leu Arg Val Ser Ile
565 570 575
Ser Ala Ala Asn Phe Tyr Gln Asn Val Val Leu Pro Gly Ser Arg Thr
580 585 590
Ser Val Asn Pro Asn Leu Ser Phe Arg Asp Phe Arg Tyr Val Asp Val
595 600 605
Pro Gly Glu Phe Glu Leu Val Ala Asn Arg Ser Tyr Ser Ile Asn Leu
610 615 620
Gly Ala Pro Thr Ser Ser Ile Pro Gly Thr Cys Leu Ile Asp Lys Ile
625 630 635 640
Glu Phe Ile Pro Val Asn Ser Ala Ala Leu Glu Tyr Glu Gly Glu Gln
645 650 655
Ser Leu Lys Lys Ala Lys Lys Ala Val Asp Asp Leu Phe Thr Asn Asp
660 665 670
Leu Lys Asn Asp Leu Lys Ile Glu Thr Thr Asp Tyr Asp Val Asp Gln
675 680 685
Ala Ala Asn Leu Val Glu Cys Val Pro Glu Glu Leu Tyr Ala Lys Glu
690 695 700
Lys Met Ile Leu Leu Asp Glu Val Lys His Ala Lys Gln Leu Ser Glu
705 710 715 720
Ser Arg Asn Leu Ile Gln Asn Gly Asn Phe Glu Phe Tyr Thr Asp Lys
725 730 735
Trp Thr Thr Ser Asn Asn Ala Ser Ile Gln Ala Asp Asn Pro Ile Phe
740 745 750
Lys Gly Asn Tyr Leu Asn Met Pro Gly Ala Arg Glu Thr Glu Ala Gly
755 760 765
Ala Thr Thr Phe Pro Thr Tyr Val Phe Gln Lys Ile Asp Glu Ser Lys
770 775 780
Leu Lys Pro Tyr Thr Arg Tyr Lys Ala Arg Gly Phe Val Gly Gly Ser
785 790 795 800
Lys Glu Val Arg Leu Ile Val Ala Arg Tyr Ala Glu Glu Val Asp Ala
805 810 815
Ile Met Asn Val Arg Asn Asp Leu Thr Pro Gly Val Pro Val Ala Ser
820 825 830
Cys Gly Glu Phe Asp Arg Cys His Pro Gln Ala Tyr Pro Ile Leu Gln
835 840 845
Asp Arg Cys His Asp Asp Arg Ile Glu Gln His Ala Asp Gly Gly Ser
850 855 860
His Ser Cys His Gly Lys Ser Arg Glu Lys His Ala Ile Cys His Asp
865 870 875 880
Ser His Gln Phe Asp Phe His Ile Asp Thr Gly Glu Val Tyr Leu Thr
885 890 895
Glu Asn Pro Gly Ile Trp Val Leu Phe Lys Ile Ser Ser Pro Glu Gly
900 905 910
Tyr Ala Thr Leu Asp Asn Ile Glu Leu Ile Glu Glu Gly Pro Leu Val
915 920 925
Gly Glu Ser Leu Ala Leu Val Lys Lys Arg Glu Lys Lys Trp Lys Asn
930 935 940
Glu Met Glu Thr Lys Trp Ile His Thr Lys Glu Ala Tyr Glu Lys Ala
945 950 955 960
Lys Trp Ala Val Asp Ala Leu Phe Thr Thr Pro Gln Asp Thr Ala Leu
965 970 975
Lys Phe Glu Thr Ile Ile Ser Asn Ile Ile Ser Ala Glu His Leu Val
980 985 990
Gln Ser Ile Pro Tyr Val Tyr Asn Lys Trp Leu Ala Asp Val Pro Gly
995 1000 1005
Met Asn Tyr Asp Ile Tyr Thr Glu Leu Lys Arg Arg Ile Thr Gln
1010 1015 1020
Ala Tyr Ser Leu Tyr Glu Arg Arg Asn Ile Ile Lys Asn Gly Asp
1025 1030 1035
Phe His Gln Gly Leu Ala His Trp His Ala Thr Pro His Ala Arg
1040 1045 1050
Val Gln Gln Ile Asp Gly Thr Ala Val Leu Val Ile Pro Asn Trp
1055 1060 1065
Gly Ser Asn Val Ser Gln Asn Leu Cys Val Glu His Asn Arg Gly
1070 1075 1080
Tyr Val Leu Arg Val Thr Ala Lys Lys Glu Asp Met Gly Lys Gly
1085 1090 1095
Tyr Val Thr Ile Ser Asp Cys Ala Lys Asn Thr Glu Thr Leu Thr
1100 1105 1110
Phe Thr Ser Cys Asp Tyr Val Ser Asn Thr Ser Thr Ser Asp His
1115 1120 1125
Pro Arg Thr Cys Asn Ser Cys Asp Asp Tyr Val Ser Asn Glu Ser
1130 1135 1140
Ile Asn Asp Gln Pro Asp Tyr Leu Leu Asn Gln Glu Arg Asn Glu
1145 1150 1155
Ser Arg Cys Tyr Asn Arg Asn Ala Ser Thr Ser Glu Glu Ser Asp
1160 1165 1170
Tyr Leu Leu Asn Gln Lys Arg Asn Glu Gln Trp Ser Asn His Ser
1175 1180 1185
Asp Ala Thr Val Asn Glu Gln Met Asn Tyr Val Thr Arg Thr Ile
1190 1195 1200
Asp Phe Phe Pro Asp Thr Asp Arg Val Arg Ile Asp Ile Gly Glu
1205 1210 1215
Thr Glu Gly Ile Phe Lys Val Glu Ser Val Glu Leu Ile Cys Met
1220 1225 1230
Glu Gly Gln
1235
<210> 102
<211> 645
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 102
Met Asn Gln Asn Asp Gln Asn Glu Ile Gln Ile Ile Asp Ser Ser Ser
1 5 10 15
Asn Asp Phe Asn Gln Ser Asn Arg Tyr Pro Arg Tyr Pro Leu Ala Glu
20 25 30
Glu Ser Asn Tyr Lys Asp Trp Leu Ala Asn Cys Lys Glu Ser Asn Leu
35 40 45
Asp Arg Leu Ser Thr Pro Ser Ser Val Gln Asp Ala Val Val Thr Ser
50 55 60
Leu Asn Ile Phe Ser Tyr Ile Phe Gly Phe Leu Asp Phe Gly Ala Thr
65 70 75 80
Ser Ala Gly Leu Gly Ile Leu Gly Ala Leu Phe Gly Val Phe Trp Pro
85 90 95
Ser Ser Asn Asn Gly Ile Ser Glu Asp Leu Leu Arg Ser Ile Glu Ile
100 105 110
Leu Ile Asp Arg Arg Ile Asn Glu Val Glu Arg Asn Arg Ile Thr Val
115 120 125
Gln Phe Glu Gly Leu Arg Ala Val Met Ser Asn Tyr Asn Thr Ala Leu
130 135 140
Arg Asn Trp Asn Ala Asn Arg Asn Asn Pro Ala Leu Gln Ser Glu Val
145 150 155 160
Arg Ser Arg Phe Asp Asn Ala Asp Asp Ala Phe Ala Gly Arg Met Pro
165 170 175
Glu Phe Arg Ile Pro Gly Phe Glu Ile Gln Ser Leu Ser Val Tyr Ala
180 185 190
Gln Ala Ala Thr Leu His Leu Leu Leu Leu Arg Asp Ala Val Val Asn
195 200 205
Gly Ser Leu Trp Gly Phe Asp Gly Glu Thr Ile Glu Ser Tyr Tyr Thr
210 215 220
Lys Leu Val Cys Leu Ile Asp Ala Tyr Ala Asp His Cys Ala Ser Phe
225 230 235 240
Tyr Arg Gln Gly Leu Gln Glu Leu Arg Asp Arg Gly Asn Trp Asn Ala
245 250 255
Phe Asn Asn Tyr Arg Arg Asp Met Thr Ile Thr Val Leu Asp Ile Ile
260 265 270
Ser Leu Phe Ser Asn Tyr Asp Pro Arg Arg Tyr Thr Ile Asn Thr Asn
275 280 285
Thr Gln Leu Thr Arg Glu Val Tyr Thr Glu Pro Val Ala Thr Pro Gly
290 295 300
Trp Leu Asn Leu Asn Ser Asn Pro Asp Gln Phe Gln Gln Ile Glu Asn
305 310 315 320
Asp Leu Ile Arg Pro Leu Arg Pro Ser Ser Thr Leu Ala Ala Leu Ser
325 330 335
Ala Glu Thr Ala Phe Ala Phe Phe Gln Ser Gly Gln Thr Pro Leu Arg
340 345 350
Ala Ile Leu Arg Thr Arg Thr Ser His Phe Asn Ile Ser Gly Gly Phe
355 360 365
Gly Thr Asp Pro Trp Gln Gly Ser Pro Asn Pro Thr Gly Pro Ser Glu
370 375 380
Ser Gln Leu Gln Phe Glu Asn Phe Asp Val Tyr His Val Ser Ser Glu
385 390 395 400
Val Thr Arg Glu Val Ser Ser Glu Asn Ser Gly Leu Phe Gly Pro Gln
405 410 415
Arg Ile Asn Phe Arg Met Val Ser Ile Ser Gly Val Val Ala Ser Arg
420 425 430
Glu Ile Ser Ser Pro Pro Phe Gly Arg Tyr Ile Thr Asn Ile Ser Ser
435 440 445
Ser Ile Pro Gly Ile Arg Ser Ala Asn Pro Thr Ala Ser Asp Tyr Thr
450 455 460
His Arg Leu Ser Ser Ile Arg Ser Thr Ser Leu Arg Thr Met Tyr Arg
465 470 475 480
Asp Arg Thr Asn Ile Ile Ala Tyr Gly Trp Thr His Thr Ser Leu Glu
485 490 495
Leu Thr Asn Arg Leu Leu Pro Asn Arg Ile Thr Gln Ile Pro Ala Val
500 505 510
Lys Ser Ser Ile Thr Pro Leu Pro Ile Val Gly Gly Pro Gly His Thr
515 520 525
Gly Gly Gly Leu Val Ser Leu Ala Gly Tyr Gly Arg Tyr Met Leu Ile
530 535 540
Asn Phe Ile Ser Pro Tyr Ser Pro Ser Arg Gln Arg Tyr Arg Leu Arg
545 550 555 560
Ile Arg Phe Val Lys Asp Phe Asp Asp Ala His Leu Arg Val Ser Ile
565 570 575
Ser Ala Ala Asn Phe Tyr Gln Asn Val Val Leu Pro Gly Ser Arg Thr
580 585 590
Ser Val Asn Pro Asn Leu Ser Phe Arg Asp Phe Arg Tyr Val Asp Val
595 600 605
Pro Gly Glu Phe Glu Leu Val Ala Asn Arg Ser Tyr Ser Ile Asn Leu
610 615 620
Gly Ala Pro Thr Ser Ser Ile Pro Gly Thr Cys Leu Ile Asp Lys Ile
625 630 635 640
Glu Phe Ile Pro Val
645
<210> 103
<211> 700
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 103
Met Ser Ser Tyr Asn Lys Asn Pro Phe Glu Thr Ile Lys Glu Leu Leu
1 5 10 15
Ser Asn Ala Asn Val Ser Ile Thr Cys Pro Leu His Pro Leu Ala Lys
20 25 30
Glu Pro Thr Ala Val Leu Gln Gly Met Asn Tyr Lys Asp Trp Leu Val
35 40 45
Met Ser Glu Glu Glu Asn Ser Leu Gly Lys Pro Glu Asn Phe Asp Ser
50 55 60
Glu Thr Ile Gln Asn Ala Leu Ile Gly Thr Leu Ala Val Ile Gly Phe
65 70 75 80
Ile Ala Ser Val Leu Ile Thr Pro Ile Thr Gly Gly Ala Ser Val Ala
85 90 95
Ile Gly Ser Ala Leu Leu Leu Glu Thr Ile Pro Tyr Tyr Trp Gly Leu
100 105 110
Ala Glu Ser Glu Asp Pro Thr Ile Gln Glu Lys Ala Met Phe Asp Asp
115 120 125
Ile Met Lys Ala Thr Glu Lys Thr Ile Lys Ser Glu Ile Glu Lys Val
130 135 140
Val Lys Thr Lys Ala Glu Ser Glu Leu Thr Ser Leu Arg Asp Val Phe
145 150 155 160
Lys Tyr Tyr Thr Arg Ala Leu Asn Leu Trp Lys Glu Asn Lys Asp Asn
165 170 175
Gln Asp Ala Arg Lys Glu Val Ser Ser Arg Phe Arg Ala Ala His Val
180 185 190
Ala Phe Ile Gly Ala Met Ser Gln Phe Lys Met Pro Gly Tyr Glu Leu
195 200 205
Val Met Leu Pro Ile Tyr Ala Ala Ala Ala Asn Phe His Leu Met Leu
210 215 220
Leu Gln Asp Gly Met Ile Tyr Glu Lys Glu Trp Asn Glu Ser Asn Asn
225 230 235 240
Leu Thr Glu Asp Phe Tyr Tyr Lys Glu Tyr Gln Asn Trp Thr Ser Gln
245 250 255
Tyr Ile Asp His Cys Thr Ser Trp Tyr Gln Lys Gly Leu Lys Asp Ile
260 265 270
Lys Asp Asn Lys Glu Lys Asn Trp Val Asp Tyr Asn Ser Tyr Arg Asn
275 280 285
Asn Met Thr Ile Leu Val Leu Asp Leu Ile Ala Leu Phe Pro Ser Tyr
290 295 300
Asp Val Arg Gln Tyr Asn Asn Met Gly Val Lys Leu Glu Thr His Thr
305 310 315 320
Arg Thr Leu Tyr Ser Asn Pro Leu Lys Phe Ser Lys Gly Ser Arg Ile
325 330 335
Asp Asp Asp Glu Arg Val Tyr Thr Arg Glu Pro Asn Leu Phe Ser Ser
340 345 350
Leu Ser His Leu Glu Phe His Thr Arg Thr Thr Asn Leu Gly Ser Asp
355 360 365
Gly Ile Glu Phe Leu Ser Gly Ile Glu Lys Lys Tyr Gln Leu Thr Glu
370 375 380
Lys Glu Ser Tyr Arg Thr Lys Phe Glu Gly Lys Pro Thr Asp Ile Gly
385 390 395 400
Lys Lys His Glu Ile Pro Phe Lys Lys Asp Glu Ser Glu Phe Tyr Ser
405 410 415
Leu Tyr Glu Leu Ser Thr Glu Gly Lys Gly Phe Lys Asp Ser Thr Lys
420 425 430
Leu Pro Leu Ile Gln Lys Met Thr Phe Asn Asn Glu Glu Gly Lys Lys
435 440 445
Phe Glu Tyr Ser Val Asp Lys Phe Gly Asn Gly Ala Leu Glu Lys Glu
450 455 460
Ser Phe Ser Ile Thr Asn Pro Asp Asp Leu Leu Ala Asp Gln Lys Ala
465 470 475 480
Arg His Val Leu Ser Asp Ile Ile Phe Val Asp Asp Lys Asn Ile Ser
485 490 495
Pro Glu Gln Ile Gln Thr Arg Asp Tyr Ser Phe Ala Trp Thr Ser Glu
500 505 510
Gln Ile Ser Pro Tyr Asn Arg Ile Thr Asp Gly Thr Lys Asn Lys Asp
515 520 525
Gly Ser Ile Ser Tyr Gln Ile Thr Gln Val Pro Ala Val Lys Ala Tyr
530 535 540
Ser Leu Lys Ser Ser Lys Asp Thr Ser His Ile Asp Val Ile Ser Gly
545 550 555 560
Pro Gly Phe Thr Gly Gly Asp Leu Val Arg Cys Ala Ile Lys Lys Thr
565 570 575
Ala Ser Ser Thr Gly Val Thr Gln Leu Ser Ile Val Phe Pro Ile Thr
580 585 590
Phe Asn His Val Lys Thr Asp Lys Tyr Arg Val Arg Met Asn Tyr Ala
595 600 605
Ala Asp His Thr Lys Leu Ile Thr Leu Val Gly Gly Ser Asn Pro Ile
610 615 620
Ser Gly Thr Ile Glu Asn Thr Phe Asp Ala Lys Asp Thr Gly Lys Ser
625 630 635 640
Leu Lys Tyr Asn Asn Phe Lys Met Met Glu Phe Asn Ile Ala Thr Asn
645 650 655
Phe Lys Asp Glu Glu Lys Gln Lys Ile Ile Thr Leu Lys Leu Thr Tyr
660 665 670
Ala Asp Asp Asp Gln Ala Ile Ala Asp Gly Lys Val Ala Gln Leu Trp
675 680 685
Ile Asp Lys Leu Glu Phe Ile Pro Val Pro Thr Lys
690 695 700
<210> 104
<211> 502
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 104
Met Asn Ser Glu Gln Arg Lys His Gly Gln Asn Asn Glu Tyr Lys His
1 5 10 15
Gln Asn His Gly Val Ser Cys Asn Cys Gly Cys Gln Gln Gly Lys Tyr
20 25 30
Glu Tyr Glu Gln Lys Gln Asp Arg Glu Glu Tyr Asn Asn Ser Tyr Met
35 40 45
Ser Asn Glu Gln Thr Gln Asn His Glu Gly Met Gly Tyr Leu Val Ala
50 55 60
Lys Glu Gly Thr Arg Asn Asp Met Tyr Asn Ser Val Lys Arg Glu Phe
65 70 75 80
Arg Glu His Asp Met Tyr His Glu Asn Lys Asn Tyr Tyr Ser Cys Ser
85 90 95
Pro Arg Gln Phe Asn Thr Leu Asn Leu Pro Asp Glu Ser Lys Arg Phe
100 105 110
Gln Thr Ile Thr His Val Gln Asn Thr Asn Ser Asn Pro Ser Leu Asn
115 120 125
Ala Thr Ser Thr Thr Arg Gly Gln Asn Ile Asp Gly Ser Asn Asn Phe
130 135 140
Cys Arg Ser Ile Ser Leu Asp Phe Gln Leu Ile Asn Gln Asp Leu Asn
145 150 155 160
Thr Thr Gln Asp Thr Phe Tyr Tyr Leu Phe Tyr Leu Leu Asp Ser Gly
165 170 175
Asp Phe Ile Ile Ala Asn Tyr Gly Thr Gly Arg Val Leu Glu Asn Arg
180 185 190
Thr Val Glu Pro Asp Glu Arg Ala Ile Ile Ser Ser Leu Tyr Thr Gly
195 200 205
Ser Val Ser Gln Phe Phe Arg Lys Arg Ser Ser Val Asn Asn Gln Phe
210 215 220
Ile Leu Ser Pro Gln Gln Glu Leu Asn Arg Ala Val Asn Val Cys Asn
225 230 235 240
Asn Thr Ile Asn Leu Ala Thr Lys Ile Thr Ser Gly Phe Asn Ile Pro
245 250 255
Asp Asn Thr Ser Asn Val Phe Arg His Ile Val Ala Ser Asn Arg Pro
260 265 270
Ile Thr Ile Pro Ser Phe Ser Asn Glu Ser Thr Leu Gly Pro Ala Pro
275 280 285
Val Leu Thr Ser Leu Ser Asp Tyr Gly Pro Ser Pro Ala Gln Ala Glu
290 295 300
Arg Ala Val Ile Gly Ser Ala Leu Ile Pro Cys Ile Phe Val Asn Asp
305 310 315 320
Val Leu Pro Leu Glu Arg Arg Ile Lys Glu Ser Pro Tyr Tyr Val Leu
325 330 335
Glu Tyr Arg Gln Tyr Trp His Arg Leu Trp Thr Asp Glu Ile Pro Ser
340 345 350
Gly Asp Arg Arg Ser Phe Gly Glu Ile Thr Gly Ile Leu Pro Asn Ala
355 360 365
Gln Ala Ser Met Arg Asp Met Ile Asp Met Ser Ile Gly Ala Asp Trp
370 375 380
Gly Leu Arg Phe Gly Asp Ser Ser Thr Pro Phe Lys Arg Gly Ile Leu
385 390 395 400
Asn Ser Leu Asn Thr Gln Gly Ser Tyr Ala Asp Val Asp Leu Gly Thr
405 410 415
Gln Thr Gly Ser Asn Gln Leu Thr Asn Phe Asn Ser Phe Ser Val Arg
420 425 430
Tyr Ala Arg Phe Val Lys Ala His Glu Tyr Arg Leu Thr Arg Ser Asn
435 440 445
Gly Thr Val Val Thr Thr Pro Trp Val Ala Leu Asp Tyr Arg Val Pro
450 455 460
Gln Thr Ala Arg Phe Pro Leu Asn Ala Gln Leu Ser Gln Asn Tyr Lys
465 470 475 480
Ile Ile Ser Ser Tyr Asn Ser Tyr Asp Leu Pro Val Leu Glu His Thr
485 490 495
Asp Asn Lys Lys Lys Trp
500
<210> 105
<211> 806
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 105
Met His Gln Thr Asn Thr Ser Lys Asp Lys Asn Arg Met Ala Ser Phe
1 5 10 15
Glu Lys Thr Asn Met Ser Asn Glu Gly Pro Ile Tyr Pro Leu Ala Asn
20 25 30
Ser Ser Asn Thr Gly Leu Gln Gly Met Asn Tyr Asn Asp Ser Val Asn
35 40 45
Met His Ala Thr Lys Asn Glu Asp Cys Leu Leu Ser Ser Arg Val Ser
50 55 60
Val Asp Pro Lys Thr Ala Val Ala Thr Gly Ile Lys Ile Ser Thr Phe
65 70 75 80
Ile Met Ser Ala Leu Gly Ile Pro Leu Ala Ser Gln Ile Gly Gln Leu
85 90 95
Trp Asn Phe Val Leu Asp Gln Leu Trp Pro Asn Gly Asp Asn Gln Trp
100 105 110
Glu Glu Phe Met Lys His Val Glu Glu Leu Ile Asn Gln Lys Ile Thr
115 120 125
Glu Tyr Ala Arg Asp Lys Ala Leu Ala Glu Leu Glu Gly Leu Gly Asn
130 135 140
Val Leu Glu Leu Tyr Gln Glu Ala Ile Glu Asp Trp Lys Lys Asp Pro
145 150 155 160
Thr Asn Val Ala Ser Gln Glu Arg Val Arg Thr Arg Phe Arg Asn Ala
165 170 175
Asp Ala Phe Phe Glu Asn Gly Met Pro Ser Phe Arg Ile Lys Asn Tyr
180 185 190
Glu Val Pro Leu Leu Ser Val Tyr Ala Ser Ala Ala Asn Leu His Leu
195 200 205
Leu Leu Leu Arg Asp Ala Ser Ile Tyr Gly Lys Asp Trp Gly Leu Asn
210 215 220
Glu Thr Asn Val Lys Asp Asn Tyr Asn Arg Gln Leu Lys Arg Thr Glu
225 230 235 240
Ser Tyr Ser Asn His Cys Val Thr Trp Tyr Gln Lys Gly Leu Gln Arg
245 250 255
Leu Lys Gly Ser Asp Ala Asp Ser Trp Leu Lys Phe Asn Ala Phe Arg
260 265 270
Arg Asp Met Thr Phe Thr Val Leu Asp Ile Val Val Leu Phe Pro Cys
275 280 285
Tyr Asp Ile Gln Lys Tyr Pro Met Gly Val Lys Thr Glu Ile Thr Arg
290 295 300
Glu Ile Tyr Thr Asp Pro Ile Gly Asn Leu Pro Arg Arg Pro Gly Phe
305 310 315 320
Gln Val Leu Asp Trp Val Asn His Ala Pro Ser Phe Glu Glu Ile Glu
325 330 335
Lys Glu Thr Thr Gly Asp Pro Ser Met Val Thr Trp Ile Asn Tyr Leu
340 345 350
Glu Ile Ser Arg Asn Lys Leu Gly Gly Trp Asn Ser Ser Asn Asp Tyr
355 360 365
Trp Glu Gly His Ser Val His Tyr Lys Tyr Ser Asn Asp Gly Ile Val
370 375 380
Ile Asp Ser Lys Ala Tyr Ala His Gly Asp Phe Ser Gly Lys Phe Pro
385 390 395 400
Thr Glu Ala Ile His Leu Glu Asn Cys Asp Val Tyr Ala Ser Ser Ser
405 410 415
Leu Pro Val Ala Ala Ser Asn Pro Tyr Gly Asp Lys Ala Ser Phe Gly
420 425 430
Val Thr Lys Val Tyr Phe Asp Met Leu Asn Leu Asn Asp Asn Thr Leu
435 440 445
Gln Arg Gly Ser Trp Asp His Pro Lys Asn Trp Gly Gly Ser Trp Val
450 455 460
Asp Leu Lys Phe Leu Asn Glu Thr Pro Asn Ile Lys Glu Ile Gln Lys
465 470 475 480
Tyr Ser His Lys Leu Thr Tyr Val Ser Ser Phe Ser Val Asp Tyr Gly
485 490 495
Ser Val Leu Cys Tyr Arg Trp Arg His Ser Ser Val Asp Arg Asn Asn
500 505 510
Thr Leu Asp Pro Gln Lys Ile Thr Gln Ile Ser Ala Val Lys Gly His
515 520 525
Thr Pro Ile Asn Cys Glu Val Val Arg Gly Asn Gly Leu Thr Asp Gly
530 535 540
Asp Trp Leu Gln Leu Lys Gly Gly Gly Ser Phe Thr Leu Asn Val Ser
545 550 555 560
Ser Val Ile Pro Lys Thr Tyr Arg Ile Arg Leu Arg Tyr Ala Cys Gly
565 570 575
Pro Lys Asp Val Asn Leu Thr Val Ser Cys Pro Asp Asn Gly Ile Asp
580 585 590
Lys Thr Ile Thr Ile Pro Ser Thr Ser Thr Ala Ser Leu Ala Asn Asn
595 600 605
Leu Met Tyr Lys Asp Phe Lys Tyr Ile Asp Leu Pro Ile Glu Phe Ile
610 615 620
Thr Ser Ala Ala Pro Lys Arg Cys Arg Ile Asn Leu Arg Leu Glu Gly
625 630 635 640
Val Gln Asn Thr Ser Phe Phe Met Asp Lys Phe Glu Phe Ile Pro Glu
645 650 655
Asn Glu Thr Glu Thr Arg Pro Trp Lys Ile Val Thr Ala Leu Asn Asn
660 665 670
Ser Ser Val Leu Ala Phe Ala Lys Asn Ser Leu Tyr Lys Glu Ala Ile
675 680 685
Leu Ser Asn Asp Thr Gly Glu Glu Ser Gln Lys Trp Arg Phe Ile Tyr
690 695 700
Asp Glu Glu Lys Lys Ala Tyr Gln Ile Glu Asp Val Lys Ser Pro Arg
705 710 715 720
Leu Ile Leu Ala Trp Ile Ile Asp Gly Pro Glu Ser Arg Lys Val Trp
725 730 735
Ala Thr Arg Ser Gln Asn Ile Asp Glu His Tyr Trp Ile Leu Glu Glu
740 745 750
Gln Glu Asp Ser Tyr Phe Ile Ile Lys Asn Lys Lys Asn Pro Ser Leu
755 760 765
Val Leu Asp Val Asp Gly Ala Gln Thr Asn Gln Gly Thr Lys Ile Lys
770 775 780
Val Asn Glu Arg His Pro Leu Tyr Ser Ser Arg Thr Asn Ala Gln Lys
785 790 795 800
Phe Lys Leu Glu Arg Val
805
<210> 106
<211> 656
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 106
Met His Gln Thr Asn Thr Ser Lys Asp Lys Asn Arg Met Ala Ser Phe
1 5 10 15
Glu Lys Thr Asn Met Ser Asn Glu Gly Pro Ile Tyr Pro Leu Ala Asn
20 25 30
Ser Ser Asn Thr Gly Leu Gln Gly Met Asn Tyr Asn Asp Ser Val Asn
35 40 45
Met His Ala Thr Lys Asn Glu Asp Cys Leu Leu Ser Ser Arg Val Ser
50 55 60
Val Asp Pro Lys Thr Ala Val Ala Thr Gly Ile Lys Ile Ser Thr Phe
65 70 75 80
Ile Met Ser Ala Leu Gly Ile Pro Leu Ala Ser Gln Ile Gly Gln Leu
85 90 95
Trp Asn Phe Val Leu Asp Gln Leu Trp Pro Asn Gly Asp Asn Gln Trp
100 105 110
Glu Glu Phe Met Lys His Val Glu Glu Leu Ile Asn Gln Lys Ile Thr
115 120 125
Glu Tyr Ala Arg Asp Lys Ala Leu Ala Glu Leu Glu Gly Leu Gly Asn
130 135 140
Val Leu Glu Leu Tyr Gln Glu Ala Ile Glu Asp Trp Lys Lys Asp Pro
145 150 155 160
Thr Asn Val Ala Ser Gln Glu Arg Val Arg Thr Arg Phe Arg Asn Ala
165 170 175
Asp Ala Phe Phe Glu Asn Gly Met Pro Ser Phe Arg Ile Lys Asn Tyr
180 185 190
Glu Val Pro Leu Leu Ser Val Tyr Ala Ser Ala Ala Asn Leu His Leu
195 200 205
Leu Leu Leu Arg Asp Ala Ser Ile Tyr Gly Lys Asp Trp Gly Leu Asn
210 215 220
Glu Thr Asn Val Lys Asp Asn Tyr Asn Arg Gln Leu Lys Arg Thr Glu
225 230 235 240
Ser Tyr Ser Asn His Cys Val Thr Trp Tyr Gln Lys Gly Leu Gln Arg
245 250 255
Leu Lys Gly Ser Asp Ala Asp Ser Trp Leu Lys Phe Asn Ala Phe Arg
260 265 270
Arg Asp Met Thr Phe Thr Val Leu Asp Ile Val Val Leu Phe Pro Cys
275 280 285
Tyr Asp Ile Gln Lys Tyr Pro Met Gly Val Lys Thr Glu Ile Thr Arg
290 295 300
Glu Ile Tyr Thr Asp Pro Ile Gly Asn Leu Pro Arg Arg Pro Gly Phe
305 310 315 320
Gln Val Leu Asp Trp Val Asn His Ala Pro Ser Phe Glu Glu Ile Glu
325 330 335
Lys Glu Thr Thr Gly Asp Pro Ser Met Val Thr Trp Ile Asn Tyr Leu
340 345 350
Glu Ile Ser Arg Asn Lys Leu Gly Gly Trp Asn Ser Ser Asn Asp Tyr
355 360 365
Trp Glu Gly His Ser Val His Tyr Lys Tyr Ser Asn Asp Gly Ile Val
370 375 380
Ile Asp Ser Lys Ala Tyr Ala His Gly Asp Phe Ser Gly Lys Phe Pro
385 390 395 400
Thr Glu Ala Ile His Leu Glu Asn Cys Asp Val Tyr Ala Ser Ser Ser
405 410 415
Leu Pro Val Ala Ala Ser Asn Pro Tyr Gly Asp Lys Ala Ser Phe Gly
420 425 430
Val Thr Lys Val Tyr Phe Asp Met Leu Asn Leu Asn Asp Asn Thr Leu
435 440 445
Gln Arg Gly Ser Trp Asp His Pro Lys Asn Trp Gly Gly Ser Trp Val
450 455 460
Asp Leu Lys Phe Leu Asn Glu Thr Pro Asn Ile Lys Glu Ile Gln Lys
465 470 475 480
Tyr Ser His Lys Leu Thr Tyr Val Ser Ser Phe Ser Val Asp Tyr Gly
485 490 495
Ser Val Leu Cys Tyr Arg Trp Arg His Ser Ser Val Asp Arg Asn Asn
500 505 510
Thr Leu Asp Pro Gln Lys Ile Thr Gln Ile Ser Ala Val Lys Gly His
515 520 525
Thr Pro Ile Asn Cys Glu Val Val Arg Gly Asn Gly Leu Thr Asp Gly
530 535 540
Asp Trp Leu Gln Leu Lys Gly Gly Gly Ser Phe Thr Leu Asn Val Ser
545 550 555 560
Ser Val Ile Pro Lys Thr Tyr Arg Ile Arg Leu Arg Tyr Ala Cys Gly
565 570 575
Pro Lys Asp Val Asn Leu Thr Val Ser Cys Pro Asp Asn Gly Ile Asp
580 585 590
Lys Thr Ile Thr Ile Pro Ser Thr Ser Thr Ala Ser Leu Ala Asn Asn
595 600 605
Leu Met Tyr Lys Asp Phe Lys Tyr Ile Asp Leu Pro Ile Glu Phe Ile
610 615 620
Thr Ser Ala Ala Pro Lys Arg Cys Arg Ile Asn Leu Arg Leu Glu Gly
625 630 635 640
Val Gln Asn Thr Ser Phe Phe Met Asp Lys Phe Glu Phe Ile Pro Glu
645 650 655
<210> 107
<211> 613
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 107
Met Leu Arg Thr Ile Gln Tyr Lys Asn Ala Ser Glu Ile Val Tyr Asp
1 5 10 15
Gln Pro Leu Pro Tyr Gly Gly Val Val Lys Ile Ala Gly Lys Ile Gly
20 25 30
Leu Gly Val Val Lys Thr Val Ile Thr Ser Ile Ile Arg Arg Gly Arg
35 40 45
Asp Asn Glu Ile Ala Gly Arg Ile Leu Ser Asp Val Tyr Ser Val Leu
50 55 60
Trp Ser Asn Arg Lys Gly Tyr Trp Asn Glu Met Ile Glu Ala Val Glu
65 70 75 80
Thr Leu Ile Gln Gln Asp Ile Ser Glu Asn Ile Lys Asn Asn Ala Leu
85 90 95
Ala Ala Leu Thr Asp Ile Arg Asn Ala Leu Leu Leu Tyr Gln Gln Ala
100 105 110
Val Glu Asp Trp Glu Ser Asn Arg Thr Asp Ser Gln Leu Gln Glu Arg
115 120 125
Val Arg Asn Gln Leu Ile Ser Thr Ile Thr Phe Ile Glu Phe Ala Met
130 135 140
Pro Ser Phe Thr Val Gln His Tyr Glu Val Ile Leu Leu Pro Ile Phe
145 150 155 160
Thr Gln Val Ala Asn Leu His Leu Leu Leu Leu Arg Asp Ala Glu Ile
165 170 175
Phe Gly Leu Lys Trp Arg Met Ser Lys Ala Glu Ile Asp Asp Tyr Tyr
180 185 190
Phe Ala Lys Ser Gly Leu Thr Arg Leu Thr Gln Lys Tyr Thr Asn His
195 200 205
Ser Val Lys Trp Tyr Arg Glu Gly Leu Cys Ile Ala Thr Asn Ile Asp
210 215 220
Leu Asp Gln Cys Pro Glu Phe Tyr Gln Leu Asp Lys Trp Asn Ala Met
225 230 235 240
Asn Asp Phe Arg Arg Glu Met Thr Phe Met Val Leu Asp Ile Ile Ala
245 250 255
Leu Trp Pro Thr Tyr Asp Pro Ile Arg Tyr Pro Leu Gly Ile Lys Thr
260 265 270
Glu Leu Thr Arg Glu Val Phe Thr Pro Leu Leu Gly Ile Asn Pro Asn
275 280 285
Ser Ser Trp Leu Ile Arg Thr Met Glu Glu Ile Glu Ala Lys Leu Thr
290 295 300
Val Leu Ser Pro Phe Leu Ser Trp Ile Ser Phe Glu Gln Leu Val Lys
305 310 315 320
Gln Gly Asp Gly Ile Ser Thr Phe Thr Asp Trp Gly Asn Phe Thr Leu
325 330 335
Ser Asn Thr Met Leu Pro Leu Ser Tyr Ile Leu Gly Gly Ala Gly Pro
340 345 350
Gly Thr Gly Asp Ser Thr Asn Ile Pro Met Gln Ser Glu Asn Tyr Asp
355 360 365
Val Tyr Lys Val Leu Val Gly Thr Asp Tyr Ser His Pro Ser Asn Val
370 375 380
Pro Ile Arg Lys Leu Glu Tyr Tyr Cys Thr Asn Gly Met Met Glu Asn
385 390 395 400
Val Ile Thr Val Gly Thr Gly Thr Thr Asn Ala Leu Phe Glu Leu Pro
405 410 415
Asn Asn Gly Cys Val Asp Tyr Ser His Arg Val Ser Arg Leu Ser Cys
420 425 430
Ser Asn Val Glu Val Tyr Glu Trp Glu Gly Gly Pro Lys Tyr Ala Leu
435 440 445
Lys Asn Ile Ala Tyr Gly Trp Thr His Ile Ser Val Asp Ser Lys Asn
450 455 460
Thr Leu Ser His Asn Ala Ile Thr Gln Ile Pro Ala Arg Lys Gly Tyr
465 470 475 480
Ser Ser Ser Glu Ser Asn Leu Ser Ile Glu Gly Pro Tyr Phe Thr Gly
485 490 495
Gly Asp Leu Ile Ala Leu Pro Pro Asn Gly Ala Gln Leu Gln Met Arg
500 505 510
Val Ser Pro Pro Val Ser Ser Ser Ser Ile Asn Tyr Cys Val Arg Leu
515 520 525
Arg Tyr Ala Ser Ser Gly Asn Thr Asn Ile Tyr Val Glu Arg Val Leu
530 535 540
Ser Ser Gly Asp Thr Tyr Gly Glu Thr His Asp Val Pro Ala Thr Tyr
545 550 555 560
Ser Gly Gly Ser Leu Ser Tyr Ser Ser Phe Ala Tyr Val Val Asn Leu
565 570 575
Thr Ala Ile Phe Glu Gly Phe Asn Val Glu Ile Lys Ile Lys Asn Ile
580 585 590
Gly Ser Ser Gln Ile Ile Leu Asp Lys Ile Glu Phe Leu Pro Ile Lys
595 600 605
Glu Ser Leu Lys Glu
610
<210> 108
<211> 375
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 108
Met Ser Asn Glu Ser Lys Asn Asn Glu Asn Met Asn Leu Pro Ile Ile
1 5 10 15
Asn Asp Glu Asp Ser Ile Pro Ser Glu Ile Arg Asp Ser Arg Ser Lys
20 25 30
Ile Thr Phe Pro Asp Phe Asn Ile Tyr Lys Ile Gly Thr Phe Cys Gly
35 40 45
Lys Tyr Val Asp Ile Pro Gly Gly Ala Asn Glu Asp Ser Ser Ala Ile
50 55 60
Gln Phe Gln Thr Gln Asp Gly Asp Asn Gln Lys Phe Leu Ile Phe Thr
65 70 75 80
Leu Asp Asp Ser Tyr Ser Ile Ile Ala Ala Arg His Ser Gly Lys Val
85 90 95
Met Asp Ala Tyr Tyr Asn Ser Ser Gln Asn Arg Ile Ile Gln Phe Asp
100 105 110
Phe His Asn Thr Asn Asn Gln Lys Phe Phe Leu Ser Asp Asn Gly Thr
115 120 125
Ile Ala Ser Lys Gln Asp Arg Gln Val Trp Asp Val Gln Gly Glu Ser
130 135 140
Thr Gln Asn Gly Ala Arg Ile Val Val Phe Lys Phe Ile Asp Val Arg
145 150 155 160
Asn Gln Lys Phe Thr Leu Asn Lys Leu Gly Ser Val Pro Val His Gln
165 170 175
Pro Thr Leu Ser Pro Leu Pro Ser Ala Pro Asp Phe Arg Thr Asn Asp
180 185 190
Ile Asn Glu Gln Leu Pro Asp Gln Thr Asn Pro Val Asn Thr His Phe
195 200 205
Thr Tyr Leu Pro Tyr Phe Met Val Lys Asp Pro Leu Tyr Asn Ala Gln
210 215 220
Gln Gln Ile Lys Asn Ser Pro Tyr Tyr Thr Leu Val Arg Arg Gln Tyr
225 230 235 240
Trp Glu Lys Lys Thr Gln Arg Ile Leu Ala Pro Ser Asp Thr Tyr Glu
245 250 255
Tyr Ser Glu Thr Ile Gly Val Ser Arg Thr Asp Gln Ser Ser Met Thr
260 265 270
Asp Ile Thr Glu Ile Ser Ile Gly Val Asp Leu Gly Phe Val Phe Lys
275 280 285
Gly Phe Ser Ala Gly Leu Ser Thr Asn Val Thr Arg Thr Leu Ser Val
290 295 300
Thr Lys Ser Thr Ser Asn Thr Glu Ser Ser Glu Ser Thr Arg Lys Leu
305 310 315 320
Ser Tyr Lys Asn Glu Phe Arg His Thr Ile Ala Tyr Ala Lys Tyr Met
325 330 335
Leu Ile Asn Glu Tyr Tyr Val Thr Arg Ala Asn Gly Glu Lys Ile Thr
340 345 350
Thr Gly Asp Ala Thr Tyr Trp Arg Val Pro Asn Pro Glu Gln Thr Val
355 360 365
Ala Arg Ile Ile Pro Arg Ser
370 375
<210> 109
<211> 641
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 109
Met Asn Lys Asn Asn Asn Glu Ile Thr Glu Val Pro Cys Ser Glu Cys
1 5 10 15
Gly Asp Ile Asp Tyr Arg Asp Ile Met Asn Met Ser Thr Gln Gly Glu
20 25 30
Tyr Val Ala Gln Gln Ser Val Val Leu Thr Ala Met Asn Thr Ser Phe
35 40 45
Gln Val Ile Arg Thr Leu Tyr Gln Ala Asn Lys Gly Thr Leu Ser Lys
50 55 60
Ala Gly Gly Val Ile Lys Ser Ile Phe Thr Tyr Leu Trp Arg Gln Lys
65 70 75 80
Asn Ser Gln Glu Glu Trp Asp Asn Phe Met Lys Ala Val Glu Ile Leu
85 90 95
Ile Ala Glu Glu Leu Ser Thr Tyr Ala Arg Asn Asp Ala Ile Ala Ser
100 105 110
Val Arg Gly Leu Glu Asn Val Leu Asn Glu Tyr Gln Glu Ala Leu Asp
115 120 125
Asp Leu Ala Leu Asp Pro Thr Asn Ser Ala Leu Lys Asp Glu Val Lys
130 135 140
Arg Trp Val Thr Thr Val His Gly Gln Phe Glu Ala Thr Met Pro Thr
145 150 155 160
Leu Ala Val Gly Gly Tyr Glu Val Gln Gln Leu Val Ala Tyr Ala Glu
165 170 175
Ala Ala Asn Leu His Leu Ala Phe Leu Arg Asp Val Ala Lys Phe Gly
180 185 190
Tyr Gln Trp Asp Leu Asp Lys Thr Thr Val Ala Asn Tyr Tyr Lys Asp
195 200 205
Leu Lys Glu Asn Thr Gln Ile Tyr Thr Asp His Cys Val Arg Thr Tyr
210 215 220
Asn Thr Gly Leu Glu Lys Thr Lys Ser Leu Met Gly Ala Ile Asn Pro
225 230 235 240
Lys Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ala Asn Gly Ser
245 250 255
Thr Asn Phe Tyr Lys Lys Val Leu Gln Trp Asn Val Trp Asn Asp Phe
260 265 270
Arg Arg Asp Met Thr Ile Met Val Leu Asp Leu Val Ala Leu Trp Pro
275 280 285
Thr Phe Asp Pro Glu Val Tyr Thr Asn Pro Glu Gly Val Asp Leu Glu
290 295 300
Leu Ser Arg Glu Val Tyr Ser Thr Ala Tyr Gly Arg Tyr Gly Ser Ser
305 310 315 320
Thr His Gly Gly Asp Trp Lys Ser Trp Glu Val Met Glu Thr Ala Val
325 330 335
Asp Arg Ser Pro His Leu Val Ser Phe Leu Thr Asn Leu Ala Ile Tyr
340 345 350
Gln Lys Thr Tyr Ser Ser Ser Ser Thr Asn Thr Lys Leu Asp Gln Tyr
355 360 365
Asp Gly Val Ile Asn Thr Leu Lys Thr Val Gly Ser Thr Ser Thr Phe
370 375 380
Glu Asp Gly Phe Ala Pro Ile Ser Lys Glu Ala Tyr Arg Ser Val Lys
385 390 395 400
Asn Ile Pro Gly Glu Tyr Ile Asn Gly Val Asp Leu Arg Leu Gly Met
405 410 415
Val Pro Cys Ser Leu Thr Phe Lys Thr Tyr Ser Ser Gly Thr Phe Phe
420 425 430
Ala Gly Gln Cys Tyr Ser His Leu Ala Gln Ser Thr Phe Tyr Asn Leu
435 440 445
Ser Ala Thr Ile Ala Tyr His Arg Leu Ser Tyr Val Asn Ala Ile Asp
450 455 460
Asp Asn Phe Ser Gly Tyr Tyr Ser Gly Trp Gly Ile Gly Thr Trp Gly
465 470 475 480
Met Gly Trp Leu His Glu Asn Leu Lys Thr Leu Asn Ile Ile His Ala
485 490 495
Thr Lys Ser Ser Cys Ile Pro Ala Val Lys Ala Arg Ile Leu Thr Asn
500 505 510
Ser Ala Thr Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val
515 520 525
Lys Leu Pro Gly Val Pro Ser Thr Ser Asp Ile Gln Val Arg Met Asn
530 535 540
Met Thr Arg Glu Val Trp Lys Asn Tyr Arg Ile Arg Ile Arg Tyr Ala
545 550 555 560
Thr Ser Ser Ser Ala Gln Leu Phe Val Gly Lys Trp Thr Gly Arg Trp
565 570 575
Ala Ala Thr Ala Asn Ile Asn Leu Ala Ala Thr Tyr Ser Gly Glu Leu
580 585 590
Thr Tyr Ser Ala Phe Lys Leu Ala Asp Val Pro Phe Asn Ile Thr Thr
595 600 605
Gly Glu Pro Glu Phe Ser Phe Glu Leu Arg Asn Asn Ser Gly Gly Pro
610 615 620
Ile Leu Ile Asp Arg Ile Glu Phe Ile Pro Leu Glu Gly Thr Phe Ala
625 630 635 640
Glu
<210> 110
<211> 635
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 110
Met Asn Lys Asn Asn Asn Glu Ile Thr Glu Val Pro Cys Ser Glu Cys
1 5 10 15
Gly Asp Ile Asp Tyr Arg Asp Ile Met Asn Met Ser Thr Gln Gly Glu
20 25 30
Tyr Val Ala Gln Gln Ser Val Val Leu Thr Ala Met Asn Thr Ser Phe
35 40 45
Gln Val Ile Arg Thr Leu Tyr Gln Ala Asn Lys Gly Thr Leu Ser Lys
50 55 60
Ala Gly Gly Val Ile Lys Ser Ile Phe Thr Tyr Leu Trp Arg Gln Lys
65 70 75 80
Asn Ser Gln Glu Glu Trp Asp Asn Phe Met Lys Ala Val Glu Ile Leu
85 90 95
Ile Ala Glu Glu Leu Ser Thr Tyr Ala Arg Asn Asp Ala Ile Ala Ser
100 105 110
Val Arg Gly Leu Glu Asn Val Leu Asn Glu Tyr Gln Glu Ala Leu Asp
115 120 125
Asp Leu Ala Leu Asp Pro Thr Asn Ser Ala Leu Lys Asp Glu Val Lys
130 135 140
Arg Trp Val Thr Thr Val His Gly Gln Phe Glu Ala Thr Met Pro Thr
145 150 155 160
Leu Ala Val Gly Gly Tyr Glu Val Gln Gln Leu Val Ala Tyr Ala Glu
165 170 175
Ala Ala Asn Leu His Leu Ala Phe Leu Arg Asp Val Ala Lys Phe Gly
180 185 190
Tyr Gln Trp Asp Leu Asp Lys Thr Thr Val Ala Asn Tyr Tyr Lys Asp
195 200 205
Leu Lys Glu Asn Thr Gln Ile Tyr Thr Asp His Cys Val Arg Thr Tyr
210 215 220
Asn Thr Gly Leu Glu Lys Thr Lys Ser Leu Met Gly Ala Ile Asn Pro
225 230 235 240
Lys Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ala Asn Gly Ser
245 250 255
Thr Asn Phe Tyr Lys Lys Val Leu Gln Trp Asn Val Trp Asn Asp Phe
260 265 270
Arg Arg Asp Met Thr Ile Met Val Leu Asp Leu Val Ala Leu Trp Pro
275 280 285
Thr Phe Asp Pro Glu Val Tyr Thr Asn Pro Glu Gly Val Asp Leu Glu
290 295 300
Leu Ser Arg Glu Val Tyr Ser Thr Ala Tyr Gly Arg Tyr Gly Ser Ser
305 310 315 320
Thr His Gly Gly Asp Trp Lys Ser Trp Glu Val Met Glu Thr Ala Val
325 330 335
Asp Arg Ser Pro His Leu Val Ser Phe Leu Thr Asn Leu Ala Ile Tyr
340 345 350
Gln Lys Thr Tyr Ser Ser Ser Ser Thr Asn Thr Lys Leu Asp Gln Tyr
355 360 365
Asp Gly Val Ile Asn Thr Leu Lys Thr Val Gly Ser Thr Ser Thr Phe
370 375 380
Glu Asp Gly Phe Ala Pro Ile Ser Lys Glu Ala Tyr Arg Ser Val Lys
385 390 395 400
Asn Ile Pro Gly Glu Tyr Ile Asn Gly Val Asp Leu Arg Leu Gly Met
405 410 415
Val Pro Cys Ser Leu Thr Phe Lys Thr Tyr Ser Ser Gly Thr Phe Phe
420 425 430
Ala Gly Gln Cys Tyr Ser His Leu Ala Gln Ser Thr Phe Tyr Asn Leu
435 440 445
Ser Ala Thr Ile Ala Tyr His Arg Leu Ser Tyr Val Asn Ala Ile Asp
450 455 460
Asp Asn Phe Ser Gly Tyr Tyr Ser Gly Trp Gly Ile Gly Thr Trp Gly
465 470 475 480
Met Gly Trp Leu His Glu Asn Leu Lys Thr Leu Asn Ile Ile His Ala
485 490 495
Thr Lys Ser Ser Cys Ile Pro Ala Val Lys Ala Arg Ile Leu Thr Asn
500 505 510
Ser Ala Thr Val Ile Lys Gly Pro Gly Ser Thr Gly Gly Asp Leu Val
515 520 525
Lys Leu Pro Gly Val Pro Ser Thr Ser Asp Ile Gln Val Arg Met Asn
530 535 540
Met Thr Arg Glu Val Trp Lys Asn Tyr Arg Ile Arg Ile Arg Tyr Ala
545 550 555 560
Thr Ser Ser Ser Ala Gln Leu Phe Val Gly Lys Trp Thr Gly Arg Trp
565 570 575
Ala Ala Thr Ala Asn Ile Asn Leu Ala Ala Thr Tyr Ser Gly Glu Leu
580 585 590
Thr Tyr Ser Ala Phe Lys Leu Ala Asp Val Pro Phe Asn Ile Thr Thr
595 600 605
Gly Glu Pro Glu Phe Ser Phe Glu Leu Arg Asn Asn Ser Gly Gly Pro
610 615 620
Ile Leu Ile Asp Arg Ile Glu Phe Ile Pro Leu
625 630 635
<210> 111
<211> 847
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 111
Met Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Ser Pro Pro Met Gly Asn Val Asp Asp Leu Pro Thr Leu
20 25 30
Ala Glu Glu Leu Lys Gln Ala Trp Glu Ser Phe Gln Lys Thr Gly Asn
35 40 45
Leu Asn Phe Thr Ala Phe Gln Gln Gly Leu Ala Ala Ala Gly Gly Gly
50 55 60
Ser Phe Asn Tyr Leu Ala Leu Leu Gln Ala Ala Ile Gly Leu Ala Gly
65 70 75 80
Thr Val Gly Ala Glu Ile Pro Gly Val Ala Ile Ala Ala Pro Leu Val
85 90 95
Ser Ile Leu Leu Gly Phe Leu Trp Pro Asn Lys Gln Lys Ser Asp Ile
100 105 110
Lys Gln Leu Ile Thr Val Ile Asp Ala Glu Val Lys Arg Ile Leu Asn
115 120 125
Gln Glu Leu Thr Asp Tyr His Ser Gln Glu Val Gln Gly Tyr Leu Glu
130 135 140
Gly Phe Phe Asp Thr Met Thr Asn Pro Ala Thr Ser Pro Asp Ile Lys
145 150 155 160
Ile Asn Glu Ser Ile Phe Thr Gly Met Pro Gly Asp Pro Asn Arg Thr
165 170 175
Pro Lys Asp Val Asp Ala Thr Ala Tyr Asp Ala Val Val Asp Arg Phe
180 185 190
Leu Thr Ala Asn Val Ser Phe Ile Ile Asp Leu Asp Gln Phe Leu Thr
195 200 205
Gly Lys Phe Gly Thr Asn Asn Asp Val Asn Phe Asp Val Ala Ser Leu
210 215 220
Pro Tyr Tyr Val Ile Gly Ala Thr Leu Gln Leu Met Ser Tyr His Ser
225 230 235 240
Val Ile Asn Phe Met Thr Ala Trp Phe Pro Lys Ile Trp Pro Gln Tyr
245 250 255
Thr Thr Glu Asp Ser Arg Tyr Lys Asp Ile Leu Asn Ala Lys Gln Ala
260 265 270
Met Arg Met Ala Ile Arg Asp Asn Thr Asn Lys Val Leu Thr Ile Val
275 280 285
Gln Ala Asn Leu Pro Ser Leu Gly Ser Thr Arg Asn Ser Leu Asn Asp
290 295 300
Tyr Asn Arg Tyr Ala Arg Thr Met Gln Leu Thr Cys Leu Asp Thr Val
305 310 315 320
Ala Gln Trp Pro Thr Leu Tyr Pro Asp Asp Tyr Pro Val Ser Thr Thr
325 330 335
Leu Glu Gln Thr Arg Arg Ile Phe Leu Asp Thr Ala Gly Ile Asp Glu
340 345 350
Arg Lys Asp Ser Asn Gly Lys Tyr Thr Asn Asp Ser Thr Ile Thr Ala
355 360 365
Leu Tyr Asn Ile Leu Asp Gly Asn Ser Tyr Asn Ser His Asn Asn Val
370 375 380
Gly Ile Asn Gly Ile Asn Ser Leu Leu Tyr Ser Gln Asp Glu Leu Lys
385 390 395 400
Ser Ile Thr Tyr Arg Met Thr Tyr Thr Asn Asn Asn Asp Tyr Gly Gln
405 410 415
Tyr Pro Cys Phe Pro Tyr Gly Val Val Leu Gly Tyr Thr Asn Asn Gln
420 425 430
Tyr Pro Ser Asn Pro Ala Tyr Gly Asn Gly Gly Asp Pro Asn Asn Gly
435 440 445
Ser Tyr Ile Asp Ser Pro Asn Thr Ser Ala Pro Phe His Thr Val Asp
450 455 460
Ala Ala Thr Gln Asn Thr Met Tyr Ile Asp Ala Gln Asn Met Phe Thr
465 470 475 480
Asp Ser Asn Pro Thr Gly Cys Asn Ile Leu Asn Tyr Gly Gly Pro Asn
485 490 495
Gly Gly Lys Ser Asn Asn Pro Thr Leu Ala Thr Gln Lys Ile Asp Thr
500 505 510
Leu Phe Pro Phe Thr Ile Asn Asn Pro Pro Gly Ser Thr Lys Asn Gly
515 520 525
Leu Gly Arg Leu Gly Val Met Ala Ser Leu Val Pro Tyr Asn Leu Ser
530 535 540
Pro Asn Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile
545 550 555 560
Ala Lys Gly Phe Pro Ala Glu Lys Gly Tyr Ile Asp Asn Gly Ser Pro
565 570 575
Val Ser Val Val Lys Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu
580 585 590
Pro Ala Trp Asn Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Gly
595 600 605
Gln Tyr Arg Ile Arg Val Arg Tyr Ala Asn Pro Gly Pro Asp Leu Leu
610 615 620
Leu His Gln Tyr Val Tyr Ala Gly Asp Asn Glu Val Gln Gly Gly Ala
625 630 635 640
Trp Thr Phe Lys Ser Thr Thr Asp Ser Asn Val Asn Thr Asn Phe Pro
645 650 655
Asn Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Val Leu Glu Asp
660 665 670
Ile Thr Trp Pro Met Gly Gly Gly Pro Ser Asp Pro Leu Thr Leu Pro
675 680 685
Ala Gly Asp Val Thr Val Gln Leu Phe Leu Gly Asp Phe Ser Gln Gln
690 695 700
Gly Gln Met Thr Gln Val Phe Ile Asp Arg Ile Glu Phe Val Pro Val
705 710 715 720
Gly Ala Thr Phe Pro Ser Thr Ile Pro Ile Ser Val Asn Ala Gln Tyr
725 730 735
Asn Gln Gly Ser Val Leu Leu Trp Gln Ala Pro Ser Ser Ser Tyr Phe
740 745 750
Ile Tyr Asn Phe Asp Ile Thr Gly Thr Leu Tyr Gly Ser Gln Phe Thr
755 760 765
Ser Val Phe Phe Asp Leu Glu Asn Pro Gln Gly Ser Ser Gly Ile Val
770 775 780
Pro Gly Ile Asn Gln Ser Ile Thr Leu Asn Pro Asn Gly Thr Asp Ile
785 790 795 800
Ser Tyr Lys Asn Leu Thr Ala Asn Leu Asn Ser His Gly Gly Phe Asn
805 810 815
Leu Met Arg Met Glu Gly Ala Gly Ser Glu Asp Phe Gly Ser Asn Ile
820 825 830
Thr Gly Asn Leu Thr Ile Thr Ile His Lys Asp Ser Thr Asn Ser
835 840 845
<210> 112
<211> 720
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 112
Met Ala Gln Leu Gln Glu Leu Gln Ile Gln Pro Val Ile Pro Tyr Asn
1 5 10 15
Val Leu Ala Ser Pro Pro Met Gly Asn Val Asp Asp Leu Pro Thr Leu
20 25 30
Ala Glu Glu Leu Lys Gln Ala Trp Glu Ser Phe Gln Lys Thr Gly Asn
35 40 45
Leu Asn Phe Thr Ala Phe Gln Gln Gly Leu Ala Ala Ala Gly Gly Gly
50 55 60
Ser Phe Asn Tyr Leu Ala Leu Leu Gln Ala Ala Ile Gly Leu Ala Gly
65 70 75 80
Thr Val Gly Ala Glu Ile Pro Gly Val Ala Ile Ala Ala Pro Leu Val
85 90 95
Ser Ile Leu Leu Gly Phe Leu Trp Pro Asn Lys Gln Lys Ser Asp Ile
100 105 110
Lys Gln Leu Ile Thr Val Ile Asp Ala Glu Val Lys Arg Ile Leu Asn
115 120 125
Gln Glu Leu Thr Asp Tyr His Ser Gln Glu Val Gln Gly Tyr Leu Glu
130 135 140
Gly Phe Phe Asp Thr Met Thr Asn Pro Ala Thr Ser Pro Asp Ile Lys
145 150 155 160
Ile Asn Glu Ser Ile Phe Thr Gly Met Pro Gly Asp Pro Asn Arg Thr
165 170 175
Pro Lys Asp Val Asp Ala Thr Ala Tyr Asp Ala Val Val Asp Arg Phe
180 185 190
Leu Thr Ala Asn Val Ser Phe Ile Ile Asp Leu Asp Gln Phe Leu Thr
195 200 205
Gly Lys Phe Gly Thr Asn Asn Asp Val Asn Phe Asp Val Ala Ser Leu
210 215 220
Pro Tyr Tyr Val Ile Gly Ala Thr Leu Gln Leu Met Ser Tyr His Ser
225 230 235 240
Val Ile Asn Phe Met Thr Ala Trp Phe Pro Lys Ile Trp Pro Gln Tyr
245 250 255
Thr Thr Glu Asp Ser Arg Tyr Lys Asp Ile Leu Asn Ala Lys Gln Ala
260 265 270
Met Arg Met Ala Ile Arg Asp Asn Thr Asn Lys Val Leu Thr Ile Val
275 280 285
Gln Ala Asn Leu Pro Ser Leu Gly Ser Thr Arg Asn Ser Leu Asn Asp
290 295 300
Tyr Asn Arg Tyr Ala Arg Thr Met Gln Leu Thr Cys Leu Asp Thr Val
305 310 315 320
Ala Gln Trp Pro Thr Leu Tyr Pro Asp Asp Tyr Pro Val Ser Thr Thr
325 330 335
Leu Glu Gln Thr Arg Arg Ile Phe Leu Asp Thr Ala Gly Ile Asp Glu
340 345 350
Arg Lys Asp Ser Asn Gly Lys Tyr Thr Asn Asp Ser Thr Ile Thr Ala
355 360 365
Leu Tyr Asn Ile Leu Asp Gly Asn Ser Tyr Asn Ser His Asn Asn Val
370 375 380
Gly Ile Asn Gly Ile Asn Ser Leu Leu Tyr Ser Gln Asp Glu Leu Lys
385 390 395 400
Ser Ile Thr Tyr Arg Met Thr Tyr Thr Asn Asn Asn Asp Tyr Gly Gln
405 410 415
Tyr Pro Cys Phe Pro Tyr Gly Val Val Leu Gly Tyr Thr Asn Asn Gln
420 425 430
Tyr Pro Ser Asn Pro Ala Tyr Gly Asn Gly Gly Asp Pro Asn Asn Gly
435 440 445
Ser Tyr Ile Asp Ser Pro Asn Thr Ser Ala Pro Phe His Thr Val Asp
450 455 460
Ala Ala Thr Gln Asn Thr Met Tyr Ile Asp Ala Gln Asn Met Phe Thr
465 470 475 480
Asp Ser Asn Pro Thr Gly Cys Asn Ile Leu Asn Tyr Gly Gly Pro Asn
485 490 495
Gly Gly Lys Ser Asn Asn Pro Thr Leu Ala Thr Gln Lys Ile Asp Thr
500 505 510
Leu Phe Pro Phe Thr Ile Asn Asn Pro Pro Gly Ser Thr Lys Asn Gly
515 520 525
Leu Gly Arg Leu Gly Val Met Ala Ser Leu Val Pro Tyr Asn Leu Ser
530 535 540
Pro Asn Asn Val Phe Gly Glu Phe Asp Ser Asp Thr Gln Ala Ile Ile
545 550 555 560
Ala Lys Gly Phe Pro Ala Glu Lys Gly Tyr Ile Asp Asn Gly Ser Pro
565 570 575
Val Ser Val Val Lys Glu Trp Val Asn Gly Ala Asn Ala Val Gln Leu
580 585 590
Pro Ala Trp Asn Thr Phe Lys Ile Thr Ala Thr Asn Leu Ala Ala Gly
595 600 605
Gln Tyr Arg Ile Arg Val Arg Tyr Ala Asn Pro Gly Pro Asp Leu Leu
610 615 620
Leu His Gln Tyr Val Tyr Ala Gly Asp Asn Glu Val Gln Gly Gly Ala
625 630 635 640
Trp Thr Phe Lys Ser Thr Thr Asp Ser Asn Val Asn Thr Asn Phe Pro
645 650 655
Asn Gln Val Tyr Val Thr Gly Gln Gln Gly Asn Tyr Val Leu Glu Asp
660 665 670
Ile Thr Trp Pro Met Gly Gly Gly Pro Ser Asp Pro Leu Thr Leu Pro
675 680 685
Ala Gly Asp Val Thr Val Gln Leu Phe Leu Gly Asp Phe Ser Gln Gln
690 695 700
Gly Gln Met Thr Gln Val Phe Ile Asp Arg Ile Glu Phe Val Pro Val
705 710 715 720
<210> 113
<211> 960
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 113
Met Gln Met Val Thr Lys Ala Met Thr Pro Lys Leu Gly Ala Leu Pro
1 5 10 15
Asp Phe Ile Asp Asp Phe Asn Gly Ile Phe Gly Phe Met Gly Asn Ile
20 25 30
Thr Asp Val Met Ser Thr Ile Phe Gly Val Asp Thr Gly Asp Ser Ser
35 40 45
Ile Asp Asp Val Leu Asn Asn Gln Glu Leu Leu Gln Glu Met Gln Ala
50 55 60
Gln Met Gln Gln Ile Gln Glu Ser Ile Lys Asp Met Thr Asp Lys Glu
65 70 75 80
Ile Ile Thr Gln Ser Ile Glu Glu Gln Ile Leu Ala Leu Thr Lys Asp
85 90 95
Leu Ser Thr Ser Ile Thr Thr Glu Phe Glu Lys Ile Gly Asn Ile Leu
100 105 110
Thr Thr Tyr Leu Gln Ala Met Ser Gly Met Leu Lys Asp Ile Tyr Ser
115 120 125
Gln Thr Ser Val Ile Asp Gln Lys Val Asp Lys Leu Leu Ala Met Met
130 135 140
Thr Phe Ala Leu Lys Glu Leu Asp Tyr Ile Lys Asp Asn Val Val Leu
145 150 155 160
Asn Ser Ser Ile Val Glu Ile Thr Pro His Val Gln Lys Leu Val Tyr
165 170 175
Val Asn Lys Lys Phe Leu Ser Leu Thr Arg Gly Phe Leu Gln Gly Glu
180 185 190
Asp Leu Ser Ile Asp Ser Met Gln Glu Ile Gln Glu Trp Ala Lys Ser
195 200 205
Ile Leu Ala Thr Glu Met Asn Ser Phe Glu Phe Ser Val Asp Thr Leu
210 215 220
His Ser Ile Ile Ile Gly Asp Asn Leu Tyr Lys Arg Ser Ala Leu Lys
225 230 235 240
Thr Phe Ser Asp Val Leu Leu Asp Asp Ala Asp Gln Tyr Gly Asn Phe
245 250 255
Gly Thr Pro Leu Ala Lys Phe Tyr Thr Phe Phe Ser Ser Leu Ala Thr
260 265 270
Leu Gln Ile Asn Ala Tyr Leu Cys Leu Thr Phe Ala Arg Lys Val Leu
275 280 285
Gly Leu Ser Glu Ile Asp Tyr Gln Leu Ile Met Gln Asp Arg Ile Glu
290 295 300
Gln Gln Asn Gln Leu Phe Val Asn Leu Ile Gln Asp Lys Asn Tyr Ser
305 310 315 320
Asn Val Leu Glu Ile Lys Gly Val Tyr Pro Met Pro His Asn Asn Glu
325 330 335
Asp Gly Asn Ser Phe Ser Leu Gln Ala Lys Lys Gly Tyr Ala Leu Ile
340 345 350
Gly Leu Glu Phe Phe Met Asp Asn Gly Ile Tyr Lys Ala Lys Ala Tyr
355 360 365
Gln Gly Lys Ile Gly Lys Asn Phe Ser Val Tyr Ala Asp Thr Val Glu
370 375 380
Glu Ile Ile Ser Asp Asp Leu Ser Lys Val Phe Leu Asn Asn Thr Lys
385 390 395 400
Asp Phe Glu Gln Ser Tyr Lys Tyr Pro Leu Phe Gly Glu Leu Lys Gly
405 410 415
Glu Pro Asn Thr Leu Ile Thr Arg Ile Gly Phe Gly Thr Lys Tyr Asp
420 425 430
Glu Ser Lys Glu Arg Gln Ala Val Ala Tyr Ile Asp Ala Asp Phe Ser
435 440 445
Pro Tyr Glu Pro Lys Ser Gly Ser Ile Ser Thr Glu Gly Thr Gln Thr
450 455 460
Phe Gln Ile Glu Glu Thr Lys Asn Tyr Thr Tyr Cys Tyr Asp Ala Trp
465 470 475 480
Pro Ile Gly Ile Ile Gly Asp Leu Tyr Met Thr Pro Leu Lys Ser Leu
485 490 495
Ser Leu His Val Asp Gly Asp Ser Arg Lys Asn Leu Tyr Met Ser Gly
500 505 510
Glu Ser Tyr Phe Ser Thr Ile Leu Ser Arg Glu Tyr Asn Thr Asn Phe
515 520 525
Ile Leu Phe Pro Tyr Thr Asn Asn Ser Ser Pro Ile Val Glu Asn Leu
530 535 540
Ile Gln Asn Gly Asp Phe Glu Asp Gly Asp Lys Tyr Trp Glu Val Ser
545 550 555 560
Gly Asp Thr Ala Val Ile Ala Lys Gly Glu Gly Ile Tyr Gly Ser Asn
565 570 575
Ala Ile Ser Ile Lys Asn Thr Gly Asp Ala Phe Glu Pro Ala Leu Lys
580 585 590
Gln Glu Leu Asn Leu Lys Pro Tyr Thr Lys Tyr Lys Leu Thr Ala Tyr
595 600 605
Val Lys Ser Lys Thr Asn Asp Ser Gln Ala Gly Leu Ile Ala Ile Lys
610 615 620
Tyr Thr Asn Tyr Met Ile Ala Phe Ile Ser Lys Thr Phe Ser Thr Ser
625 630 635 640
Tyr Lys Gln Ile Glu Leu Lys Phe Arg Thr Gly Ala Asp Thr Gln Asn
645 650 655
Phe Tyr Val Thr Phe Glu Lys Ser Asn Thr Gly Tyr Leu Leu Leu Asp
660 665 670
Asn Ile Lys Leu His Glu Ile Pro Gln Thr Asp Asn Leu Ile Asp Cys
675 680 685
Gly Asp Phe Gly Gly Tyr Asp Tyr Thr Thr Lys Ile Asp Cys Gln Tyr
690 695 700
Tyr Glu Pro Tyr Leu Trp Glu Leu Asp Gly Gly Gly Ser Ile Ser Asn
705 710 715 720
Gly Phe Glu Glu Gly Leu Phe Gly Thr Asp Cys Leu Lys Ile Ile Lys
725 730 735
Ser Gly Gly Ala Asn Gln Lys Val Gln Leu Lys Ala Asn Thr Asn Tyr
740 745 750
Ile Leu Thr Ala Tyr Val Lys Val Asp Asp Pro Thr Thr Thr Ala His
755 760 765
Ile Gly Cys Gly Ser Asn Gln Val Thr Cys Asn Ser Thr Ser Tyr Thr
770 775 780
Leu Val Lys Val Lys Phe Lys Thr Gly Asp Asp Pro Ser Thr Thr Glu
785 790 795 800
Ser Ser Val Tyr Cys Ser Asn Pro Asn Asp Ser Gly Thr Ile Trp Ala
805 810 815
Asp Asn Phe Val Leu Cys Glu Val Pro Asn Leu Ile Val Asn Gly Asp
820 825 830
Phe Glu Gln Met Asp Leu Ser Ser Trp Ser Leu Ser Pro Ser Glu Arg
835 840 845
Gly Thr Ile Tyr Ile Gly Ile Gly Lys Gly Ile Ser Asn Ser Asn Ala
850 855 860
Ile Val Leu Gln Asp His Asn Gly Lys Ile Ser Gln Lys Leu Pro Leu
865 870 875 880
Lys Pro Ile Thr Lys Tyr Arg Leu Thr Ala Tyr Val Ser Val Thr Gly
885 890 895
Gly Gly Thr Ala Gln Ile Gly Tyr Gly Asp Asp Ser Ala Ser Cys Thr
900 905 910
Ser Asn Lys Phe Gln Gln Ile Ser Thr Asp Phe Thr Thr Gly Ala Asp
915 920 925
Pro Leu Gln Ser Asp Asp Val Ile Tyr Leu Ser Thr Asp Gly Cys Thr
930 935 940
Tyr Thr Ile Ala Asp Asn Phe Glu Leu Tyr Glu Leu Asp Gln Ile Glu
945 950 955 960
<210> 114
<211> 959
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 114
Met Asn Asn Thr Ile Val Pro Tyr Asn Val Leu Arg Asp Asn Asp Met
1 5 10 15
Ser Asn Ile Ser Gly Thr Ser Trp Asp Lys Asp Gln Ile Ser Lys Gly
20 25 30
Met Glu Asn Ser Ser Ile Leu Leu Glu Leu Ile Glu Lys Gly Leu Asn
35 40 45
Asp Gly Asn Asp Ile Leu Ser Leu Leu Gly Phe Ile Gly Met Lys Ala
50 55 60
Ile Glu Ala Ile Pro Val Val Gly Gly Thr Ile Ser Ser Leu Ile Ser
65 70 75 80
Leu Ile Leu Phe Pro Leu Lys Ser Lys Val Asp Tyr Gln Glu Ile Trp
85 90 95
Asn Gln Leu Lys Lys Ser Ile Glu Lys Met Val Asp Gln Lys Ile Glu
100 105 110
Glu Ala Met Met Ser Gln Leu Met Gln Glu Ile Ala Gly Leu Ala Ala
115 120 125
Ile Leu Glu Gln Tyr Arg Ala Ala Tyr Asp Leu Tyr Asn Gly Lys Lys
130 135 140
Val Phe Asn Ile Pro Asp Gly Met Thr Pro Gly Asp His Leu Asn Thr
145 150 155 160
Val Phe Val Ser Ala Asn Met Gln Phe Ile Gln Arg Met Lys Ile Phe
165 170 175
Gln Asn Pro Lys Tyr Lys Val Asp Leu Leu Pro Phe Phe Val His Ala
180 185 190
Ala Glu Met His Ile Leu Leu Val Arg Asp Ala Ala Ile Tyr Gly Leu
195 200 205
Glu Trp Gly Met Asp Glu Ala Val Asn Gln Tyr Tyr Lys Met Glu Leu
210 215 220
Lys Arg Leu Ile Asn Glu Tyr Ser Lys Tyr Val Leu Asp Thr Tyr Gln
225 230 235 240
Lys Gly Leu Lys Glu Val Thr Glu Arg Glu Leu Glu Asp Asn Asp Phe
245 250 255
Pro Thr Thr Trp Gln Lys Asp Asn Tyr Lys Asn Thr Val Arg Trp Asn
260 265 270
Val Ile Asn Lys Tyr Lys Arg Gly Met Ser Leu Thr Val Phe Asp Phe
275 280 285
Ala Tyr Lys Trp Lys Tyr Tyr His Glu Ser Tyr Gln Lys Asn Leu Met
290 295 300
Leu Asn Pro Thr Arg Thr Ile Tyr Ser Asp Ile Glu Gly Ser Val Tyr
305 310 315 320
Pro Tyr Thr Lys Glu Thr Ser Glu Ile Asp Asn Asp Ile Asn Phe Thr
325 330 335
Asn Leu Glu Tyr Arg Gly Ile Leu Lys Glu Val Gln Val Asn Asn Ala
340 345 350
Asp Arg Ile Asp Ser Leu Gln Ser Ile Phe Ile Arg Glu Asn Gln Lys
355 360 365
Phe Glu Asn Arg Lys Thr Gly Gly Ser Gly Gly Arg Gly Thr Phe Val
370 375 380
Asn Ile Glu Ser Pro Lys Asn Asn Pro Leu Thr Arg Val Glu Thr Trp
385 390 395 400
Ser Glu Leu Val Pro Phe Ala Leu Lys Phe Gly Phe Tyr Asn Gly Glu
405 410 415
Glu Thr Arg Leu Met Gly Gln Lys Thr Tyr Gly Lys Lys Tyr Thr Ile
420 425 430
Tyr Gln Tyr Ser Gly Asn Lys Val Val Ser Ile Met Gly Phe Gly Leu
435 440 445
Asn Ala Thr Ser Gly Phe Asp Ser Leu Asp Ala Met Val Val Gly Phe
450 455 460
Lys Gln Asp Asp Tyr Ile Pro Glu Asn Lys Ile Val Pro Ile Asn Thr
465 470 475 480
Asn Gly Glu Pro Leu Ile Gln Val Val Glu Val Gly Asn Phe Tyr Glu
485 490 495
Gly Lys Phe Glu Ser Asn Leu Lys Met Ile Glu Glu Pro Met Phe Gly
500 505 510
Asp Gly Val Leu Gln Phe Glu Asn His Ser Asn Asn Leu Asn Gln Asp
515 520 525
Asn Tyr Val Thr Tyr Gln Ile His Ala Glu Ile Glu Gly Thr Tyr Glu
530 535 540
Leu Asn Ala Ile Ile Gly Ala Asn Lys Gln Lys Asp Arg Ile Ala Phe
545 550 555 560
Lys Met Ser Leu Asn Gly Glu Gln Pro Glu Asn Phe Ile Thr Glu Ser
565 570 575
Phe Asp Asp Gly Asp Ile Trp Glu Gly Met Ser Leu Asn Glu Glu Leu
580 585 590
Val Tyr Lys Lys Asn Ser Leu Gly Asn Phe Gln Leu Asn Lys Gly Val
595 600 605
Asn Asn Ile Thr Ile Tyr Asn Gly Ala Leu Gln Ala Ser Ala Asn Ile
610 615 620
Lys Ile Trp Asn Leu Ala Thr Ile Glu Leu Thr Ile Asn Ala Asp Ser
625 630 635 640
Leu Lys Asp Pro Asp Ile Thr Thr Leu Tyr Asp Gln Asp Asn Tyr Thr
645 650 655
Gly Thr Lys Lys Leu Ile Phe Glu Asp Thr Asn Arg Leu Glu Asp Phe
660 665 670
Asn Asp Lys Ala Ser Ser Ile Lys Ala Glu Ser His Ile Ala Gly Ile
675 680 685
Arg Phe Tyr Gln Asn Tyr Asp Tyr Thr Gly Asp Phe Ile Asp Leu Val
690 695 700
Gly Gly Glu Lys Ile Ser Leu Lys Asn His Leu Phe Asn Asn Arg Ala
705 710 715 720
Ser Ser Val Lys Phe Ala Asn Ile Val Leu Tyr Asn Lys Asp Asn Tyr
725 730 735
Glu Gly Ser Arg Lys Phe Val Phe Glu Asp Ile Pro Glu Leu Thr Asp
740 745 750
Phe Asn Asp Gln Thr Ser Ser Ile Val Ile Ser Ser Asn Val Ser Gly
755 760 765
Ala Arg Leu Tyr Glu His Ala Asn Tyr Glu Gly Asn Tyr Val Asp Val
770 775 780
Val Gly Gly Gln Arg Leu Ser Leu Lys Asn His Val Leu Asn Lys Glu
785 790 795 800
Ile Thr Ser Ile Lys Phe Phe Lys Gly Asp Asp Glu Ile Leu Asn Gly
805 810 815
Ile Tyr Gln Ile Val Pro Ala Ile Asn Asn Thr Ser Val Ile Val Asn
820 825 830
Leu Glu Ser Ser Ala Phe Val Trp Leu Trp Gly Lys Asp Ser Glu Phe
835 840 845
Ala His Gln Trp Arg Val Glu Tyr Asp Glu Ser Glu Leu Ala Tyr Gln
850 855 860
Ile Lys Asn Met Ser Asn Glu Asn Ile Val Leu Ala Ala Met Ile Val
865 870 875 880
Thr Glu Pro Ser Tyr Ala His Gly Ile Pro Asn Met Lys Asn Ser Leu
885 890 895
Gln Tyr Trp Ile Phe Glu Tyr Val Gly Asn Gly Tyr Tyr Ile Ile Lys
900 905 910
Leu Lys Ala Asn Pro Lys Leu Val Leu Asp Val Asn Gly Gly Asn Pro
915 920 925
Asn Asp Gly Thr Ile Ile Lys Val Asn Tyr Ser His Asp Leu Asn Ser
930 935 940
Pro Asn Ile Asn Ala Gln Lys Phe Arg Phe Asn Asn Met Asn Asn
945 950 955
<210> 115
<211> 1391
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 115
Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu Ile Leu Asp Asn Gly
1 5 10 15
Gly Arg Gly Tyr Gln Pro Arg Tyr Pro Leu Ala Gln Ala Pro Gly Ser
20 25 30
Glu Phe Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Arg Tyr Thr Gly
35 40 45
Glu Glu Gln Ala Gly Gly Leu Ile Arg Ala Glu Ile Asp Ala Asn Thr
50 55 60
Ala Leu Thr Ala Thr Phe Gly Val Ala Trp Ala Val Leu Gly Val Ser
65 70 75 80
Asn Pro Val Gly Ser Ala Ile Ala Gly Ile Leu Asn Val Val Thr Pro
85 90 95
Ile Ile Phe Asn Gln Val Asp Pro Asn Ser Pro Leu Lys Val Trp Glu
100 105 110
Ser Met Ile Gly Tyr Ala Gln Ala Leu Ile Lys Lys Glu Leu Thr Glu
115 120 125
Thr Val Arg Asn Leu Ala Leu Asp His Ile Asp Arg Ile Asn Asp Tyr
130 135 140
Gln Ile Asp Tyr Lys Asn Lys Ala Arg Ile Trp Glu Gly Asn Pro Thr
145 150 155 160
Pro Gly Asn Ala Ser Gln Leu Leu Asp Ala Phe Lys Asn Leu Lys Lys
165 170 175
Ser Cys Gln Asp Ala Met Pro His Leu Asp Thr Arg Gly Tyr Glu Thr
180 185 190
Ile Leu Leu Pro Leu Tyr Ala Gln Ala Ala Asn Ile His Leu Ile Val
195 200 205
Leu Arg Asp Gly Leu Met His Gly Arg Ser Trp Gly Met Thr Asn Glu
210 215 220
Glu Tyr Glu Asp Leu Tyr Ser Gly Phe Phe Gly Phe Lys Asn Arg Ile
225 230 235 240
Ala Thr Tyr Thr Asn Tyr Cys Thr Asn Thr Phe Asn Gln Gly Ala Thr
245 250 255
Thr Asn Tyr Thr Ala Asp Met Glu Asp Cys Ala Lys Tyr Pro Trp Val
260 265 270
Arg Tyr Asn Gln Asp Glu Phe Tyr Ser Gly Phe Phe Gly Pro Glu Asn
275 280 285
Ser Cys Arg Gln Ser Gln Thr Thr Asn Ala Tyr Pro Gln Pro Lys Ser
290 295 300
Gly Phe Asn Gln Pro Leu Asn Leu Arg Asn Ser Arg Gln Ser Pro Gly
305 310 315 320
Glu Tyr Arg Asp Val Glu Ala Trp Asn Leu Arg Asn Glu Phe Ile Ser
325 330 335
Ser Met Thr Ile Gln Val Leu Asp Thr Val Ala Leu Trp Pro Thr Tyr
340 345 350
Asp Pro Arg Val Tyr Thr Ser Ala Val Lys Thr Glu Phe Thr Arg Glu
355 360 365
Ile Tyr Thr Ser Ile Arg Gly Thr Thr Tyr Arg Ser Asp Pro Ala Gln
370 375 380
Asn Thr Met Ser Ala Ile Glu Ala Arg Met Ile Arg Pro Pro His Leu
385 390 395 400
Phe Glu Trp Pro Gln Thr Met Thr Phe Phe Phe Glu Asp Val Asn Val
405 410 415
Arg Tyr Thr Trp Val Gly Asn Ser Trp Thr Lys Gly Gln Ala Leu Ile
420 425 430
Gly Leu Lys Thr Glu Ser Ala Arg Thr Leu Asp Thr Asn Met Ile Thr
435 440 445
Ala Leu Gln Gly Leu Thr Asp Glu Tyr Glu Tyr Tyr His Ile Pro Leu
450 455 460
Asp Val Ser Thr Arg Gln Lys Asp Ile Thr Asn Ile Arg Thr Lys Gln
465 470 475 480
Trp Phe Glu Pro Arg Glu Phe Ile Phe Tyr Arg Ser Gly Gly Gln Gln
485 490 495
Val Leu Ala Leu Gly Thr Ile Glu Glu Asp Ile Pro Gly Tyr Ser Val
500 505 510
Tyr Leu Val Asn Asp Tyr Ile Tyr Thr Pro Ser Ile Arg Gln Asn Thr
515 520 525
Phe Pro Pro Ile Glu Val Pro Glu Gly Thr Pro Pro Pro Ser His Arg
530 535 540
Leu Ser Trp Val Lys Phe Glu Pro Val Arg Asp Asn Ala Thr Thr Phe
545 550 555 560
Val Asp Pro Lys Gln Ile Gly Ala Thr Ile Phe Gly Trp Thr His Thr
565 570 575
Ser Val Asp Pro Asn Asn Thr Ile Asp Ala Thr Lys Ile Thr Gln Ile
580 585 590
Pro Ala Val Lys Ala Ser Gly Gly Thr Asp Tyr Gln Val Ile Lys Gly
595 600 605
Pro Gly Ser Thr Gly Gly Asp Leu Val Ser Leu Pro Thr Tyr Ala Lys
610 615 620
Ile Glu Ile Pro Leu Thr Ser Ser Ala Asp Gln Leu Tyr Glu Val Arg
625 630 635 640
Ile Arg Tyr Ala Ala Ala Glu Gln Lys Arg Ile Arg Ile Gly Lys Lys
645 650 655
Ser Arg Asp Phe Gln Trp Arg Tyr Ile Asp Tyr Asp Val Pro Ser Thr
660 665 670
Pro Tyr Ser Gly Gly Asn Leu Thr Tyr Asn Ala Phe Ser Tyr Lys Glu
675 680 685
Leu Gly Thr Val Gln Gly Val Leu Glu Leu Ser Ile Glu Arg Ile Asp
690 695 700
Pro Phe Asp Glu Pro Ile Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile
705 710 715 720
Gln Gly Ser Val Glu Thr Tyr Glu Ala Asp Gln Asp Leu Glu Lys Ala
725 730 735
Arg Lys Ala Val Asn Ala Leu Phe Thr Ser Asp Ala Lys Asn Ala Leu
740 745 750
Gln Leu Glu Val Thr Asp Tyr Leu Val Asp Gln Ala Ala Lys Leu Val
755 760 765
Glu Cys Met Ser Asp Glu Ile Tyr Pro Gln Glu Lys Met Cys Leu Leu
770 775 780
Asp Gln Val Lys Val Ala Lys Arg Leu Ser Gln Ala Arg Asn Leu Leu
785 790 795 800
His His Gly Asp Phe Glu Ser Pro Asp Trp Phe Gly Glu Asn Gly Trp
805 810 815
Lys Thr Ser Asn His Val Ser Val Thr Ser Gly Asn Pro Ile Phe Lys
820 825 830
Gly Arg Tyr Leu Asn Met Pro Gly Ala Asp Asn Pro Gln Leu Ser Asp
835 840 845
Thr Val Phe Pro Thr Tyr Ala Tyr Gln Lys Ile Asp Glu Ser Lys Leu
850 855 860
Lys Pro Tyr Thr Arg Tyr Met Ile Arg Gly Phe Val Gly Asn Ser Lys
865 870 875 880
Asp Leu Glu Val Phe Ile Thr Arg Tyr Asp Lys Glu Val His Lys Asn
885 890 895
Met Asn Val Pro His Asp Ile Leu Pro Thr Asp Pro Cys Thr Gly Thr
900 905 910
Tyr Pro Leu Gly Glu Gly Pro Met Leu Thr Asn His Thr Ile Pro Gln
915 920 925
Asp Met Ser Cys Asp Pro Cys Asp Ala Gly Thr Val Met Lys Val Gln
930 935 940
Gln Thr Leu Val Lys Cys Glu Ala Pro His Ala Phe Thr Phe His Ile
945 950 955 960
Asp Thr Gly Glu Leu Asn Arg Asp Gln Asn Leu Gly Ile Trp Val Gly
965 970 975
Phe Lys Ile Gly Thr Thr Asp Gly Arg Ala Thr Phe Asp Asn Leu Glu
980 985 990
Val Ile Glu Ala Asn Pro Leu Thr Gly Glu Ala Leu Ala Arg Val Lys
995 1000 1005
Lys Arg Glu His Lys Trp Lys Gln Lys Trp Met Glu Lys Arg Thr
1010 1015 1020
Lys Ile Asp Lys Ala Val Gln Ile Ala Gln Gly Ala Ile Gln Ala
1025 1030 1035
Leu Phe Thr Asp Ser Asn Gln Asn Arg Leu Lys Pro Asp Ile Thr
1040 1045 1050
Leu Asn His Ile Leu Gln Val Glu Thr Glu Val Gln Lys Ile Pro
1055 1060 1065
Tyr Val Tyr Asn Ala Phe Leu Gln Gly Ala Leu Pro Pro Val Pro
1070 1075 1080
Gly Glu Thr Tyr Asp Ile Phe Gln Gln Leu Ser Ser Ala Val Ala
1085 1090 1095
Ile Ala Arg Ser Leu Tyr Glu Gln Arg Asn Val Leu Arg Asn Gly
1100 1105 1110
Asp Phe Met Ala Gly Leu Ser Asn Trp Arg Gly Ala Lys Gly Ala
1115 1120 1125
Ile Val Gln Lys Ile Gly Asn Ala Ser Val Leu Val Ile Ser Asp
1130 1135 1140
Trp Ser Ala Asn Leu Ser Gln Asp Val Pro Val Asn Pro Gly His
1145 1150 1155
Gly Tyr Ile Leu Arg Val Thr Ala Lys Lys Glu Gly Ser Gly Glu
1160 1165 1170
Gly Tyr Val Thr Ile Ser Asp Gly Thr Glu Asp Asn Thr Glu Thr
1175 1180 1185
Leu Lys Phe Thr Val Gly Glu Val Ala Thr Arg Leu Ala Gln Ser
1190 1195 1200
Asp Met His Ser Pro Met Gln Gly Arg Tyr Gln Glu Arg Asn Met
1205 1210 1215
Thr Asn Ala Pro Ser Glu Ala Tyr Gly Thr Asn Gly Tyr Thr Ser
1220 1225 1230
His Thr Met Thr Asn Tyr Pro Ser Asp Asn Asp Gly Thr Asn Ala
1235 1240 1245
Tyr Pro Gly Asn His Asn Arg Asn Tyr Gln Ser Glu Ser Phe Gly
1250 1255 1260
Ile Thr Pro Tyr Gly Asp Glu Thr Ser Met Met Asn Gly Pro Ser
1265 1270 1275
Asn Lys Tyr Glu Met Asn Ala Tyr Ala Gly Ser Thr Tyr Met Thr
1280 1285 1290
Asn Asn Gly Val Gly Cys Ser Cys Gly Cys Ser Thr Asn Ala Tyr
1295 1300 1305
Ala Gly Glu Asn Met Met Arg Asn Asp Ser Ser Asn Glu Asn Gly
1310 1315 1320
Val Gly Tyr Gly Cys Gly Cys Arg Thr His Pro Asn Asn Arg Thr
1325 1330 1335
Asp Ser Tyr Pro Met Thr Ala His Gly Ser Ser Leu Ser Gly Tyr
1340 1345 1350
Val Thr Lys Thr Phe Glu Ile Phe Pro Glu Thr Asn Arg Val Cys
1355 1360 1365
Ile Glu Ile Gly Glu Thr Ser Gly Thr Phe Lys Val Glu Ser Ile
1370 1375 1380
Glu Leu Ile Arg Met Asp Cys Glu
1385 1390
<210> 116
<211> 720
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 116
Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu Ile Leu Asp Asn Gly
1 5 10 15
Gly Arg Gly Tyr Gln Pro Arg Tyr Pro Leu Ala Gln Ala Pro Gly Ser
20 25 30
Glu Phe Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Arg Tyr Thr Gly
35 40 45
Glu Glu Gln Ala Gly Gly Leu Ile Arg Ala Glu Ile Asp Ala Asn Thr
50 55 60
Ala Leu Thr Ala Thr Phe Gly Val Ala Trp Ala Val Leu Gly Val Ser
65 70 75 80
Asn Pro Val Gly Ser Ala Ile Ala Gly Ile Leu Asn Val Val Thr Pro
85 90 95
Ile Ile Phe Asn Gln Val Asp Pro Asn Ser Pro Leu Lys Val Trp Glu
100 105 110
Ser Met Ile Gly Tyr Ala Gln Ala Leu Ile Lys Lys Glu Leu Thr Glu
115 120 125
Thr Val Arg Asn Leu Ala Leu Asp His Ile Asp Arg Ile Asn Asp Tyr
130 135 140
Gln Ile Asp Tyr Lys Asn Lys Ala Arg Ile Trp Glu Gly Asn Pro Thr
145 150 155 160
Pro Gly Asn Ala Ser Gln Leu Leu Asp Ala Phe Lys Asn Leu Lys Lys
165 170 175
Ser Cys Gln Asp Ala Met Pro His Leu Asp Thr Arg Gly Tyr Glu Thr
180 185 190
Ile Leu Leu Pro Leu Tyr Ala Gln Ala Ala Asn Ile His Leu Ile Val
195 200 205
Leu Arg Asp Gly Leu Met His Gly Arg Ser Trp Gly Met Thr Asn Glu
210 215 220
Glu Tyr Glu Asp Leu Tyr Ser Gly Phe Phe Gly Phe Lys Asn Arg Ile
225 230 235 240
Ala Thr Tyr Thr Asn Tyr Cys Thr Asn Thr Phe Asn Gln Gly Ala Thr
245 250 255
Thr Asn Tyr Thr Ala Asp Met Glu Asp Cys Ala Lys Tyr Pro Trp Val
260 265 270
Arg Tyr Asn Gln Asp Glu Phe Tyr Ser Gly Phe Phe Gly Pro Glu Asn
275 280 285
Ser Cys Arg Gln Ser Gln Thr Thr Asn Ala Tyr Pro Gln Pro Lys Ser
290 295 300
Gly Phe Asn Gln Pro Leu Asn Leu Arg Asn Ser Arg Gln Ser Pro Gly
305 310 315 320
Glu Tyr Arg Asp Val Glu Ala Trp Asn Leu Arg Asn Glu Phe Ile Ser
325 330 335
Ser Met Thr Ile Gln Val Leu Asp Thr Val Ala Leu Trp Pro Thr Tyr
340 345 350
Asp Pro Arg Val Tyr Thr Ser Ala Val Lys Thr Glu Phe Thr Arg Glu
355 360 365
Ile Tyr Thr Ser Ile Arg Gly Thr Thr Tyr Arg Ser Asp Pro Ala Gln
370 375 380
Asn Thr Met Ser Ala Ile Glu Ala Arg Met Ile Arg Pro Pro His Leu
385 390 395 400
Phe Glu Trp Pro Gln Thr Met Thr Phe Phe Phe Glu Asp Val Asn Val
405 410 415
Arg Tyr Thr Trp Val Gly Asn Ser Trp Thr Lys Gly Gln Ala Leu Ile
420 425 430
Gly Leu Lys Thr Glu Ser Ala Arg Thr Leu Asp Thr Asn Met Ile Thr
435 440 445
Ala Leu Gln Gly Leu Thr Asp Glu Tyr Glu Tyr Tyr His Ile Pro Leu
450 455 460
Asp Val Ser Thr Arg Gln Lys Asp Ile Thr Asn Ile Arg Thr Lys Gln
465 470 475 480
Trp Phe Glu Pro Arg Glu Phe Ile Phe Tyr Arg Ser Gly Gly Gln Gln
485 490 495
Val Leu Ala Leu Gly Thr Ile Glu Glu Asp Ile Pro Gly Tyr Ser Val
500 505 510
Tyr Leu Val Asn Asp Tyr Ile Tyr Thr Pro Ser Ile Arg Gln Asn Thr
515 520 525
Phe Pro Pro Ile Glu Val Pro Glu Gly Thr Pro Pro Pro Ser His Arg
530 535 540
Leu Ser Trp Val Lys Phe Glu Pro Val Arg Asp Asn Ala Thr Thr Phe
545 550 555 560
Val Asp Pro Lys Gln Ile Gly Ala Thr Ile Phe Gly Trp Thr His Thr
565 570 575
Ser Val Asp Pro Asn Asn Thr Ile Asp Ala Thr Lys Ile Thr Gln Ile
580 585 590
Pro Ala Val Lys Ala Ser Gly Gly Thr Asp Tyr Gln Val Ile Lys Gly
595 600 605
Pro Gly Ser Thr Gly Gly Asp Leu Val Ser Leu Pro Thr Tyr Ala Lys
610 615 620
Ile Glu Ile Pro Leu Thr Ser Ser Ala Asp Gln Leu Tyr Glu Val Arg
625 630 635 640
Ile Arg Tyr Ala Ala Ala Glu Gln Lys Arg Ile Arg Ile Gly Lys Lys
645 650 655
Ser Arg Asp Phe Gln Trp Arg Tyr Ile Asp Tyr Asp Val Pro Ser Thr
660 665 670
Pro Tyr Ser Gly Gly Asn Leu Thr Tyr Asn Ala Phe Ser Tyr Lys Glu
675 680 685
Leu Gly Thr Val Gln Gly Val Leu Glu Leu Ser Ile Glu Arg Ile Asp
690 695 700
Pro Phe Asp Glu Pro Ile Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile
705 710 715 720
<210> 117
<211> 864
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 117
Met Pro Asn Glu Met Asn Ser Lys Asn Asn Met Asn Thr Thr Arg Ile
1 5 10 15
Asp Asp Lys Asp Leu Leu Asp Ser Lys Gly Arg Glu Cys Gln Pro Arg
20 25 30
Tyr Pro Ser Ala Glu Ala Pro Asp Val Met Asn Pro Ile Glu Lys Ile
35 40 45
Tyr Thr Arg Gly Gln Ser Ala Asn Ile Ser Asn Ala Thr Ser Thr Thr
50 55 60
Ile Asp Val Thr Ser Gln Ile Leu Arg Gly Ser Lys Lys Leu Ala Gly
65 70 75 80
Leu Ile Ile Lys Val Leu Ile Lys Ile Leu Trp Pro Lys Gly Thr Gln
85 90 95
Gln Glu Trp Glu Asn Phe Met Asp Ala Val Glu Gln Leu Ile His Glu
100 105 110
Lys Leu Asp Thr Tyr Ala Arg Asn Lys Ala Thr Ala Asp Leu Glu Gly
115 120 125
Leu Gln Arg Val Leu Ser Glu Tyr Ile Asp Lys Leu Thr Ile Tyr Thr
130 135 140
Asp Asp Pro Asn Ala Ala His Ala Gln Ser Leu Arg Thr Gln Val Ile
145 150 155 160
Thr Thr Asp Asn Leu Phe Glu Tyr Ser Met Pro Ser Phe Ala Ile Arg
165 170 175
Asp Tyr Glu Met Gln Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu
180 185 190
His Leu Ala Phe Leu Glu Asp Ile Val Arg Phe Gly Asp Glu Trp Gly
195 200 205
Phe Thr Pro Ala Glu Ile Ala Asp Phe His Lys Ser Met Lys Glu Arg
210 215 220
Thr Asp Glu Tyr Thr Asn His Cys Val Ser Thr Tyr Asn Lys Gly Leu
225 230 235 240
Glu Lys Ala Lys Leu Leu Thr Ala Asp Leu Cys Asn His Ser Thr Tyr
245 250 255
Pro Trp Thr Arg Tyr Asn Gln Gly Trp Arg Lys Glu Glu Thr Pro Asn
260 265 270
Cys Ser Leu Gln Gln Ile Asp Ser Cys Glu Gln Pro Leu Asn Glu Ser
275 280 285
Asp Glu Gln Gly Thr Pro Tyr Met Gly Pro Glu Arg Arg Ser Ile Asp
290 295 300
Glu Tyr Gln Lys Leu Glu Asn Trp Asn Leu Tyr Asn Ala Tyr Arg Arg
305 310 315 320
Asp Met Thr Leu Met Val Leu Asp Ile Val Ser Leu Trp Pro Thr Tyr
325 330 335
Asp Pro Lys Leu Tyr Pro Thr Ser Tyr Gly Val Thr Ser Glu Leu Thr
340 345 350
Arg Glu Leu Tyr Thr Asp Ile Arg Gly Thr Thr Tyr Arg Ser Asp Glu
355 360 365
Ser Gln Asn Ser Leu Asp Ala Ile Glu Gly Arg Met Ile Pro Gln Pro
370 375 380
Ser Leu Phe Arg Trp Leu Phe Gly Ile Ser Ile Lys Met Asn Gln Phe
385 390 395 400
Ile Gly Thr Gly Gly Ser Asp Asp Trp Thr Ser Gly Asn Leu Met Leu
405 410 415
Gly Met Arg Ile Met Trp Arg Lys Thr Leu Thr Glu Gln Trp Glu Gly
420 425 430
Leu Tyr Phe Gly Gly Gly Gly Asn Leu Tyr Ile Asn Leu Asn Pro Leu
435 440 445
Asp Tyr Ala Glu Gly Ile Thr His Val Asn Thr Arg Gln Trp Phe Glu
450 455 460
Pro Arg Trp Phe Glu Phe Tyr Ser Gly Gly Tyr Asp Thr Glu Leu Lys
465 470 475 480
Phe Gln Asn Lys Cys Gly Thr Ile Glu Glu Lys Ser Pro Phe Trp Ala
485 490 495
Ala Gly Pro Tyr Asn Tyr Ile Tyr Gln Pro Gly Val Arg Thr Ser Gln
500 505 510
Ile Pro Glu His Arg Leu Ser Trp Met Thr Tyr Glu Pro Val Arg Glu
515 520 525
Asn Ala Pro Phe Val Tyr Pro Gln Tyr Lys Gln Leu Gly Ala Val Ala
530 535 540
Leu Gly Trp Thr Ser Asn Ser Val Asp Gln Lys Asn Ala Ile Asp Leu
545 550 555 560
Asp Lys Ile Thr Ser Leu Pro Ala Val Lys Gly Asn Glu Ile Arg Gly
565 570 575
Pro Gly Ser Val Val Arg Gly Pro Gly Ser Thr Gly Gly Asn Leu Val
580 585 590
Leu Leu Asn Pro Thr Gly Glu Val Ser Ile Thr Val Ser Gln Pro Ile
595 600 605
Leu Asn Lys Pro Gly Ala Gly Tyr Tyr His Val Arg Ile Arg Tyr Ala
610 615 620
Ala Ile Ala Asn Gly Lys Leu Asn Val Lys Arg Ser Val Asn Ser Ser
625 630 635 640
Thr Gln Glu Ser Val Thr Tyr Asp Tyr Lys Gln Thr Thr Ala Arg Asp
645 650 655
Leu Thr Tyr Ser Ser Phe Gln Tyr Leu Glu Val Tyr Asp Phe Asn Pro
660 665 670
Val Thr Thr Ala Ser Gln Phe Glu Val Leu Leu Thr Asn Glu Ser Gly
675 680 685
Gly Pro Ile Tyr Ile Asp Lys Ile Glu Phe Ile Pro Phe Gln Gly Thr
690 695 700
Thr Gly Pro Glu Ala Pro Gly Ile Pro Ala Asp Ile Tyr Tyr Thr Ile
705 710 715 720
Ser Pro Ala Ile Asp Glu Asn Tyr Tyr Thr Ile Gly Gly Ser Thr Ile
725 730 735
Trp Leu Ser Leu Lys Glu Ser Phe Ser Asn Pro Pro Arg Asp Gly Gln
740 745 750
Trp Arg Phe Val Tyr Asp Thr Asn Lys Lys Ala Tyr Gln Ile Gly Thr
755 760 765
Gln Leu Asn Glu Asp Pro Ala Gly Gly Leu Phe Ala Val Leu Thr Ser
770 775 780
Thr Glu Pro Asn Asn Asn Leu Thr Met Tyr Pro Asn Leu Ser Leu Asp
785 790 795 800
Gln Gln Tyr Trp Phe Val Lys Asp Ala Gly Asp Gly Tyr Val Thr Leu
805 810 815
Glu Asn Tyr Gly Tyr Pro Gly Tyr Val Val Asn Val Pro Arg Val Tyr
820 825 830
Gln Gly Ala Ile Leu Thr Leu Ser Pro Phe Ile Gly Ser Met Asn His
835 840 845
Lys Phe Lys Leu Thr Lys Leu Ser Thr Asn Asp Glu Val Ile Leu Gly
850 855 860
<210> 118
<211> 701
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 118
Met Pro Asn Glu Met Asn Ser Lys Asn Asn Met Asn Thr Thr Arg Ile
1 5 10 15
Asp Asp Lys Asp Leu Leu Asp Ser Lys Gly Arg Glu Cys Gln Pro Arg
20 25 30
Tyr Pro Ser Ala Glu Ala Pro Asp Val Met Asn Pro Ile Glu Lys Ile
35 40 45
Tyr Thr Arg Gly Gln Ser Ala Asn Ile Ser Asn Ala Thr Ser Thr Thr
50 55 60
Ile Asp Val Thr Ser Gln Ile Leu Arg Gly Ser Lys Lys Leu Ala Gly
65 70 75 80
Leu Ile Ile Lys Val Leu Ile Lys Ile Leu Trp Pro Lys Gly Thr Gln
85 90 95
Gln Glu Trp Glu Asn Phe Met Asp Ala Val Glu Gln Leu Ile His Glu
100 105 110
Lys Leu Asp Thr Tyr Ala Arg Asn Lys Ala Thr Ala Asp Leu Glu Gly
115 120 125
Leu Gln Arg Val Leu Ser Glu Tyr Ile Asp Lys Leu Thr Ile Tyr Thr
130 135 140
Asp Asp Pro Asn Ala Ala His Ala Gln Ser Leu Arg Thr Gln Val Ile
145 150 155 160
Thr Thr Asp Asn Leu Phe Glu Tyr Ser Met Pro Ser Phe Ala Ile Arg
165 170 175
Asp Tyr Glu Met Gln Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu
180 185 190
His Leu Ala Phe Leu Glu Asp Ile Val Arg Phe Gly Asp Glu Trp Gly
195 200 205
Phe Thr Pro Ala Glu Ile Ala Asp Phe His Lys Ser Met Lys Glu Arg
210 215 220
Thr Asp Glu Tyr Thr Asn His Cys Val Ser Thr Tyr Asn Lys Gly Leu
225 230 235 240
Glu Lys Ala Lys Leu Leu Thr Ala Asp Leu Cys Asn His Ser Thr Tyr
245 250 255
Pro Trp Thr Arg Tyr Asn Gln Gly Trp Arg Lys Glu Glu Thr Pro Asn
260 265 270
Cys Ser Leu Gln Gln Ile Asp Ser Cys Glu Gln Pro Leu Asn Glu Ser
275 280 285
Asp Glu Gln Gly Thr Pro Tyr Met Gly Pro Glu Arg Arg Ser Ile Asp
290 295 300
Glu Tyr Gln Lys Leu Glu Asn Trp Asn Leu Tyr Asn Ala Tyr Arg Arg
305 310 315 320
Asp Met Thr Leu Met Val Leu Asp Ile Val Ser Leu Trp Pro Thr Tyr
325 330 335
Asp Pro Lys Leu Tyr Pro Thr Ser Tyr Gly Val Thr Ser Glu Leu Thr
340 345 350
Arg Glu Leu Tyr Thr Asp Ile Arg Gly Thr Thr Tyr Arg Ser Asp Glu
355 360 365
Ser Gln Asn Ser Leu Asp Ala Ile Glu Gly Arg Met Ile Pro Gln Pro
370 375 380
Ser Leu Phe Arg Trp Leu Phe Gly Ile Ser Ile Lys Met Asn Gln Phe
385 390 395 400
Ile Gly Thr Gly Gly Ser Asp Asp Trp Thr Ser Gly Asn Leu Met Leu
405 410 415
Gly Met Arg Ile Met Trp Arg Lys Thr Leu Thr Glu Gln Trp Glu Gly
420 425 430
Leu Tyr Phe Gly Gly Gly Gly Asn Leu Tyr Ile Asn Leu Asn Pro Leu
435 440 445
Asp Tyr Ala Glu Gly Ile Thr His Val Asn Thr Arg Gln Trp Phe Glu
450 455 460
Pro Arg Trp Phe Glu Phe Tyr Ser Gly Gly Tyr Asp Thr Glu Leu Lys
465 470 475 480
Phe Gln Asn Lys Cys Gly Thr Ile Glu Glu Lys Ser Pro Phe Trp Ala
485 490 495
Ala Gly Pro Tyr Asn Tyr Ile Tyr Gln Pro Gly Val Arg Thr Ser Gln
500 505 510
Ile Pro Glu His Arg Leu Ser Trp Met Thr Tyr Glu Pro Val Arg Glu
515 520 525
Asn Ala Pro Phe Val Tyr Pro Gln Tyr Lys Gln Leu Gly Ala Val Ala
530 535 540
Leu Gly Trp Thr Ser Asn Ser Val Asp Gln Lys Asn Ala Ile Asp Leu
545 550 555 560
Asp Lys Ile Thr Ser Leu Pro Ala Val Lys Gly Asn Glu Ile Arg Gly
565 570 575
Pro Gly Ser Val Val Arg Gly Pro Gly Ser Thr Gly Gly Asn Leu Val
580 585 590
Leu Leu Asn Pro Thr Gly Glu Val Ser Ile Thr Val Ser Gln Pro Ile
595 600 605
Leu Asn Lys Pro Gly Ala Gly Tyr Tyr His Val Arg Ile Arg Tyr Ala
610 615 620
Ala Ile Ala Asn Gly Lys Leu Asn Val Lys Arg Ser Val Asn Ser Ser
625 630 635 640
Thr Gln Glu Ser Val Thr Tyr Asp Tyr Lys Gln Thr Thr Ala Arg Asp
645 650 655
Leu Thr Tyr Ser Ser Phe Gln Tyr Leu Glu Val Tyr Asp Phe Asn Pro
660 665 670
Val Thr Thr Ala Ser Gln Phe Glu Val Leu Leu Thr Asn Glu Ser Gly
675 680 685
Gly Pro Ile Tyr Ile Asp Lys Ile Glu Phe Ile Pro Phe
690 695 700
<210> 119
<211> 1195
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 119
Met Asn His Asn Asn Asn Asn Asn Glu Tyr Glu Ile Ile Asp Ser His
1 5 10 15
Thr Ser Ser Tyr Pro Ser Asn Arg Asn Ser Asn His Ser Arg Tyr Pro
20 25 30
Tyr Thr Asn Asn Pro Asn Gln Pro Leu Ser Asn Thr Asn Tyr Lys Asp
35 40 45
Trp Leu Thr Met Cys Gln Gly Asn Gln Pro Tyr Arg Asp Lys Pro Gln
50 55 60
Thr Leu Val Asp Thr Gly Val Ala Ile Ser Thr Gly Leu Ile Ile Ile
65 70 75 80
Gly Thr Ile Met Thr Gly Phe Pro Ala Leu Thr Pro Trp Gly Ile Gly
85 90 95
Ile Val Ser Phe Gly Thr Leu Leu Pro Val Leu Trp Pro Ser Gly Glu
100 105 110
Thr Asp Lys Lys Val Trp Ile Asp Phe Met Asn Tyr Gly Gly Thr Ile
115 120 125
Phe Lys Pro Leu Leu Asn Glu Gln Lys Asp Gln Ala Gln Arg Ile Leu
130 135 140
Gln Gly Phe His Leu Pro Ile Asn Ser Tyr Gln Asn Gly Ile Thr Thr
145 150 155 160
Trp Gln Asn Leu Arg Lys Ser His Ile Pro Gly Thr Pro Pro Ser Ser
165 170 175
Ala Leu Arg Glu Ala Ala Arg Val Ala Arg Thr Arg Ile Asp Ile Ala
180 185 190
Asn Ser Ala Leu Ser Arg Ile Ser Glu Ile Gln Ser Ala Gln Phe Gly
195 200 205
Ala Ala Ile Leu Pro Phe Tyr Ala Gln Ala Ala Thr Leu His Leu Asn
210 215 220
Leu Leu Ser Gln Ala Val Gln Phe Ala Asp Gln Leu Lys Ala Asp Ala
225 230 235 240
Glu Asn Gln Pro Thr Gln Asp Ala Gly Thr Ser Asp Glu Tyr His Arg
245 250 255
Leu Leu Lys Gln Tyr Thr Gln Gln Tyr Thr Asp Tyr Cys Val Lys Thr
260 265 270
Tyr Gln Glu Gly Leu Thr Thr Leu Lys Asn Ala Pro Asp Met Thr Trp
275 280 285
Asn Ile Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Leu Val Leu Asp
290 295 300
Tyr Ile Ala Leu Phe Pro Asn Phe Asp Ile Arg Gln Tyr Pro Thr Gly
305 310 315 320
Thr His Ser Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ala Leu Ile Lys
325 330 335
Glu Gln Pro Asn Pro Pro Ile Pro Ile Pro Val Ala Gln Arg Glu Thr
340 345 350
Glu Leu Thr Arg Pro Pro His Leu Phe Ser Arg Leu Asn Arg Leu Leu
355 360 365
Leu Phe Ser Asn Thr Asp Pro Ser Pro Pro Ala Leu Ser Ala Val Gln
370 375 380
Asn Ser Phe Thr Tyr Thr Asn Asn Asn Ile Val Gln Tyr Ser Gln Ile
385 390 395 400
Tyr Arg Leu Ile Asn Ser Pro Glu Pro Thr Gly Ser Thr Gln Gln Ile
405 410 415
Ile Ile Ser Asn Thr Asp Ile Tyr Arg Ile Glu Leu Thr Leu Arg Gln
420 425 430
Asn Tyr Phe Ile Thr His Asn Ile Asn Phe Tyr Thr Leu Asp Gly Ile
435 440 445
Thr Arg Arg Tyr Asn Ser Gly Ser Thr Pro Glu Thr Pro Ile Ile Ile
450 455 460
Asn Phe Gln Leu Pro Glu Phe Pro Ala Asp Lys Ile Pro Pro Tyr Gly
465 470 475 480
Leu Gly Thr Lys Tyr Ser His Ile Leu Ser Tyr Met Lys Ile Ser Pro
485 490 495
Asn Arg His Asn Ile Val Leu Pro Asn Glu Arg His Leu Ser Phe Ala
500 505 510
Trp Thr His Thr Ser Val Asn Phe Asp Asn Glu Ile Ser Asn Gln Ile
515 520 525
Val Thr Gln Ile Pro Ala Val Lys Ala Arg Ser Leu Asp Thr Ser Ser
530 535 540
Ser Val Ile Ser Gly Pro Gly His Thr Gly Gly Asp Leu Val Leu Phe
545 550 555 560
Arg Ser Arg Met Glu Phe Gln Cys Phe Val Pro Thr Gln Thr Lys Tyr
565 570 575
Lys Ile Arg Met Arg Tyr Val Ala Tyr Ser Pro Asn Asn Ser Ile Ser
580 585 590
Leu Thr Leu Asn Ile Met Gly Gln Val Asn Tyr Gly Leu Arg Ala Leu
595 600 605
Asn Ile Lys Ser Thr Val Ala Ser Glu Lys Asp Thr Val Asn Pro Lys
610 615 620
Tyr Asn Asp Phe Gln Tyr Leu Tyr Phe Tyr Thr Gln Asp Pro Asn Glu
625 630 635 640
Ile Ala Ile Val Asn Leu Tyr Ser Leu Thr Asn Met Thr Leu Gln Ser
645 650 655
Asn Asn Phe Ala Gly Asn Thr Leu Val Ile Asp Lys Ile Glu Phe Leu
660 665 670
Pro Val Thr Gln Thr Leu Glu Glu Asn Leu Glu Thr Gln Lys Leu Glu
675 680 685
Glu Ile Gln Gln Thr Val Asn Thr Phe Phe Thr Asn His Thr Lys Asn
690 695 700
Ile Leu Asn Ile Glu Thr Thr Asp Tyr Glu Ile Asp Gln Thr Ala Ile
705 710 715 720
Leu Val Glu Ser Leu Ser Glu Glu Gln Tyr Pro Gln Glu Lys Met Ile
725 730 735
Leu Leu Asp Thr Met Lys Gln Ala Lys Gln Phe Ser Gln Ser Arg Asn
740 745 750
Leu Leu Gln Asn Gly Asp Phe Ser Ser Phe Ile Gly Trp Thr Ile Asn
755 760 765
Asn Asp Leu Thr Leu Gln Thr Asn Asn Pro Ile Phe Lys Gly Gln Tyr
770 775 780
Leu His Ile Leu Gly Ala Arg Thr Thr Glu Ile Asp Asn Thr Ile Phe
785 790 795 800
Pro Thr Tyr Val Tyr Gln Lys Ile Asp Glu Ser Lys Leu Lys Pro Tyr
805 810 815
Thr Arg Tyr Val Val Arg Gly Cys Ile Ser Asn Ser Lys Asp Leu Glu
820 825 830
Leu Phe Val Thr Arg Tyr Asn Lys Glu Ile His Lys Ile Leu Asn Ile
835 840 845
Pro Asn Asp Ser Ser His Met Asn Ala Glu Ile Gln Thr Asn Ser Tyr
850 855 860
Glu Ala Gln Ser Tyr Ser Phe Arg Asn Glu Asn Asn Asn Leu Thr Asn
865 870 875 880
Gln Gln Ser Pro Tyr Pro Thr Thr Ser Asn His Ser Gly Tyr Ser Glu
885 890 895
Ser Thr Leu Ile Ser Ser Asn Phe Glu Asn Glu Asn Ala Phe Ser Phe
900 905 910
Ser Ile Asp Thr Gly Asn Leu Asp Phe Asn Glu Asn Leu Gly Ile Gly
915 920 925
Ile Leu Phe Lys Ile Ser Asn Pro Asp Gly Tyr Ala Ala Leu Gly Asn
930 935 940
Leu Glu Val Ile Glu Glu Gln Pro Leu Thr Glu Glu Glu Val Asn Arg
945 950 955 960
Val Thr Glu Lys Glu Asn Arg Trp Lys Gln Val Arg Asn Asn Gln Gln
965 970 975
Lys Glu Thr Glu Gln Ala Tyr Tyr Gln Ala Gln Gln Ala Ile Asp Ala
980 985 990
Leu Phe Ile Asn Ser Gln His Gln Met Leu Lys Ile Glu Thr Thr Ile
995 1000 1005
Gln Asp Ile Val Val Ala Glu Thr Phe Ile Asp Lys Ile Pro Tyr
1010 1015 1020
Ile Tyr Asn Glu Trp Leu Pro Asn Glu Pro Gly Met Asn Tyr Thr
1025 1030 1035
Leu Tyr Gln Gly Leu Lys Gln Gln Ile Ser Gln Ala Tyr Met Leu
1040 1045 1050
Tyr Arg Asn Arg Asn Leu Ile Gln Asn Gly Asn Phe Ile Asn Ser
1055 1060 1065
Thr Thr Asn Trp Phe Thr Ser Pro Asp Ala Lys Ile Gln Gln Ile
1070 1075 1080
Asp Asn Thr Ser Ile Leu Val Ile Pro Asn Trp Ser Thr Gln Val
1085 1090 1095
Ser Gln Asn Ile Asn Leu Pro Gln Gln Asn His Gln Tyr Leu Leu
1100 1105 1110
Arg Val Ile Ala Lys Lys Glu Gly Ile Gly Asn Gly Tyr Val Lys
1115 1120 1125
Ile Ser Asp Cys Thr Lys His Val Glu Thr Leu Ile Phe Thr Ser
1130 1135 1140
Cys Asp Tyr Asn Asn Asn Ala Met Trp Asn Glu Pro Met Gly Tyr
1145 1150 1155
Val Thr Lys Thr Met His Ile Thr Pro His Thr Asn Gln Val Arg
1160 1165 1170
Ile Asp Ile Gly Glu Thr Glu Gly Thr Phe Lys Ile Asn Ser Val
1175 1180 1185
Glu Leu Ile Gly Ile Lys Asn
1190 1195
<210> 120
<211> 674
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 120
Met Asn His Asn Asn Asn Asn Asn Glu Tyr Glu Ile Ile Asp Ser His
1 5 10 15
Thr Ser Ser Tyr Pro Ser Asn Arg Asn Ser Asn His Ser Arg Tyr Pro
20 25 30
Tyr Thr Asn Asn Pro Asn Gln Pro Leu Ser Asn Thr Asn Tyr Lys Asp
35 40 45
Trp Leu Thr Met Cys Gln Gly Asn Gln Pro Tyr Arg Asp Lys Pro Gln
50 55 60
Thr Leu Val Asp Thr Gly Val Ala Ile Ser Thr Gly Leu Ile Ile Ile
65 70 75 80
Gly Thr Ile Met Thr Gly Phe Pro Ala Leu Thr Pro Trp Gly Ile Gly
85 90 95
Ile Val Ser Phe Gly Thr Leu Leu Pro Val Leu Trp Pro Ser Gly Glu
100 105 110
Thr Asp Lys Lys Val Trp Ile Asp Phe Met Asn Tyr Gly Gly Thr Ile
115 120 125
Phe Lys Pro Leu Leu Asn Glu Gln Lys Asp Gln Ala Gln Arg Ile Leu
130 135 140
Gln Gly Phe His Leu Pro Ile Asn Ser Tyr Gln Asn Gly Ile Thr Thr
145 150 155 160
Trp Gln Asn Leu Arg Lys Ser His Ile Pro Gly Thr Pro Pro Ser Ser
165 170 175
Ala Leu Arg Glu Ala Ala Arg Val Ala Arg Thr Arg Ile Asp Ile Ala
180 185 190
Asn Ser Ala Leu Ser Arg Ile Ser Glu Ile Gln Ser Ala Gln Phe Gly
195 200 205
Ala Ala Ile Leu Pro Phe Tyr Ala Gln Ala Ala Thr Leu His Leu Asn
210 215 220
Leu Leu Ser Gln Ala Val Gln Phe Ala Asp Gln Leu Lys Ala Asp Ala
225 230 235 240
Glu Asn Gln Pro Thr Gln Asp Ala Gly Thr Ser Asp Glu Tyr His Arg
245 250 255
Leu Leu Lys Gln Tyr Thr Gln Gln Tyr Thr Asp Tyr Cys Val Lys Thr
260 265 270
Tyr Gln Glu Gly Leu Thr Thr Leu Lys Asn Ala Pro Asp Met Thr Trp
275 280 285
Asn Ile Tyr Asn Thr Tyr Arg Arg Asp Met Thr Leu Leu Val Leu Asp
290 295 300
Tyr Ile Ala Leu Phe Pro Asn Phe Asp Ile Arg Gln Tyr Pro Thr Gly
305 310 315 320
Thr His Ser Glu Leu Thr Arg Glu Ile Tyr Thr Asp Ala Leu Ile Lys
325 330 335
Glu Gln Pro Asn Pro Pro Ile Pro Ile Pro Val Ala Gln Arg Glu Thr
340 345 350
Glu Leu Thr Arg Pro Pro His Leu Phe Ser Arg Leu Asn Arg Leu Leu
355 360 365
Leu Phe Ser Asn Thr Asp Pro Ser Pro Pro Ala Leu Ser Ala Val Gln
370 375 380
Asn Ser Phe Thr Tyr Thr Asn Asn Asn Ile Val Gln Tyr Ser Gln Ile
385 390 395 400
Tyr Arg Leu Ile Asn Ser Pro Glu Pro Thr Gly Ser Thr Gln Gln Ile
405 410 415
Ile Ile Ser Asn Thr Asp Ile Tyr Arg Ile Glu Leu Thr Leu Arg Gln
420 425 430
Asn Tyr Phe Ile Thr His Asn Ile Asn Phe Tyr Thr Leu Asp Gly Ile
435 440 445
Thr Arg Arg Tyr Asn Ser Gly Ser Thr Pro Glu Thr Pro Ile Ile Ile
450 455 460
Asn Phe Gln Leu Pro Glu Phe Pro Ala Asp Lys Ile Pro Pro Tyr Gly
465 470 475 480
Leu Gly Thr Lys Tyr Ser His Ile Leu Ser Tyr Met Lys Ile Ser Pro
485 490 495
Asn Arg His Asn Ile Val Leu Pro Asn Glu Arg His Leu Ser Phe Ala
500 505 510
Trp Thr His Thr Ser Val Asn Phe Asp Asn Glu Ile Ser Asn Gln Ile
515 520 525
Val Thr Gln Ile Pro Ala Val Lys Ala Arg Ser Leu Asp Thr Ser Ser
530 535 540
Ser Val Ile Ser Gly Pro Gly His Thr Gly Gly Asp Leu Val Leu Phe
545 550 555 560
Arg Ser Arg Met Glu Phe Gln Cys Phe Val Pro Thr Gln Thr Lys Tyr
565 570 575
Lys Ile Arg Met Arg Tyr Val Ala Tyr Ser Pro Asn Asn Ser Ile Ser
580 585 590
Leu Thr Leu Asn Ile Met Gly Gln Val Asn Tyr Gly Leu Arg Ala Leu
595 600 605
Asn Ile Lys Ser Thr Val Ala Ser Glu Lys Asp Thr Val Asn Pro Lys
610 615 620
Tyr Asn Asp Phe Gln Tyr Leu Tyr Phe Tyr Thr Gln Asp Pro Asn Glu
625 630 635 640
Ile Ala Ile Val Asn Leu Tyr Ser Leu Thr Asn Met Thr Leu Gln Ser
645 650 655
Asn Asn Phe Ala Gly Asn Thr Leu Val Ile Asp Lys Ile Glu Phe Leu
660 665 670
Pro Val
<210> 121
<211> 654
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 121
Met Asn Ser Tyr Gln Asn Thr Asn Glu Tyr Glu Ile Leu Asp Gly Ser
1 5 10 15
Pro Asn Asn Thr Asn Met Ser Asn Arg Tyr Pro Phe Ala Lys Asp Pro
20 25 30
Asn Ile Phe Pro Ile Asn Leu Asp Ala Cys Gln Gly Arg Pro Trp Gln
35 40 45
Asp Thr Trp Glu Ser Val Ser Asp Ile Val Thr Ile Gly Thr Tyr Leu
50 55 60
Ile Gln Phe Leu Leu Glu Pro Gly Ile Gly Gly Ile Pro Val Ile Leu
65 70 75 80
Ser Ile Ile Asn Lys Leu Ile Pro Ser Ser Gly Gln Ser Val Ala Ala
85 90 95
Leu Ser Ile Cys Asp Leu Leu Ser Ile Ile Arg Lys Glu Val Asp Glu
100 105 110
Ser Val Leu Ser Asp Gly Val Ala Asp Phe Lys Gly Glu Met Thr Ala
115 120 125
Tyr Arg Asp Tyr Tyr Leu Pro Tyr Leu Glu Asp Trp Leu Thr Asp Lys
130 135 140
Ser Asn Pro Glu Lys Leu Ala Asp Val Val Lys Gln Phe Gln Ala Arg
145 150 155 160
Glu Glu Asp Phe Thr Lys Leu Leu Ala Gly Ser Leu Ser Arg Gln Lys
165 170 175
Ala Glu Ile Leu Leu Leu Pro Thr Tyr Val Gln Ala Ala Asn Val His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Val Lys Tyr Lys Lys Glu Trp Gly Leu
195 200 205
Val Cys Pro Pro Leu Tyr Pro Gly Ser Gly Arg Thr Asp Cys Asn Glu
210 215 220
Arg Leu Lys Ala Lys Ile Lys Glu Tyr Thr Asn Tyr Cys Val Glu Trp
225 230 235 240
Tyr Asn Lys Gly Leu Asp Gln Ile Lys Gln Ala Gly Thr Ser Thr Glu
245 250 255
Thr Trp Leu Lys Phe Asn Lys Phe Arg Arg Glu Met Thr Leu Ala Val
260 265 270
Leu Asp Val Ile Ala Ile Phe Pro Thr Tyr Asp Phe Glu Lys Tyr Pro
275 280 285
Leu Ala Thr Asn Val Glu Leu Thr Arg Glu Val Tyr Thr Asp Pro Val
290 295 300
Gly Tyr Ser Gly Ser Ser Ile Arg Trp Glu Arg Thr Phe Pro Asn Ala
305 310 315 320
Phe Asn Thr Leu Glu Ala Asn Gly Thr Arg Gly Pro Gly Leu Val Thr
325 330 335
Trp Leu Glu Val Leu Gly Ile Tyr Asn His Asp Phe Gln Asp Ile Thr
340 345 350
Val Tyr Phe Ser Gly Trp Val Gly Ser Arg His Ser Glu Asp Tyr Thr
355 360 365
Arg Gly Asn Gly Thr Phe Ser Arg Ile Ser Gly Thr Thr Ser Asn Asp
370 375 380
Ile Arg Asn Gln Thr Phe Tyr Ser Arg Asp Val Phe Lys Ile Asp Ser
385 390 395 400
Leu Gly Ile Tyr Glu Thr Arg Ala Glu Leu Gly Tyr Thr Arg Gln Arg
405 410 415
Phe Cys Val Ser Arg Ala Val Phe Ser Ser Thr Leu Tyr Asn Tyr Val
420 425 430
Tyr Asp Ala Arg Asn Asn Gly Gln Gly Arg Met Thr Ile Glu Ser Met
435 440 445
Leu Pro Gly Ile Lys Asn Pro Ile Pro Ser Ser Gln Asp Tyr Ser His
450 455 460
Arg Leu Ser Asn Ala Ala Cys Val Gln Phe Asn Asp Ser Arg Ile Asn
465 470 475 480
Val Tyr Gly Trp Thr His Thr Ser Met Thr Lys Asn Asn Pro Ile Tyr
485 490 495
Thr Asp Lys Ile Ser Gln Ile Pro Ala Val Lys Ala Phe Ala Leu Glu
500 505 510
Ser Gly Ala Tyr Val Ser Arg Gly Pro Gly His Thr Gly Gly Asp Val
515 520 525
Val Thr Leu Pro Asn Arg Gly Arg Leu Lys Ile Arg Leu Thr Ser Ala
530 535 540
Pro Thr Asn Lys Asn Tyr Arg Val Arg Val Arg Tyr Ala Thr Ser Tyr
545 550 555 560
Gly Ala Asp Leu Met Val Gln Arg Trp Ser Pro Ser Gly Asn His Ala
565 570 575
Ser Gly Tyr Phe Ser Gly Ser Pro Thr Gly Ser Tyr Pro Thr Phe Gly
580 585 590
Tyr Met Asn Thr Leu Val Thr Thr Phe Asn Gln Ser Gly Val Glu Ile
595 600 605
Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile Asp Lys Ile Glu
610 615 620
Phe Ile Pro Asp Asp Ile Thr Thr Leu Glu Tyr Glu Gly Glu Arg Asp
625 630 635 640
Leu Glu Lys Thr Lys Asn Ala Val Asn Asp Leu Phe Thr Asn
645 650
<210> 122
<211> 628
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 122
Met Asn Ser Tyr Gln Asn Thr Asn Glu Tyr Glu Ile Leu Asp Gly Ser
1 5 10 15
Pro Asn Asn Thr Asn Met Ser Asn Arg Tyr Pro Phe Ala Lys Asp Pro
20 25 30
Asn Ile Phe Pro Ile Asn Leu Asp Ala Cys Gln Gly Arg Pro Trp Gln
35 40 45
Asp Thr Trp Glu Ser Val Ser Asp Ile Val Thr Ile Gly Thr Tyr Leu
50 55 60
Ile Gln Phe Leu Leu Glu Pro Gly Ile Gly Gly Ile Pro Val Ile Leu
65 70 75 80
Ser Ile Ile Asn Lys Leu Ile Pro Ser Ser Gly Gln Ser Val Ala Ala
85 90 95
Leu Ser Ile Cys Asp Leu Leu Ser Ile Ile Arg Lys Glu Val Asp Glu
100 105 110
Ser Val Leu Ser Asp Gly Val Ala Asp Phe Lys Gly Glu Met Thr Ala
115 120 125
Tyr Arg Asp Tyr Tyr Leu Pro Tyr Leu Glu Asp Trp Leu Thr Asp Lys
130 135 140
Ser Asn Pro Glu Lys Leu Ala Asp Val Val Lys Gln Phe Gln Ala Arg
145 150 155 160
Glu Glu Asp Phe Thr Lys Leu Leu Ala Gly Ser Leu Ser Arg Gln Lys
165 170 175
Ala Glu Ile Leu Leu Leu Pro Thr Tyr Val Gln Ala Ala Asn Val His
180 185 190
Leu Leu Leu Leu Arg Asp Ala Val Lys Tyr Lys Lys Glu Trp Gly Leu
195 200 205
Val Cys Pro Pro Leu Tyr Pro Gly Ser Gly Arg Thr Asp Cys Asn Glu
210 215 220
Arg Leu Lys Ala Lys Ile Lys Glu Tyr Thr Asn Tyr Cys Val Glu Trp
225 230 235 240
Tyr Asn Lys Gly Leu Asp Gln Ile Lys Gln Ala Gly Thr Ser Thr Glu
245 250 255
Thr Trp Leu Lys Phe Asn Lys Phe Arg Arg Glu Met Thr Leu Ala Val
260 265 270
Leu Asp Val Ile Ala Ile Phe Pro Thr Tyr Asp Phe Glu Lys Tyr Pro
275 280 285
Leu Ala Thr Asn Val Glu Leu Thr Arg Glu Val Tyr Thr Asp Pro Val
290 295 300
Gly Tyr Ser Gly Ser Ser Ile Arg Trp Glu Arg Thr Phe Pro Asn Ala
305 310 315 320
Phe Asn Thr Leu Glu Ala Asn Gly Thr Arg Gly Pro Gly Leu Val Thr
325 330 335
Trp Leu Glu Val Leu Gly Ile Tyr Asn His Asp Phe Gln Asp Ile Thr
340 345 350
Val Tyr Phe Ser Gly Trp Val Gly Ser Arg His Ser Glu Asp Tyr Thr
355 360 365
Arg Gly Asn Gly Thr Phe Ser Arg Ile Ser Gly Thr Thr Ser Asn Asp
370 375 380
Ile Arg Asn Gln Thr Phe Tyr Ser Arg Asp Val Phe Lys Ile Asp Ser
385 390 395 400
Leu Gly Ile Tyr Glu Thr Arg Ala Glu Leu Gly Tyr Thr Arg Gln Arg
405 410 415
Phe Cys Val Ser Arg Ala Val Phe Ser Ser Thr Leu Tyr Asn Tyr Val
420 425 430
Tyr Asp Ala Arg Asn Asn Gly Gln Gly Arg Met Thr Ile Glu Ser Met
435 440 445
Leu Pro Gly Ile Lys Asn Pro Ile Pro Ser Ser Gln Asp Tyr Ser His
450 455 460
Arg Leu Ser Asn Ala Ala Cys Val Gln Phe Asn Asp Ser Arg Ile Asn
465 470 475 480
Val Tyr Gly Trp Thr His Thr Ser Met Thr Lys Asn Asn Pro Ile Tyr
485 490 495
Thr Asp Lys Ile Ser Gln Ile Pro Ala Val Lys Ala Phe Ala Leu Glu
500 505 510
Ser Gly Ala Tyr Val Ser Arg Gly Pro Gly His Thr Gly Gly Asp Val
515 520 525
Val Thr Leu Pro Asn Arg Gly Arg Leu Lys Ile Arg Leu Thr Ser Ala
530 535 540
Pro Thr Asn Lys Asn Tyr Arg Val Arg Val Arg Tyr Ala Thr Ser Tyr
545 550 555 560
Gly Ala Asp Leu Met Val Gln Arg Trp Ser Pro Ser Gly Asn His Ala
565 570 575
Ser Gly Tyr Phe Ser Gly Ser Pro Thr Gly Ser Tyr Pro Thr Phe Gly
580 585 590
Tyr Met Asn Thr Leu Val Thr Thr Phe Asn Gln Ser Gly Val Glu Ile
595 600 605
Ile Ile Glu Asn Arg His Tyr Ser Asn Ile Ile Ile Asp Lys Ile Glu
610 615 620
Phe Ile Pro Asp
625
<210> 123
<211> 317
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 123
Met Lys Asn Lys Ile Ile Leu Tyr Gly Val Val Thr Ser Thr Leu Leu
1 5 10 15
Ala Gly Gly Pro Phe Ser Asn Asn Ala Ser Ala Ser Glu Asn Gln Gly
20 25 30
Met Thr Gly Asn Asn Ile Gly Val Val Ser Ser Asn Ile Lys Ala Ala
35 40 45
Glu Gln Asn Lys Asn Ser Ala Tyr Gln Asp Ile Asp Glu Arg Val Lys
50 55 60
Lys Met Leu Ile Ser Ala Ala Phe Asp Gly Thr Glu Tyr Arg Tyr His
65 70 75 80
His Ile Lys Asp Thr Glu Leu Asn Gly Asn Ile Ile Glu Gly Ser Met
85 90 95
Ile Glu Asn Ala Gln Ile Leu Thr Ile Gly Glu Asn Ile Leu Glu Asn
100 105 110
Asn Leu Gly His Glu Val Thr Leu Phe Thr Ser Gly Tyr Glu His Glu
115 120 125
Phe Lys Glu Met Thr Ser Thr Thr Asn Thr Ser Gly Trp Thr Phe Gly
130 135 140
Tyr Asn Tyr Asn Ala Thr Leu Ser Val Met Met Val Ser Ala Ser Gln
145 150 155 160
Ser Phe Ser Val Asp Tyr Asn Met Ser Thr Ser Asn Thr Thr Glu Lys
165 170 175
Ser Gln Val Arg Lys Phe Thr Val Pro Pro Gln Gly Phe Pro Val Pro
180 185 190
Ala Gly Lys Lys Tyr Lys Val Glu Tyr Val Phe Glu Lys Ile Thr Ile
195 200 205
Ser Gly Arg Asn Arg Leu Asn Ala Asp Leu Tyr Gly Asp Ala Thr Tyr
210 215 220
Tyr Tyr Asn Asn Leu Pro Met Ser Pro Gln Leu Leu Tyr Ser Ala Met
225 230 235 240
Asn Phe Ala Ser Asp Thr Gln Gly Phe Glu Arg Val Ile Arg Asp Asn
245 250 255
Ala Glu Gly Asn Asp Arg Phe Gly Ile Arg Ala Thr Gly Glu Gly Arg
260 265 270
Phe Ser Thr Glu Phe Gly Thr Arg Leu Tyr Ala Lys Ile Thr Asp Ile
275 280 285
Thr Asp Ser Lys Lys Pro Val Thr Ile Glu Thr Lys Lys Ile Pro Val
290 295 300
Asn Phe Glu Thr Leu Ser Val Ala Thr Lys Ile Ile Glu
305 310 315
<210> 124
<211> 291
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 124
Met Ser Glu Asn Gln Gly Met Thr Gly Asn Asn Ile Gly Val Val Ser
1 5 10 15
Ser Asn Ile Lys Ala Ala Glu Gln Asn Lys Asn Ser Ala Tyr Gln Asp
20 25 30
Ile Asp Glu Arg Val Lys Lys Met Leu Ile Ser Ala Ala Phe Asp Gly
35 40 45
Thr Glu Tyr Arg Tyr His His Ile Lys Asp Thr Glu Leu Asn Gly Asn
50 55 60
Ile Ile Glu Gly Ser Met Ile Glu Asn Ala Gln Ile Leu Thr Ile Gly
65 70 75 80
Glu Asn Ile Leu Glu Asn Asn Leu Gly His Glu Val Thr Leu Phe Thr
85 90 95
Ser Gly Tyr Glu His Glu Phe Lys Glu Met Thr Ser Thr Thr Asn Thr
100 105 110
Ser Gly Trp Thr Phe Gly Tyr Asn Tyr Asn Ala Thr Leu Ser Val Met
115 120 125
Met Val Ser Ala Ser Gln Ser Phe Ser Val Asp Tyr Asn Met Ser Thr
130 135 140
Ser Asn Thr Thr Glu Lys Ser Gln Val Arg Lys Phe Thr Val Pro Pro
145 150 155 160
Gln Gly Phe Pro Val Pro Ala Gly Lys Lys Tyr Lys Val Glu Tyr Val
165 170 175
Phe Glu Lys Ile Thr Ile Ser Gly Arg Asn Arg Leu Asn Ala Asp Leu
180 185 190
Tyr Gly Asp Ala Thr Tyr Tyr Tyr Asn Asn Leu Pro Met Ser Pro Gln
195 200 205
Leu Leu Tyr Ser Ala Met Asn Phe Ala Ser Asp Thr Gln Gly Phe Glu
210 215 220
Arg Val Ile Arg Asp Asn Ala Glu Gly Asn Asp Arg Phe Gly Ile Arg
225 230 235 240
Ala Thr Gly Glu Gly Arg Phe Ser Thr Glu Phe Gly Thr Arg Leu Tyr
245 250 255
Ala Lys Ile Thr Asp Ile Thr Asp Ser Lys Lys Pro Val Thr Ile Glu
260 265 270
Thr Lys Lys Ile Pro Val Asn Phe Glu Thr Leu Ser Val Ala Thr Lys
275 280 285
Ile Ile Glu
290
<210> 125
<211> 900
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 125
Met Asp Cys Asn Leu Gln Ser Gln Gln Asn Ile Pro Tyr Asn Val Leu
1 5 10 15
Ala Ile Pro Val Ser Asn Val Asn Ser Leu Thr Asp Thr Ile Gly Asp
20 25 30
Leu Lys Lys Ala Trp Glu Glu Phe Gln Lys Thr Gly Ser Phe Ser Leu
35 40 45
Thr Ala Leu Gln Gln Gly Phe Asn Ala Ala Gln Gly Gly Thr Phe Asn
50 55 60
Tyr Leu Thr Leu Leu Gln Ser Gly Ile Ser Leu Ala Gly Ser Phe Val
65 70 75 80
Pro Gly Gly Thr Phe Val Ala Pro Ile Ile Asn Met Val Ile Gly Trp
85 90 95
Leu Trp Pro His Lys Asn Lys Asn Ala Asp Thr Glu Asn Leu Ile Asn
100 105 110
Leu Ile Asp Ser Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Leu Asp
115 120 125
Gln Asp Arg Gln Asp Trp Ile Ser Tyr Leu Glu Ser Ile Phe Asp Ser
130 135 140
Ser Asn Asn Leu Asn Gly Ala Ile Ile Asp Ala Gln Trp Ser Gly Thr
145 150 155 160
Val Asn Thr Thr Asn Arg Thr Leu Arg Asn Pro Thr Glu Ser Asp Tyr
165 170 175
Leu Asn Val Val Thr Asn Phe Ile Ala Ala Asp Gly Asp Ile Ser Ser
180 185 190
Asn Glu His His Ile Met Asn Gly Asn Phe Asp Val Ala Ala Ala Pro
195 200 205
Tyr Phe Val Ile Gly Ala Thr Ala Arg Phe Ala Thr Met Gln Ser Tyr
210 215 220
Ile Lys Phe Cys Asn Ala Trp Ile Asp Lys Val Gly Leu Ser Asp Ser
225 230 235 240
Gln Leu Thr Thr Gln Lys Ala Asn Leu Asp Arg Thr Lys Gln Thr Met
245 250 255
Arg Asn Ala Ile Leu Ser Tyr Thr Gln Gln Val Met Lys Val Phe Lys
260 265 270
Asp Pro Lys Asn Met Pro Thr Ile Asp Thr Asn Lys Phe Ser Val Asp
275 280 285
Thr Tyr Asn Val Tyr Val Lys Gly Met Thr Leu Asn Val Leu Asp Ile
290 295 300
Val Ala Ile Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Thr Ser Gln Thr
305 310 315 320
Ser Leu Glu Gln Thr Arg Val Thr Phe Ser Asn Met Val Gly Gln Glu
325 330 335
Glu Gly Thr Asp Gly Thr Val Thr Ile Tyr Asn Thr Phe Asp Ser Asp
340 345 350
Ser Tyr Arg His Lys Ala Ile Pro Asn Asn Asp Val Asn Leu Leu Ser
355 360 365
Tyr Phe Pro Asp Glu Leu Gln Asn Val Gln Leu Ala Leu Tyr Thr Pro
370 375 380
Ser Lys Lys Asp Ser Gly Tyr Thr Tyr Pro Tyr Ala Phe Ile Ser Asn
385 390 395 400
Tyr Gln Tyr Ser Arg Tyr Gln Tyr Gly Asp Asn Ala Pro Glu Gly Pro
405 410 415
Leu Ser Thr Leu Ser Ala Pro Ile Gln Ser Ile Asn Ala Ala Thr Gln
420 425 430
Asn Thr Lys Tyr Leu Asp Gly Glu Thr Ile Asn Gly Ile Gly Ala Asn
435 440 445
Leu Pro Gly Tyr Cys Thr Ser Asp Cys Ser Pro Thr Gln Gln Pro Phe
450 455 460
Ser Cys Thr Ser Thr Val Asn Gln Phe Lys Gln Ser Cys Asn Pro Thr
465 470 475 480
Asp Thr Asn Gln Lys Ile Asn Ala Leu Tyr Ala Phe Thr Gln Thr Asn
485 490 495
Val Lys Gly Asn Gln Gly Lys Leu Gly Leu Leu Ala Ser His Val Pro
500 505 510
Tyr Asp Leu Asn Pro Lys Asn Ile Phe Gly Glu Leu Asp Pro Asp Thr
515 520 525
Asn Asn Val Ile Leu Lys Gly Ile Pro Ala Glu Lys Gly Tyr Phe Ser
530 535 540
Asn Asn Thr Arg Pro Asn Ile Val Lys Glu Trp Ile Asn Gly Ala Asn
545 550 555 560
Ala Val Pro Leu Asp Ser Gly Asn Thr Leu Phe Met Thr Ala Thr Asn
565 570 575
Val Thr Ala Thr Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Asn Pro Asn
580 585 590
Ser Asn Thr Gln Ile Gly Val Gln Ile Ser Gln Asn Gly Ser Ile Ile
595 600 605
Ser Asn Ser Asn Leu Thr Leu Tyr Ser Thr Thr Asp Met Asn Asn Thr
610 615 620
Leu Pro Leu Asn Val Tyr Val Thr Gly Glu Asn Gly Asn Tyr Thr Leu
625 630 635 640
Leu Asp Leu Tyr Ser Thr Thr Asn Val Leu Ser Thr Gly Asp Ile Thr
645 650 655
Leu Lys Leu Thr Gly Gly Asp Gln Lys Ile Phe Ile Asp Arg Ile Glu
660 665 670
Phe Val Pro Thr Met Pro Val Pro Gly Asn Thr Asn Asn Asn Asn Gly
675 680 685
Asp Asn Gly Asn Asn Asn Pro Val His His Val Cys Ala Ile Ala Gly
690 695 700
Thr Gln Gln Ser Cys Ser Gly Pro Pro Lys Phe Glu Gln Val Ser Asp
705 710 715 720
Leu Glu Lys Ile Thr Thr Gln Val Tyr Met Leu Phe Lys Ser Ser Pro
725 730 735
Tyr Glu Glu Leu Ala Pro Glu Val Ser Ser Tyr Gln Ile Ser Gln Val
740 745 750
Ala Leu Lys Val Met Ala Leu Ser Asp Glu Leu Phe Cys Lys Glu Lys
755 760 765
Ser Leu Leu Arg Lys Leu Val Asn Lys Ala Lys Gln Leu Leu Glu Ala
770 775 780
Ser Asn Leu Leu Val Gly Gly Asn Phe Glu Thr Thr Gln Asn Trp Val
785 790 795 800
Leu Gly Thr Asn Ala Tyr Ile Asn Tyr Asp Ser Phe Leu Phe Asn Gly
805 810 815
Asn Tyr Leu Ser Leu Gln Pro Ala Ser Gly Phe Phe Thr Ser Tyr Ala
820 825 830
Tyr Gln Lys Ile Asp Glu Ser Thr Leu Lys Pro Tyr Thr Arg Tyr Lys
835 840 845
Val Ser Gly Phe Ile Gly Gln Ser Asn Gln Val Glu Leu Ile Ile Ser
850 855 860
Arg Tyr Gly Lys Glu Ile Asp Lys Ile Leu Asn Val Pro Tyr Ala Gly
865 870 875 880
Pro Leu Pro Ile Thr Ala Asn Ala Leu Ile Thr Cys Cys Ala Pro Glu
885 890 895
Asn Arg Pro Met
900
<210> 126
<211> 676
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 126
Met Asp Cys Asn Leu Gln Ser Gln Gln Asn Ile Pro Tyr Asn Val Leu
1 5 10 15
Ala Ile Pro Val Ser Asn Val Asn Ser Leu Thr Asp Thr Ile Gly Asp
20 25 30
Leu Lys Lys Ala Trp Glu Glu Phe Gln Lys Thr Gly Ser Phe Ser Leu
35 40 45
Thr Ala Leu Gln Gln Gly Phe Asn Ala Ala Gln Gly Gly Thr Phe Asn
50 55 60
Tyr Leu Thr Leu Leu Gln Ser Gly Ile Ser Leu Ala Gly Ser Phe Val
65 70 75 80
Pro Gly Gly Thr Phe Val Ala Pro Ile Ile Asn Met Val Ile Gly Trp
85 90 95
Leu Trp Pro His Lys Asn Lys Asn Ala Asp Thr Glu Asn Leu Ile Asn
100 105 110
Leu Ile Asp Ser Glu Ile Gln Lys Gln Leu Asn Lys Ala Leu Leu Asp
115 120 125
Gln Asp Arg Gln Asp Trp Ile Ser Tyr Leu Glu Ser Ile Phe Asp Ser
130 135 140
Ser Asn Asn Leu Asn Gly Ala Ile Ile Asp Ala Gln Trp Ser Gly Thr
145 150 155 160
Val Asn Thr Thr Asn Arg Thr Leu Arg Asn Pro Thr Glu Ser Asp Tyr
165 170 175
Leu Asn Val Val Thr Asn Phe Ile Ala Ala Asp Gly Asp Ile Ser Ser
180 185 190
Asn Glu His His Ile Met Asn Gly Asn Phe Asp Val Ala Ala Ala Pro
195 200 205
Tyr Phe Val Ile Gly Ala Thr Ala Arg Phe Ala Thr Met Gln Ser Tyr
210 215 220
Ile Lys Phe Cys Asn Ala Trp Ile Asp Lys Val Gly Leu Ser Asp Ser
225 230 235 240
Gln Leu Thr Thr Gln Lys Ala Asn Leu Asp Arg Thr Lys Gln Thr Met
245 250 255
Arg Asn Ala Ile Leu Ser Tyr Thr Gln Gln Val Met Lys Val Phe Lys
260 265 270
Asp Pro Lys Asn Met Pro Thr Ile Asp Thr Asn Lys Phe Ser Val Asp
275 280 285
Thr Tyr Asn Val Tyr Val Lys Gly Met Thr Leu Asn Val Leu Asp Ile
290 295 300
Val Ala Ile Trp Pro Ser Leu Tyr Pro Asp Asp Tyr Thr Ser Gln Thr
305 310 315 320
Ser Leu Glu Gln Thr Arg Val Thr Phe Ser Asn Met Val Gly Gln Glu
325 330 335
Glu Gly Thr Asp Gly Thr Val Thr Ile Tyr Asn Thr Phe Asp Ser Asp
340 345 350
Ser Tyr Arg His Lys Ala Ile Pro Asn Asn Asp Val Asn Leu Leu Ser
355 360 365
Tyr Phe Pro Asp Glu Leu Gln Asn Val Gln Leu Ala Leu Tyr Thr Pro
370 375 380
Ser Lys Lys Asp Ser Gly Tyr Thr Tyr Pro Tyr Ala Phe Ile Ser Asn
385 390 395 400
Tyr Gln Tyr Ser Arg Tyr Gln Tyr Gly Asp Asn Ala Pro Glu Gly Pro
405 410 415
Leu Ser Thr Leu Ser Ala Pro Ile Gln Ser Ile Asn Ala Ala Thr Gln
420 425 430
Asn Thr Lys Tyr Leu Asp Gly Glu Thr Ile Asn Gly Ile Gly Ala Asn
435 440 445
Leu Pro Gly Tyr Cys Thr Ser Asp Cys Ser Pro Thr Gln Gln Pro Phe
450 455 460
Ser Cys Thr Ser Thr Val Asn Gln Phe Lys Gln Ser Cys Asn Pro Thr
465 470 475 480
Asp Thr Asn Gln Lys Ile Asn Ala Leu Tyr Ala Phe Thr Gln Thr Asn
485 490 495
Val Lys Gly Asn Gln Gly Lys Leu Gly Leu Leu Ala Ser His Val Pro
500 505 510
Tyr Asp Leu Asn Pro Lys Asn Ile Phe Gly Glu Leu Asp Pro Asp Thr
515 520 525
Asn Asn Val Ile Leu Lys Gly Ile Pro Ala Glu Lys Gly Tyr Phe Ser
530 535 540
Asn Asn Thr Arg Pro Asn Ile Val Lys Glu Trp Ile Asn Gly Ala Asn
545 550 555 560
Ala Val Pro Leu Asp Ser Gly Asn Thr Leu Phe Met Thr Ala Thr Asn
565 570 575
Val Thr Ala Thr Gln Tyr Lys Ile Arg Ile Arg Tyr Ala Asn Pro Asn
580 585 590
Ser Asn Thr Gln Ile Gly Val Gln Ile Ser Gln Asn Gly Ser Ile Ile
595 600 605
Ser Asn Ser Asn Leu Thr Leu Tyr Ser Thr Thr Asp Met Asn Asn Thr
610 615 620
Leu Pro Leu Asn Val Tyr Val Thr Gly Glu Asn Gly Asn Tyr Thr Leu
625 630 635 640
Leu Asp Leu Tyr Ser Thr Thr Asn Val Leu Ser Thr Gly Asp Ile Thr
645 650 655
Leu Lys Leu Thr Gly Gly Asp Gln Lys Ile Phe Ile Asp Arg Ile Glu
660 665 670
Phe Val Pro Thr
675
<210> 127
<211> 662
<212> PRT
<213> Unknown
<220>
<223> Unknown organism from environmental sample
<400> 127
Met Gly Gly Leu Leu Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu
1 5 10 15
Ile Leu Asp Asn Gly Val Gln Cys Cys Ser Pro Arg Tyr Pro Leu Ala
20 25 30
Gln Ala Pro Gly Ser Glu Trp Gln Asn Met Asn Tyr Gln Asp Val Met
35 40 45
Asn Thr Ser Val Gly Arg Glu Tyr Gln Ala Val Gln Gln Val Asn Val
50 55 60
Gly Glu Ala Val Gly Ala Ala Leu Gly Ile Leu Thr Thr Ile Leu Lys
65 70 75 80
Ala Val Asn Pro Thr Leu Gly Thr Ala Ala Gly Val Ile Ser Ser Ile
85 90 95
Phe Gly Phe Leu Trp Lys Arg Phe Gly Thr Asp Pro Gln Ala Gln Trp
100 105 110
Lys Gln Phe Met Glu Ala Val Glu Tyr Leu Val Ser Gln Lys Ile Thr
115 120 125
Asp Ala Val Arg Ser Lys Ala Val Ser Glu Leu Glu Gly Val Gln Arg
130 135 140
Ala Val Glu Leu Tyr Gln Glu Ala Ala Asn Ala Trp Asn Met Asn Pro
145 150 155 160
Asp Asp Ala Ala Ala Lys Glu Arg Ile Arg Arg Gln Tyr Thr Ser Thr
165 170 175
Asn Thr Val Ile Glu Tyr Ala Met Pro Ser Phe Arg Val Gly Ser Phe
180 185 190
Glu Val Pro Leu Leu Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu
195 200 205
Leu Leu Leu Arg Asp Ala Val Lys Phe Gly Ala Gly Trp Gly Phe Ser
210 215 220
Pro Thr Glu Val Glu Asp Asn Tyr Thr Arg Leu Gln Ala Arg Thr Ala
225 230 235 240
Glu Tyr Thr Asn His Cys Thr Asn Thr Tyr Asp Lys Gly Leu Lys Gln
245 250 255
Ala Tyr Asp Leu Ala Pro Asn Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr
260 265 270
Leu Asn Pro Tyr Ser Lys Asn Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala
275 280 285
Pro Val Asp Trp Asn Leu Phe Asn Asp Phe Arg Arg Asp Met Thr Leu
290 295 300
Met Val Leu Asp Ile Val Ala Val Trp Pro Thr Tyr Asn Pro Arg Leu
305 310 315 320
Tyr Asn Asn Pro Asn Gly Val Gln Ile Gln Leu Ser Arg Glu Val Tyr
325 330 335
Ser Thr Val Tyr Gly Arg Gly Trp Ser Asn Asn Ser Thr Val Asp Ala
340 345 350
Ile Glu Ser Thr Leu Val Arg Pro Pro His Leu Val Thr Glu Leu Thr
355 360 365
Lys Leu Thr Phe Asp Glu Arg Asn Leu Tyr Glu Ala Glu Thr Val Pro
370 375 380
Val Ala Phe Arg Lys Val Thr Asn Thr Leu His Tyr Val Gly Ser Ser
385 390 395 400
Thr Thr Trp Glu Gln Ser Phe Ser Ala Thr Pro Val Gly Ser Leu Lys
405 410 415
Ala Val His Asn Val Leu Ala Thr Asp Ile Gly Asn Leu Ala Leu Ser
420 425 430
Leu Gly Ala Val Pro Leu Gly Phe Ser Phe Tyr Asn Lys Asn Asp Gln
435 440 445
His Ile Thr Thr Val Gly Tyr Ser Gly Gly Trp Trp Asn Gly Ile Pro
450 455 460
Lys Asp Glu Gly Ser Asn Gln Asn Ser His His Leu Ser Tyr Val Ala
465 470 475 480
Ala Leu Glu Thr Gln Thr Thr Ala Gly Trp Phe Pro Thr Tyr Pro Val
485 490 495
Pro Leu Leu Gly Glu Trp Gly Phe Gly Trp Gln His Asn Ser Leu Lys
500 505 510
Tyr Thr Asn Arg Leu Val Thr Asn Gln Ile Asn Gln Leu Pro Ala Val
515 520 525
Lys Ala Ser Ala Leu Val Asn His Ala Arg Val Ser Lys Gly Thr Thr
530 535 540
Gly Thr Gly Gly Asn Ile Ile Glu Leu Val Asn Pro Asn Cys Gly Phe
545 550 555 560
Ile Phe Asn Phe Tyr Ala Glu Pro Glu Asn Pro Gln Gly Gln Ala Phe
565 570 575
Asn Ile Arg Ile Arg Tyr Ser Ala Thr Val Pro Gly Lys Leu Asn Ile
580 585 590
Arg His Leu Glu Ser Gly Lys Glu Phe Met Asn Thr Pro Arg Asp Phe
595 600 605
Lys Gly Thr Thr Thr Gly Thr Asn Trp Ile Pro Lys Phe Gln Glu Tyr
610 615 620
Gln Tyr Leu Asp Leu Gly Pro Ile Val Leu Arg Thr Gly Met Asn Tyr
625 630 635 640
Gly Phe Lys Ile Glu Leu Ala Ser Gly Gln Leu Val Ser Ile Asp Lys
645 650 655
Ile Glu Phe Leu Pro Phe
660
<210> 128
<211> 657
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 128
Met Asn Gln Tyr Tyr Asn Asn Asn Glu Tyr Glu Ile Leu Asp Asn Gly
1 5 10 15
Val Gln Cys Cys Ser Pro Arg Tyr Pro Leu Ala Gln Ala Pro Gly Ser
20 25 30
Glu Trp Gln Asn Met Asn Tyr Gln Asp Val Met Asn Thr Ser Val Gly
35 40 45
Arg Glu Tyr Gln Ala Val Gln Gln Val Asn Val Gly Glu Ala Val Gly
50 55 60
Ala Ala Leu Gly Ile Leu Thr Thr Ile Leu Lys Ala Val Asn Pro Thr
65 70 75 80
Leu Gly Thr Ala Ala Gly Val Ile Ser Ser Ile Phe Gly Phe Leu Trp
85 90 95
Lys Arg Phe Gly Thr Asp Pro Gln Ala Gln Trp Lys Gln Phe Met Glu
100 105 110
Ala Val Glu Tyr Leu Val Ser Gln Lys Ile Thr Asp Ala Val Arg Ser
115 120 125
Lys Ala Val Ser Glu Leu Glu Gly Val Gln Arg Ala Val Glu Leu Tyr
130 135 140
Gln Glu Ala Ala Asn Ala Trp Asn Met Asn Pro Asp Asp Ala Ala Ala
145 150 155 160
Lys Glu Arg Ile Arg Arg Gln Tyr Thr Ser Thr Asn Thr Val Ile Glu
165 170 175
Tyr Ala Met Pro Ser Phe Arg Val Gly Ser Phe Glu Val Pro Leu Leu
180 185 190
Thr Val Tyr Ala Gln Ala Ala Asn Leu His Leu Leu Leu Leu Arg Asp
195 200 205
Ala Val Lys Phe Gly Ala Gly Trp Gly Phe Ser Pro Thr Glu Val Glu
210 215 220
Asp Asn Tyr Thr Arg Leu Gln Ala Arg Thr Ala Glu Tyr Thr Asn His
225 230 235 240
Cys Thr Asn Thr Tyr Asp Lys Gly Leu Lys Gln Ala Tyr Asp Leu Ala
245 250 255
Pro Asn Pro Thr Asp Tyr Asn Lys Tyr Pro Tyr Leu Asn Pro Tyr Ser
260 265 270
Lys Asn Pro Ile Tyr Gly Lys Tyr Tyr Thr Ala Pro Val Asp Trp Asn
275 280 285
Leu Phe Asn Asp Phe Arg Arg Asp Met Thr Leu Met Val Leu Asp Ile
290 295 300
Val Ala Val Trp Pro Thr Tyr Asn Pro Arg Leu Tyr Asn Asn Pro Asn
305 310 315 320
Gly Val Gln Ile Gln Leu Ser Arg Glu Val Tyr Ser Thr Val Tyr Gly
325 330 335
Arg Gly Trp Ser Asn Asn Ser Thr Val Asp Ala Ile Glu Ser Thr Leu
340 345 350
Val Arg Pro Pro His Leu Val Thr Glu Leu Thr Lys Leu Thr Phe Asp
355 360 365
Glu Arg Asn Leu Tyr Glu Ala Glu Thr Val Pro Val Ala Phe Arg Lys
370 375 380
Val Thr Asn Thr Leu His Tyr Val Gly Ser Ser Thr Thr Trp Glu Gln
385 390 395 400
Ser Phe Ser Ala Thr Pro Val Gly Ser Leu Lys Ala Val His Asn Val
405 410 415
Leu Ala Thr Asp Ile Gly Asn Leu Ala Leu Ser Leu Gly Ala Val Pro
420 425 430
Leu Gly Phe Ser Phe Tyr Asn Lys Asn Asp Gln His Ile Thr Thr Val
435 440 445
Gly Tyr Ser Gly Gly Trp Trp Asn Gly Ile Pro Lys Asp Glu Gly Ser
450 455 460
Asn Gln Asn Ser His His Leu Ser Tyr Val Ala Ala Leu Glu Thr Gln
465 470 475 480
Thr Thr Ala Gly Trp Phe Pro Thr Tyr Pro Val Pro Leu Leu Gly Glu
485 490 495
Trp Gly Phe Gly Trp Gln His Asn Ser Leu Lys Tyr Thr Asn Arg Leu
500 505 510
Val Thr Asn Gln Ile Asn Gln Leu Pro Ala Val Lys Ala Ser Ala Leu
515 520 525
Val Asn His Ala Arg Val Ser Lys Gly Thr Thr Gly Thr Gly Gly Asn
530 535 540
Ile Ile Glu Leu Val Asn Pro Asn Cys Gly Phe Ile Phe Asn Phe Tyr
545 550 555 560
Ala Glu Pro Glu Asn Pro Gln Gly Gln Ala Phe Asn Ile Arg Ile Arg
565 570 575
Tyr Ser Ala Thr Val Pro Gly Lys Leu Asn Ile Arg His Leu Glu Ser
580 585 590
Gly Lys Glu Phe Met Asn Thr Pro Arg Asp Phe Lys Gly Thr Thr Thr
595 600 605
Gly Thr Asn Trp Ile Pro Lys Phe Gln Glu Tyr Gln Tyr Leu Asp Leu
610 615 620
Gly Pro Ile Val Leu Arg Thr Gly Met Asn Tyr Gly Phe Lys Ile Glu
625 630 635 640
Leu Ala Ser Gly Gln Leu Val Ser Ile Asp Lys Ile Glu Phe Leu Pro
645 650 655
Phe
<210> 129
<211> 1018
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 129
Met Asn Gln Asn Tyr Asn Asp Glu Tyr Glu Ile Met Asn Thr Gly Val
1 5 10 15
Met Gly Tyr Gln Ser Arg Tyr Pro Leu Ala Lys Thr Pro Gly Ser Glu
20 25 30
Ile Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Met Tyr Thr His Gly
35 40 45
Glu Ser Gly Glu Thr Phe Ile Ser Ala Arg Asp Gly Val Ile Ile Gly
50 55 60
Thr Gly Ile Gly Trp Ala Ile Leu Gly Phe Val Pro Val Leu Gly Pro
65 70 75 80
Gly Leu Ser Ala Ile Ser Gly Ile Leu Asn Val Leu Leu Pro Ile Leu
85 90 95
Trp Pro Glu Asp Ala Val Thr Ala Glu Ser Gln Val Ser Trp Asp Lys
100 105 110
Leu Met Arg Thr Ala Glu Glu Ile Ala Asn Lys Ala Ile Asp Ala Gln
115 120 125
Val Arg Ala Glu Ala Ile Thr Glu Leu Gln Gly Ile Gln Glu Ala Ile
130 135 140
Gln Leu Tyr His Gly Ala Val Cys Asp Trp Lys Lys Ala Pro Thr Asn
145 150 155 160
Val Arg Leu Gln Glu Thr Val Arg Thr Gln Phe Ile Ala Thr Asn Thr
165 170 175
Val Ile Phe Asn Arg Met Pro Ser Phe Arg Val Gln Gly Phe Glu Thr
180 185 190
Ser Leu Leu Ser Thr Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Gly Val Gln Phe Gly Thr Glu Trp Gly Leu Asp Pro Thr
210 215 220
Thr Val Asp Arg Tyr Tyr Gly Tyr Leu Thr Ala Asn Thr Ala Ser Tyr
225 230 235 240
Thr Asn Tyr Cys Val Arg Trp Tyr Asp Lys Gly Leu Ser Asp Leu Arg
245 250 255
Thr Thr Asn Asn Trp Asn Lys Tyr Asn Asn Phe Arg Arg Asp Met Thr
260 265 270
Ile Met Ala Met Asp Leu Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys
275 280 285
Arg Tyr Pro Thr Phe Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr
290 295 300
Pro Trp Ile Gly His Ile Lys Asn Arg Trp Met Gly His Pro Gln Glu
305 310 315 320
Val Pro Phe Ser Glu Ile Glu Ser Asn Leu Ile Asp Ile Pro Arg Leu
325 330 335
Phe Ala Trp Leu Arg Glu Leu Ile Val Tyr Ser Val Asp Pro Gln Arg
340 345 350
Arg Gly Tyr Ser Thr Leu Val Cys Gly Arg Lys Gln Arg Phe Gln Asn
355 360 365
Thr Leu Ser Asn Asn Leu Trp Asp Gly Gly Met Lys Gly Ser His Gly
370 375 380
Glu Glu Pro Pro Gln Thr Leu Thr Ile Pro Ser Pro Gln Ser Lys Asp
385 390 395 400
Asp Ile Trp Lys Ile Asn Thr Thr Ser Thr Leu Met Ala Gln Gly Pro
405 410 415
Glu Asn Thr Gln Ile Lys Gly Trp Asp Phe Ser Phe Thr Asn Ser Gln
420 425 430
Thr Gln Thr Ile Ser Thr Ile Tyr Gln Gly Gln Glu Ser Leu Val Tyr
435 440 445
Ala Val Ser Gly Leu Pro Cys Arg Ser Ser Tyr Asn Leu Cys Asp Pro
450 455 460
Cys Asp Pro Asp Cys Lys Val Asp Ile Leu Thr Pro Gly Asp Pro Cys
465 470 475 480
Asn Asp Lys Leu Leu Tyr Ser His Arg Phe Ala Tyr Leu Gly Ala Gly
485 490 495
Ile Asp Pro Tyr Met Gly Tyr Pro Gly Arg Ala Gly Gly Tyr Gly Tyr
500 505 510
Tyr Ser Tyr Gly Trp Thr His Ile Ser Ala Asp Ala Asn Asn Leu Ile
515 520 525
Asp Thr Glu Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly
530 535 540
Thr Asp Tyr Gln Val Ile Arg Gly Pro Gly Ser Thr Gly Gly Asp Leu
545 550 555 560
Val Ser Leu Ala Thr Asp Ala Arg Ile Glu Leu Lys Leu Thr Ala Ser
565 570 575
Ile Ala Lys Tyr Tyr Gln Met Arg Ile Arg Tyr Ala Ser Pro Val Gln
580 585 590
Thr Arg Leu Arg Leu Glu Thr Tyr Gln Asp Asp Ile Ser Gly Gly Pro
595 600 605
Val Asn Tyr Asp Ile Pro Ser Thr Pro Tyr Ser Gly Asn Gln Asn Glu
610 615 620
Leu Ser Tyr Asn Ala Phe Gly Tyr Lys His Val Met Asn Val Tyr Ile
625 630 635 640
Ser Leu Thr Gly Thr Lys Ile Ser Ile Gln Arg Val Asp Pro Phe Asp
645 650 655
Tyr Pro Phe Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile Glu Gly Ser
660 665 670
Leu Glu Ala Tyr Gln Ala Asp Gln Asp Leu Glu Lys Ala Arg Lys Lys
675 680 685
Val Asn Ala Leu Phe Met Ser Asp Ala Lys Asp Val Leu Lys Leu Asn
690 695 700
Val Thr Asp Tyr Ala Leu Asp Gln Ala Ala Asn Leu Val Glu Cys Leu
705 710 715 720
Ser Asp Glu Phe Cys Ala Gln Glu Lys Met Ile Leu Leu Asp Gln Val
725 730 735
Lys Phe Ala Lys Arg Leu Ser Gln Ser Arg Asn Leu Leu Asn Tyr Gly
740 745 750
Asp Phe Glu Ser Pro Tyr Trp Ser Gly Glu Asn Gly Trp Lys Thr Ser
755 760 765
Pro His Ile His Val Ala Ser Gly Asn Pro Ile Phe Lys Gly His Tyr
770 775 780
Leu His Met Pro Gly Ala Asn Gln Pro Gln Thr Ser Gly Thr Val Tyr
785 790 795 800
Pro Thr Tyr Ile Tyr Gln Lys Val Asp Glu Ser Lys Leu Lys Pro Tyr
805 810 815
Thr Arg Tyr His Val Arg Gly Phe Val Gly Asn Ser Lys Asp Leu Glu
820 825 830
Leu Leu Val Glu Arg Tyr Gly Lys Glu Ile His Val Glu Met Asp Val
835 840 845
Pro Asn Asp Ile Arg Tyr Thr Leu Pro Met Asn Glu Cys Gly Gly Phe
850 855 860
Glu Arg Cys Ser His Glu Pro His His Thr Cys Thr Cys Lys Asp Thr
865 870 875 880
Ala Arg Val Asp Thr Gly Cys Gln Cys Gln Asp Lys Met Asn Arg Met
885 890 895
Thr Thr Gly Val Tyr Thr Asn Ile Pro Thr Ala Ser Glu Met Tyr Pro
900 905 910
Asn Gly Tyr Leu Val His Lys Ser Cys Gly Cys Lys Asp Pro His Val
915 920 925
Phe Ser Phe His Ile Asp Thr Gly Cys Val Asp Leu Lys Glu Asn Leu
930 935 940
Gly Leu Leu Phe Ala Phe Lys Ile Ser Ser Thr Asp Gly Val Ala Asn
945 950 955 960
Val Asp Asn Leu Glu Ile Ile Glu Gly Gln Pro Leu Thr Gly Glu Ala
965 970 975
Leu Ala Arg Val Lys Lys Arg Glu His Lys Trp Lys Glu Thr Met Lys
980 985 990
Gln Lys Arg Cys Lys Thr Asp Glu Ala Val Gln Ala Val Gln Thr Ala
995 1000 1005
Ile Asn Ala Leu Phe Thr Asn Ala Gln Tyr
1010 1015
<210> 130
<211> 669
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 130
Met Asn Gln Asn Tyr Asn Asp Glu Tyr Glu Ile Met Asn Thr Gly Val
1 5 10 15
Met Gly Tyr Gln Ser Arg Tyr Pro Leu Ala Lys Thr Pro Gly Ser Glu
20 25 30
Ile Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Met Tyr Thr His Gly
35 40 45
Glu Ser Gly Glu Thr Phe Ile Ser Ala Arg Asp Gly Val Ile Ile Gly
50 55 60
Thr Gly Ile Gly Trp Ala Ile Leu Gly Phe Val Pro Val Leu Gly Pro
65 70 75 80
Gly Leu Ser Ala Ile Ser Gly Ile Leu Asn Val Leu Leu Pro Ile Leu
85 90 95
Trp Pro Glu Asp Ala Val Thr Ala Glu Ser Gln Val Ser Trp Asp Lys
100 105 110
Leu Met Arg Thr Ala Glu Glu Ile Ala Asn Lys Ala Ile Asp Ala Gln
115 120 125
Val Arg Ala Glu Ala Ile Thr Glu Leu Gln Gly Ile Gln Glu Ala Ile
130 135 140
Gln Leu Tyr His Gly Ala Val Cys Asp Trp Lys Lys Ala Pro Thr Asn
145 150 155 160
Val Arg Leu Gln Glu Thr Val Arg Thr Gln Phe Ile Ala Thr Asn Thr
165 170 175
Val Ile Phe Asn Arg Met Pro Ser Phe Arg Val Gln Gly Phe Glu Thr
180 185 190
Ser Leu Leu Ser Thr Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Gly Val Gln Phe Gly Thr Glu Trp Gly Leu Asp Pro Thr
210 215 220
Thr Val Asp Arg Tyr Tyr Gly Tyr Leu Thr Ala Asn Thr Ala Ser Tyr
225 230 235 240
Thr Asn Tyr Cys Val Arg Trp Tyr Asp Lys Gly Leu Ser Asp Leu Arg
245 250 255
Thr Thr Asn Asn Trp Asn Lys Tyr Asn Asn Phe Arg Arg Asp Met Thr
260 265 270
Ile Met Ala Met Asp Leu Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys
275 280 285
Arg Tyr Pro Thr Phe Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr
290 295 300
Pro Trp Ile Gly His Ile Lys Asn Arg Trp Met Gly His Pro Gln Glu
305 310 315 320
Val Pro Phe Ser Glu Ile Glu Ser Asn Leu Ile Asp Ile Pro Arg Leu
325 330 335
Phe Ala Trp Leu Arg Glu Leu Ile Val Tyr Ser Val Asp Pro Gln Arg
340 345 350
Arg Gly Tyr Ser Thr Leu Val Cys Gly Arg Lys Gln Arg Phe Gln Asn
355 360 365
Thr Leu Ser Asn Asn Leu Trp Asp Gly Gly Met Lys Gly Ser His Gly
370 375 380
Glu Glu Pro Pro Gln Thr Leu Thr Ile Pro Ser Pro Gln Ser Lys Asp
385 390 395 400
Asp Ile Trp Lys Ile Asn Thr Thr Ser Thr Leu Met Ala Gln Gly Pro
405 410 415
Glu Asn Thr Gln Ile Lys Gly Trp Asp Phe Ser Phe Thr Asn Ser Gln
420 425 430
Thr Gln Thr Ile Ser Thr Ile Tyr Gln Gly Gln Glu Ser Leu Val Tyr
435 440 445
Ala Val Ser Gly Leu Pro Cys Arg Ser Ser Tyr Asn Leu Cys Asp Pro
450 455 460
Cys Asp Pro Asp Cys Lys Val Asp Ile Leu Thr Pro Gly Asp Pro Cys
465 470 475 480
Asn Asp Lys Leu Leu Tyr Ser His Arg Phe Ala Tyr Leu Gly Ala Gly
485 490 495
Ile Asp Pro Tyr Met Gly Tyr Pro Gly Arg Ala Gly Gly Tyr Gly Tyr
500 505 510
Tyr Ser Tyr Gly Trp Thr His Ile Ser Ala Asp Ala Asn Asn Leu Ile
515 520 525
Asp Thr Glu Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly
530 535 540
Thr Asp Tyr Gln Val Ile Arg Gly Pro Gly Ser Thr Gly Gly Asp Leu
545 550 555 560
Val Ser Leu Ala Thr Asp Ala Arg Ile Glu Leu Lys Leu Thr Ala Ser
565 570 575
Ile Ala Lys Tyr Tyr Gln Met Arg Ile Arg Tyr Ala Ser Pro Val Gln
580 585 590
Thr Arg Leu Arg Leu Glu Thr Tyr Gln Asp Asp Ile Ser Gly Gly Pro
595 600 605
Val Asn Tyr Asp Ile Pro Ser Thr Pro Tyr Ser Gly Asn Gln Asn Glu
610 615 620
Leu Ser Tyr Asn Ala Phe Gly Tyr Lys His Val Met Asn Val Tyr Ile
625 630 635 640
Ser Leu Thr Gly Thr Lys Ile Ser Ile Gln Arg Val Asp Pro Phe Asp
645 650 655
Tyr Pro Phe Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile
660 665
<210> 131
<211> 1018
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 131
Met Asn Gln Asn Tyr Asn Asp Glu Tyr Glu Ile Met Asn Thr Gly Val
1 5 10 15
Met Gly Tyr Gln Ser Arg Tyr Pro Leu Ala Lys Thr Pro Gly Ser Glu
20 25 30
Ile Gln Gln Met Asn Tyr Lys Asp Trp Met Asp Met Tyr Thr His Gly
35 40 45
Glu Ser Gly Glu Thr Phe Ile Ser Ala Arg Asp Gly Val Ile Ile Gly
50 55 60
Thr Gly Ile Gly Trp Ala Ile Leu Gly Phe Val Pro Val Leu Gly Pro
65 70 75 80
Gly Leu Ser Ala Ile Ser Gly Ile Leu Asn Val Leu Leu Pro Ile Leu
85 90 95
Trp Pro Glu Asp Ala Val Thr Ala Glu Ser Gln Val Ser Trp Asp Lys
100 105 110
Leu Met Arg Thr Ala Glu Glu Ile Ala Asn Lys Ala Ile Asp Ala Gln
115 120 125
Val Arg Ala Glu Ala Ile Thr Glu Leu Gln Gly Ile Gln Glu Ala Ile
130 135 140
Gln Leu Tyr His Gly Ala Val Cys Asp Trp Lys Lys Ala Pro Thr Asn
145 150 155 160
Val Arg Leu Gln Glu Thr Val Arg Thr Gln Phe Ile Ala Thr Asn Thr
165 170 175
Val Ile Phe Asn Arg Met Pro Ser Phe Arg Val Gln Gly Phe Glu Thr
180 185 190
Ser Leu Leu Ser Thr Tyr Ala Gln Ala Ala Asn Leu His Leu Ala Phe
195 200 205
Leu Arg Asp Gly Val Gln Phe Gly Thr Glu Trp Gly Leu Asp Pro Thr
210 215 220
Thr Val Asp Arg Tyr Tyr Gly Tyr Leu Thr Ala Asn Thr Ala Ser Tyr
225 230 235 240
Thr Asn Tyr Cys Val Arg Trp Tyr Asp Lys Gly Leu Ser Asp Leu Arg
245 250 255
Thr Thr Asn Asn Trp Asn Lys Tyr Asn Asn Phe Arg Arg Asp Met Thr
260 265 270
Ile Met Ala Met Asp Leu Val Ser Leu Trp Pro Thr Tyr Asp Pro Lys
275 280 285
Arg Tyr Pro Thr Phe Thr Lys Ser Gln Leu Thr Arg Thr Val Tyr Thr
290 295 300
Pro Trp Ile Gly His Ile Lys Asn Arg Trp Met Gly His Pro Gln Glu
305 310 315 320
Val Pro Phe Ser Glu Ile Glu Ser Asn Leu Ile Asp Ile Pro Arg Leu
325 330 335
Phe Ala Trp Leu Arg Glu Leu Ile Val Tyr Ser Val Asp Pro Gln Arg
340 345 350
Arg Gly Tyr Ser Thr Leu Val Cys Gly Arg Lys Gln Arg Phe Gln Asn
355 360 365
Thr Leu Ser Asn Asn Leu Trp Asp Gly Gly Met Lys Gly Ser His Gly
370 375 380
Glu Glu Pro Pro Gln Thr Leu Thr Ile Pro Ser Pro Gln Ser Lys Asp
385 390 395 400
Asp Ile Trp Lys Ile Asn Thr Thr Ser Thr Leu Met Ala Gln Gly Pro
405 410 415
Glu Asn Thr Gln Ile Lys Gly Trp Asp Phe Ser Phe Thr Asn Ser Gln
420 425 430
Thr Gln Thr Ile Ser Thr Ile Tyr Gln Gly Gln Glu Ser Leu Val Tyr
435 440 445
Ala Val Ser Gly Leu Pro Cys Arg Ser Ser Tyr Asn Leu Cys Asp Pro
450 455 460
Cys Asp Pro Asp Cys Lys Val Asp Ile Leu Thr Pro Gly Asp Pro Cys
465 470 475 480
Asn Asp Lys Leu Leu Tyr Ser His Arg Phe Ala Tyr Leu Gly Ala Gly
485 490 495
Ile Asp Pro Tyr Met Gly Tyr Pro Gly Arg Ala Gly Gly Tyr Gly Tyr
500 505 510
Tyr Ser Tyr Gly Trp Thr His Ile Ser Ala Asp Ala Asn Asn Leu Ile
515 520 525
Asp Thr Glu Lys Ile Thr Gln Ile Pro Ala Val Lys Ala Ser Gly Gly
530 535 540
Thr Asp Tyr Gln Val Ile Arg Gly Pro Gly Ser Thr Gly Gly Asp Leu
545 550 555 560
Val Ser Leu Ala Thr Asp Ala Arg Ile Glu Leu Lys Leu Thr Ala Ser
565 570 575
Ile Ala Lys Tyr Tyr Gln Met Arg Ile Arg Tyr Ala Ser Pro Val Gln
580 585 590
Thr Arg Leu Arg Leu Glu Thr Tyr Gln Asp Asp Ile Ser Gly Gly Pro
595 600 605
Val Asn Tyr Asp Ile Pro Ser Thr Pro Tyr Ser Gly Asn Gln Asn Glu
610 615 620
Leu Ser Tyr Asn Ala Phe Gly Tyr Lys His Val Met Asn Val Tyr Ile
625 630 635 640
Ser Leu Thr Gly Thr Lys Ile Ser Ile Gln Arg Val Asp Pro Phe Asp
645 650 655
Tyr Pro Phe Ile Ile Asp Lys Ile Glu Phe Ile Pro Ile Glu Gly Ser
660 665 670
Leu Glu Ala Tyr Gln Ala Asp Gln Asp Leu Glu Lys Ala Arg Lys Lys
675 680 685
Val Asn Ala Leu Phe Met Ser Asp Ala Lys Asp Val Leu Lys Leu Asn
690 695 700
Val Thr Asp Tyr Ala Leu Asp Gln Ala Ala Asn Leu Val Glu Cys Leu
705 710 715 720
Ser Asp Glu Phe Cys Ala Gln Glu Lys Ile Ile Leu Leu Asp Gln Val
725 730 735
Lys Phe Ala Lys Arg Leu Ser Gln Ser Arg Asn Leu Leu Asn Tyr Gly
740 745 750
Asp Phe Glu Ser Pro Tyr Trp Ser Gly Glu Asn Gly Trp Lys Thr Ser
755 760 765
Pro His Ile His Val Ala Ser Gly Asn Pro Ile Phe Lys Gly His Tyr
770 775 780
Leu His Met Pro Gly Ala Asn Gln Pro Gln Thr Ser Gly Thr Val Tyr
785 790 795 800
Pro Thr Tyr Ile Tyr Gln Lys Val Asp Glu Ser Lys Leu Lys Pro Tyr
805 810 815
Thr Arg Tyr His Val Arg Gly Phe Val Gly Asn Ser Lys Asp Leu Glu
820 825 830
Leu Leu Val Glu Arg Tyr Gly Lys Glu Ile His Val Glu Met Asp Val
835 840 845
Pro Asn Asp Ile Arg Tyr Thr Leu Pro Met Asn Glu Cys Gly Gly Phe
850 855 860
Glu Arg Cys Ser His Glu Pro His His Thr Cys Thr Cys Lys Asp Thr
865 870 875 880
Ala Arg Val Asp Thr Gly Cys Gln Cys Gln Asp Lys Met Asn Arg Met
885 890 895
Thr Thr Gly Val Tyr Thr Asn Ile Pro Thr Ala Ser Glu Met Tyr Pro
900 905 910
Asn Gly Tyr Leu Val His Lys Ser Cys Gly Cys Lys Asp Pro His Val
915 920 925
Phe Ser Phe His Ile Asp Thr Gly Cys Val Asp Leu Lys Glu Asn Leu
930 935 940
Gly Leu Leu Phe Ala Phe Lys Ile Ser Ser Thr Asp Gly Val Ala Asn
945 950 955 960
Val Asp Asn Leu Glu Ile Ile Glu Gly Gln Pro Leu Thr Gly Glu Ala
965 970 975
Leu Ala Arg Val Lys Lys Arg Glu His Lys Trp Lys Glu Thr Met Lys
980 985 990
Gln Lys Arg Cys Lys Thr Asp Glu Ala Val Gln Ala Val Gln Thr Ala
995 1000 1005
Ile Asn Ala Leu Phe Thr Asn Ala Gln Tyr
1010 1015
<210> 132
<211> 641
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 132
Met Lys Gly Arg Arg Pro Met Gly Asn Met Asp Ser Lys Ser Val Met
1 5 10 15
Glu Lys Lys Glu Gln Lys Ser Gln Tyr Leu Asp Ile Arg Asp Ile Cys
20 25 30
Ser Ile Asn Gly Ser Ala Lys Phe Asp Pro Asn Thr Asn Ile Thr Thr
35 40 45
Leu Thr Glu Ala Ile Asn Ser Gln Ala Gly Ala Ile Ala Gly Lys Thr
50 55 60
Ala Phe Asp Met Arg His Asp Phe Thr Leu Val Ala Asp Ile Tyr Leu
65 70 75 80
Gly Ser Lys Ser Ser Gly Ala Asp Gly Ile Ala Ile Ala Phe His Arg
85 90 95
Gly Ser Ile Gly Phe Ile Gly Gln Met Gly Gly Gly Leu Gly Ile Leu
100 105 110
Gly Ala Pro Asn Gly Ile Gly Phe Glu Ile Asp Thr Tyr Trp Lys Ala
115 120 125
Ser Ser Asp Glu Thr Gly Asp Ser Phe Gly His Gly Gln Met Asn Gly
130 135 140
Pro His Ala Gly Phe Val Ser Thr Asn Arg Asn Thr Ser Tyr Leu Thr
145 150 155 160
Ala Leu Ala Pro Met Gln Arg Ile Pro Ala Pro Asn Ser Lys Trp Arg
165 170 175
Ile Leu Thr Ile Asn Trp Asp Ala Arg Asn Asn Lys Leu Thr Ala Arg
180 185 190
Leu Gln Glu Lys Asn Asn Asp Asp Thr Thr Arg Ser Ala Ser Pro Arg
195 200 205
Tyr Gln Thr Trp Glu Leu Leu Asn Pro Ala Phe Asp Leu Asn Gln Lys
210 215 220
Tyr Thr Phe Val Ile Gly Ser Ala Thr Gly Ala Ala Asn Asn Lys His
225 230 235 240
Gln Ile Gly Val Thr Leu Phe Glu Ala Tyr Phe Thr Lys Pro Thr Ile
245 250 255
Glu Ala Asn Pro Val Asp Ile Glu Ile Gly Thr Ala Phe Asp Pro Leu
260 265 270
Lys Tyr Glu Leu Ile Gly Leu Lys Ala Thr Asp Glu Ile Asp Gly Asp
275 280 285
Ile Thr Asp Lys Ile Thr Val Glu Phe Asn Asn Val Asp Thr Ser Lys
290 295 300
Pro Gly Ala Tyr Arg Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Glu
305 310 315 320
Ser Asp Glu Lys Ile Ile Glu Val Val Val Tyr Thr Lys Pro Thr Ile
325 330 335
Ala Ala His Asp Val Ala Ile Lys Lys Asp Leu Ala Phe Asp Pro Leu
340 345 350
Asp Tyr Glu Pro Ile Glu Leu Lys Ala Thr Asp Pro Ile Asp Gly Asp
355 360 365
Ile Thr Asp Lys Ile Thr Ile Glu Ser Asn Asn Ala Asp Thr Ser Lys
370 375 380
Pro Gly Lys Tyr His Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Lys
385 390 395 400
Ser Asp Glu Lys Thr Ile Glu Ile Thr Val Tyr Thr Lys Pro Thr Leu
405 410 415
Glu Ala Tyr Asp Val Glu Ile Lys Lys Asp Thr Val Phe Asp Pro Leu
420 425 430
Asn Tyr Glu Pro Ile Gly Leu Lys Ala Thr Asp Pro Ile Asp Gly Asp
435 440 445
Ile Thr Asp Lys Ile Thr Val Glu Ser Asn Asn Val Asp Thr Ser Lys
450 455 460
Pro Gly Glu Tyr His Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Glu
465 470 475 480
Ser Asp Glu Lys Thr Ile Thr Val Met Val Pro Val Ile Asp Asp Gly
485 490 495
Trp Glu Asn Gly Asp Pro Asn Gly Trp Lys Phe Phe Ser Gly Glu Ser
500 505 510
Ile Thr Leu Glu Glu Asp Glu Glu Asn Ala Leu Asn Gly Lys Trp Val
515 520 525
Phe Tyr Ser Asp Lys His Val Ala Ile Tyr Lys Gln Val Glu Leu Glu
530 535 540
Asn Asn Ile Ser Tyr Gln Ile Thr Val Tyr Val Lys Pro Glu Asp Glu
545 550 555 560
Gly Thr Val Ala Asn His Ile Val Lys Val Ser Leu Lys Pro Asp Pro
565 570 575
Ala Ser Gly Glu Ser Gln Glu Ile Leu Asn Thr Arg Leu Leu Gly Ala
580 585 590
Glu Gln Ile Gln Lys Gly Tyr Arg Lys Leu Thr Ser Asn Pro Phe Ile
595 600 605
Pro Thr Thr Ile Glu Pro Tyr Lys Lys Pro Ile Ile Val Ile Glu Asn
610 615 620
Phe Leu Val Gly Trp Ile Gly Gly Ile Lys Ile Ile Val Glu Pro Thr
625 630 635 640
Lys
<210> 133
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 133
Met Asp Ser Lys Ser Val Met Glu Lys Lys Glu Gln Lys Ser Gln Tyr
1 5 10 15
Leu Asp Ile Arg Asp Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Phe Asp Met Arg His Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Gln Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Thr Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Ser Lys Trp Arg Ile Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Asn Asn Asp Asp Thr
180 185 190
Thr Arg Ser Ala Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Lys Tyr Glu Leu Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
275 280 285
Asn Asn Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Leu Asn Ser Tyr Glu Glu Ser Asp Glu Lys Ile Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Ala Ala His Asp Val Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asp Tyr Glu Pro Ile Glu Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Ile Glu Ser
355 360 365
Asn Asn Ala Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Leu Asn Ser Tyr Glu Lys Ser Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Leu Glu Ala Tyr Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Val Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr His Val Thr Tyr Lys
450 455 460
Val Leu Asn Ser Tyr Glu Glu Ser Asp Glu Lys Thr Ile Thr Val Met
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Asn Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Ser Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ser Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Glu Asn Asn Ile Ser Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala Asn His Ile Val Lys
545 550 555 560
Val Ser Leu Lys Pro Asp Pro Ala Ser Gly Glu Ser Gln Glu Ile Leu
565 570 575
Asn Thr Arg Leu Leu Gly Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Asn Pro Phe Ile Pro Thr Thr Ile Glu Pro Tyr Lys Lys
595 600 605
Pro Ile Ile Val Ile Glu Asn Phe Leu Val Gly Trp Ile Gly Gly Ile
610 615 620
Lys Ile Ile Val Glu Pro Thr Lys
625 630
<210> 134
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 134
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 135
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 135
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 136
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 136
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Ser Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 137
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 137
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Ser Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 138
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 138
Met Asn Ser Lys Ser Thr Ile Glu Lys Glu Val Gln Lys Ser Gln Tyr
1 5 10 15
Leu Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg His Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Asn Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ser
145 150 155 160
Thr Pro Asn Gly Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Ser Ser Ala Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Ala
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Glu Glu Ser Asp Glu Lys Thr Ile Glu Val Ile
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ile Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Val Thr Val Glu Ser
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Ala Tyr Lys
370 375 380
Val Val Asn Ser Tyr Glu Lys Ser Asp Glu Lys Thr Ile Glu Val Val
385 390 395 400
Val Tyr Thr Lys Pro Thr Leu Glu Ala His Asp Val Glu Val Lys Lys
405 410 415
Asp Thr Val Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asp Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr His Val Thr Tyr Lys
450 455 460
Val Val Asn Ser Tyr Glu Glu Ser Asp Glu Lys Ser Ile Leu Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Lys Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Ser Ile Thr Leu Asp Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ser Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Glu Asn Asn Thr Thr Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Ala Ser Leu Lys Pro Asn Pro Ser Gly Pro Asp Ser Glu Glu Leu Leu
565 570 575
Asn Glu Arg Leu Ile Asn Gly Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Glu Pro Tyr Lys Lys
595 600 605
Pro Leu Ile Val Ile Glu Asn Tyr Leu Ala Gly Trp Ile Gly Gly Ile
610 615 620
Lys Ile Ile Val Glu Pro Thr Lys
625 630
<210> 139
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 139
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asp
130 135 140
Arg Lys Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 140
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 140
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asp
130 135 140
Arg Lys Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 141
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 141
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Lys Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 142
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 142
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Lys Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 143
<211> 472
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 143
Leu Asn Ser Lys Ser Ile Thr Glu Lys Glu Ile Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Lys Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Glu Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Glu His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Ser Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Tyr
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
290 295 300
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
305 310 315 320
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
325 330 335
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
340 345 350
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
355 360 365
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
370 375 380
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
385 390 395 400
Val Ser Phe Lys Ser Asp Ser Ala Gly Pro Glu Asn Glu Glu Val Ile
405 410 415
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
420 425 430
Leu Thr Ser Met Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
435 440 445
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
450 455 460
Arg Ile Ile Val Glu Pro Thr Lys
465 470
<210> 144
<211> 472
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 144
Met Asn Ser Lys Ser Ile Thr Glu Lys Glu Ile Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Lys Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Glu Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Glu His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Ser Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Tyr
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
290 295 300
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
305 310 315 320
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
325 330 335
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
340 345 350
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
355 360 365
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
370 375 380
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
385 390 395 400
Val Ser Phe Lys Ser Asp Ser Ala Gly Pro Glu Asn Glu Glu Val Ile
405 410 415
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
420 425 430
Leu Thr Ser Met Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
435 440 445
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
450 455 460
Arg Ile Ile Val Glu Pro Thr Lys
465 470
<210> 145
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 145
Leu Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Val Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Ser Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ser Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Ile
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Val Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Ile Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Pro Glu Asn Glu Glu Val Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Met Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 146
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 146
Met Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Val Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Ser Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ser Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Ile
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Val Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Ile Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Pro Glu Asn Glu Glu Val Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Met Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 147
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 147
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asp
130 135 140
Arg Lys Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Lys Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 148
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 148
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asp
130 135 140
Arg Lys Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Ala
145 150 155 160
Pro Pro Asn Gly Gln Trp Arg Val Leu Thr Ile Lys Trp Asp Ala Arg
165 170 175
Asp Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Lys Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 149
<211> 641
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 149
Met Lys Gly Arg Arg Pro Met Gly Asn Met Asp Ser Lys Ser Val Met
1 5 10 15
Glu Lys Lys Glu Gln Lys Ser Gln Tyr Leu Asp Ile Arg Asp Ile Cys
20 25 30
Ser Ile Asn Gly Ser Ala Lys Phe Asp Pro Asn Thr Asn Ile Thr Thr
35 40 45
Leu Thr Glu Ala Ile Asn Ser Gln Ala Gly Ala Ile Ala Gly Lys Thr
50 55 60
Ala Phe Asp Met Arg His Asp Phe Thr Leu Val Ala Asp Ile Tyr Leu
65 70 75 80
Gly Ser Lys Ser Ser Gly Ala Asp Gly Ile Ala Ile Ala Phe His Arg
85 90 95
Gly Ser Ile Gly Phe Ile Gly Gln Met Gly Gly Gly Leu Gly Ile Leu
100 105 110
Gly Ala Pro Asn Gly Ile Gly Phe Glu Ile Asp Thr Tyr Trp Lys Ala
115 120 125
Ser Ser Asp Glu Thr Gly Asp Ser Phe Gly His Gly Gln Met Asn Gly
130 135 140
Pro His Ala Gly Phe Val Ser Thr Asn Arg Asn Thr Ser Tyr Leu Thr
145 150 155 160
Ala Leu Ala Pro Met Gln Arg Ile Pro Ala Pro Asn Ser Lys Trp Arg
165 170 175
Ile Leu Thr Ile Asn Trp Asp Ala Arg Asn Asn Lys Leu Thr Ala Arg
180 185 190
Leu Gln Glu Lys Asn Asn Asp Asp Thr Thr Arg Ser Ala Ser Pro Arg
195 200 205
Tyr Gln Thr Trp Glu Leu Leu Asn Pro Ala Phe Asp Leu Asn Gln Lys
210 215 220
Tyr Thr Phe Val Ile Gly Ser Ala Thr Gly Ala Ala Asn Asn Lys His
225 230 235 240
Gln Ile Gly Val Thr Leu Phe Glu Ala Tyr Phe Thr Lys Pro Thr Ile
245 250 255
Glu Ala Asn Pro Val Asp Ile Glu Ile Gly Thr Ala Phe Asp Pro Leu
260 265 270
Lys Tyr Glu Leu Ile Gly Leu Lys Ala Thr Asp Glu Ile Asp Gly Asp
275 280 285
Ile Thr Asp Lys Ile Thr Val Glu Phe Asn Asn Val Asp Thr Ser Lys
290 295 300
Pro Gly Ala Tyr Arg Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Glu
305 310 315 320
Ser Asp Glu Lys Thr Ile Glu Val Val Val Tyr Thr Lys Pro Thr Ile
325 330 335
Ala Ala His Asp Val Ala Ile Lys Lys Asp Leu Ala Phe Asp Pro Leu
340 345 350
Asp Tyr Glu Pro Ile Glu Leu Lys Ala Thr Asp Pro Ile Asp Gly Asp
355 360 365
Ile Thr Asp Lys Ile Thr Ile Glu Ser Asn Asn Ala Asp Thr Ser Lys
370 375 380
Pro Gly Lys Tyr His Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Lys
385 390 395 400
Ser Asp Glu Lys Thr Ile Glu Ile Thr Val Tyr Thr Lys Pro Thr Leu
405 410 415
Glu Ala Tyr Asp Val Glu Ile Lys Lys Asp Thr Val Phe Asp Pro Leu
420 425 430
Asn Tyr Glu Pro Ile Gly Leu Lys Ala Thr Asp Pro Ile Asp Gly Asp
435 440 445
Ile Thr Asp Lys Ile Thr Val Glu Ser Asn Asn Val Asp Thr Ser Lys
450 455 460
Pro Gly Glu Tyr His Val Thr Tyr Lys Val Leu Asn Ser Tyr Glu Glu
465 470 475 480
Ser Asp Glu Lys Thr Ile Thr Val Met Val Pro Val Ile Asp Asp Gly
485 490 495
Trp Glu Asn Gly Asp Pro Asn Gly Trp Lys Phe Phe Ser Gly Glu Ser
500 505 510
Ile Thr Leu Glu Glu Asp Glu Glu Asn Ala Leu Asn Gly Lys Trp Val
515 520 525
Phe Tyr Ser Asp Lys His Val Ala Ile Tyr Lys Gln Val Glu Leu Glu
530 535 540
Asn Asn Ile Ser Tyr Gln Ile Thr Val Tyr Val Lys Pro Glu Asp Glu
545 550 555 560
Gly Thr Val Ala Asn His Ile Val Lys Val Ser Leu Lys Pro Asp Pro
565 570 575
Ala Ser Gly Glu Ser Gln Glu Ile Leu Asn Thr Arg Leu Leu Gly Ala
580 585 590
Glu Gln Ile Gln Lys Gly Tyr Arg Lys Leu Thr Ser Asn Pro Phe Ile
595 600 605
Pro Thr Thr Ile Glu Pro Tyr Lys Lys Pro Ile Ile Val Ile Glu Asn
610 615 620
Phe Leu Ala Gly Trp Ile Gly Gly Ile Lys Ile Ile Val Glu Pro Thr
625 630 635 640
Lys
<210> 150
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 150
Met Asp Ser Lys Ser Val Met Glu Lys Lys Glu Gln Lys Ser Gln Tyr
1 5 10 15
Leu Asp Ile Arg Asp Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Phe Asp Met Arg His Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Gln Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Thr Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Ser Lys Trp Arg Ile Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Asn Asn Asp Asp Thr
180 185 190
Thr Arg Ser Ala Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Lys Tyr Glu Leu Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
275 280 285
Asn Asn Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Leu Asn Ser Tyr Glu Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Ala Ala His Asp Val Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asp Tyr Glu Pro Ile Glu Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Ile Glu Ser
355 360 365
Asn Asn Ala Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Leu Asn Ser Tyr Glu Lys Ser Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Leu Glu Ala Tyr Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Val Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr His Val Thr Tyr Lys
450 455 460
Val Leu Asn Ser Tyr Glu Glu Ser Asp Glu Lys Thr Ile Thr Val Met
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Asn Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Ser Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ser Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Glu Asn Asn Ile Ser Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala Asn His Ile Val Lys
545 550 555 560
Val Ser Leu Lys Pro Asp Pro Ala Ser Gly Glu Ser Gln Glu Ile Leu
565 570 575
Asn Thr Arg Leu Leu Gly Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Asn Pro Phe Ile Pro Thr Thr Ile Glu Pro Tyr Lys Lys
595 600 605
Pro Ile Ile Val Ile Glu Asn Phe Leu Ala Gly Trp Ile Gly Gly Ile
610 615 620
Lys Ile Ile Val Glu Pro Thr Lys
625 630
<210> 151
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 151
Leu Asn Ser Lys Ser Ile Ile Glu Lys Glu Val Gln Glu Ser Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Lys Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Ile Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Lys Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Gln Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Pro Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala His Asp Val Ala Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ser Val Glu Ser
435 440 445
Asn Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr Gln Val Thr Tyr Lys
450 455 460
Val Ile Asn Ser Tyr Gly Glu Phe Asp Asp Lys Thr Ile Thr Ile Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Gln Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Phe Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Ile Lys Pro Glu Asp Glu Glu Thr Val Ala His His Asn Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Ser Gln Glu Ser Glu Glu Ile Leu
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Ala Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 152
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 152
Met Asn Ser Lys Ser Ile Ile Glu Lys Glu Val Gln Glu Ser Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Lys Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Ile Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Lys Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Gln Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Pro Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala His Asp Val Ala Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ser Val Glu Ser
435 440 445
Asn Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr Gln Val Thr Tyr Lys
450 455 460
Val Ile Asn Ser Tyr Gly Glu Phe Asp Asp Lys Thr Ile Thr Ile Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Gln Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Phe Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Ile Lys Pro Glu Asp Glu Glu Thr Val Ala His His Asn Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Ser Gln Glu Ser Glu Glu Ile Leu
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Ala Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 153
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 153
Leu Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 154
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 154
Met Asn Ser Lys Ser Ile Ile Glu Asn Glu Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Asp Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Ser Leu Gln Glu Lys Asn Asn Asp Ala Ser
180 185 190
Thr Arg Ser Ala Thr Pro Arg Tyr Gln Ala Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Val Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Ala Ile Lys Lys
325 330 335
Gly Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Thr Ile Val Ala Gln Asp Val Lys Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Phe
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Arg Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Glu Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 155
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 155
Met Asn Ser Lys Ser Thr Ile Glu Lys Glu Val Gln Lys Ser Gln Tyr
1 5 10 15
Leu Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg His Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Phe Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Thr Phe His Arg Gly Ser Ile Gly Phe Ile Gly Asn Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Ser Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Pro His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Arg Ile Pro
145 150 155 160
Ala Pro Asn Gly Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Ser Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Val Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Ile Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Ile
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Gly Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Glu Glu Ser Asp Glu Lys Thr Ile Glu Val Ile
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Val Val Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Ile Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Val Thr Val Glu Ser
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Ala Tyr Lys
370 375 380
Val Val Asn Ser Tyr Glu Lys Ser Asp Glu Lys Thr Ile Glu Val Val
385 390 395 400
Val Tyr Thr Lys Pro Thr Leu Glu Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Val Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asp Asn Val Asp Thr Ser Lys Pro Gly Glu Tyr His Val Thr Tyr Lys
450 455 460
Val Val Asn Ser Tyr Glu Glu Ser Asp Glu Lys Ser Ile Leu Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Lys Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Ser Ile Thr Leu Asp Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Ser Ser Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Glu Asn Asn Thr Thr Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Ala Ser Leu Lys Pro Asp Pro Ser Gly Pro Asp Ser Glu Glu Leu Leu
565 570 575
Asn Glu Arg Leu Ile Asn Gly Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Glu Pro Tyr Lys Lys
595 600 605
Pro Leu Ile Val Ile Glu Asn Tyr Leu Ala Gly Trp Ile Gly Gly Ile
610 615 620
Lys Ile Ile Val Glu Pro Thr Lys
625 630
<210> 156
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 156
Leu Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Val Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Ser Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Ile Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Val Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Cys Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Ser Pro Glu Ser Glu Glu Val Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Arg Lys Gly Tyr Arg Gln
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Ile Glu Pro Thr Lys
625 630
<210> 157
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 157
Met Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Val Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Asn Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Ser Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Ile Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Val Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Cys Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Ser Pro Glu Ser Glu Glu Val Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Arg Lys Gly Tyr Arg Gln
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Ile Glu Pro Thr Lys
625 630
<210> 158
<211> 632
<212> PRT
<213> Unknown
<220>
<223> Bacillus sp. obtained from an environmental sample
<400> 158
Leu Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Gln Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Arg Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Thr Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Ile Thr Tyr Lys
370 375 380
Val Val Asn Ser Tyr Glu Lys Ile Glu Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Glu Ala Tyr Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Lys Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Ala Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Val Ile Val Glu Pro Thr Lys
625 630
<210> 159
<211> 632
<212> PRT
<213> Artificial sequence
<220>
<223> variant of native sequence
<400> 159
Met Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Gln Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Arg Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Ala Ile Lys Lys
325 330 335
Asp Leu Thr Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Ile Thr Tyr Lys
370 375 380
Val Val Asn Ser Tyr Glu Lys Ile Glu Glu Lys Thr Ile Glu Ile Thr
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Glu Ala Tyr Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Glu Asp Glu Glu Asn
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Thr Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Gln Glu Ser Glu Glu Ile Ile
565 570 575
Asn Lys Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Ala Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Val Ile Val Glu Pro Thr Lys
625 630
<210> 160
<211> 632
<212> PRT
<213> Bacillus thuringiensis
<400> 160
Leu Asn Ser Lys Ser Ile Ile Glu Lys Gly Val Gln Glu Asn Gln Tyr
1 5 10 15
Ile Asp Ile Arg Asn Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp
20 25 30
Pro Asn Thr Asn Ile Thr Thr Leu Thr Glu Ala Ile Asn Ser Gln Ala
35 40 45
Gly Ala Ile Ala Gly Lys Thr Ala Leu Asp Met Arg Arg Asp Phe Thr
50 55 60
Leu Val Ala Asp Ile Tyr Leu Gly Ser Lys Ser Ser Gly Ala Asp Gly
65 70 75 80
Ile Ala Ile Ala Phe His Arg Gly Ser Ile Gly Phe Ile Gly Thr Met
85 90 95
Gly Gly Gly Leu Gly Ile Leu Gly Ala Pro Asn Gly Ile Gly Phe Glu
100 105 110
Ile Asp Thr Tyr Trp Lys Ala Thr Ser Asp Glu Thr Gly Asp Ser Phe
115 120 125
Gly His Gly Gln Met Asn Gly Ala His Ala Gly Phe Val Ser Thr Asn
130 135 140
Arg Asn Ala Ser Tyr Leu Thr Ala Leu Ala Pro Met Gln Lys Ile Pro
145 150 155 160
Ala Pro Asn Asn Lys Trp Arg Val Leu Thr Ile Asn Trp Asp Ala Arg
165 170 175
Asn Asn Lys Leu Thr Ala Arg Leu Gln Glu Lys Ser Asn Asp Ala Ser
180 185 190
Thr Ser Thr Pro Ser Pro Arg Tyr Gln Thr Trp Glu Leu Leu Asn Pro
195 200 205
Ala Phe Asp Leu Asn Gln Lys Tyr Thr Phe Ile Ile Gly Ser Ala Thr
210 215 220
Gly Ala Ala Asn Asn Lys His Gln Ile Gly Val Thr Leu Phe Glu Ala
225 230 235 240
Tyr Phe Thr Lys Pro Thr Ile Glu Ala Asn Pro Val Asp Ile Glu Leu
245 250 255
Gly Thr Ala Phe Asp Pro Leu Asn His Glu Pro Ile Gly Leu Lys Ala
260 265 270
Thr Asp Glu Val Asp Gly Asp Ile Thr Lys Asp Ile Thr Val Glu Phe
275 280 285
Asn Asp Ile Asp Thr Ser Lys Pro Gly Ala Tyr Arg Val Thr Tyr Lys
290 295 300
Val Val Asn Ser Tyr Gly Glu Ser Asp Glu Lys Thr Ile Glu Val Val
305 310 315 320
Val Tyr Thr Lys Pro Thr Ile Thr Ala His Asp Ile Thr Ile Lys Lys
325 330 335
Asp Leu Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
340 345 350
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Ala Val Lys Phe
355 360 365
Asn Asn Val Asp Thr Ser Lys Pro Gly Lys Tyr His Val Thr Tyr Lys
370 375 380
Val Ile Asn Ser Tyr Glu Lys Ile Asp Glu Lys Thr Ile Glu Val Thr
385 390 395 400
Val Tyr Thr Lys Pro Ser Ile Val Ala His Asp Val Glu Ile Lys Lys
405 410 415
Asp Thr Ala Phe Asp Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala
420 425 430
Thr Asp Pro Ile Asp Gly Asp Ile Thr Asp Lys Ile Thr Val Glu Ser
435 440 445
Asn Asp Val Asp Thr Ser Lys Pro Gly Ala Tyr Ser Val Lys Tyr Lys
450 455 460
Val Val Asn Asn Tyr Glu Glu Ser Asp Glu Lys Thr Ile Ala Val Thr
465 470 475 480
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro Thr Gly Trp
485 490 495
Lys Phe Phe Ser Gly Glu Thr Ile Thr Leu Glu Asp Asp Glu Glu His
500 505 510
Ala Leu Asn Gly Lys Trp Val Phe Tyr Ala Asp Lys His Val Ala Ile
515 520 525
Tyr Lys Gln Val Glu Leu Lys Asn Asn Ile Pro Tyr Gln Ile Thr Val
530 535 540
Tyr Val Lys Pro Glu Asp Glu Gly Thr Val Ala His His Ile Val Lys
545 550 555 560
Val Ser Phe Lys Ser Asp Ser Ala Gly Pro Glu Ser Glu Glu Val Ile
565 570 575
Asn Glu Arg Leu Ile Asp Ala Glu Gln Ile Gln Lys Gly Tyr Arg Lys
580 585 590
Leu Thr Ser Ile Pro Phe Thr Pro Thr Thr Ile Val Pro Asn Lys Lys
595 600 605
Pro Val Ile Ile Val Glu Asn Phe Leu Pro Gly Trp Ile Gly Gly Val
610 615 620
Arg Ile Ile Val Glu Pro Thr Lys
625 630
<210> 161
<211> 15
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 161
Ile Cys Ser Ile Asn Gly Ser Ala Lys Phe Asp Pro Asn Thr Asn
1 5 10 15
<210> 162
<211> 12
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 162
Asn Ser Gln Ala Gly Ala Ile Ala Gly Lys Thr Ala
1 5 10
<210> 163
<211> 10
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 163
Ile Gly Ser Ala Thr Gly Ala Ala Asn Asn
1 5 10
<210> 164
<211> 13
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 164
Pro Leu Asn Tyr Glu Pro Ile Gly Leu Lys Ala Thr Asp
1 5 10
<210> 165
<211> 13
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 165
Val Pro Val Ile Asp Asp Gly Trp Glu Asn Gly Asp Pro
1 5 10
<210> 166
<211> 13
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 166
Glu Asp Glu Glu Asn Ala Leu Asn Gly Lys Trp Val Phe
1 5 10
<210> 167
<211> 11
<212> PRT
<213> Artificial sequence
<220>
<223> consensus sequence
<400> 167
Asp Lys His Val Ala Ile Tyr Lys Gln Val Glu
1 5 10
Claims (23)
- (a) 서열식별번호 (SEQ ID NO): 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드; 또는
(b) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 아미노산 서열을 포함하는 폴리펩티드
를 포함하는, 살충 활성을 갖는 재조합 폴리펩티드. - 제1항에 있어서, 이종 아미노산 서열을 더 포함하는 폴리펩티드.
- 제1항 또는 제2항의 폴리펩티드를 포함하는 조성물.
- (a) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성; 또는
(b) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 아미노산 서열
을 포함하는 아미노산 서열을 코딩하며, 재조합 핵산 분자가 상기 폴리펩티드를 코딩하는 천연 발생 서열이 아닌 것인 재조합 핵산 분자. - 제4항에 있어서, 상기 핵산 분자가 식물에서의 발현용으로 설계된 합성 서열인 재조합 핵산 분자.
- 제4항 또는 제5항에 있어서, 상기 핵산 분자가 식물 세포에서의 발현을 지정할 수 있는 프로모터에 작동적으로 연결된 것인 재조합 핵산 분자.
- 제4항 내지 제6항 중 어느 한 항에 있어서, 상기 핵산 분자가 박테리아에서의 발현을 지정할 수 있는 프로모터에 작동적으로 연결된 것인 재조합 핵산 분자.
- 제4항 내지 제7항 중 어느 한 항의 재조합 핵산 분자를 포함하는 숙주 세포.
- 제8항에 있어서, 박테리아 숙주 세포인 숙주 세포.
- 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함하는 재조합 핵산 분자에 작동적으로 연결된 식물 세포에서의 발현을 유도하는 프로모터를 포함하는 DNA 구축물.
- 제10항에 있어서, 상기 뉴클레오티드 서열이 식물에서의 발현용으로 설계된 합성 DNA 서열인 DNA 구축물.
- 제10항의 DNA 구축물을 포함하는 벡터.
- 제12항 또는 제13항의 DNA 구축물을 포함하는 숙주 세포.
- 제13항의 숙주 세포를 포함하는 조성물.
- 제14항에 있어서, 분말, 더스트, 펠릿, 과립, 분무제, 에멀젼, 콜로이드, 및 용액으로 이루어진 군으로부터 선택되는 조성물.
- 제15항에 있어서, 약 1% 내지 약 99 중량%의 상기 폴리펩티드를 포함하는 조성물.
- 해충 집단을 살충-유효량의 제3항 또는 제14항의 조성물과 접촉시키는 것을 포함하는, 해충 집단을 방제하는 방법.
- 제8항 또는 제13항의 숙주 세포를 폴리펩티드를 코딩하는 핵산 분자가 발현되는 조건 하에서 배양하는 것을 포함하는, 살충 활성을 갖는 폴리펩티드를 제조하는 방법.
- 살충 활성을 갖는 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 DNA 구축물이 그의 게놈 내로 안정하게 혼입된 식물이며,
여기서 상기 뉴클레오티드 서열은
(a) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 식물. - 제19항의 식물의 트랜스제닉 종자.
- 살충 폴리펩티드를 코딩하는 뉴클레오티드 서열을 식물 또는 그의 세포에서 발현시키는 것을 포함하는, 곤충 해충으로부터 식물을 보호하는 방법이며,
여기서 상기 뉴클레오티드 서열은
(a) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 곤충 해충으로부터 식물을 보호하는 방법. - 제21항에 있어서, 상기 식물이 인시목 또는 딱정벌레목 해충 또는 반시목 해충에 대한 살충성을 갖는 살충 폴리펩티드를 생산하는 것인 방법.
- 살충 폴리펩티드를 코딩하는 뉴클레오티드 서열에 작동적으로 연결된 식물에서의 발현을 유도하는 프로모터를 포함하는 DNA 구축물이 그의 게놈 내로 안정하게 혼입된 식물 또는 그의 종자를 재배지에서 성장시키는 것을 포함하는, 식물에서의 수율을 증가시키는 방법이며, 여기서 상기 뉴클레오티드 서열은
(a) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94 중 어느 하나의 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열; 또는
(b) 서열식별번호: 15, 9, 93, 17, 7, 25, 32, 71, 또는 94에 제시된 서열로 이루어진 군으로부터 선택되는 아미노산 서열과 적어도 90% 퍼센트 서열 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드를 코딩하는 뉴클레오티드 서열
을 포함하는 것인, 식물에서의 수율을 증가시키는 방법.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462095524P | 2014-12-22 | 2014-12-22 | |
US62/095,524 | 2014-12-22 | ||
PCT/US2015/066314 WO2016106066A2 (en) | 2014-12-22 | 2015-12-17 | Pesticidal genes and methods of use |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20170101247A true KR20170101247A (ko) | 2017-09-05 |
Family
ID=55135528
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020177020203A KR20170101247A (ko) | 2014-12-22 | 2015-12-17 | 살충 유전자 및 사용 방법 |
Country Status (13)
Country | Link |
---|---|
US (3) | US10480006B2 (ko) |
EP (1) | EP3237438A2 (ko) |
KR (1) | KR20170101247A (ko) |
CN (1) | CN107438617A (ko) |
AR (2) | AR103212A1 (ko) |
AU (1) | AU2015369927A1 (ko) |
BR (1) | BR112017013633A2 (ko) |
CA (2) | CA2970788A1 (ko) |
CL (1) | CL2017001658A1 (ko) |
MX (1) | MX2017008367A (ko) |
PH (1) | PH12017501148A1 (ko) |
RU (2) | RU2020121273A (ko) |
WO (1) | WO2016106066A2 (ko) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111148837A (zh) * | 2018-09-03 | 2020-05-12 | 先正达参股股份有限公司 | 用于控制植物有害生物的组合物和方法 |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4196265A (en) | 1977-06-15 | 1980-04-01 | The Wistar Institute | Method of producing antibodies |
US5380831A (en) * | 1986-04-04 | 1995-01-10 | Mycogen Plant Science, Inc. | Synthetic insecticidal crystal protein gene |
US4853331A (en) | 1985-08-16 | 1989-08-01 | Mycogen Corporation | Cloning and expression of Bacillus thuringiensis toxin gene toxic to beetles of the order Coleoptera |
US5039523A (en) | 1988-10-27 | 1991-08-13 | Mycogen Corporation | Novel Bacillus thuringiensis isolate denoted B.t. PS81F, active against lepidopteran pests, and a gene encoding a lepidopteran-active toxin |
US5023179A (en) | 1988-11-14 | 1991-06-11 | Eric Lam | Promoter enhancer element for gene expression in plant roots |
US5110732A (en) | 1989-03-14 | 1992-05-05 | The Rockefeller University | Selective gene expression in plants |
US5240842A (en) | 1989-07-11 | 1993-08-31 | Biotechnology Research And Development Corporation | Aerosol beam microinjector |
EP0548197B1 (en) | 1990-09-10 | 1998-06-03 | Advanced Technolgies (Cambridge) Limited | Plant parasitic nematode control |
CA2051562C (en) | 1990-10-12 | 2003-12-02 | Jewel M. Payne | Bacillus thuringiensis isolates active against dipteran pests |
US5459252A (en) | 1991-01-31 | 1995-10-17 | North Carolina State University | Root specific gene promoter |
JPH06511152A (ja) | 1991-10-04 | 1994-12-15 | ノースカロライナ ステイト ユニバーシティー | 病原体耐性トランスジェニック植物 |
US5428148A (en) | 1992-04-24 | 1995-06-27 | Beckman Instruments, Inc. | N4 - acylated cytidinyl compounds useful in oligonucleotide synthesis |
US5401836A (en) | 1992-07-16 | 1995-03-28 | Pioneer Hi-Bre International, Inc. | Brassica regulatory sequence for root-specific or root-abundant gene expression |
US5849870A (en) | 1993-03-25 | 1998-12-15 | Novartis Finance Corporation | Pesticidal proteins and strains |
EP1471145A2 (en) | 1993-03-25 | 2004-10-27 | Syngenta Participations AG | Pesticipal proteins and strains |
US5633363A (en) | 1994-06-03 | 1997-05-27 | Iowa State University, Research Foundation In | Root preferential promoter |
US5837876A (en) | 1995-07-28 | 1998-11-17 | North Carolina State University | Root cortex specific gene promoter |
US6063756A (en) | 1996-09-24 | 2000-05-16 | Monsanto Company | Bacillus thuringiensis cryET33 and cryET34 compositions and uses therefor |
CA2286284A1 (en) | 1997-04-03 | 1998-10-08 | Novartis Ag | Plant pest control |
CA2332628C (en) | 1998-08-20 | 2005-04-12 | Pioneer Hi-Bred International, Inc. | Seed-preferred promoters |
AU5788299A (en) | 1998-08-28 | 2000-03-21 | Pioneer Hi-Bred International, Inc. | Seed-preferred promoters from (end) genes |
US6468523B1 (en) | 1998-11-02 | 2002-10-22 | Monsanto Technology Llc | Polypeptide compositions toxic to diabrotic insects, and methods of use |
ATE538205T1 (de) | 1999-11-29 | 2012-01-15 | Midwest Oilseeds Inc | Verfahren, medien und vorrichtung zur einführung von molekülen in pflanzenzellen und bakterien mittels aerosolstrahlen |
US7629504B2 (en) | 2003-12-22 | 2009-12-08 | Pioneer Hi-Bred International, Inc. | Bacillus thuringiensis cry9 nucleic acids |
US7022899B2 (en) | 2004-01-30 | 2006-04-04 | Pioneer Hi-Bred International, Inc. | Soybean variety XB53J04 |
GB0508993D0 (en) | 2005-05-03 | 2005-06-08 | Syngenta Participations Ag | Pesticidal compositions |
WO2007147029A2 (en) * | 2006-06-14 | 2007-12-21 | Athenix Corporation | Axmi-031, axmi-039, axmi-040 and axmi-049, a family of delta-endotoxin genes and methods for their use |
US20080072494A1 (en) * | 2006-09-07 | 2008-03-27 | Stoner Richard J | Micronutrient elicitor for treating nematodes in field crops |
WO2009099602A1 (en) * | 2008-02-04 | 2009-08-13 | Massachusetts Institute Of Technology | Selection of nucleic acids by solution hybridization to oligonucleotide baits |
CA2729294C (en) | 2008-06-25 | 2018-08-14 | Athenix Corp. | Toxin genes and methods for their use |
US8318900B2 (en) | 2009-02-27 | 2012-11-27 | Athenix Corp. | Pesticidal proteins and methods for their use |
CA2785195A1 (en) * | 2009-12-21 | 2011-07-14 | Pioneer Hi-Bred International, Inc. | Novel bacillus thuringiensis gene with coleopteran activity |
CA2790023A1 (en) * | 2010-02-18 | 2011-08-25 | Athenix Corp. | Axmi218, axmi219, axmi220, axmi226, axmi227, axmi228, axmi229, axmi230, and axmi231 delta-endotoxin genes and methods for their use |
US8878007B2 (en) | 2011-03-10 | 2014-11-04 | Pioneer Hi Bred International Inc | Bacillus thuringiensis gene with lepidopteran activity |
US9139843B2 (en) * | 2011-05-24 | 2015-09-22 | Pioneer Hi Bred International Inc | Endotoxins having nematocidal activity and methods of use thereof |
MX346439B (es) | 2011-08-22 | 2017-03-17 | Bayer Cropscience Nv | Métodos y medios para modificar un genoma vegetal. |
EP2822963A2 (en) * | 2012-03-09 | 2015-01-14 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
CN102816816B (zh) * | 2012-08-29 | 2014-01-29 | 西安交通大学 | 一种球形芽孢杆菌伴胞晶体蛋白制备方法及其应用 |
CN110172466A (zh) * | 2013-03-07 | 2019-08-27 | 巴斯夫农业解决方案种子美国有限责任公司 | 毒素基因及其使用方法 |
CN103675296B (zh) * | 2013-12-05 | 2015-08-12 | 北京大学 | 与转基因植物外源Bt蛋白相互作用的人类蛋白ANKHD1 |
-
2015
- 2015-12-17 RU RU2020121273A patent/RU2020121273A/ru unknown
- 2015-12-17 EP EP15823874.1A patent/EP3237438A2/en not_active Withdrawn
- 2015-12-17 MX MX2017008367A patent/MX2017008367A/es unknown
- 2015-12-17 CN CN201580074711.6A patent/CN107438617A/zh active Pending
- 2015-12-17 US US14/972,757 patent/US10480006B2/en not_active Expired - Fee Related
- 2015-12-17 KR KR1020177020203A patent/KR20170101247A/ko not_active Application Discontinuation
- 2015-12-17 AU AU2015369927A patent/AU2015369927A1/en not_active Abandoned
- 2015-12-17 WO PCT/US2015/066314 patent/WO2016106066A2/en active Application Filing
- 2015-12-17 RU RU2017126167A patent/RU2727665C2/ru active
- 2015-12-17 BR BR112017013633A patent/BR112017013633A2/pt not_active IP Right Cessation
- 2015-12-17 CA CA2970788A patent/CA2970788A1/en not_active Abandoned
- 2015-12-17 CA CA3103519A patent/CA3103519A1/en not_active Abandoned
- 2015-12-21 AR ARP150104220A patent/AR103212A1/es unknown
-
2017
- 2017-06-19 PH PH12017501148A patent/PH12017501148A1/en unknown
- 2017-06-22 CL CL2017001658A patent/CL2017001658A1/es unknown
-
2019
- 2019-09-18 US US16/574,976 patent/US20200002720A1/en not_active Abandoned
-
2020
- 2020-06-25 US US16/912,253 patent/US20200318133A1/en not_active Abandoned
-
2021
- 2021-05-10 AR ARP210101271A patent/AR122047A2/es unknown
Also Published As
Publication number | Publication date |
---|---|
US10480006B2 (en) | 2019-11-19 |
WO2016106066A2 (en) | 2016-06-30 |
CL2017001658A1 (es) | 2018-03-16 |
US20160177333A1 (en) | 2016-06-23 |
EP3237438A2 (en) | 2017-11-01 |
MX2017008367A (es) | 2017-09-15 |
CA3103519A1 (en) | 2016-06-30 |
CA2970788A1 (en) | 2016-06-30 |
AU2015369927A1 (en) | 2017-07-06 |
US20200318133A1 (en) | 2020-10-08 |
RU2727665C2 (ru) | 2020-07-22 |
BR112017013633A2 (pt) | 2018-03-13 |
RU2020121273A (ru) | 2020-11-03 |
US20200002720A1 (en) | 2020-01-02 |
RU2017126167A3 (ko) | 2019-07-26 |
AR103212A1 (es) | 2017-04-26 |
AR122047A2 (es) | 2022-08-10 |
PH12017501148A1 (en) | 2017-11-27 |
RU2017126167A (ru) | 2019-01-24 |
CN107438617A (zh) | 2017-12-05 |
WO2016106066A3 (en) | 2016-08-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10421976B2 (en) | Pesticidal genes and methods of use | |
RU2738424C2 (ru) | Пестицидные гены и способы их применения | |
US11898153B2 (en) | Pesticidal genes and methods of use | |
ES2943259T3 (es) | Genes de plaguicidas y métodos de uso | |
WO2018140859A2 (en) | Pesticidal genes and methods of use | |
US20220272974A1 (en) | Pesticidal genes and methods of use | |
US20200318133A1 (en) | Pesticidal genes and methods of use | |
WO2019204584A1 (en) | Pesticidal proteins and methods of use | |
EP3728606A1 (en) | Pesticidal genes and methods of use | |
BR112019021380A2 (pt) | genes pesticidas e métodos de uso | |
EP3661950A2 (en) | Pesticidal genes and methods of use |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E601 | Decision to refuse application |