CA2410153C - A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof - Google Patents
A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof Download PDFInfo
- Publication number
- CA2410153C CA2410153C CA2410153A CA2410153A CA2410153C CA 2410153 C CA2410153 C CA 2410153C CA 2410153 A CA2410153 A CA 2410153A CA 2410153 A CA2410153 A CA 2410153A CA 2410153 C CA2410153 C CA 2410153C
- Authority
- CA
- Canada
- Prior art keywords
- ser
- leu
- amino acid
- ala
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 87
- 241000193388 Bacillus thuringiensis Species 0.000 title claims abstract description 82
- 229940097012 bacillus thuringiensis Drugs 0.000 title claims abstract description 54
- 101710151559 Crystal protein Proteins 0.000 title claims abstract description 51
- 239000013078 crystal Substances 0.000 title abstract description 15
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 11
- 230000036541 health Effects 0.000 claims abstract description 6
- 210000004027 cell Anatomy 0.000 claims description 84
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 83
- 102000004169 proteins and genes Human genes 0.000 claims description 57
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 46
- 229920001184 polypeptide Polymers 0.000 claims description 43
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 43
- 239000002773 nucleotide Substances 0.000 claims description 42
- 125000003729 nucleotide group Chemical group 0.000 claims description 42
- 108020004414 DNA Proteins 0.000 claims description 40
- 206010028980 Neoplasm Diseases 0.000 claims description 39
- 201000011510 cancer Diseases 0.000 claims description 39
- 238000000034 method Methods 0.000 claims description 32
- 108090000631 Trypsin Proteins 0.000 claims description 24
- 102000004142 Trypsin Human genes 0.000 claims description 24
- 230000001472 cytotoxic effect Effects 0.000 claims description 24
- 239000012588 trypsin Substances 0.000 claims description 24
- 231100000433 cytotoxic Toxicity 0.000 claims description 23
- 150000007523 nucleic acids Chemical class 0.000 claims description 23
- 102000039446 nucleic acids Human genes 0.000 claims description 22
- 108020004707 nucleic acids Proteins 0.000 claims description 22
- 238000009396 hybridization Methods 0.000 claims description 18
- 231100000135 cytotoxicity Toxicity 0.000 claims description 16
- 230000003013 cytotoxicity Effects 0.000 claims description 15
- 239000013598 vector Substances 0.000 claims description 12
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 9
- 238000000338 in vitro Methods 0.000 claims description 7
- 230000000295 complement effect Effects 0.000 claims description 4
- 244000005700 microbiome Species 0.000 claims description 4
- 230000002934 lysing effect Effects 0.000 claims description 3
- 238000005406 washing Methods 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 238000010188 recombinant method Methods 0.000 claims description 2
- 238000010998 test method Methods 0.000 claims description 2
- 239000003053 toxin Substances 0.000 abstract description 6
- 231100000765 toxin Toxicity 0.000 abstract description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 abstract description 3
- 239000002246 antineoplastic agent Substances 0.000 abstract description 2
- 229940024606 amino acid Drugs 0.000 description 124
- 235000001014 amino acid Nutrition 0.000 description 123
- 150000001413 amino acids Chemical class 0.000 description 97
- 235000018102 proteins Nutrition 0.000 description 53
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 22
- 239000004475 Arginine Substances 0.000 description 19
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 19
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 18
- 235000004279 alanine Nutrition 0.000 description 18
- 239000012634 fragment Substances 0.000 description 18
- 108010087924 alanylproline Proteins 0.000 description 17
- 108010020532 tyrosyl-proline Proteins 0.000 description 17
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 16
- 239000004471 Glycine Substances 0.000 description 14
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 14
- 241000880493 Leptailurus serval Species 0.000 description 14
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 14
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 14
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 14
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 11
- 108010078144 glutaminyl-glycine Proteins 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 11
- 108010017391 lysylvaline Proteins 0.000 description 11
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 11
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 10
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 10
- 108091034117 Oligonucleotide Proteins 0.000 description 10
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 10
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 108010077515 glycylproline Proteins 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 9
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 9
- 239000004472 Lysine Substances 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 8
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 8
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 8
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 8
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 8
- 230000027455 binding Effects 0.000 description 8
- 239000002158 endotoxin Substances 0.000 description 8
- 108010018006 histidylserine Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 231100000419 toxicity Toxicity 0.000 description 8
- 230000001988 toxicity Effects 0.000 description 8
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 7
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 7
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 7
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 7
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 7
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 7
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 7
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 7
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 7
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 7
- 108020004511 Recombinant DNA Proteins 0.000 description 7
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 7
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 7
- 239000004473 Threonine Substances 0.000 description 7
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 7
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 7
- 108010070783 alanyltyrosine Proteins 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 7
- 229960000310 isoleucine Drugs 0.000 description 7
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 239000005720 sucrose Substances 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 6
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 6
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 6
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 6
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 6
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 6
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 6
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 6
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 6
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 6
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 6
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 6
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 6
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 6
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 6
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 6
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 6
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 6
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 6
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 6
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 6
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 6
- GQNZIAGMRXOFJX-GUBZILKMSA-N Cys-Val-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O GQNZIAGMRXOFJX-GUBZILKMSA-N 0.000 description 6
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 6
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 6
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 6
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 6
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 6
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 6
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 6
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 6
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 6
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 6
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 6
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 6
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 6
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 6
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 6
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 6
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 6
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 6
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 6
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 6
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 6
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 6
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 6
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 6
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 6
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 6
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 6
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 6
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 6
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 6
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 6
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 6
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 6
- 108010066427 N-valyltryptophan Proteins 0.000 description 6
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 6
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 6
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 6
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 6
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 6
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 6
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 6
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 6
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 6
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 6
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 6
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 6
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 6
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 6
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 6
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 6
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 229930006000 Sucrose Natural products 0.000 description 6
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 6
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 6
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 6
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 6
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 6
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 6
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 6
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 6
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 6
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 6
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 6
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 6
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 6
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 6
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 6
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 6
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 229960001230 asparagine Drugs 0.000 description 6
- 235000009582 asparagine Nutrition 0.000 description 6
- 108010068265 aspartyltyrosine Proteins 0.000 description 6
- 238000012512 characterization method Methods 0.000 description 6
- 230000000445 cytocidal effect Effects 0.000 description 6
- 230000000120 cytopathologic effect Effects 0.000 description 6
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 6
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108700012359 toxins Proteins 0.000 description 6
- 229960004799 tryptophan Drugs 0.000 description 6
- 239000004474 valine Substances 0.000 description 6
- 108010027345 wheylin-1 peptide Proteins 0.000 description 6
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 5
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 5
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 5
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 5
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 5
- 235000014469 Bacillus subtilis Nutrition 0.000 description 5
- 241001147758 Bacillus thuringiensis serovar kurstaki Species 0.000 description 5
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 5
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 5
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 5
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 5
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 5
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 5
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 5
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 5
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 5
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 5
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 5
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 5
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 5
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 5
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 5
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 5
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 5
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 5
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 5
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 5
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 5
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 5
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 5
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 230000000749 insecticidal effect Effects 0.000 description 5
- 108010000761 leucylarginine Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 229930182817 methionine Natural products 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 239000013605 shuttle vector Substances 0.000 description 5
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UFBFGSQYSA-N Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UFBFGSQYSA-N 0.000 description 4
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 4
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 4
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 4
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 4
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 4
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 4
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 4
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 4
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 4
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 4
- 241000282693 Cercopithecidae Species 0.000 description 4
- 108010067770 Endopeptidase K Proteins 0.000 description 4
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 4
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 4
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 4
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 4
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 4
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 4
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 4
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 4
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 4
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 4
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 4
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 4
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 4
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 4
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 4
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 4
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 4
- 102000035195 Peptidases Human genes 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 4
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 4
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 4
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 4
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 101100173636 Rattus norvegicus Fhl2 gene Proteins 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 4
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 4
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 4
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 4
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- 108091081024 Start codon Proteins 0.000 description 4
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 4
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 4
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 4
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 4
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 4
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 4
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 4
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 4
- 239000006285 cell suspension Substances 0.000 description 4
- 238000000432 density-gradient centrifugation Methods 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 229930195712 glutamate Natural products 0.000 description 4
- 210000003000 inclusion body Anatomy 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 210000004379 membrane Anatomy 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 4
- 235000019419 proteases Nutrition 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 3
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 3
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 3
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 3
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 3
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 3
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 3
- DMRUJUFCPVHHKP-UHFFFAOYSA-N Asn-Met-Asn-Gln Chemical compound NC(=O)CC(N)C(=O)NC(CCSC)C(=O)NC(CC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O DMRUJUFCPVHHKP-UHFFFAOYSA-N 0.000 description 3
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 3
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 241000905451 Bacillus thuringiensis serovar higo Species 0.000 description 3
- 108090000317 Chymotrypsin Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 241001522878 Escherichia coli B Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 3
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 3
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 3
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 3
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 3
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 3
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 3
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 3
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 3
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 3
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 3
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 3
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 3
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 3
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 3
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 3
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 3
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 3
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 3
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 3
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 3
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 3
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 229960002376 chymotrypsin Drugs 0.000 description 3
- 231100000673 dose–response relationship Toxicity 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 230000002949 hemolytic effect Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000008018 melting Effects 0.000 description 3
- 238000002844 melting Methods 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 238000002135 phase contrast microscopy Methods 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 235000017550 sodium carbonate Nutrition 0.000 description 3
- 229910000029 sodium carbonate Inorganic materials 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 239000012137 tryptone Substances 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 210000004291 uterus Anatomy 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- 108020004465 16S ribosomal RNA Proteins 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 2
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 101000664393 Bacillus thuringiensis subsp. finitimus S-layer protein Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 101000583086 Bunodosoma granuliferum Delta-actitoxin-Bgr2b Proteins 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 2
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- 244000141359 Malus pumila Species 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 2
- 239000006146 Roswell Park Memorial Institute medium Substances 0.000 description 2
- -1 SAWANO Proteins 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- 238000003917 TEM image Methods 0.000 description 2
- 241001454293 Tetranychus urticae Species 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 2
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 2
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 239000008186 active pharmaceutical agent Substances 0.000 description 2
- 208000009956 adenocarcinoma Diseases 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 229940009098 aspartate Drugs 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000002051 biphasic effect Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 231100000409 cytocidal Toxicity 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- BJRNKVDFDLYUGJ-RMPHRYRLSA-N hydroquinone O-beta-D-glucopyranoside Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=CC=C(O)C=C1 BJRNKVDFDLYUGJ-RMPHRYRLSA-N 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 230000000974 larvacidal effect Effects 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 238000012809 post-inoculation Methods 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- QRXMUCSWCMTJGU-UHFFFAOYSA-L (5-bromo-4-chloro-1h-indol-3-yl) phosphate Chemical compound C1=C(Br)C(Cl)=C2C(OP([O-])(=O)[O-])=CNC2=C1 QRXMUCSWCMTJGU-UHFFFAOYSA-L 0.000 description 1
- IJROUPFSQHMOBK-UHFFFAOYSA-N 1-methyl-3-nitro-1-nitrosoguanidine 2-nitrosoguanidine Chemical compound NC(=N)NN=O.O=NN(C)C(=N)N[N+]([O-])=O IJROUPFSQHMOBK-UHFFFAOYSA-N 0.000 description 1
- AZKSAVLVSZKNRD-UHFFFAOYSA-M 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide Chemical compound [Br-].S1C(C)=C(C)N=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=CC=C1 AZKSAVLVSZKNRD-UHFFFAOYSA-M 0.000 description 1
- WJXXQCLWWFUNNY-UHFFFAOYSA-N 3-hydroxybutan-2-one;1h-indole Chemical compound CC(O)C(C)=O.C1=CC=C2NC=CC2=C1 WJXXQCLWWFUNNY-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 241000238876 Acari Species 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- UBTKNYUAMYRMKE-GOPGUHFVSA-N Ala-Trp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N UBTKNYUAMYRMKE-GOPGUHFVSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 240000008025 Alternanthera ficoidea Species 0.000 description 1
- 241000143060 Americamysis bahia Species 0.000 description 1
- 102000009091 Amyloidogenic Proteins Human genes 0.000 description 1
- 108010048112 Amyloidogenic Proteins Proteins 0.000 description 1
- 241001414900 Anopheles stephensi Species 0.000 description 1
- 241000239223 Arachnida Species 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- 241000238421 Arthropoda Species 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 208000035404 Autolysis Diseases 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 1
- 241000193365 Bacillus thuringiensis serovar israelensis Species 0.000 description 1
- 241001050388 Bacillus thuringiensis serovar shandongiensis Species 0.000 description 1
- 101000762002 Bacillus thuringiensis subsp. fukuokaensis Pesticidal crystal protein Cry20Aa Proteins 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 208000010667 Carcinoma of liver and intrahepatic biliary tract Diseases 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 206010057248 Cell death Diseases 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- RFSUNEUAIZKAJO-VRPWFDPXSA-N D-Fructose Natural products OC[C@H]1OC(O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-VRPWFDPXSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 241000255925 Diptera Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241001529716 Entomophaga Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 206010016334 Feeling hot Diseases 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 102000013382 Gelatinases Human genes 0.000 description 1
- 108010026132 Gelatinases Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 206010073069 Hepatic cancer Diseases 0.000 description 1
- 101000829958 Homo sapiens N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Proteins 0.000 description 1
- 101000578353 Homo sapiens Nodal modulator 2 Proteins 0.000 description 1
- 101000964478 Homo sapiens Zinc finger and BTB domain-containing protein 17 Proteins 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 239000007836 KH2PO4 Substances 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- SRBFZHDQGSBBOR-OWMBCFKOSA-N L-ribopyranose Chemical compound O[C@H]1COC(O)[C@@H](O)[C@H]1O SRBFZHDQGSBBOR-OWMBCFKOSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- DUKURNFHYQXCJG-UHFFFAOYSA-N Lewis A pentasaccharide Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)OC1CO DUKURNFHYQXCJG-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 241000316144 Macrodon ancylodon Species 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- 240000000249 Morus alba Species 0.000 description 1
- 235000008708 Morus alba Nutrition 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- BACYUWVYYTXETD-UHFFFAOYSA-N N-Lauroylsarcosine Chemical compound CCCCCCCCCCCC(=O)N(C)CC(O)=O BACYUWVYYTXETD-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- 102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 1
- 229910002651 NO3 Inorganic materials 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 1
- 102100027967 Nodal modulator 2 Human genes 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- DKXNBNKWCZZMJT-UHFFFAOYSA-N O4-alpha-D-Mannopyranosyl-D-mannose Natural products O=CC(O)C(O)C(C(O)CO)OC1OC(CO)C(O)C(O)C1O DKXNBNKWCZZMJT-UHFFFAOYSA-N 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- 208000002151 Pleural effusion Diseases 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- NGFMICBWJRZIBI-JZRPKSSGSA-N Salicin Natural products O([C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@H](CO)O1)c1c(CO)cccc1 NGFMICBWJRZIBI-JZRPKSSGSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- 241001454295 Tetranychidae Species 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- IJRXQJVGFBSKIV-ZFWWWQNUSA-N Trp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N IJRXQJVGFBSKIV-ZFWWWQNUSA-N 0.000 description 1
- HJXWDGGIORSQQF-WDSOQIARSA-N Trp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HJXWDGGIORSQQF-WDSOQIARSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- 208000002495 Uterine Neoplasms Diseases 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- 102100040761 Zinc finger and BTB domain-containing protein 17 Human genes 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- OWVSQLKZFJFIEL-BCCCVJHXSA-N [[(5r)-5-[5-[(e)-3-[6-[[2-[[(3s,5r,8r,9s,10s,12r,13s,17r)-12,14-dihydroxy-10,13-dimethyl-17-(5-oxo-2h-furan-3-yl)-1,2,3,4,5,6,7,8,9,11,12,15,16,17-tetradecahydrocyclopenta[a]phenanthren-3-yl]oxy]acetyl]amino]hexanoylamino]prop-1-enyl]-2,4-dioxopyrimidin-1 Chemical compound O([C@@H]1C[C@H]2CC[C@H]3C4(O)CC[C@@H]([C@]4([C@@H](C[C@@H]3[C@@]2(C)CC1)O)C)C=1COC(=O)C=1)CC(=O)NCCCCCC(=O)NC\C=C\C(C(NC1=O)=O)=CN1[C@H]1CCC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 OWVSQLKZFJFIEL-BCCCVJHXSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 210000000588 acetabulum Anatomy 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- NGFMICBWJRZIBI-UHFFFAOYSA-N alpha-salicin Natural products OC1C(O)C(O)C(CO)OC1OC1=CC=CC=C1CO NGFMICBWJRZIBI-UHFFFAOYSA-N 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 210000001691 amnion Anatomy 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 229960000271 arbutin Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000022534 cell killing Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 239000002962 chemical mutagen Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 101150086784 cry gene Proteins 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 210000004696 endometrium Anatomy 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 229960003276 erythromycin Drugs 0.000 description 1
- 229940093496 esculin Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 210000003414 extremity Anatomy 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 210000003953 foreskin Anatomy 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 229940050410 gluconate Drugs 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 229940127121 immunoconjugate Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000001985 kidney epithelial cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 201000002250 liver carcinoma Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 229960002160 maltose Drugs 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 210000002050 maxilla Anatomy 0.000 description 1
- HAWPXGHAZFHHAD-UHFFFAOYSA-N mechlorethamine Chemical class ClCCN(C)CCCl HAWPXGHAZFHHAD-UHFFFAOYSA-N 0.000 description 1
- 229960004961 mechlorethamine Drugs 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- MBABOKRGFJTBAE-UHFFFAOYSA-N methyl methanesulfonate Chemical compound COS(C)(=O)=O MBABOKRGFJTBAE-UHFFFAOYSA-N 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000009343 monoculture Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000002420 orchard Substances 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- BJRNKVDFDLYUGJ-UHFFFAOYSA-N p-hydroxyphenyl beta-D-alloside Natural products OC1C(O)C(O)C(CO)OC1OC1=CC=C(O)C=C1 BJRNKVDFDLYUGJ-UHFFFAOYSA-N 0.000 description 1
- 230000003071 parasitic effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000000361 pesticidal effect Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229940093429 polyethylene glycol 6000 Drugs 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000011533 pre-incubation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 102200082402 rs751610198 Human genes 0.000 description 1
- NGFMICBWJRZIBI-UJPOAAIJSA-N salicin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=CC=CC=C1CO NGFMICBWJRZIBI-UJPOAAIJSA-N 0.000 description 1
- 229940120668 salicin Drugs 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000012488 sample solution Substances 0.000 description 1
- 108700004121 sarkosyl Proteins 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000028043 self proteolysis Effects 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 235000011008 sodium phosphates Nutrition 0.000 description 1
- JUJBNYBVVQSIOU-UHFFFAOYSA-M sodium;4-[2-(4-iodophenyl)-3-(4-nitrophenyl)tetrazol-2-ium-5-yl]benzene-1,3-disulfonate Chemical compound [Na+].C1=CC([N+](=O)[O-])=CC=C1N1[N+](C=2C=CC(I)=CC=2)=NC(C=2C(=CC(=CC=2)S([O-])(=O)=O)S([O-])(=O)=O)=N1 JUJBNYBVVQSIOU-UHFFFAOYSA-M 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 229960002920 sorbitol Drugs 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000028070 sporulation Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000002782 sympathoadrenal effect Effects 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 210000003954 umbilical cord Anatomy 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 206010046766 uterine cancer Diseases 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
- C07K14/325—Bacillus thuringiensis crystal peptides, i.e. delta-endotoxins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
- C12N1/205—Bacterial isolates
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/07—Bacillus
- C12R2001/075—Bacillus thuringiensis
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Crystallography & Structural Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
A novel Bacillus thuringiensis strain deposited at the International Depository Authority of Health Canada in Winnipeg under accession number IDAC010201-5, its crystal gene having the sequence SEQ ID NO: 1 and crystal protein encoded by same having the sequence SEQ ID NO: 2 and uses thereof. More specifically, the present invention is concerned with a novel Bacillus thuringiensis, novel cry31 protoxin and toxin, nucleotide sequences encoding same and anti-cancer therapeutic applications for the toxin.
Description
TITLE OF THE INVENTION
A NOVEL BACILLUS THURINGIENSIS STRAIN, CRYSTAL GENE AND
CRYSTAL PROTEIN AND USES THEREOF
FIELD OF THE INVENTION
[0001] The present invention relates to a novel Bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof. More specifically, the present invention is concerned with a novel Bacillus thuringiensis, novel Cry3l protoxin and toxin, nucleotide sequences encoding same and anti-cancer therapeutic applications for the toxin.
BACKGROUND OF THE INVENTION
A NOVEL BACILLUS THURINGIENSIS STRAIN, CRYSTAL GENE AND
CRYSTAL PROTEIN AND USES THEREOF
FIELD OF THE INVENTION
[0001] The present invention relates to a novel Bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof. More specifically, the present invention is concerned with a novel Bacillus thuringiensis, novel Cry3l protoxin and toxin, nucleotide sequences encoding same and anti-cancer therapeutic applications for the toxin.
BACKGROUND OF THE INVENTION
[0002] Bacillus thuringiensis has been known for years for coding for 6-endotoxin crystal proteins. A large variety of endotoxins have been described and characterized, many of them having reported insecticidal activities. Most of these have molecular weights in the range of 130-140KDa and 65-80KDa (Schnepf et al., 1998). Recently, a novel endotoxin protein has been identified and designated Cry3l Aal (also called parasporin) (Mizuki et al., 2000). It is an 81 KDa protein encoded by a 2169 bp gene that has been characterized as having a selective activity as a human Leukemic Cell-Recognizing Protein (Mizuki et al., (1999) and (2000)). No other member of this novel family of endotoxin has yet been reported.
[0003] It is therefore an object of the present invention to provide a new bacillus thuringiensis strain expressing a new member of this novel family of b-endoxins displaying advantageous cytotoxicity against human cancer cells.
SUMMARY OF THE INVENTION
SUMMARY OF THE INVENTION
[0004] More specifically, in accordance with the present invention, there is provided a novel Bacillus thuringiensis strain, named M15, a novel 83-kDa crystal protein 6-endotoxin assigned the designation Cry31Aa2 by the Bacillus thuringiensis Pesticide Crystal Protein Nomenclature Committee and displaying cytotoxicity against certain human cancer cells.
[0005] According to a first aspect of the present invention, there is also provided a biologically pure culture of a microorganism strain comprising all of the identifying characteristics of a Bacillus thuringiensis strain deposited at the International Depository Authority of Health Canada in Winnipeg under accession number IDAC010201-5, or a mutant thereof isolated from said strain, said mutant comprising all of said biochemical characteristics and producing a crystal protein having cytotoxicity against human cancer cells..
[0006] According to a second aspect of the present invention, there is also provided an isolated nucleic acid molecule comprising:
(a) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:2;
(b) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:8;
(c) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:14;
(d) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:13, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(e) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:15, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a nucleotide sequence comprising the nucleotide sequence set forth in SEQ ID NO:1 or 9;
(g) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, the crystal protein having at least 94%
identity with the complete amino acid sequence in SEQ ID NO:2;
(h) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, the crystal protein having at least 97%
identity with the complete amino acid sequence in SEQ ID NO:8;
(i) a nucleotide sequence completely complementary to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g) and (h); or Q) a nucleotide sequence encoding a crystal protein cytotoxic against at least one cancer cell, which hybridizes under high stringency conditions to the nucleotide sequence in (i), wherein the high stringency conditions comprise hybridization in 5X SSC, 5X Denhardt's solution, 1% SDS
and 100 pg/ml of carrier DNA at 65 C followed by washing in 0.2X SSC, 0.1%
SDS at 65 C.
(a) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:2;
(b) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:8;
(c) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:14;
(d) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:13, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(e) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:15, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a nucleotide sequence comprising the nucleotide sequence set forth in SEQ ID NO:1 or 9;
(g) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, the crystal protein having at least 94%
identity with the complete amino acid sequence in SEQ ID NO:2;
(h) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, the crystal protein having at least 97%
identity with the complete amino acid sequence in SEQ ID NO:8;
(i) a nucleotide sequence completely complementary to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g) and (h); or Q) a nucleotide sequence encoding a crystal protein cytotoxic against at least one cancer cell, which hybridizes under high stringency conditions to the nucleotide sequence in (i), wherein the high stringency conditions comprise hybridization in 5X SSC, 5X Denhardt's solution, 1% SDS
and 100 pg/ml of carrier DNA at 65 C followed by washing in 0.2X SSC, 0.1%
SDS at 65 C.
[0007] According to another aspect of the present invention, there is also provided an isolated polypeptide comprising:
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence set forth in SEQ ID NO:8;
(c) an amino acid sequence as set forth in SEQ ID NO:14;
(d) an amino acid sequence as set forth in SEQ ID NO:13, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(e) an amino acid sequence as set forth in SEQ ID NO:15, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18 (f) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein the protein becomes cytotoxic upon treatment with trypsin; or (g) a crystal protein cytotoxic against at least one cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence set forth in SEQ ID NO:8;
(c) an amino acid sequence as set forth in SEQ ID NO:14;
(d) an amino acid sequence as set forth in SEQ ID NO:13, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(e) an amino acid sequence as set forth in SEQ ID NO:15, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18 (f) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein the protein becomes cytotoxic upon treatment with trypsin; or (g) a crystal protein cytotoxic against at least one cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
[0008] According to another aspect of the present invention, there is also provided a recombinant vector comprising an isolated nucleic acid molecule of the present invention, a recombinant host cell same, a method for making same comprising inserting such isolated nucleic acid molecule in a vector.
[0009] According to another aspect of the present invention, there is also provided a recombinant method for producing a cytotoxic polypeptide, comprising culturing the host cell under conditions such that the polypeptide is expressed and recovering the polypeptide.
[0010] According to another aspect of the present invention, there is also provided an isolated antibody that binds specifically to a polypeptide of the present invention.
[0010a] In an embodiment, the antibody specifically binds to:
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (d) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
[0010a] In an embodiment, the antibody specifically binds to:
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (d) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
[0011] According to another aspect of the present invention, there is also provided a method of modulating the level of cry31Aa2 active protein in a cell comprising a modulation of the level or activity of the sequence SEQ ID
NO:8.
NO:8.
[0012] According to another aspect of the present invention, there is also provided an in vitro method of using a polypeptide of the present invention for lysing a human cancer cell which according a specific embodiment of the present invention is a HELA, TCS, HL-60, Jurkat, or Hep-G2 cell.
[0013] According to another aspect of the present invention, there is also provided an in vitro method of testing the cytotoxicity of a polypeptide of the present invention against a candidate cancer cell comprising determining the EC50 of the polypeptide on the candidate cell, wherein the polypeptide is characterized as possessing cytotoxicity against the candidate cell if the of the polypeptide against the candidate cell is measurably lower than that against a normal T cell.
5a [0014] According to another aspect of the present invention, there is also provided an in vitro method for lysing a human cancer cell comprising applying a cytotoxic amount of a polypeptide of the present invention on a human cancer cell.
5a [0014] According to another aspect of the present invention, there is also provided an in vitro method for lysing a human cancer cell comprising applying a cytotoxic amount of a polypeptide of the present invention on a human cancer cell.
[0015] According to another aspect of the present invention, there is also provided a method for obtaining a cytotoxic polypeptide comprising cleaving a polypeptide of the present invention with a protease able to cleave between a residue R and a residue I. In an embodiment the residue R and the Residue I is an arginine at position 250 and an isoleucine at position 251 of SEQ ID NO:2. In a specific embodiment, the protease is trypsin.
[0016] In order to provide a clear and consistent understanding of terms used in the present description, a number of definitions are provided hereinbelow.
[0017] Unless defined otherwise, the scientific and technological terms and nomenclature used herein have the same meaning as commonly understood by a person of ordinary skill to which this invention pertains.
Generally, the procedures for molecular biology methods and the like are common methods used in the art. Such standard techniques can be found in reference manuals such as for example Sambrook et al. (1989, Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.) and Ausubel et al. (1994, Current Protocols in Molecular Biology, Wiley, New York).
Generally, the procedures for molecular biology methods and the like are common methods used in the art. Such standard techniques can be found in reference manuals such as for example Sambrook et al. (1989, Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.) and Ausubel et al. (1994, Current Protocols in Molecular Biology, Wiley, New York).
[0018] The terminology "human cancer cell" as used herein refers to cells associated with at least one type of cancer. Without limiting the generality of this definition, this terminology includes the following cells and corresponding tissues, namely acetabulum: HT-1080; amnion: WISH; B-cells: NAGL-1;
blood: J-111, IM-9, jurkat; bone: HOS, MG-63, MEG-01; bone marrow: A549;
MEG-01; FS-1; brain: SF126, U-251, MG, Becker, Marcus, T98G, SK-MG-1, ONS-76, KNS, B2-17, no. 10, no. 11, KALS-1, KINGS-1, KS-1, KNS-81-FD, NMC-G1, GB-1, AM-38, YH-13; colon: WiDr, LoVo, CCD 841, CCD-33, Caco-2; embryonic limb: Miz-1; epidermoid: A-431; whole fetus: HE-1; foreskin:
FLOW7000, Hs68, liver: Chang Liver, Alexander cells, HC, Hep-G2; lung:
MRC-5, MRC-9, HFL1, WI-38, Flow 2000, KNS-62; lymph node: GAK;
lymphoblastoid: Namalwa; maxilla: Raji; melanoma: G-361, A2058;
neuroblastoma: KP-N; ovary: RMG, RKN, RTSG, RMUG; peripheral blast:
MTA; peripheral blood: RPMI 8226, HL-60, CCRF-SB, EB-3, RPMI 788, NC-37, MOLT-4, KU812, CCRF-CEM, CMK, NOMO, NKM-1, MEG-A2, TMD5, KA13;
pleural effusion: U-937; prostate: DU145, CEACAM-1; sympatho-adrenal cell:
IMR-32, NB-1; umbilical cord: HUV-EC-C; uterine cervix: Ca Ski, HeLa, SKG, BOKU; uterine endometrium : SNG; uterus: SKN, NJG, SAWANO, TCS, UtSMC. The terminology "cancer cell" also refers herein to cells associated with non-human forms of cancer including Vero, COS-7 and NIH3T3 cells.
blood: J-111, IM-9, jurkat; bone: HOS, MG-63, MEG-01; bone marrow: A549;
MEG-01; FS-1; brain: SF126, U-251, MG, Becker, Marcus, T98G, SK-MG-1, ONS-76, KNS, B2-17, no. 10, no. 11, KALS-1, KINGS-1, KS-1, KNS-81-FD, NMC-G1, GB-1, AM-38, YH-13; colon: WiDr, LoVo, CCD 841, CCD-33, Caco-2; embryonic limb: Miz-1; epidermoid: A-431; whole fetus: HE-1; foreskin:
FLOW7000, Hs68, liver: Chang Liver, Alexander cells, HC, Hep-G2; lung:
MRC-5, MRC-9, HFL1, WI-38, Flow 2000, KNS-62; lymph node: GAK;
lymphoblastoid: Namalwa; maxilla: Raji; melanoma: G-361, A2058;
neuroblastoma: KP-N; ovary: RMG, RKN, RTSG, RMUG; peripheral blast:
MTA; peripheral blood: RPMI 8226, HL-60, CCRF-SB, EB-3, RPMI 788, NC-37, MOLT-4, KU812, CCRF-CEM, CMK, NOMO, NKM-1, MEG-A2, TMD5, KA13;
pleural effusion: U-937; prostate: DU145, CEACAM-1; sympatho-adrenal cell:
IMR-32, NB-1; umbilical cord: HUV-EC-C; uterine cervix: Ca Ski, HeLa, SKG, BOKU; uterine endometrium : SNG; uterus: SKN, NJG, SAWANO, TCS, UtSMC. The terminology "cancer cell" also refers herein to cells associated with non-human forms of cancer including Vero, COS-7 and NIH3T3 cells.
[0019] As used herein, a "biologically pure" strain is intended to mean the strain separated from materials with which it is normally associated in nature. Note that a strain associated with other strains, or with compounds or materials that it is not normally found with in nature, is still defined as "biologically pure." A monoculture of a particular strain is, of course, "biologically pure."
Nucleotides [0020] Nucleotide sequences of the present invention are presented herein by single strand, in the 5' to 3' direction, from left to right, using the one letter nucleotide symbols as commonly used in the art and in accordance with the recommendations of the IUPAC-IUB Biochemical Nomenclature Commission.
Nucleotides [0020] Nucleotide sequences of the present invention are presented herein by single strand, in the 5' to 3' direction, from left to right, using the one letter nucleotide symbols as commonly used in the art and in accordance with the recommendations of the IUPAC-IUB Biochemical Nomenclature Commission.
[0021] The present description refers to a number of routinely used recombinant DNA (rDNA) technology terms. Nevertheless, definitions of selected examples of such rDNA terms are provided for clarity and consistency.
[0022] As used herein, "nucleic acid molecule", refers to a polymer of nucleotides. Non-limiting examples thereof include DNA (e.g. genomic DNA, cDNA), RNA molecules (e.g. mRNA) and chimeras thereof. The nucleic acid molecule can be obtained by cloning techniques or synthesized. DNA can be double-stranded or single-stranded (coding strand or non-coding strand [antisense]).
Protein expression [0023] Prokaryotic expressions are useful for the preparation of large quantities of the Cry31Aa2 protoxine and toxine encoded by the cry3lAa2 DNA sequence. These proteins can be purified according to standard protocols that take advantage of the intrinsic properties thereof, such as size and charge (e.g. SDS gel electrophoresis, gel filtration, centrifugation, ion exchange chromatography...). In addition, the protein of interest can be purified via affinity chromatography using polyclonal or monoclonal antibodies.
The purified protein can be used for therapeutic applications in accordance with the methods and uses of the present invention.
Mutations, mutants and variants [0024] As commonly known, a "mutation" is a detectable change in the genetic material which can be transmitted to a daughter cell. As well known, a mutation can be, for example, a detectable change in one or more deoxyribonucleotides. For example, nucleotides can be added, deleted, substituted for, inverted, or transposed to a new position. Spontaneous mutations and experimentally induced mutations exist. A mutant polypeptide can be encoded from this mutant nucleic acid molecule.
Protein expression [0023] Prokaryotic expressions are useful for the preparation of large quantities of the Cry31Aa2 protoxine and toxine encoded by the cry3lAa2 DNA sequence. These proteins can be purified according to standard protocols that take advantage of the intrinsic properties thereof, such as size and charge (e.g. SDS gel electrophoresis, gel filtration, centrifugation, ion exchange chromatography...). In addition, the protein of interest can be purified via affinity chromatography using polyclonal or monoclonal antibodies.
The purified protein can be used for therapeutic applications in accordance with the methods and uses of the present invention.
Mutations, mutants and variants [0024] As commonly known, a "mutation" is a detectable change in the genetic material which can be transmitted to a daughter cell. As well known, a mutation can be, for example, a detectable change in one or more deoxyribonucleotides. For example, nucleotides can be added, deleted, substituted for, inverted, or transposed to a new position. Spontaneous mutations and experimentally induced mutations exist. A mutant polypeptide can be encoded from this mutant nucleic acid molecule.
[0025] As used herein, a "mutant" of the novel strain of Bacillus thuringiensis of the present invention namely the M15 strain deposited under access no, IDAC010201-5 may or may not have the same identifying biological characteristics of the M15 strain, as long as the mutant produces a crystal protein that is cytotoxic against human cancer cells.
Illustrative examples of suitable methods for preparing mutants and variants of the inventive microorganism strain include, but are not limited to:
mutagenesis by irradiation with ultraviolet light or X-rays, or by treatment with a chemical mutagen such as nitrosoguanidine (N-methyl-N'-nitro-N-nitrosoguanidine), methylmethanesulfonate, nitrogen mustard and the like; gene integration techniques, such as those mediated by insertional elements or transposons or by homologous recombination of transforming linear or circular DNA molecules;
and transduction mediated by bacteriophages such as P1. These methods are well known in the art and are described, for example, in J. H. Miller, Experiments in Molecular Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1972); J. H. Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1992); M.
Singer and P. Berg, Genes & Genomes, University Science Books, Mill Valley, CA (1991); J. Sambrook, E. F. Fritsch and T. Maniatis, Molecular Cloning: A
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); P. B. Kaufman et al., Handbook of Molecular and Cellular Methods in Biology and Medicine, CRC Press, Boca Raton, FL (1995);
Methods in Plant Molecular Biology and Biotechnology, B. R. Glick and J. E.
Thompson, eds., CRC Press, Boca Raton, FL (1993); and P. F. Smith-Keary, Molecular Genetics of Escherichia coli, The Guilford Press, New York, N.Y.
(1989).
Illustrative examples of suitable methods for preparing mutants and variants of the inventive microorganism strain include, but are not limited to:
mutagenesis by irradiation with ultraviolet light or X-rays, or by treatment with a chemical mutagen such as nitrosoguanidine (N-methyl-N'-nitro-N-nitrosoguanidine), methylmethanesulfonate, nitrogen mustard and the like; gene integration techniques, such as those mediated by insertional elements or transposons or by homologous recombination of transforming linear or circular DNA molecules;
and transduction mediated by bacteriophages such as P1. These methods are well known in the art and are described, for example, in J. H. Miller, Experiments in Molecular Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1972); J. H. Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1992); M.
Singer and P. Berg, Genes & Genomes, University Science Books, Mill Valley, CA (1991); J. Sambrook, E. F. Fritsch and T. Maniatis, Molecular Cloning: A
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); P. B. Kaufman et al., Handbook of Molecular and Cellular Methods in Biology and Medicine, CRC Press, Boca Raton, FL (1995);
Methods in Plant Molecular Biology and Biotechnology, B. R. Glick and J. E.
Thompson, eds., CRC Press, Boca Raton, FL (1993); and P. F. Smith-Keary, Molecular Genetics of Escherichia coli, The Guilford Press, New York, N.Y.
(1989).
[0026] Mutated strains derived from the M15 strain using known methods are then preferably selected or screened for improved cytotoxic crystal proteins production potential or for other desirable properties related to their utility in expressing crystal proteins that are cytotoxic to human cancer cells. In a specific embodiment of the mutagenesis and screening approach to strain improvement, mutagenized cells are selected on the basis of their cytopathic effects or cytocidal activity on target cells and their spectrum of action.
[0027] The term "variant" refers herein to a protein or nucleic acid molecule which is substantially similar in structure and biological activity to the protein or nucleic acid of the present invention and includes a cry3lAa2 nucleic sequence or the protein encoded by same having one or more mutations that does not affect its cytotoxic activity. In particular, a variant of the nucleotide or polypeptide sequence of the active portion of Cry31Aa2 possesses the ability to lyse human cancer cells including HeLa, TCS, HL-60, Jurkat and Hep-G2. The methods for determining whether a nucleotide or polypeptide sequence constitutes a variant of Cry31Aa2 include conducting an EC50 assay on a cancer cell against which the Cry31Aa2 active toxin displays cytotoxicity.
[0028] The functional derivatives of the present invention can be synthesized chemically or produced through recombinant DNA technology. All these methods are well known in the art.
Hybridization [0029] "Nucleic acid hybridization" refers generally to the hybridization of two single-stranded nucleic acid molecules having complementary base sequences, which under appropriate conditions will form a thermodynamically favored double-stranded structure. Examples of hybridization conditions can be found in the two laboratory manuals referred above (Sambrook et al., 1989, supra, and Ausubel et al., 1989, supra) and are commonly known in the art. In the case of an hybridization to a nitrocellulose filter, as for example in the well known Southern blotting procedure, a nitrocellulose filter can be incubated overnight at 65 C with a labeled probe in a solution containing 50% formamide, high salt (5 x SSC or 5 x SSPE), 5 x Denhardt's solution, 1% SDS, and 100 pg/mI denatured carrier DNA (e.g.
salmon sperm DNA). The non-specifically binding probe can then be washed off the filter by several washes in 0.2 x SSC/0.1 % SDS at a temperature which is selected in view of the desired stringency: room temperature (low stringency), 42 C (moderate stringency) or 65 C (high stringency). The selected temperature is based on the melting temperature (Tm) of the DNA
hybrid. Of course, RNA-DNA hybrids can also be formed and detected. In such cases, the conditions of hybridization and washing can be adapted according to well known methods by the person of ordinary skill. Stringent conditions will be preferably used (Sambrook et a!.,1989, supra). In most hybridizations, a 1%
mismatching of bases will lower the melting temperature by 1-1.5 C (Sambrook et a!.,1989, supra). Consequently, nucleotide sequences sharing 98%
nucleotide identities with the cry3lAa2 gene encoding the trypsin-activated portion of Cry31Aa2 will still hybridize with the cry3lAa2 gene when the melting temperature is lowered by 3 C, respective to the most stringent conditions for hybridization between two identical cry3lAa2 sequences. The term "high stringency" conditions refer herein to the conditions required for the hybridization of nucleotide sequences sharing at least 97% nucleotide identities with the cry3lAa2 gene encoding the trypsin-activated portion of Cry31Aa2.
Hybridization [0029] "Nucleic acid hybridization" refers generally to the hybridization of two single-stranded nucleic acid molecules having complementary base sequences, which under appropriate conditions will form a thermodynamically favored double-stranded structure. Examples of hybridization conditions can be found in the two laboratory manuals referred above (Sambrook et al., 1989, supra, and Ausubel et al., 1989, supra) and are commonly known in the art. In the case of an hybridization to a nitrocellulose filter, as for example in the well known Southern blotting procedure, a nitrocellulose filter can be incubated overnight at 65 C with a labeled probe in a solution containing 50% formamide, high salt (5 x SSC or 5 x SSPE), 5 x Denhardt's solution, 1% SDS, and 100 pg/mI denatured carrier DNA (e.g.
salmon sperm DNA). The non-specifically binding probe can then be washed off the filter by several washes in 0.2 x SSC/0.1 % SDS at a temperature which is selected in view of the desired stringency: room temperature (low stringency), 42 C (moderate stringency) or 65 C (high stringency). The selected temperature is based on the melting temperature (Tm) of the DNA
hybrid. Of course, RNA-DNA hybrids can also be formed and detected. In such cases, the conditions of hybridization and washing can be adapted according to well known methods by the person of ordinary skill. Stringent conditions will be preferably used (Sambrook et a!.,1989, supra). In most hybridizations, a 1%
mismatching of bases will lower the melting temperature by 1-1.5 C (Sambrook et a!.,1989, supra). Consequently, nucleotide sequences sharing 98%
nucleotide identities with the cry3lAa2 gene encoding the trypsin-activated portion of Cry31Aa2 will still hybridize with the cry3lAa2 gene when the melting temperature is lowered by 3 C, respective to the most stringent conditions for hybridization between two identical cry3lAa2 sequences. The term "high stringency" conditions refer herein to the conditions required for the hybridization of nucleotide sequences sharing at least 97% nucleotide identities with the cry3lAa2 gene encoding the trypsin-activated portion of Cry31Aa2.
[0030] As used herein, "chemical derivatives" is meant to cover additional chemical moieties not normally part of the subject matter of the invention. Such moieties could affect the physico-chemical characteristic of the derivative (e.g. solubility, absorption, half life, decrease of toxicity and the like).
Such moieties are exemplified in Remington's Pharmaceutical Sciences (1980).
Methods of coupling these chemical-physical moieties to a polypeptide or nucleic acid sequence are well known in the art.
Recombinant vectors [00311 The term "recombinant DNA" as known in the art refers to a DNA molecule resulting from the joining of DNA segments. This is often referred to as genetic engineering. The same is true for "recombinant nucleic acid".
[0032] The term "vector" is commonly known in the art and defines a plasmid DNA, phage DNA, viral DNA and the like, which can serve as a DNA
vehicle into which DNA of the present invention can be cloned. Numerous types of vectors exist and are well known in the art.
[0033] The term "expression" defines the process by which a gene is transcribed into mRNA (transcription), the mRNA is then being translated (translation) into one polypeptide (or protein) or more.
[0034] The terminology "expression vector" defines a vector or vehicle as described above but designed to enable the expression of an inserted sequence following transformation into a host. The cloned gene (inserted sequence) is usually placed under the control of control element sequences such as promoter sequences. The placing of a cloned gene under such control sequences is often referred to as being operably linked to control elements or sequences.
[0035] Operably linked sequences may also include two segments that are transcribed onto the same RNA transcript. Thus, two sequences, such as a promoter and a "reporter sequence" are operably linked if transcription commencing in the promoter will produce an RNA transcript of the reporter sequence. In order to be "operably linked" it is not necessary that two sequences be immediately adjacent to one another.
[0036] Expression control sequences will vary depending on whether the vector is designed to express the operably linked gene in a prokaryotic or eukaryotic host or both (shuttle vectors) and can additionally contain transcriptional elements such as enhancer elements, termination sequences, tissue-specificity elements, and/or translational initiation and termination sites.
[0037] The DNA construct can be a vector comprising a promoter that is operably linked to an oligonucleotide sequence of the present invention, which is in turn, operably linked to a heterologous gene, such as the gene for the luciferase reporter molecule. "Promoter" refers to a DNA regulatory region capable of binding directly or indirectly to RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. For purposes of the present invention, the promoter is bound at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter will be found a transcription initiation site (conveniently defined by mapping with S1 nuclease), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CCAT" boxes. Prokaryotic promoters contain -10 and -35 consensus sequences, which serve to initiate transcription and the transcript products contain Shine-Dalgarno sequences, which serve as ribosome binding sequences during translation initiation.
Recombinant host cell [0038] A host cell or indicator cell has been "transfected" by exogenous or heterologous DNA (e.g. a DNA construct) when such DNA has been introduced inside the cell. The transfecting DNA may or may not be integrated (covalently linked) into chromosomal DNA making up the genome of the cell. In prokaryotes, yeast, and mammalian cells for example, the transfecting DNA may be maintained on an episomal element such as a plasmid. With respect to eukaryotic cells, a stably transfected cell is one in which the transfecting DNA has become integrated into a chromosome so that it is inherited by daughter cells through chromosome replication. This stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones comprised of a population of daughter cells containing the transfecting DNA. Transfection methods are well known in the art (Sambrook et al., 1989, supra; Ausubel et aL, 1994 supra).
Method for identifying other cancer cells against which Cry3l Aa2 is c otoxic [0039] In addition to the EC50 assay described herein, other assays may be used to determine the effects of Cry31 Aa2 or other proteins encompassed by the present invention on human cancer cells. In particular, these effects may be observed by photonic microscopy. Furthermore, assays for detecting cytopathic effects can also be used for this purpose.
[0040] Other objects, advantages and features of the present invention will become more apparent upon reading of the following non-restrictive description of preferred embodiments thereof, given by way of example only with reference to the accompanying drawings.
[0041] The present invention seeks to meet these needs and other needs.
[0042] The present description refers to a number of documents, the content of which is herein incorporated by reference in their entirety.
BRIEF DESCRIPTION OF THE DRAWINGS
In the appended drawings:
[0043] Figure 1 illustrates in panel A) a phase-contrast micrograph of a lysed culture of Bacillus thuringiensis strain M15; in panel B, a transmission electron micrograph of Bacillus thuringiensis strain M15 containing a spore and a tightly bound parasporal inclusion;
[0044] Figure 2 shows a SDS-PAGE analysis of the parasporal inclusion protein(s) of B. thuringiensis strain M15;
[0045] Figure 3 is the nucleotide sequence of the translated portion of the Cry3lAa2 gene (SEQ ID NO: 1);
[0046] Figure 4 is the deduced amino acid sequence of the Cry39Aa2 gene (SEQ ID NO: 2);
[0047] Figure 5 shows a comparison of the deduced amino acid sequences of Cry3lAa2 (SEQ ID NO:2) and Cry3lAal (SEQ ID NO:18). The capital letters and dotted lines under the amino acid sequence of Cry31Aa2 (SEQ ID NO:2) correspond to the difference and alignment gaps between the Cry3lAa2 (SEQ ID NO:2) and Cry3lAal (SEQ ID NO:18). The asterisks under the Cry3lAa2 sequence indicate the identities between Cry3lAa2 (SEQ ID
NO:2) and Cry31Aal (SEQ ID NO:18);
[0048] Figure 6 shows a restriction map of the recombinant plasmid pYCP31A2 containing the Cry31Aa2 gene;
[0049] Figure 7 shows a transmission electron micrograph of a B.
thuringiensis Cr{ B transformant expressing the Cry3lAa2 gene. S: spore; P:
parasporal inclusion; Magnification : 20,000 X;
[0050] Figure 8 shows a SDS-PAGE analysis of the parasporal inclusion protein from a B. thuringiensis transformant expressing the crystal protein gene Cry31 Aa2;
[0051] Figure 9 shows the nucleotide sequence (SEQ ID NO:16) and deduced amino acid sequence (SEQ ID NO:2) of the Cry31Aa2 gene along with features thereof; and [0052] Figure 10 shows the nucleotide sequence of the translated portion of the Cry31Aal gene (SEQ ID NO:17).
DESCRIPTION OF THE PREFERRED EMBODIMENT
Isolation of strain, morphological and biochemical characteristics [0053] A Bacillus thuringiensis strain was isolated from dead two-spotted spider mites (Tetranychus urticae Koch; Arthropoda: Arachnida: Tetranychidae) and named M15. The mites, parasitic on apple leaves, were collected in an apple orchard located in Frelighsburgh, Quebec, Canada. They were homogenized in 3 ml of phosphate-buffered saline (PBS) (NaCl 8 g, KCI 0.2 g, Na2HPO4 1.44 g, KH2PO4 0.24 g I-1). The homogenized solution was incubated for 16 hr at room temp and heated at 78 C for 15 min. Afterwards, the homogenate was plated on 2YT agar medium (Bacto Tryptone 16 g, Bacto Yeast Extract 10 g, NaCl 5 g, Agar 18 g I-1), and incubated for 24 hr at 30 C.
All colonies with a morphology similar to B. thuringiensis were streaked on T3 agar medium (Bacto Tryptone 3 g, Bacto Tryptose 2 g, Bacto Yeast Extract 1.5 g, MnC12 0.005 g, 0.05M Sodium phosphate, pH6.7, Agar 18 g I-1) and incubated at 30 C for 48 hr. The cultures were examined by phase-contrast microscopy (Carl Zeiss Canada Ltd., Toronto, Ontario, Canada) for the presence of spores and crystals. B. thuringiensis M15 was deposited on 29 January 2001 in the International Depository Authority of Health Canada in Winnipeg under the Budapest Treaty (Bureau of Microbiology, Health Canada, 1015 Arlington Street, Winnipeg, Manitoba, R3E 3R2) under accession no.
I DAC010201-5.
[0054] The M15 strain was characterized for its ability to ferment specific carbon sources, and for the production, utilization and reduction of specific compounds (see Table 1 below). The biochemical characteristics of B.
thuringiensis strain M15, obtained using the API 50CH and API 20E kits as recommended by the manufacturer (bioMerieux, St-Laurent, Quebec, Canada), were different from those of three controls, B. thuringiensis var. kurstaki HD-1, -var. israelensis HD-500 and -var. higo BT205. B. thuringiensis var. kurstaki HD-1 and -var. israelensis HD-500 were obtained from "Laboratoire des bacteries entomopathogenes", Institut Pasteur (Paris, France). B. thuringiensis var.
higo BT205 was in the Agriculture Canada collection (Jung et at., 1998). The M15 strain is further characterized in Jung, 2001.
Table 1. The biochemical profile of B. thuringiensis M15 and selected control strains Tests B. thuringiensis B. thuringiensis B. thuringiensis B. thuringiensis var. kurstaki HD-1 var. israelensis HD-500 var. higo BT M 15 Fermentatio n of Glycerol + + +
D-Arabinose - - - -L-Arabinose - - - -Ribose + + + +
D-Xylose - - - -L-Xylose - - - -D-Galactose - - - -D-Glucose + + + +
D-Fructose + + + +
D-Mannose - - - -L-Sorbose - - - -Inositol - - - -D-Mannitol - - - -D-Sorbitol - - - -N- + + + +
Acetylglucosa mine Arbutin + + + -Esculin + +
Salicin + - + +
D- + + + -Cellobiose D-Maltose + + + +
Lactose - - - -Melibiose - - - -Sucrose - - - -Trehalose + + + +
Starch - + + -Glycogen + + + -Gluconate + + -Production of Galactosida se Arginine + + + -dihydrolase Ornithine - - - -decarboxyla se Urease + - + +
Tryptophane - - - -deaminase Gelatinase + + + +
Oxidase + + + +
Catalase + + + +
Indole - - - -Acetoin + + + +
Citrate + - - -utilization Nitrate + - - -reduction +, -, and indicate positive, negative, and weak reactions, respectively.
Microscopic characterization of Cry31 Aa2 parasporal inclusion bodies [0055] The parasporal inclusion bodies produced by a sporulated culture of B. thuringiensis strain M15 appear roughly spherical when observed under phase-contrast microscopy (Fig. 1 A) and are tightly coupled to the spores even in lysed cultures. Further analysis under the transmission electron microscope (TEM), however, reveals that the parasporal inclusion body has a polygonal shape (Fig. 1B). The TEM observation was conducted after the B.
thuringiensis strain M15 was incubated for 5 days at 309C in T3 medium and the samples ultra-thinly sectioned according to Beveridge et al. (1994).
Arrows show the roughly spherical parasporal inclusions tightly bound to the white ovoid spores. In this figure, "S" and "P" denote spore and parasporal inclusion, respectively. Magnification used is of 25,000 X.
SDS-PAGE analysis and N-terminal sequencing of the native parasporal inclusion protein [0056] The B. thuringiensis strain M15 was grown in T3 medium for 5 days at 30 C on a rotary shaker to allow crystal protein production. Spores and crystals were separated from each other in the tightly bound parasporal duplexes using an ultrasonic processor model VC 130 (Sonics & Materials, Inc., Newtown, CT, USA) and purified by sucrose density gradient centrifugation as described elsewhere (Thomas and Ellar, 1983). Twenty microliters of the crystal suspension were added to 3 volumes of gel loading buffer (4% SDS, 20% glycerol, 125 mM Tris-HCI, 10% 2-mercaptoethanol, pH 6.8) in a 1.5-ml microtube, incubated at 90 C for 7 min and centrifuged for 2 min to remove unsolubilized materials. Thirty microliters of the supernatant were loaded on top of 10% SDS-polyacrylamide gels. Discontinuous sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) was performed according to Laemmli and Favre (1973).
[0057] Figure 2 shows the B. thuringiensis strain M15 parasporal inclusion purified by sucrose density gradient centrifugation as subjected to a 10% SDS-PAGE electrophoresis (lane 4); the crude extracts of the fully lysed B. thuringiensis var. kurstaki HD-1 subjected to electrophoresis on the same gel (lane 3) as a control; and high molecular (lane 1) and low molecular masses (lane 2) of standard protein markers on the left. At least two major bands of approximately 86- and 79-KDa in size were revealed. They were transferred to a polyvinylidene difluoride (PVDF) membrane (Bio-RadTM), excised and subjected to a pulsed liquid phase sequencer for determination of N-terminal amino acid sequence.
[0058] The N-terminal amino acid sequence of the crystal protein from B.
thuringiensis strain M15 was determined as follows. The purified parasporal crystal was added into O.1 N NaOH-3M HEPES solution and solubilized in 10 volumes of gel loading buffer by incubating in boiling water for 5 min. The crystal protein was separated on 10 % SDS-PAGE and transferred to a polyvinylidene difluoride (PVDF) membrane (Bio-Rad, Mississauga, Ontario, Canada). The crystal protein band stained with Coomassie brilliant blueTM R-250 (Bio-Rad) was excised and subjected to a pulsed liquid phase sequencer model 473A (Applied Biosystems, Foster City, CA, USA) at the Regional Sequencing Facility (Centre de recherche du Centre hospitalier de I'Universite Laval, Quebec, Canada).
[0059] The N-terminal sequence analysis revealed that both polypeptides (86- and 79-KDa) shared identical 20-amino acids residues.
These were Met, Asp, Pro, Phe, Ser, Asn, Tyr, Ser, Glu, Gin, Lys, Tyr, Pro, Asp, Ser, Asn, Asn, Asn, Gin and Glu (SEQ ID NO: 3).
Southern hybridization and gene cloning [0060] An 18-mer oligonucleotide sequence, referred to as M15-M, was deduced from a middle portion of the N-terminal amino acid sequence (Glu, Gin, Lys, Tyr, Pro, Asp (SEQ ID NO: 4)) of the 86-kDa crystal protein. The M15-M oligonucleotide was labeled by the Digoxigenin (DIG) oligonucleotide 3'-end labeling kit containing DIG-11-ddUTP (Roche, Laval, Quebec, Canada) as recommended by the manufacturer. The labeled oligonucleotide was precipitated with 0.1 volume of 4M LiCI and 2.5 volumes of ice-cold ethanol, and transferred at -70 C for 30 min. The reaction was centrifuged at 16,000g for 15 min at 4 C. The washed pellet was resuspended in nuclease-free water, and stored at -20 C until use.
[0061] The M 15-M generated had the following sequence: 5'-GARCARAARTAYCCNGAY-3' (SEQ ID NO: 5).
[0062] B. thuringiensis strain M15 was grown in Luria-Bertani (LB) medium (Bacto Tryptone 10 g, Bacto Yeast Extract 5 g, NaCl 5 g I-1) at 30 C
for 16 hr on a rotary shaker. Plasmid DNA was isolated using the alkaline extraction method as described elsewhere (Birnboim and Doly, 1979) with the following modifications. LysozymeTM (Sigma-Aldrich Canada Ltd., Oakville, Ontario, Canada) was added at a concentration of 2 mg.ml-1 and the cell suspension was incubated at 37 C for 1 hr.
[0063] The plasmid DNA was then purified with Wizard TM Plus SV
minipreps DNA purification system following the manufacturer's recommendation (Promega, Nepean, Ontario, Canada). Three samples of the plasmid were then digested with Hindlll, HindlIl/EcoRl and EcoRl (Gibco BRL), respectively, electrophoresed on a 0.7% agarose gel and transferred onto a NytranTM nylon membrane (Schleicher & Schuell, Keene, New Hampshire, USA) by the method of Southern (1975). They were then probed with the DIG-labeled 18-mer M15-M oligonucleotide.
[0064] This Southern blot hybridization was performed using the DIG-labeled oligonucleotides with the standard hybridization solution (5X SSC, 1%
blocking reagent (Roche), 0.1% N-lauroylsarcosine, 0.02% SDS) for 13 hr at 39 C. After hybridization, the membrane was washed twice for 15 min each in 4X wash solution (4X SSC, 0.1% SDS) at 39 C. Following the washes, detection of signals on the membrane was performed with the color-substrate solution containing NBT (4-Nitroblue tetrazolium chloride, Roche) and BCIP (5-Bromo-4-chloro-3-indolyl-phosphate, Roche) as recommended by the manufacturer. After hybridization and post-hybridization washes at 39 C, the M15-M probe strongly hybridized to an 8-kb Hindlll, a 2.6-kb Hindlll/EcoRl, and a 2.6-kb EcoRl fragment.
[0065] The purified B. thuringiensis M15 plasmid DNA was digested with Hindlll and ligated with the Hind lll-digested SAP (Shrimp Alkaline Phosphatase, Roche)-treated pBluescriptTM II KS(+) (Stratagene, La Jolla, California, USA). After ligation, the recombinant DNA was transformed into E.
coli DH5a (Gibco BRL, Burlington, Ontario, Canada). Preparation of E. coli DH5a competent cells and transformation were done as described (Sambrook et al., 1989).
[0066] The transformants were grown on LB agar plates containing 100 g ml-1 ampicillin (Sigma-Aldrich Canada Ltd.) and 40 g ml-1 X-Gal (5'-Bromo-4-chloro-3-indolyl-p-D-galactopyranoside, Sigma-Aldrich Canada Ltd.) at 37 C. White colonies were toothpicks-transferred to 1 ml of fresh LB media supplemented with 100 .tg ml-1 ampicillin, and incubated overnight at 37 C.
[0067] The recombinant DNA were then isolated by the cracking procedure (Sambrook et al., 1989) and electrophoresed on 0.7% agarose gel to assess the size of the undigested recombinant plasmids.
[0068] The three recombinant plasmids with the highest molecular weight were selected and digested with Hindlll. They were designated pYCH27, pYCH40 and pYCH217, respectively. All three plasmids contained an 8-kb Hindlll insert. In addition, pYCH27 and pYCH40 also contained a 0.75-kb and a 1.9-kb Hindlll fragment, respectively. They were then electrophoresed on a 0.7% agarose gel, transferred onto a NytranTM nylon membrane by the method of Southern (1975) and probed with the M15-M oligonucleotide. The M15-M probe hybridized to the 8-kb Hindlll fragments in pYCH27, pYCH40 and pYCH217 as revealed by Southern blot hybridization.
[0069] The 8-kb Hindlll fragments from pYCH27, pYCH40 and pYCH217 were doubly digested with Hindlll/EcoRl, electrophoresed on agarose gel, Southern transferred, and hybridized with the M15-M probe. For each of the three recombinant plasmids, a single 2.6-kb fragment was detected (data not shown). This confirms that this 2.6-kb fragment is the same as the one in the EcoRl-digested plasmid DNA of strain M15.
[0070] The 8-kb Hindlll insert was excised from recombinant plasmid pYCH217, digested with various restriction enzymes [EcoRl, Bglll (Gibco BRL), Dral, Sphl (Amersham Pharmacia Biotech)], and a restriction map constructed.
The 8-kb Hindlll fragment contains a 3.4-kb HindlII/EcoRl, a 2.6-kb EcoRl/EcoRl, a 1.4-kb EcoRl/EcoRl and a 0.6-kb EcoRl/Hindlll fragment. The open reading frame of a crystal protein gene, cry3lAa2, is indicated by a gray arrow. The hatched box indicates the region homologous to the DIG-labeled 18-mer M15-M oligonucleotide probe. Selected subfragment sizes are indicated below the structural map.
[0071] To identify the region homologous to the M15-M probe, the recombinant plasmid pYCH217 was doubly digested with Hindlll/EcoRl, and the resulting fragments were subcloned into EcoRl-digested pBluescriptTM II
KS(+). After ligation, four subclones were obtained to give the recombinant plasmids pYC12S, pYC22S, pYC30S, and pYC31S. Plasmids pYC12S and pYC30S contained a 1.4-kb and a 2.6-kb insert, respectively, while pYC22S
and pYC31 S both harbored a 2.6-kb insert along with a 0.6-kb and a 1.4-kb fragment, respectively. Only the 2.6-kb EcoRl/EcoRl fragment from subclones, pYC22S, pYC30S and pYC31S hybridized with the M15-M probe. To further localize the region of hybridization of the M15-M probe in the 2.6-kb EcoRl/EcoRl fragment, the recombinant plasmid pYC30S was digested with EcoRl, EcoRl/Dral, EcoRI/Sphl, and EcoRl/BgllI, respectively, and then hybridized with the M15-M probe. The M15-M probe detected a 2.6-kb EcoRl, a 0.6-kb Dral, a 1.6-kb EcoRl/Sphl, and a 0.85-kb EcoRl/BgIII fragment, respectively. It was thus determined that the region of hybridization of the M probe lied between the BgIII and Dral sites within the 2.6-kb EcoRl fragment.
Characterization of a new crystal protein gene, cry31Aa2 [0072] The nucleotide sequences of the 2.6-kb EcoRI/EcoRl, 1.4-kb EcoRl/EcoRl and 0.6-kb EcoRl/Hindlll fragments were determined. An open reading frame (ORF) of 2,226-bp in length that codes for a polypeptide of 742 amino acids with a predicted molecular mass of 83,068Da (Figures 3 and 4) was found. The start codon is not ATG but GTG. One potential promoter-like sequence in the 5' non-coding region (Lereclus et al., 1989; Baum and Malvar, 1995) shows a 13-bp spacing between the putative -10 and -35 sequences located 138-bp upstream from the start codon (GTG). The potential ribosome binding site (RBS) (GAAAGGTGG (SEQ ID NO: 6)) is located 7-bp upstream of the start codon (GTG) and is partially complementary to the 3' end (UCUUUCCUCC (SEQ ID NO: 7)) of B. subtilis 16S rRNA (McLaughlin et al., 1981; Moran et al., 1982). Both potential -35 and -10 boxes and a putative ribosome-binding site are underlined in figure 9. The calculated free energy of interaction (AG, 25 C) between the B. subtilis 16S rRNA and the putative ribosome binding site is -14.8 kcal-mol-1 (Tinoco et al., 1973). A terminal inverted repeat that could form a stem-and-loop secondary structure with a calculated energy (AG, 25 C) of -12.2 kcal=mol-1 (Tinoco et al., 1973) is located 112-bp downstream from the stop codon (TAA), which is marked with asterisks in figure 9, and may function as a transcription terminator (indicated by arrows). The 18-mer M15-M oligonucleotide sequence based on the N-terminal amino acid sequence (Glu, Gin, Lys, Tyr, Pro, Asp (SEQ ID NO: 4)) of the crystal protein is homologous to a region located 24-bp downstream from the start codon (GTG). The sequence of the DIG-labeled 18-mer oligonucleotide (M15-M) probe is indicated in bold capital letters in figure 9.
The crv3lAa2 gene expression in B. thuringgiensis Cry- B strain [0073] The 3.6-kb Hindlll/Sphl fragment containing the entire crystal protein gene was excised from the recombinant plasmid pYCH217, and then cloned into the E. coli-B. thuringiensis shuttle vector pHPS9 doubly digested with Hindlll/Sphl to yield recombinant plasmid pYCP31A2 (Figure 6). The E.
coli-B. thuringiensis shuttle vector pHPS9 (Haima, et al., 1990) was purchased from American Type Culture Collection (Manassas, VA, USA). To express the cloned cry3lAa2 crystal protein gene in the acrystalliferous B. thuringiensis strain Cry- B, the 3.6-kb Hindlll/Sphl fragment was cloned into the Hindlll/Sphl doubly-digested E. coli-B. thuringiensis shuttle vector pHPS9 to yield recombinant plasmid pYCP31A2 (Fig. 6).
[0074] The B. thuringiensis var. kurstaki HD-1 acrystalliferous Cry-B
strain ((Stahly et al., 1978) provided by the Bacillus Genetic Stock Center, The Ohio State University (Columbus, OH, USA)), was transformed with the cloned B. thuringiensis M15 crystal protein gene by electroporation as described by Vehmaanpera (1989) with the following modifications. Bacterial cells cultured in 200 ml of LB supplemented with 0.25 M sucrose and 0.05 M potassium phosphate, pH7.0 (LBSP) to an optical density of 1.0 at 600 nm were centrifuged, washed three times with ice-cold SHMG buffer (250 mM sucrose, 1 mM HEPES, 1 mM MgC12, 10% (v/v) glycerol, pH 7.0), and then resuspended in 1 ml of ice-cold SHMG buffer. The cell suspension was mixed with plasmid DNA at a final DNA concentration of 10 g ml-1 in a 0.2-cm electroporation cuvette (Bio-Rad), kept on ice for 30 min, and then electroporated by a Gene PulserTM model 1652076 (Bio-Rad) at 25 F, 2.5kV and 400Q with the pulse once. After electroporation, 3ml of LBSP supplemented with 10% (v/v) glycerol (LBSPG) were immediately added into the cuvette and incubated at 37 C for 2 hr with shaking.
[0075] The selected B. thuringiensis transformant was cultured in 250 ml of nutrient broth supplemented with 5 gg m1-1 of erythromycin (Sigma-Aldrich Canada Ltd.) and 5 g ml-1 of chloramphenicol (Sigma-Aldrich Canada Ltd.) at 37 C until cell autolysis was observed. The lysate was harvested and then washed twice with 10 mM EDTA (pH 8.0)-1 M NaCI-1 mM phenylmethylsulfonyl fluoride (Sigma-Aldrich Canada Ltd.).
[0076] The B. thuringiensis Cry- B transformant containing the B.
thuringiensis M15 parasporal crystal protein gene was incubated in nutrient broth (Bacto Beef Extract 3 g, Bacto Peptone 5 g I-1) at 30 C for 3 days to allow expression of the toxin gene and crystal formation. The presence of parasporal inclusions was examined by phase-contrast microscopy. When observed under a phase-contrast microscope, the B. thuringiensis transformants expressing the cry31 Aa2 gene contained, in addition to the spore, a roughly spherical inclusion, whereas no inclusions were found in the B.
thuringiensis transformant harboring the non-recombinant shuttle vector pHPS9 alone (data not shown). Under the transmission electron microscope (TEM), however, the parasporal inclusion body has a nearly perfect hexagonal shape (Fig. 7). Both inclusions in the transformant, spore and crystal, are separated from each other as opposed to what is found in B. thuringiensis strain M15 where they are tightly bound to each other.
[0077] The parasporal inclusion from a B. thuringiensis transformant was purified by sucrose density gradient centrifugation as described previously (Thomas and Ellar, 1983). It was then subjected to a 10% discontinuous sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SIDS-PAGE) (lane 3) as reported previously (Laemmli and Favre, 1973). High molecular (lane 1) and low molecular masses (lane 2) of standard protein markers are indicated on the left. The parasporal inclusion protein in the B. thuringiensis transformant is composed of a single major polypeptide of 83-kDa (Fig. 8).
Preparation of inclusion proteins, proteolytic processing, and toxin activation [0078] The spore-inclusion mixture was harvested from sporulated cultures and the inclusions were partially purified by a biphasic separation method described in Goodman (1967) using polyethylene glycol 6000 (Wako Pure Chemical, Osaka, Japan) and sodium dextran sulfate 500 (Sigma, St.
Louis, Mo.). Inclusions were further purified by sucrose density gradient centrifugation as described in Saitoh et al., (1998a). The purified inclusions were stored at 20 C until use.
[0079] Solubilization of purified inclusions was done in 50 mM Na2CO3 (pH 10.0) containing 1 mM EDTA and 10 mM dithiothreitol for 1 h at 37 C. After centrifugation at 20,000 x g for 5 min at 4 C to remove unsolubilized materials, the pH of the solution was adjusted to 8Ø
[0080] The native 83KDa protoxine displayed no cytocidal activity against cancer cells. This protein was therefore cleaved with three enzymes, namely trypsin, chymotrypsin and proteinase K to identify an active toxine.
The solubilized proteins (1.3 mg m11) were therefore treated with proteinase K
(final concentrations, 0.0003, 0.003, 0.03, and 0.3 mg m11), trypsin (0.03, 0.3, 3, and 30 mg m11), and chymotrypsin (0.03, 0.3, 3, and 30 mg m11) in 50 mM Na2CO3 (pH 10.0) for 1.5 h at 37 C. After protease treatment, phenylmethylsulfonyl fluoride (Wako Pure Chemical) was added to the solution to stop the proteolytic reaction, and the mixture was examined for both sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) profiles and cytopathic effect (CPE) on certain cancer cells including MOLT-4 and Hela. The CPE was monitored under a phase-contrast microscope for 24 h, and the degree of cytopathy was graded on the basis of the ratio of damaged cells as described in Mizuki et al., (1999).
One-dose assay, hemolytic assay, and dose-response study [0081] One-dose assays for cytotoxicity and hemolytic activity were carried out as described in Mizuki et al., (1999). Each well of a MicroTest plate received 90 pl of cell suspension containing 2 x 104 cells. After preincubation for 16 h at 37 C, 10 pI of the trypsin-activated sample solution (1.3 mg ml) was added to the well.
[0082] Thirteen human cells, two monkey cells and one mouse cell were used for dose-response studies. A hemolytic assay was done using human erythrocytes according to the method described in Saitoh et al., (1998b). Each well containing 90 1u1 of cell suspension (2 x 104 cells) received 10 N1 of trypsin-activated inclusion proteins which had been prepared in 10-fold serial dilutions in 50 mM Na2CO3 (pH 10.0) containing 10 mM DTT and 1 mM EDTA. Five wells were used for each dilution, and the test was repeated at least three times. The CPE was monitored under a phase-contrast microscope at appropriate intervals for 24 h postinoculation.
[0083] For assessment of the level of cytotoxicity, a cell proliferation test using an MTT [3-(4,5-dimethyl-2-thiazolyl)-2,5-diphenyl-2H tetrazolium bromide] assay as described in Behl (1992) and Heiss (1997) was conducted 24 h postinoculation by using a PremixTM WST-1 kit (Takara Co.). The average of absorbance in mock-inoculated negative controls was used as a blank value.
The arbitrary unit was defined on the basis of the relative value of absorbance at 450 nm to the blank (1.0). The 50% effective concentrations (EC50s) were deduced from the dose-response curves using a log-probit program. Table 2 below presents the EC50s.
[0084] The protein exhibited cytotoxicity against HeLa, TCS, HL-60, Jurkat and Hep-G2 cells when treated with trypsin. No cytocidal activity was induced after treatment with chymotrypsin or proteinase K on HeLa, MOLT-4 and Sawano cells. Without protease digestion, inclusion proteins showed no cytocidal toxicity. Trypsin cleaves the cry31 Aa2 protein after the arginine at position 250 of the full native protein sequence (SEQ ID NO: 2). The sequence of the trypsin-activated protein is designated SEQ ID NO: 8 and the corresponding nucleotide sequence is designated SEQ ID NO: 9.
[0085] Table 2 shows the results of one-dose assays of trypsin-activated Cry31 Aa2 as compared to those of trypsin-activated Cry3lAal against several species of cultured cells. The toxicity spectrum of the protein from the recombinant Cry-B was similar to that of the protein of the wild strain M15.
Both cloned proteins were highly or moderately cytocidal against HeLa, TCS, HL-60, Jurkat and Hep-G2 but were slightly toxic or nontoxic for normal T
cells and for Sawano, UtSMC, MOLT-4, A549, MRC-5, HC, Caco-2 and the non human cells tested.
Table 2 Effective concentration 50 of trypsin-activated cry31Aa2 as compared to that of activated cry31Aa 1 on various cells EC50(ug/ml) EC50(ug/ml) Organ Cell Cell type Cry3lAa2 Cry3lAal 1 Human Uterus HeLa Cervix cancer 0.30 0.23 2 Sawano Uterus cancer, adenocarcinoma >10 3 TCS Cervix cancer, keratinizing squamous 0.32 4 UtSMC Uterus normal smooth muscle >10 5 Blood MOLT-4 T cell leukaemia >10 1.06 6 HL-60 T cell leukaemia 0.05 7 Jurkat T cell leukaemia 0.02 8 T cell Normal T cell >10 9 Lung A549 Lung cancer >10 10 MRC-5 Lung normal fibroblast >10 11 Liver HC Normal hepatocyte cell >10 12 Hep-G2 Liver carcinoma hepatocellular 0.02 13 Colon Caco-2 fColon cancer, adenocarcinoma >10 14 Monkey Kidney Vero Monkey, kidney epithelial cell >10 15 COS-7 Monkey, kidney SV40 transformed cell >10 16 Mouse Embryo NIH3T3 Mouse, embryo fibroblast cell >10 Comparison of Cry3lAa2 and Cry31Aal [0001] As may be seen in Figure 5, the Cry3lAa2 amino acid sequence shares extensive homology with Cry31Aal except for a substitution of 26 amino acid residues and an addition of 19 contiguous codons in Cry31Aa2 (Figure 5). This 19 amino acid sequence is as follows SYQNMKTEIVNTDLPYNTN and is designated SEQ ID NO: 10 while the corresponding nucleotide sequence is designated SEQ ID NO: 11. The asterisks under the Cry3lAa2 sequence indicate the identities between Cry31Aa2 and Cry3lAal. The capital letters and dotted lines under the amino acid sequence of Cry3lAa2 in Figure 5, correspond to the difference and alignment gaps between the Cry31Aa2 and Cry3lAal proteins. The 83-KDa Cry3lAa2 protein exhibits 93.9% amino acid sequence identities with the Cry3lAal protein. The five conserved amino acid blocks of the Cry3lAa2 protein were especially identical to those of the Cry3lAal protein except for the substitution of a single lysine residue in the second conserved block of Cry3lAa2. The bold lines above the Cry3lAa2 sequence correspond to the five conserved amino acid blocks found in the amino acid sequence of Cry3lAal.
Both Cry3lAa2 and Cry3lAal show very low amino acid sequence homology to the known B. thuringiensis Cry and Cyt proteins (Mizuki et al., 2000). The trypsin-activated protein shares a 96.9% identity with the corresponding sequence of Cry3lAal.
[0002] Table 2 above shows that both Cry3lAa2 and Cry3lAal proteins display cytotoxicity against a number of human cancer cells. The cytocidal activity of the Cry3lAal was due to the cleavage by proteinase K and trypsin (Mizuki et al., 2000) while that of Cry3lAa2 was due to trypsin. The comparison of the amino acid sequence of Cry3lAal with that of Cry3lAa2 indicates which amino acids of the amino acid sequence of members of the Cry3l family can be substituted without abrogating this cytoxicity against human cancer cells.
Although certain of these substitutions surely provide for the specificity of each of Cry3lAa2 and Cry3lAal against specific human cancer cells, they each display a significant toxicity against a number of human cancer cells. Hence, it is submitted that the amino acids at positions 24, 37, 39, 51, 56, 59, 87, 97, 138, 158, 170, 251, 389, 444-446, 466, 481, 507, 510, 518, 551, 582, 637, 725 and 742 can be replaced by any other amino acid without abrogating the cytotoxicity of the protein that it constitutes against at least some cancer cells.
Sequences encompassing substitutions at these positions in the complete Cry31Aa2 protein sequence (SEQ ID NO:2) and in the trypsin-activated Cry3lAa2 protein sequence (SEQ ID NO:8) starting after the arginine at position 250 are within the scope of the present invention and are designated herein as SEQ ID NOs:12 and 13, respectively.
Such moieties are exemplified in Remington's Pharmaceutical Sciences (1980).
Methods of coupling these chemical-physical moieties to a polypeptide or nucleic acid sequence are well known in the art.
Recombinant vectors [00311 The term "recombinant DNA" as known in the art refers to a DNA molecule resulting from the joining of DNA segments. This is often referred to as genetic engineering. The same is true for "recombinant nucleic acid".
[0032] The term "vector" is commonly known in the art and defines a plasmid DNA, phage DNA, viral DNA and the like, which can serve as a DNA
vehicle into which DNA of the present invention can be cloned. Numerous types of vectors exist and are well known in the art.
[0033] The term "expression" defines the process by which a gene is transcribed into mRNA (transcription), the mRNA is then being translated (translation) into one polypeptide (or protein) or more.
[0034] The terminology "expression vector" defines a vector or vehicle as described above but designed to enable the expression of an inserted sequence following transformation into a host. The cloned gene (inserted sequence) is usually placed under the control of control element sequences such as promoter sequences. The placing of a cloned gene under such control sequences is often referred to as being operably linked to control elements or sequences.
[0035] Operably linked sequences may also include two segments that are transcribed onto the same RNA transcript. Thus, two sequences, such as a promoter and a "reporter sequence" are operably linked if transcription commencing in the promoter will produce an RNA transcript of the reporter sequence. In order to be "operably linked" it is not necessary that two sequences be immediately adjacent to one another.
[0036] Expression control sequences will vary depending on whether the vector is designed to express the operably linked gene in a prokaryotic or eukaryotic host or both (shuttle vectors) and can additionally contain transcriptional elements such as enhancer elements, termination sequences, tissue-specificity elements, and/or translational initiation and termination sites.
[0037] The DNA construct can be a vector comprising a promoter that is operably linked to an oligonucleotide sequence of the present invention, which is in turn, operably linked to a heterologous gene, such as the gene for the luciferase reporter molecule. "Promoter" refers to a DNA regulatory region capable of binding directly or indirectly to RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. For purposes of the present invention, the promoter is bound at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter will be found a transcription initiation site (conveniently defined by mapping with S1 nuclease), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CCAT" boxes. Prokaryotic promoters contain -10 and -35 consensus sequences, which serve to initiate transcription and the transcript products contain Shine-Dalgarno sequences, which serve as ribosome binding sequences during translation initiation.
Recombinant host cell [0038] A host cell or indicator cell has been "transfected" by exogenous or heterologous DNA (e.g. a DNA construct) when such DNA has been introduced inside the cell. The transfecting DNA may or may not be integrated (covalently linked) into chromosomal DNA making up the genome of the cell. In prokaryotes, yeast, and mammalian cells for example, the transfecting DNA may be maintained on an episomal element such as a plasmid. With respect to eukaryotic cells, a stably transfected cell is one in which the transfecting DNA has become integrated into a chromosome so that it is inherited by daughter cells through chromosome replication. This stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones comprised of a population of daughter cells containing the transfecting DNA. Transfection methods are well known in the art (Sambrook et al., 1989, supra; Ausubel et aL, 1994 supra).
Method for identifying other cancer cells against which Cry3l Aa2 is c otoxic [0039] In addition to the EC50 assay described herein, other assays may be used to determine the effects of Cry31 Aa2 or other proteins encompassed by the present invention on human cancer cells. In particular, these effects may be observed by photonic microscopy. Furthermore, assays for detecting cytopathic effects can also be used for this purpose.
[0040] Other objects, advantages and features of the present invention will become more apparent upon reading of the following non-restrictive description of preferred embodiments thereof, given by way of example only with reference to the accompanying drawings.
[0041] The present invention seeks to meet these needs and other needs.
[0042] The present description refers to a number of documents, the content of which is herein incorporated by reference in their entirety.
BRIEF DESCRIPTION OF THE DRAWINGS
In the appended drawings:
[0043] Figure 1 illustrates in panel A) a phase-contrast micrograph of a lysed culture of Bacillus thuringiensis strain M15; in panel B, a transmission electron micrograph of Bacillus thuringiensis strain M15 containing a spore and a tightly bound parasporal inclusion;
[0044] Figure 2 shows a SDS-PAGE analysis of the parasporal inclusion protein(s) of B. thuringiensis strain M15;
[0045] Figure 3 is the nucleotide sequence of the translated portion of the Cry3lAa2 gene (SEQ ID NO: 1);
[0046] Figure 4 is the deduced amino acid sequence of the Cry39Aa2 gene (SEQ ID NO: 2);
[0047] Figure 5 shows a comparison of the deduced amino acid sequences of Cry3lAa2 (SEQ ID NO:2) and Cry3lAal (SEQ ID NO:18). The capital letters and dotted lines under the amino acid sequence of Cry31Aa2 (SEQ ID NO:2) correspond to the difference and alignment gaps between the Cry3lAa2 (SEQ ID NO:2) and Cry3lAal (SEQ ID NO:18). The asterisks under the Cry3lAa2 sequence indicate the identities between Cry3lAa2 (SEQ ID
NO:2) and Cry31Aal (SEQ ID NO:18);
[0048] Figure 6 shows a restriction map of the recombinant plasmid pYCP31A2 containing the Cry31Aa2 gene;
[0049] Figure 7 shows a transmission electron micrograph of a B.
thuringiensis Cr{ B transformant expressing the Cry3lAa2 gene. S: spore; P:
parasporal inclusion; Magnification : 20,000 X;
[0050] Figure 8 shows a SDS-PAGE analysis of the parasporal inclusion protein from a B. thuringiensis transformant expressing the crystal protein gene Cry31 Aa2;
[0051] Figure 9 shows the nucleotide sequence (SEQ ID NO:16) and deduced amino acid sequence (SEQ ID NO:2) of the Cry31Aa2 gene along with features thereof; and [0052] Figure 10 shows the nucleotide sequence of the translated portion of the Cry31Aal gene (SEQ ID NO:17).
DESCRIPTION OF THE PREFERRED EMBODIMENT
Isolation of strain, morphological and biochemical characteristics [0053] A Bacillus thuringiensis strain was isolated from dead two-spotted spider mites (Tetranychus urticae Koch; Arthropoda: Arachnida: Tetranychidae) and named M15. The mites, parasitic on apple leaves, were collected in an apple orchard located in Frelighsburgh, Quebec, Canada. They were homogenized in 3 ml of phosphate-buffered saline (PBS) (NaCl 8 g, KCI 0.2 g, Na2HPO4 1.44 g, KH2PO4 0.24 g I-1). The homogenized solution was incubated for 16 hr at room temp and heated at 78 C for 15 min. Afterwards, the homogenate was plated on 2YT agar medium (Bacto Tryptone 16 g, Bacto Yeast Extract 10 g, NaCl 5 g, Agar 18 g I-1), and incubated for 24 hr at 30 C.
All colonies with a morphology similar to B. thuringiensis were streaked on T3 agar medium (Bacto Tryptone 3 g, Bacto Tryptose 2 g, Bacto Yeast Extract 1.5 g, MnC12 0.005 g, 0.05M Sodium phosphate, pH6.7, Agar 18 g I-1) and incubated at 30 C for 48 hr. The cultures were examined by phase-contrast microscopy (Carl Zeiss Canada Ltd., Toronto, Ontario, Canada) for the presence of spores and crystals. B. thuringiensis M15 was deposited on 29 January 2001 in the International Depository Authority of Health Canada in Winnipeg under the Budapest Treaty (Bureau of Microbiology, Health Canada, 1015 Arlington Street, Winnipeg, Manitoba, R3E 3R2) under accession no.
I DAC010201-5.
[0054] The M15 strain was characterized for its ability to ferment specific carbon sources, and for the production, utilization and reduction of specific compounds (see Table 1 below). The biochemical characteristics of B.
thuringiensis strain M15, obtained using the API 50CH and API 20E kits as recommended by the manufacturer (bioMerieux, St-Laurent, Quebec, Canada), were different from those of three controls, B. thuringiensis var. kurstaki HD-1, -var. israelensis HD-500 and -var. higo BT205. B. thuringiensis var. kurstaki HD-1 and -var. israelensis HD-500 were obtained from "Laboratoire des bacteries entomopathogenes", Institut Pasteur (Paris, France). B. thuringiensis var.
higo BT205 was in the Agriculture Canada collection (Jung et at., 1998). The M15 strain is further characterized in Jung, 2001.
Table 1. The biochemical profile of B. thuringiensis M15 and selected control strains Tests B. thuringiensis B. thuringiensis B. thuringiensis B. thuringiensis var. kurstaki HD-1 var. israelensis HD-500 var. higo BT M 15 Fermentatio n of Glycerol + + +
D-Arabinose - - - -L-Arabinose - - - -Ribose + + + +
D-Xylose - - - -L-Xylose - - - -D-Galactose - - - -D-Glucose + + + +
D-Fructose + + + +
D-Mannose - - - -L-Sorbose - - - -Inositol - - - -D-Mannitol - - - -D-Sorbitol - - - -N- + + + +
Acetylglucosa mine Arbutin + + + -Esculin + +
Salicin + - + +
D- + + + -Cellobiose D-Maltose + + + +
Lactose - - - -Melibiose - - - -Sucrose - - - -Trehalose + + + +
Starch - + + -Glycogen + + + -Gluconate + + -Production of Galactosida se Arginine + + + -dihydrolase Ornithine - - - -decarboxyla se Urease + - + +
Tryptophane - - - -deaminase Gelatinase + + + +
Oxidase + + + +
Catalase + + + +
Indole - - - -Acetoin + + + +
Citrate + - - -utilization Nitrate + - - -reduction +, -, and indicate positive, negative, and weak reactions, respectively.
Microscopic characterization of Cry31 Aa2 parasporal inclusion bodies [0055] The parasporal inclusion bodies produced by a sporulated culture of B. thuringiensis strain M15 appear roughly spherical when observed under phase-contrast microscopy (Fig. 1 A) and are tightly coupled to the spores even in lysed cultures. Further analysis under the transmission electron microscope (TEM), however, reveals that the parasporal inclusion body has a polygonal shape (Fig. 1B). The TEM observation was conducted after the B.
thuringiensis strain M15 was incubated for 5 days at 309C in T3 medium and the samples ultra-thinly sectioned according to Beveridge et al. (1994).
Arrows show the roughly spherical parasporal inclusions tightly bound to the white ovoid spores. In this figure, "S" and "P" denote spore and parasporal inclusion, respectively. Magnification used is of 25,000 X.
SDS-PAGE analysis and N-terminal sequencing of the native parasporal inclusion protein [0056] The B. thuringiensis strain M15 was grown in T3 medium for 5 days at 30 C on a rotary shaker to allow crystal protein production. Spores and crystals were separated from each other in the tightly bound parasporal duplexes using an ultrasonic processor model VC 130 (Sonics & Materials, Inc., Newtown, CT, USA) and purified by sucrose density gradient centrifugation as described elsewhere (Thomas and Ellar, 1983). Twenty microliters of the crystal suspension were added to 3 volumes of gel loading buffer (4% SDS, 20% glycerol, 125 mM Tris-HCI, 10% 2-mercaptoethanol, pH 6.8) in a 1.5-ml microtube, incubated at 90 C for 7 min and centrifuged for 2 min to remove unsolubilized materials. Thirty microliters of the supernatant were loaded on top of 10% SDS-polyacrylamide gels. Discontinuous sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) was performed according to Laemmli and Favre (1973).
[0057] Figure 2 shows the B. thuringiensis strain M15 parasporal inclusion purified by sucrose density gradient centrifugation as subjected to a 10% SDS-PAGE electrophoresis (lane 4); the crude extracts of the fully lysed B. thuringiensis var. kurstaki HD-1 subjected to electrophoresis on the same gel (lane 3) as a control; and high molecular (lane 1) and low molecular masses (lane 2) of standard protein markers on the left. At least two major bands of approximately 86- and 79-KDa in size were revealed. They were transferred to a polyvinylidene difluoride (PVDF) membrane (Bio-RadTM), excised and subjected to a pulsed liquid phase sequencer for determination of N-terminal amino acid sequence.
[0058] The N-terminal amino acid sequence of the crystal protein from B.
thuringiensis strain M15 was determined as follows. The purified parasporal crystal was added into O.1 N NaOH-3M HEPES solution and solubilized in 10 volumes of gel loading buffer by incubating in boiling water for 5 min. The crystal protein was separated on 10 % SDS-PAGE and transferred to a polyvinylidene difluoride (PVDF) membrane (Bio-Rad, Mississauga, Ontario, Canada). The crystal protein band stained with Coomassie brilliant blueTM R-250 (Bio-Rad) was excised and subjected to a pulsed liquid phase sequencer model 473A (Applied Biosystems, Foster City, CA, USA) at the Regional Sequencing Facility (Centre de recherche du Centre hospitalier de I'Universite Laval, Quebec, Canada).
[0059] The N-terminal sequence analysis revealed that both polypeptides (86- and 79-KDa) shared identical 20-amino acids residues.
These were Met, Asp, Pro, Phe, Ser, Asn, Tyr, Ser, Glu, Gin, Lys, Tyr, Pro, Asp, Ser, Asn, Asn, Asn, Gin and Glu (SEQ ID NO: 3).
Southern hybridization and gene cloning [0060] An 18-mer oligonucleotide sequence, referred to as M15-M, was deduced from a middle portion of the N-terminal amino acid sequence (Glu, Gin, Lys, Tyr, Pro, Asp (SEQ ID NO: 4)) of the 86-kDa crystal protein. The M15-M oligonucleotide was labeled by the Digoxigenin (DIG) oligonucleotide 3'-end labeling kit containing DIG-11-ddUTP (Roche, Laval, Quebec, Canada) as recommended by the manufacturer. The labeled oligonucleotide was precipitated with 0.1 volume of 4M LiCI and 2.5 volumes of ice-cold ethanol, and transferred at -70 C for 30 min. The reaction was centrifuged at 16,000g for 15 min at 4 C. The washed pellet was resuspended in nuclease-free water, and stored at -20 C until use.
[0061] The M 15-M generated had the following sequence: 5'-GARCARAARTAYCCNGAY-3' (SEQ ID NO: 5).
[0062] B. thuringiensis strain M15 was grown in Luria-Bertani (LB) medium (Bacto Tryptone 10 g, Bacto Yeast Extract 5 g, NaCl 5 g I-1) at 30 C
for 16 hr on a rotary shaker. Plasmid DNA was isolated using the alkaline extraction method as described elsewhere (Birnboim and Doly, 1979) with the following modifications. LysozymeTM (Sigma-Aldrich Canada Ltd., Oakville, Ontario, Canada) was added at a concentration of 2 mg.ml-1 and the cell suspension was incubated at 37 C for 1 hr.
[0063] The plasmid DNA was then purified with Wizard TM Plus SV
minipreps DNA purification system following the manufacturer's recommendation (Promega, Nepean, Ontario, Canada). Three samples of the plasmid were then digested with Hindlll, HindlIl/EcoRl and EcoRl (Gibco BRL), respectively, electrophoresed on a 0.7% agarose gel and transferred onto a NytranTM nylon membrane (Schleicher & Schuell, Keene, New Hampshire, USA) by the method of Southern (1975). They were then probed with the DIG-labeled 18-mer M15-M oligonucleotide.
[0064] This Southern blot hybridization was performed using the DIG-labeled oligonucleotides with the standard hybridization solution (5X SSC, 1%
blocking reagent (Roche), 0.1% N-lauroylsarcosine, 0.02% SDS) for 13 hr at 39 C. After hybridization, the membrane was washed twice for 15 min each in 4X wash solution (4X SSC, 0.1% SDS) at 39 C. Following the washes, detection of signals on the membrane was performed with the color-substrate solution containing NBT (4-Nitroblue tetrazolium chloride, Roche) and BCIP (5-Bromo-4-chloro-3-indolyl-phosphate, Roche) as recommended by the manufacturer. After hybridization and post-hybridization washes at 39 C, the M15-M probe strongly hybridized to an 8-kb Hindlll, a 2.6-kb Hindlll/EcoRl, and a 2.6-kb EcoRl fragment.
[0065] The purified B. thuringiensis M15 plasmid DNA was digested with Hindlll and ligated with the Hind lll-digested SAP (Shrimp Alkaline Phosphatase, Roche)-treated pBluescriptTM II KS(+) (Stratagene, La Jolla, California, USA). After ligation, the recombinant DNA was transformed into E.
coli DH5a (Gibco BRL, Burlington, Ontario, Canada). Preparation of E. coli DH5a competent cells and transformation were done as described (Sambrook et al., 1989).
[0066] The transformants were grown on LB agar plates containing 100 g ml-1 ampicillin (Sigma-Aldrich Canada Ltd.) and 40 g ml-1 X-Gal (5'-Bromo-4-chloro-3-indolyl-p-D-galactopyranoside, Sigma-Aldrich Canada Ltd.) at 37 C. White colonies were toothpicks-transferred to 1 ml of fresh LB media supplemented with 100 .tg ml-1 ampicillin, and incubated overnight at 37 C.
[0067] The recombinant DNA were then isolated by the cracking procedure (Sambrook et al., 1989) and electrophoresed on 0.7% agarose gel to assess the size of the undigested recombinant plasmids.
[0068] The three recombinant plasmids with the highest molecular weight were selected and digested with Hindlll. They were designated pYCH27, pYCH40 and pYCH217, respectively. All three plasmids contained an 8-kb Hindlll insert. In addition, pYCH27 and pYCH40 also contained a 0.75-kb and a 1.9-kb Hindlll fragment, respectively. They were then electrophoresed on a 0.7% agarose gel, transferred onto a NytranTM nylon membrane by the method of Southern (1975) and probed with the M15-M oligonucleotide. The M15-M probe hybridized to the 8-kb Hindlll fragments in pYCH27, pYCH40 and pYCH217 as revealed by Southern blot hybridization.
[0069] The 8-kb Hindlll fragments from pYCH27, pYCH40 and pYCH217 were doubly digested with Hindlll/EcoRl, electrophoresed on agarose gel, Southern transferred, and hybridized with the M15-M probe. For each of the three recombinant plasmids, a single 2.6-kb fragment was detected (data not shown). This confirms that this 2.6-kb fragment is the same as the one in the EcoRl-digested plasmid DNA of strain M15.
[0070] The 8-kb Hindlll insert was excised from recombinant plasmid pYCH217, digested with various restriction enzymes [EcoRl, Bglll (Gibco BRL), Dral, Sphl (Amersham Pharmacia Biotech)], and a restriction map constructed.
The 8-kb Hindlll fragment contains a 3.4-kb HindlII/EcoRl, a 2.6-kb EcoRl/EcoRl, a 1.4-kb EcoRl/EcoRl and a 0.6-kb EcoRl/Hindlll fragment. The open reading frame of a crystal protein gene, cry3lAa2, is indicated by a gray arrow. The hatched box indicates the region homologous to the DIG-labeled 18-mer M15-M oligonucleotide probe. Selected subfragment sizes are indicated below the structural map.
[0071] To identify the region homologous to the M15-M probe, the recombinant plasmid pYCH217 was doubly digested with Hindlll/EcoRl, and the resulting fragments were subcloned into EcoRl-digested pBluescriptTM II
KS(+). After ligation, four subclones were obtained to give the recombinant plasmids pYC12S, pYC22S, pYC30S, and pYC31S. Plasmids pYC12S and pYC30S contained a 1.4-kb and a 2.6-kb insert, respectively, while pYC22S
and pYC31 S both harbored a 2.6-kb insert along with a 0.6-kb and a 1.4-kb fragment, respectively. Only the 2.6-kb EcoRl/EcoRl fragment from subclones, pYC22S, pYC30S and pYC31S hybridized with the M15-M probe. To further localize the region of hybridization of the M15-M probe in the 2.6-kb EcoRl/EcoRl fragment, the recombinant plasmid pYC30S was digested with EcoRl, EcoRl/Dral, EcoRI/Sphl, and EcoRl/BgllI, respectively, and then hybridized with the M15-M probe. The M15-M probe detected a 2.6-kb EcoRl, a 0.6-kb Dral, a 1.6-kb EcoRl/Sphl, and a 0.85-kb EcoRl/BgIII fragment, respectively. It was thus determined that the region of hybridization of the M probe lied between the BgIII and Dral sites within the 2.6-kb EcoRl fragment.
Characterization of a new crystal protein gene, cry31Aa2 [0072] The nucleotide sequences of the 2.6-kb EcoRI/EcoRl, 1.4-kb EcoRl/EcoRl and 0.6-kb EcoRl/Hindlll fragments were determined. An open reading frame (ORF) of 2,226-bp in length that codes for a polypeptide of 742 amino acids with a predicted molecular mass of 83,068Da (Figures 3 and 4) was found. The start codon is not ATG but GTG. One potential promoter-like sequence in the 5' non-coding region (Lereclus et al., 1989; Baum and Malvar, 1995) shows a 13-bp spacing between the putative -10 and -35 sequences located 138-bp upstream from the start codon (GTG). The potential ribosome binding site (RBS) (GAAAGGTGG (SEQ ID NO: 6)) is located 7-bp upstream of the start codon (GTG) and is partially complementary to the 3' end (UCUUUCCUCC (SEQ ID NO: 7)) of B. subtilis 16S rRNA (McLaughlin et al., 1981; Moran et al., 1982). Both potential -35 and -10 boxes and a putative ribosome-binding site are underlined in figure 9. The calculated free energy of interaction (AG, 25 C) between the B. subtilis 16S rRNA and the putative ribosome binding site is -14.8 kcal-mol-1 (Tinoco et al., 1973). A terminal inverted repeat that could form a stem-and-loop secondary structure with a calculated energy (AG, 25 C) of -12.2 kcal=mol-1 (Tinoco et al., 1973) is located 112-bp downstream from the stop codon (TAA), which is marked with asterisks in figure 9, and may function as a transcription terminator (indicated by arrows). The 18-mer M15-M oligonucleotide sequence based on the N-terminal amino acid sequence (Glu, Gin, Lys, Tyr, Pro, Asp (SEQ ID NO: 4)) of the crystal protein is homologous to a region located 24-bp downstream from the start codon (GTG). The sequence of the DIG-labeled 18-mer oligonucleotide (M15-M) probe is indicated in bold capital letters in figure 9.
The crv3lAa2 gene expression in B. thuringgiensis Cry- B strain [0073] The 3.6-kb Hindlll/Sphl fragment containing the entire crystal protein gene was excised from the recombinant plasmid pYCH217, and then cloned into the E. coli-B. thuringiensis shuttle vector pHPS9 doubly digested with Hindlll/Sphl to yield recombinant plasmid pYCP31A2 (Figure 6). The E.
coli-B. thuringiensis shuttle vector pHPS9 (Haima, et al., 1990) was purchased from American Type Culture Collection (Manassas, VA, USA). To express the cloned cry3lAa2 crystal protein gene in the acrystalliferous B. thuringiensis strain Cry- B, the 3.6-kb Hindlll/Sphl fragment was cloned into the Hindlll/Sphl doubly-digested E. coli-B. thuringiensis shuttle vector pHPS9 to yield recombinant plasmid pYCP31A2 (Fig. 6).
[0074] The B. thuringiensis var. kurstaki HD-1 acrystalliferous Cry-B
strain ((Stahly et al., 1978) provided by the Bacillus Genetic Stock Center, The Ohio State University (Columbus, OH, USA)), was transformed with the cloned B. thuringiensis M15 crystal protein gene by electroporation as described by Vehmaanpera (1989) with the following modifications. Bacterial cells cultured in 200 ml of LB supplemented with 0.25 M sucrose and 0.05 M potassium phosphate, pH7.0 (LBSP) to an optical density of 1.0 at 600 nm were centrifuged, washed three times with ice-cold SHMG buffer (250 mM sucrose, 1 mM HEPES, 1 mM MgC12, 10% (v/v) glycerol, pH 7.0), and then resuspended in 1 ml of ice-cold SHMG buffer. The cell suspension was mixed with plasmid DNA at a final DNA concentration of 10 g ml-1 in a 0.2-cm electroporation cuvette (Bio-Rad), kept on ice for 30 min, and then electroporated by a Gene PulserTM model 1652076 (Bio-Rad) at 25 F, 2.5kV and 400Q with the pulse once. After electroporation, 3ml of LBSP supplemented with 10% (v/v) glycerol (LBSPG) were immediately added into the cuvette and incubated at 37 C for 2 hr with shaking.
[0075] The selected B. thuringiensis transformant was cultured in 250 ml of nutrient broth supplemented with 5 gg m1-1 of erythromycin (Sigma-Aldrich Canada Ltd.) and 5 g ml-1 of chloramphenicol (Sigma-Aldrich Canada Ltd.) at 37 C until cell autolysis was observed. The lysate was harvested and then washed twice with 10 mM EDTA (pH 8.0)-1 M NaCI-1 mM phenylmethylsulfonyl fluoride (Sigma-Aldrich Canada Ltd.).
[0076] The B. thuringiensis Cry- B transformant containing the B.
thuringiensis M15 parasporal crystal protein gene was incubated in nutrient broth (Bacto Beef Extract 3 g, Bacto Peptone 5 g I-1) at 30 C for 3 days to allow expression of the toxin gene and crystal formation. The presence of parasporal inclusions was examined by phase-contrast microscopy. When observed under a phase-contrast microscope, the B. thuringiensis transformants expressing the cry31 Aa2 gene contained, in addition to the spore, a roughly spherical inclusion, whereas no inclusions were found in the B.
thuringiensis transformant harboring the non-recombinant shuttle vector pHPS9 alone (data not shown). Under the transmission electron microscope (TEM), however, the parasporal inclusion body has a nearly perfect hexagonal shape (Fig. 7). Both inclusions in the transformant, spore and crystal, are separated from each other as opposed to what is found in B. thuringiensis strain M15 where they are tightly bound to each other.
[0077] The parasporal inclusion from a B. thuringiensis transformant was purified by sucrose density gradient centrifugation as described previously (Thomas and Ellar, 1983). It was then subjected to a 10% discontinuous sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SIDS-PAGE) (lane 3) as reported previously (Laemmli and Favre, 1973). High molecular (lane 1) and low molecular masses (lane 2) of standard protein markers are indicated on the left. The parasporal inclusion protein in the B. thuringiensis transformant is composed of a single major polypeptide of 83-kDa (Fig. 8).
Preparation of inclusion proteins, proteolytic processing, and toxin activation [0078] The spore-inclusion mixture was harvested from sporulated cultures and the inclusions were partially purified by a biphasic separation method described in Goodman (1967) using polyethylene glycol 6000 (Wako Pure Chemical, Osaka, Japan) and sodium dextran sulfate 500 (Sigma, St.
Louis, Mo.). Inclusions were further purified by sucrose density gradient centrifugation as described in Saitoh et al., (1998a). The purified inclusions were stored at 20 C until use.
[0079] Solubilization of purified inclusions was done in 50 mM Na2CO3 (pH 10.0) containing 1 mM EDTA and 10 mM dithiothreitol for 1 h at 37 C. After centrifugation at 20,000 x g for 5 min at 4 C to remove unsolubilized materials, the pH of the solution was adjusted to 8Ø
[0080] The native 83KDa protoxine displayed no cytocidal activity against cancer cells. This protein was therefore cleaved with three enzymes, namely trypsin, chymotrypsin and proteinase K to identify an active toxine.
The solubilized proteins (1.3 mg m11) were therefore treated with proteinase K
(final concentrations, 0.0003, 0.003, 0.03, and 0.3 mg m11), trypsin (0.03, 0.3, 3, and 30 mg m11), and chymotrypsin (0.03, 0.3, 3, and 30 mg m11) in 50 mM Na2CO3 (pH 10.0) for 1.5 h at 37 C. After protease treatment, phenylmethylsulfonyl fluoride (Wako Pure Chemical) was added to the solution to stop the proteolytic reaction, and the mixture was examined for both sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) profiles and cytopathic effect (CPE) on certain cancer cells including MOLT-4 and Hela. The CPE was monitored under a phase-contrast microscope for 24 h, and the degree of cytopathy was graded on the basis of the ratio of damaged cells as described in Mizuki et al., (1999).
One-dose assay, hemolytic assay, and dose-response study [0081] One-dose assays for cytotoxicity and hemolytic activity were carried out as described in Mizuki et al., (1999). Each well of a MicroTest plate received 90 pl of cell suspension containing 2 x 104 cells. After preincubation for 16 h at 37 C, 10 pI of the trypsin-activated sample solution (1.3 mg ml) was added to the well.
[0082] Thirteen human cells, two monkey cells and one mouse cell were used for dose-response studies. A hemolytic assay was done using human erythrocytes according to the method described in Saitoh et al., (1998b). Each well containing 90 1u1 of cell suspension (2 x 104 cells) received 10 N1 of trypsin-activated inclusion proteins which had been prepared in 10-fold serial dilutions in 50 mM Na2CO3 (pH 10.0) containing 10 mM DTT and 1 mM EDTA. Five wells were used for each dilution, and the test was repeated at least three times. The CPE was monitored under a phase-contrast microscope at appropriate intervals for 24 h postinoculation.
[0083] For assessment of the level of cytotoxicity, a cell proliferation test using an MTT [3-(4,5-dimethyl-2-thiazolyl)-2,5-diphenyl-2H tetrazolium bromide] assay as described in Behl (1992) and Heiss (1997) was conducted 24 h postinoculation by using a PremixTM WST-1 kit (Takara Co.). The average of absorbance in mock-inoculated negative controls was used as a blank value.
The arbitrary unit was defined on the basis of the relative value of absorbance at 450 nm to the blank (1.0). The 50% effective concentrations (EC50s) were deduced from the dose-response curves using a log-probit program. Table 2 below presents the EC50s.
[0084] The protein exhibited cytotoxicity against HeLa, TCS, HL-60, Jurkat and Hep-G2 cells when treated with trypsin. No cytocidal activity was induced after treatment with chymotrypsin or proteinase K on HeLa, MOLT-4 and Sawano cells. Without protease digestion, inclusion proteins showed no cytocidal toxicity. Trypsin cleaves the cry31 Aa2 protein after the arginine at position 250 of the full native protein sequence (SEQ ID NO: 2). The sequence of the trypsin-activated protein is designated SEQ ID NO: 8 and the corresponding nucleotide sequence is designated SEQ ID NO: 9.
[0085] Table 2 shows the results of one-dose assays of trypsin-activated Cry31 Aa2 as compared to those of trypsin-activated Cry3lAal against several species of cultured cells. The toxicity spectrum of the protein from the recombinant Cry-B was similar to that of the protein of the wild strain M15.
Both cloned proteins were highly or moderately cytocidal against HeLa, TCS, HL-60, Jurkat and Hep-G2 but were slightly toxic or nontoxic for normal T
cells and for Sawano, UtSMC, MOLT-4, A549, MRC-5, HC, Caco-2 and the non human cells tested.
Table 2 Effective concentration 50 of trypsin-activated cry31Aa2 as compared to that of activated cry31Aa 1 on various cells EC50(ug/ml) EC50(ug/ml) Organ Cell Cell type Cry3lAa2 Cry3lAal 1 Human Uterus HeLa Cervix cancer 0.30 0.23 2 Sawano Uterus cancer, adenocarcinoma >10 3 TCS Cervix cancer, keratinizing squamous 0.32 4 UtSMC Uterus normal smooth muscle >10 5 Blood MOLT-4 T cell leukaemia >10 1.06 6 HL-60 T cell leukaemia 0.05 7 Jurkat T cell leukaemia 0.02 8 T cell Normal T cell >10 9 Lung A549 Lung cancer >10 10 MRC-5 Lung normal fibroblast >10 11 Liver HC Normal hepatocyte cell >10 12 Hep-G2 Liver carcinoma hepatocellular 0.02 13 Colon Caco-2 fColon cancer, adenocarcinoma >10 14 Monkey Kidney Vero Monkey, kidney epithelial cell >10 15 COS-7 Monkey, kidney SV40 transformed cell >10 16 Mouse Embryo NIH3T3 Mouse, embryo fibroblast cell >10 Comparison of Cry3lAa2 and Cry31Aal [0001] As may be seen in Figure 5, the Cry3lAa2 amino acid sequence shares extensive homology with Cry31Aal except for a substitution of 26 amino acid residues and an addition of 19 contiguous codons in Cry31Aa2 (Figure 5). This 19 amino acid sequence is as follows SYQNMKTEIVNTDLPYNTN and is designated SEQ ID NO: 10 while the corresponding nucleotide sequence is designated SEQ ID NO: 11. The asterisks under the Cry3lAa2 sequence indicate the identities between Cry31Aa2 and Cry3lAal. The capital letters and dotted lines under the amino acid sequence of Cry3lAa2 in Figure 5, correspond to the difference and alignment gaps between the Cry31Aa2 and Cry3lAal proteins. The 83-KDa Cry3lAa2 protein exhibits 93.9% amino acid sequence identities with the Cry3lAal protein. The five conserved amino acid blocks of the Cry3lAa2 protein were especially identical to those of the Cry3lAal protein except for the substitution of a single lysine residue in the second conserved block of Cry3lAa2. The bold lines above the Cry3lAa2 sequence correspond to the five conserved amino acid blocks found in the amino acid sequence of Cry3lAal.
Both Cry3lAa2 and Cry3lAal show very low amino acid sequence homology to the known B. thuringiensis Cry and Cyt proteins (Mizuki et al., 2000). The trypsin-activated protein shares a 96.9% identity with the corresponding sequence of Cry3lAal.
[0002] Table 2 above shows that both Cry3lAa2 and Cry3lAal proteins display cytotoxicity against a number of human cancer cells. The cytocidal activity of the Cry3lAal was due to the cleavage by proteinase K and trypsin (Mizuki et al., 2000) while that of Cry3lAa2 was due to trypsin. The comparison of the amino acid sequence of Cry3lAal with that of Cry3lAa2 indicates which amino acids of the amino acid sequence of members of the Cry3l family can be substituted without abrogating this cytoxicity against human cancer cells.
Although certain of these substitutions surely provide for the specificity of each of Cry3lAa2 and Cry3lAal against specific human cancer cells, they each display a significant toxicity against a number of human cancer cells. Hence, it is submitted that the amino acids at positions 24, 37, 39, 51, 56, 59, 87, 97, 138, 158, 170, 251, 389, 444-446, 466, 481, 507, 510, 518, 551, 582, 637, 725 and 742 can be replaced by any other amino acid without abrogating the cytotoxicity of the protein that it constitutes against at least some cancer cells.
Sequences encompassing substitutions at these positions in the complete Cry31Aa2 protein sequence (SEQ ID NO:2) and in the trypsin-activated Cry3lAa2 protein sequence (SEQ ID NO:8) starting after the arginine at position 250 are within the scope of the present invention and are designated herein as SEQ ID NOs:12 and 13, respectively.
[0088] Certain substitutions are preferred however and correspond to either a substitution by an amino acid having similar chemical properties or, even more preferred, a substitution by the amino acid found at the corresponding position in Cry3lAal. Amino acids are categorized herein into 5 groups of amino acids according to their chemical properties, namely small nonpolar (i.e. C, P, A and T), small polar (i.e. S, G, D and N), large polar (i.e. E, 0, K and R), intermediate polarity (i.e. Y, H and W), aand large nonpolar (i.e. F, M, L, I and V) . Hence, the amino acid at position 24 of the Cry31 Aa2, is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably glutamate or lysine. The amino acid at position 37 is preferably methionine or alanine. The amino acid at position 39 is preferably threonine or asparagine. The amino acid at position 51, is preferably a nonpolar amino acid, most preferably a small nonpolar amino acid and even more preferably alanine or threonine. The amino acid at position 56 is preferably proline or serine. The amino acid at position 59 is preferably an amino acid of intermediate polarity and most preferably tyrosine or tryptophan. The amino acid at position 87, is preferably a polar amino acid, most preferably a small polar amino acid and even more preferably asparagine or aspartate. The amino acid at position 97, is preferably a polar amino acid, most preferably a large polar amino acid and even most preferably arginine or lysine. The amino acid at position 138 is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably glutamate or lysine. The amino acid at position 158 is preferably alanine or asparagine. The amino acid at position is preferably a polar amino acid, most preferably a small polar amino acid and even more preferably glycine or serine.
[0089] The amino acid at position 251 is preferably a nonpolar amino acid, most preferably a large nonpolar amino acid and even more preferably isoleucine or methionine. The amino acid at position 389 is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably lysine or arginine. The amino acid at position 444 is preferably serine or histidine. The amino acid at position 445 is preferably a polar amino acid, most preferably a small polar amino acid and even more preferably glycine or serine.
The amino acid at position 446 is preferably glycine or proline. The amino acid at position 466 is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably glutamine or arginine. The amino acid at position 481 is preferably an amino acid of intermediate polarity and most preferably tyrosine or tryptophan. The amino acid at position 507 is preferably alanine or leucine. The amino acid at position 510 is preferably glycine or histidine. The amino acid at position 518 is preferably a nonpolar amino acid and is preferably alanine or valine. The amino acid at position 551 is preferably a nonpolar amino acid, most preferably a small nonpolar amino acid and even more preferably alanine or proline. The amino acid at position 582 is preferably a nonpolar amino acid, most preferably a small nonpolar amino acid and even more preferably alanine or threonine. The amino acid at position 637 is preferably arginine or isoleucine. The amino acid at position 725 is preferably glycine or arginine. Finally, the amino acid at position 742 is preferably valine or serine. Sequences encompassing the most preferred substitutions listed above at these positions in the complete Cry3lAa2 protein sequence (SEQ ID NO:2) and in the trypsin-activated Cry3lAa2 protein sequence (SEQ ID NO:8) starting after the arginine at position 250 are within the scope of the present invention and are designated herein as SEQ ID NOs:14 and 15, respectively.
[0090] Sequences encompassing all the possible substitutions to the cry3lAa2 gene nucleotide sequence, the crystal protein and the trypsin-activated crystal protein derived from the crystal protein of the Bacillus thuringiensis M15 deposited under no. IDAC010201-5 as described above are within the scope of the present invention.
[0089] The amino acid at position 251 is preferably a nonpolar amino acid, most preferably a large nonpolar amino acid and even more preferably isoleucine or methionine. The amino acid at position 389 is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably lysine or arginine. The amino acid at position 444 is preferably serine or histidine. The amino acid at position 445 is preferably a polar amino acid, most preferably a small polar amino acid and even more preferably glycine or serine.
The amino acid at position 446 is preferably glycine or proline. The amino acid at position 466 is preferably a polar amino acid, most preferably a large polar amino acid and even more preferably glutamine or arginine. The amino acid at position 481 is preferably an amino acid of intermediate polarity and most preferably tyrosine or tryptophan. The amino acid at position 507 is preferably alanine or leucine. The amino acid at position 510 is preferably glycine or histidine. The amino acid at position 518 is preferably a nonpolar amino acid and is preferably alanine or valine. The amino acid at position 551 is preferably a nonpolar amino acid, most preferably a small nonpolar amino acid and even more preferably alanine or proline. The amino acid at position 582 is preferably a nonpolar amino acid, most preferably a small nonpolar amino acid and even more preferably alanine or threonine. The amino acid at position 637 is preferably arginine or isoleucine. The amino acid at position 725 is preferably glycine or arginine. Finally, the amino acid at position 742 is preferably valine or serine. Sequences encompassing the most preferred substitutions listed above at these positions in the complete Cry3lAa2 protein sequence (SEQ ID NO:2) and in the trypsin-activated Cry3lAa2 protein sequence (SEQ ID NO:8) starting after the arginine at position 250 are within the scope of the present invention and are designated herein as SEQ ID NOs:14 and 15, respectively.
[0090] Sequences encompassing all the possible substitutions to the cry3lAa2 gene nucleotide sequence, the crystal protein and the trypsin-activated crystal protein derived from the crystal protein of the Bacillus thuringiensis M15 deposited under no. IDAC010201-5 as described above are within the scope of the present invention.
[0091] Although the present invention has been described hereinabove by way of preferred embodiments thereof, it can be modified, without departing from the spirit and nature of the subject invention as defined in the appended claims.
REFERENCES
1. Aronson, A. (1993). The two faces of Bacillus thuringiensis: insecticidal proteins and post-exponential survival. Mol. Microbiol. 7, 489-496.
2. Baum, J. A., and Malvar, T. (1995). Regulation of insecticidal crystal protein production in Bacillus thuringiensis. Mol. Micro biol. 18, 1-12.
3. Behl, C., J. Davis, G. M. Cole, and D. Schubert. 1992. Vitamin E protects nerve cells from amyloid protein toxicity. Biochem. Biophys. Res. Commun.
186:944-950.
4. Beveridge, T. J., Popkin, T. J., and Cole, R. M. (1994). Electron microscopy. In "Methods for General and Molecular Bacteriology"
(Gerhardt, P., R. G. E. Murray, W. A. Wood, and N. R. Krieg, Eds.), pp.
42-71. Washington, D.C. Am. Soc. for Microbiol.
5. Birnboim, H. C., and Doly, J. (1979). A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Res. 7, 1513-1523.
6. Crickmore, N. 2001. Bacillus thuringiensis toxin gene nomenclature. Web site http://epunix. biols. susx. ac. uk/Home/Neii_Crickmore/Bt/index.html.
7. Crickmore, N., Zeigler, D. R., Feitelson, J., Schnepf, E., Van Rie, J., Lereclus, D., Baum, J., and Dean, D. H. (1998). Revision of the nomenclature for the Bacillus thuringiensis pesticidal crystal proteins.
Microbiol. Mol. Biol. Rev. 62, 807-813. de Barjac, H., and Frachon, E.
(1990). Classification of Bacillus thuringiensis strains. Entomophaga 35, 233-240.
8. Feitelson, J. S., Payne, J., and Kim, L. (1992). Bacillus thuringiensis:
insects and beyond. Bio/Technology 10, 271-275.
9. Gill, S. S., Cowles, E. A., and Pietrantonio, P. V. (1992). The mode of action of B. thuringiensis endotoxins. Annu. Rev. Entomol. 37, 615-636.
10. Goodman, N. S., R. J. Gottfried, and M. H. Rogoff. 1967. Biphasic system for separation of spores and crystals of Bacillus thuringiensis. J. Bacteriol.
94:485 11.Haima, P., Van Sinderen, D., Schotting, H., Bron, S., and Venema, G.
(1990). Development of a P-galactosidase a-complementation system for molecular cloning in Bacillus subtilis. Gene 86, 63-69.
12. Heiss, P., S. Bernatz, G. Bruchelt, and R. Senekowitsch-Schmidtke. 1997.
Cytotoxic effect of immunoconjugate composed of glucose-oxidase coupled to an anti-ganglioside (GD2) antibody on spheroids. Anticancer Res.
17:3177-3178 13. Hofte, H., and Whiteley, H. R. (1989). Insecticidal crystal proteins of B.
thuringiensis. Micro biol. Rev. 53, 242-255.
14. Jung, Y. C., Kim, S. U., Cote, J. -C., Lecadet, M. -M., and Chung, Y. S., and Bok, S. H. (1998). Characterization of a new Bacillus thuringiensis subsp. higo strain isolated from rice bran in Korea. J. lnvertebr. Pathol.
71, 95-96.
15. Kim, H. S. (2000). Comparative study of the frequency, flagellar serotype, crystal shape, toxicity, and cry gene contents of Bacillus thuringiensis from three environments. Curr. Microbiol. 41, 250-256.
16.Laemmli, U. K., and Favre, M. (1973). Maturation of the head of acteriophage T4. J. Mol. Biol. 80, 575-599.
17.Lecadet, M. -M., Frachon, E., Cosmao Dumanoir, V. et at. (1999).
Updating the H-antigen classification of Bacillus thuringiensis. J. App!.
Microbiol. 86, 660-672.
18. Lee, D. W., Akao, T., Yamashita, S., Katayama, H., Maeda, M., Saitoh, H., Mizuki, E., and Ohba, M. (2000). Noninsecticidal parasporal proteins of a Bacillus thuringiensis serovar shandongiensis isolate exhibit a preferential cytotoxicity against human leukemic T cells. Biochem. Biophys. Res.
Commun. 272, 218-223.
19. Lereclus, D., Bourgouin, C., Lecadet, M. -M., Klier, A., and Rapoport, G.
(1989). Role, structure, and molecular organization of the genes coding for the parasporal 6-endotoxins of B. thuringiensis. In "Regulation of Procaryotic Development: Structural and Functional Analysis of Bacterial Sporulation and Germination" (Smith, I., R. A. Slepecky, and P. Setlow, Eds.), pp. 255-276, Washington, D.C. Am. Soc. for Microbiol.
20. McLaughlin, J. R., Murray, C. L., and Rabinowitz, J. C. (1981). Unique features in the ribosome binding site sequence of the Gram-positive Staphylococcus aureus 13-lactamase gene. J. Biol. Chem. 256, 11283-11291.
21.Mizuki, E., Ohba, M., Akao, T., Yamashita, S., Saitoh, H., and Park, Y. S.
(1999). Unique activity associated with non-insecticidal Bacillus thuringiensis parasporal inclusions: in vitro cell-killing action on human cancer cells. J. App!. Micro biol. 86, 477-486.
22. Mizuki, E., Park, Y. S., Saitoh, H., Yamashita, S., Akao, T., Higuchi, K., and Ohba, M. (2000). Parasporin, a human leukemic cell-recognizing parasporal protein of Bacillus thuringiensis. Clin. Diag. Lab. Immunol. 7, 625-634.
23. Moran, C. P., Lang, N., Jr., LeGrice, S. F. J., Lee, G., Stephens, M., Sonenshein, A. L., Pero, J., and Losick, R. (1982). Nucleotide sequences that signal the initiation of transcription and translation in Bacillus subtilis.
Mol Gen. Gene. 186, 339-346.
24.Ohba, M. (1996). Bacillus thuringiensis populations naturally occuring on mulberry leaves: a possible source of the populations associated with silkworm-rearing insectaries. J. Appl. Bacteriol. 80, 56-64.
25.Ohba, M., and Aizawa, K. (1986). Insect toxicity of Bacillus thuringiensis isolated from soils of Japan. J. Invertebr. Pathol. 47, 12-20.
26. Roh, J. Y., Park, H. W., Jin, B. R., Kim, H. S., Yu, Y. M., and Kang, S.
K.
(1996). Characterization of novel non-toxic Bacillus thuringiensis isolates from Korea. Lett. Appl. Microbiol. 23, 249-252.
27. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989). "Molecular cloning.
A Laboratory manual," 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York.
28.Saitoh, H., K. Higuchi, E. Mizuki, and M. Ohba. (1998a). Larvicidal toxicity of Japanese Bacillus thuringiensis against the mosquito Anopheles stephensi.
Med. Vet. Entomol. 12:98-102.
29.Saitoh, H., K. Higuchi, E. Mizuki, S.-H. Hwang, and M. Ohba. (1998b).
Characterization of mosquito larvicidal parasporal inclusions of a Bacillus thuringiensis serovar higo strain. J. Appl. Microbiol. 84:883-888 30.Schnepf, H. E., Wong, H. C., and Whiteley, H. R. (1985). The amino acid sequence of a crystal protein from Bacillus thuringiensis deduced from the DNA base sequence. J. Biol. Chem. 260, 6264-6272.
31. Southern, E. (1975). Detection of specific sequences among DNA
fragments separated by gel electrophoresis. J. Mol. Biol. 98, 503-517.
32.Stahly, D. P., Dingman, D. W., Bulla, L. A., Jr., and Aronson, A. I.
(1978).
Possible origin and function of the parasporal crystal in Bacillus thuringiensis. Biochem. Biophys. Res. Commun. 84, 581-588.
33. Thomas, W. E., and Ellar, D. J. (1983). B. thuringiensis var. israelensis crystal S-endotoxin: effects on insect and mammalian cells in vitro and in vivo. J. Cell. Sci. 60, 181-197.
34.Tinoco, I., Jr., Borer, P. N., Dengler, B., Levine, M. D., Uhlenbeck, 0.
C., Crothers, D. M., and Gralla, J. (1973). Improved estimation of secondary structure in ribonucleic acids. Nature New Biol. 246, 40-41.
REFERENCES
1. Aronson, A. (1993). The two faces of Bacillus thuringiensis: insecticidal proteins and post-exponential survival. Mol. Microbiol. 7, 489-496.
2. Baum, J. A., and Malvar, T. (1995). Regulation of insecticidal crystal protein production in Bacillus thuringiensis. Mol. Micro biol. 18, 1-12.
3. Behl, C., J. Davis, G. M. Cole, and D. Schubert. 1992. Vitamin E protects nerve cells from amyloid protein toxicity. Biochem. Biophys. Res. Commun.
186:944-950.
4. Beveridge, T. J., Popkin, T. J., and Cole, R. M. (1994). Electron microscopy. In "Methods for General and Molecular Bacteriology"
(Gerhardt, P., R. G. E. Murray, W. A. Wood, and N. R. Krieg, Eds.), pp.
42-71. Washington, D.C. Am. Soc. for Microbiol.
5. Birnboim, H. C., and Doly, J. (1979). A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Res. 7, 1513-1523.
6. Crickmore, N. 2001. Bacillus thuringiensis toxin gene nomenclature. Web site http://epunix. biols. susx. ac. uk/Home/Neii_Crickmore/Bt/index.html.
7. Crickmore, N., Zeigler, D. R., Feitelson, J., Schnepf, E., Van Rie, J., Lereclus, D., Baum, J., and Dean, D. H. (1998). Revision of the nomenclature for the Bacillus thuringiensis pesticidal crystal proteins.
Microbiol. Mol. Biol. Rev. 62, 807-813. de Barjac, H., and Frachon, E.
(1990). Classification of Bacillus thuringiensis strains. Entomophaga 35, 233-240.
8. Feitelson, J. S., Payne, J., and Kim, L. (1992). Bacillus thuringiensis:
insects and beyond. Bio/Technology 10, 271-275.
9. Gill, S. S., Cowles, E. A., and Pietrantonio, P. V. (1992). The mode of action of B. thuringiensis endotoxins. Annu. Rev. Entomol. 37, 615-636.
10. Goodman, N. S., R. J. Gottfried, and M. H. Rogoff. 1967. Biphasic system for separation of spores and crystals of Bacillus thuringiensis. J. Bacteriol.
94:485 11.Haima, P., Van Sinderen, D., Schotting, H., Bron, S., and Venema, G.
(1990). Development of a P-galactosidase a-complementation system for molecular cloning in Bacillus subtilis. Gene 86, 63-69.
12. Heiss, P., S. Bernatz, G. Bruchelt, and R. Senekowitsch-Schmidtke. 1997.
Cytotoxic effect of immunoconjugate composed of glucose-oxidase coupled to an anti-ganglioside (GD2) antibody on spheroids. Anticancer Res.
17:3177-3178 13. Hofte, H., and Whiteley, H. R. (1989). Insecticidal crystal proteins of B.
thuringiensis. Micro biol. Rev. 53, 242-255.
14. Jung, Y. C., Kim, S. U., Cote, J. -C., Lecadet, M. -M., and Chung, Y. S., and Bok, S. H. (1998). Characterization of a new Bacillus thuringiensis subsp. higo strain isolated from rice bran in Korea. J. lnvertebr. Pathol.
71, 95-96.
15. Kim, H. S. (2000). Comparative study of the frequency, flagellar serotype, crystal shape, toxicity, and cry gene contents of Bacillus thuringiensis from three environments. Curr. Microbiol. 41, 250-256.
16.Laemmli, U. K., and Favre, M. (1973). Maturation of the head of acteriophage T4. J. Mol. Biol. 80, 575-599.
17.Lecadet, M. -M., Frachon, E., Cosmao Dumanoir, V. et at. (1999).
Updating the H-antigen classification of Bacillus thuringiensis. J. App!.
Microbiol. 86, 660-672.
18. Lee, D. W., Akao, T., Yamashita, S., Katayama, H., Maeda, M., Saitoh, H., Mizuki, E., and Ohba, M. (2000). Noninsecticidal parasporal proteins of a Bacillus thuringiensis serovar shandongiensis isolate exhibit a preferential cytotoxicity against human leukemic T cells. Biochem. Biophys. Res.
Commun. 272, 218-223.
19. Lereclus, D., Bourgouin, C., Lecadet, M. -M., Klier, A., and Rapoport, G.
(1989). Role, structure, and molecular organization of the genes coding for the parasporal 6-endotoxins of B. thuringiensis. In "Regulation of Procaryotic Development: Structural and Functional Analysis of Bacterial Sporulation and Germination" (Smith, I., R. A. Slepecky, and P. Setlow, Eds.), pp. 255-276, Washington, D.C. Am. Soc. for Microbiol.
20. McLaughlin, J. R., Murray, C. L., and Rabinowitz, J. C. (1981). Unique features in the ribosome binding site sequence of the Gram-positive Staphylococcus aureus 13-lactamase gene. J. Biol. Chem. 256, 11283-11291.
21.Mizuki, E., Ohba, M., Akao, T., Yamashita, S., Saitoh, H., and Park, Y. S.
(1999). Unique activity associated with non-insecticidal Bacillus thuringiensis parasporal inclusions: in vitro cell-killing action on human cancer cells. J. App!. Micro biol. 86, 477-486.
22. Mizuki, E., Park, Y. S., Saitoh, H., Yamashita, S., Akao, T., Higuchi, K., and Ohba, M. (2000). Parasporin, a human leukemic cell-recognizing parasporal protein of Bacillus thuringiensis. Clin. Diag. Lab. Immunol. 7, 625-634.
23. Moran, C. P., Lang, N., Jr., LeGrice, S. F. J., Lee, G., Stephens, M., Sonenshein, A. L., Pero, J., and Losick, R. (1982). Nucleotide sequences that signal the initiation of transcription and translation in Bacillus subtilis.
Mol Gen. Gene. 186, 339-346.
24.Ohba, M. (1996). Bacillus thuringiensis populations naturally occuring on mulberry leaves: a possible source of the populations associated with silkworm-rearing insectaries. J. Appl. Bacteriol. 80, 56-64.
25.Ohba, M., and Aizawa, K. (1986). Insect toxicity of Bacillus thuringiensis isolated from soils of Japan. J. Invertebr. Pathol. 47, 12-20.
26. Roh, J. Y., Park, H. W., Jin, B. R., Kim, H. S., Yu, Y. M., and Kang, S.
K.
(1996). Characterization of novel non-toxic Bacillus thuringiensis isolates from Korea. Lett. Appl. Microbiol. 23, 249-252.
27. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989). "Molecular cloning.
A Laboratory manual," 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York.
28.Saitoh, H., K. Higuchi, E. Mizuki, and M. Ohba. (1998a). Larvicidal toxicity of Japanese Bacillus thuringiensis against the mosquito Anopheles stephensi.
Med. Vet. Entomol. 12:98-102.
29.Saitoh, H., K. Higuchi, E. Mizuki, S.-H. Hwang, and M. Ohba. (1998b).
Characterization of mosquito larvicidal parasporal inclusions of a Bacillus thuringiensis serovar higo strain. J. Appl. Microbiol. 84:883-888 30.Schnepf, H. E., Wong, H. C., and Whiteley, H. R. (1985). The amino acid sequence of a crystal protein from Bacillus thuringiensis deduced from the DNA base sequence. J. Biol. Chem. 260, 6264-6272.
31. Southern, E. (1975). Detection of specific sequences among DNA
fragments separated by gel electrophoresis. J. Mol. Biol. 98, 503-517.
32.Stahly, D. P., Dingman, D. W., Bulla, L. A., Jr., and Aronson, A. I.
(1978).
Possible origin and function of the parasporal crystal in Bacillus thuringiensis. Biochem. Biophys. Res. Commun. 84, 581-588.
33. Thomas, W. E., and Ellar, D. J. (1983). B. thuringiensis var. israelensis crystal S-endotoxin: effects on insect and mammalian cells in vitro and in vivo. J. Cell. Sci. 60, 181-197.
34.Tinoco, I., Jr., Borer, P. N., Dengler, B., Levine, M. D., Uhlenbeck, 0.
C., Crothers, D. M., and Gralla, J. (1973). Improved estimation of secondary structure in ribonucleic acids. Nature New Biol. 246, 40-41.
35.Vehmaanpera, J. (1989). Transformation of Bacillus amyloliquefaciens by electroporation. FEMS Microbiol. Lett. 61, 165-170.
36. Jung, Y.-C. (2001) Bacillus thuringiensis Strain M15, a Novel Autoagglutinable, Non-Serotypeable Strain-Cloning and Characterization of a Novel cry31A-type Crystal Protein Gene, cry3lAa2, and Two New Insertion Sequences, IS231M and -N. Departement de sciences biologiques, Faculte des arts et des sciences, Universite de Montreal.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: AGRICULTURE AGROALIMENTAIRE CANADA
(ii) TITLE OF INVENTION: A Novel Bacillus Thuringiensis Strain, Crystal Gene and Crystal Protein and Uses Thereof (iii) NUMBER OF SEQUENCES: 18 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Goudreau Gage Dubuc (B) STREET: Stock Exchange Tower suite 3400, 800 Place Victoria (C) CITY: Montreal (D) STATE: Quebec (E) COUNTRY: Canada (F) ZIP: H4Z 1E9 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,410,153 (B) FILING DATE: 05-DEC-2002 (C) CLASSIFICATION: C12N-15/31 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Gauvreau, Julie (B) REGISTRATION NUMBER: 11,127 (C) REFERENCE/DOCKET NUMBER: 12292.5 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 514 397-7602 (B) TELEFAX: 514 397-4382 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2229 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Glu Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Met Lys Thr Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Ala Asn Gln Glu Phe Pro Asp Asn Tyr Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Asn Ile Asn Ser Met Arg Asn Thr Leu Cys Arg Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Glu Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Ala Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Gly Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Ile Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Lys Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Ser Gly Pro Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Gln Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Trp Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Ala Ala Trp His Met Trp Ser Ser Asp Pro Ser Ala Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Ala Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Ala Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Arg Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Gly Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Val (2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu (2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Glu Gln Lys Tyr Pro Asp (2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: misc feature (B) LOCATION: 15 (D) OTHER INFORMATION: /note= "N IS A, C, G, OR T"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 9 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: RNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus subtilis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
Ile Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Lys Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Ser Gly Pro Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Gln Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Trp Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Ala Ala Trp His Met Trp Ser Ser Asp Pro Ser Ala Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Ala Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Ala Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Arg Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Gly Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Val (2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1479 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn (2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 57 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
(2) INFORMATION FOR SEQ ID NO:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 24 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 37 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 39 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 51 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 56 (D) OTHER INFORMATION: /note= "Xaa=anyt amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 59 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 87 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 97 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 158 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 170 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 251 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 389 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 444 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 445 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 446 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 466 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE :
(A) NAME/KEY: Modified-site (B) LOCATION: 481 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 507 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 510 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 518 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 551 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 582 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 637 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 725 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 742 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Xaa Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Xaa Lys Xaa Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Xaa Asn Gln Glu Phe Xaa Asp Asn Xaa Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Xaa Ile Asn Ser Met Arg Asn Thr Leu Cys Xaa Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Xaa Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Xaa Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Xaa Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 139 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 194 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 195..196 (D) OTHER INFORMATION: /note= "Xaa can be any naturally occurring amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 216 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 231 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 257 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 260 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 268 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 301 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 332 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 387 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 475 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 492 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 24 (D) OTHER INFORMATION: /note= "Xaa=glutamate or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 37 (D) OTHER INFORMATION: /note= "Xaa= methionine or alanine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 39 (D) OTHER INFORMATION: /note= "Xaa=threonine or asparagine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 51 (D) OTHER INFORMATION: /note= "Xaa=alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 56 (D) OTHER INFORMATION: /note= "Xaa= proline or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 59 (D) OTHER INFORMATION: /note= "Xaa=tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 87 (D) OTHER INFORMATION: /note= "Xaa= asparagine or aspartate"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 97 (D) OTHER INFORMATION: /note= "Xaa=arginine or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaa= glutamate or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 158 (D) OTHER INFORMATION: /note= "Xaa= alanine or asparagine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 170 (D) OTHER INFORMATION: /note= "Xaa=glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 251 (D) OTHER INFORMATION: /note= "Xaa= isoleucine or methionine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 389 (D) OTHER INFORMATION: /note= "Xaa= lysine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 444 (D) OTHER INFORMATION: /note= "Xaa= serine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 445 (D) OTHER INFORMATION: /note= "Xaa= glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 446 (D) OTHER INFORMATION: /note= "Xaa= glycine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 466 (D) OTHER INFORMATION: /note= "Xaa= glutamine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 481 (D) OTHER INFORMATION: /note= "Xaa= tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 507 (D) OTHER INFORMATION: /note= "Xaa= alanine or leucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 510 (D) OTHER INFORMATION: /note= "Xaa= glycine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 518 (D) OTHER INFORMATION: /note= "Xaa= alanine or valine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 551 (D) OTHER INFORMATION: /note= "Xaa= alanine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 582 (D) OTHER INFORMATION: /note= "Xaa= alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 637 (D) OTHER INFORMATION: /note= "Xaa= arginine or isoleucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 725 (D) OTHER INFORMATION: /note= "Xaa= glycine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 742 (D) OTHER INFORMATION: /note= "Xaa= valine or serine"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Xaa Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Xaa Lys Xaa Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Xaa Asn Gln Glu Phe Xaa Asp Asn Xaa Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Xaa Ile Asn Ser Met Arg Asn Thr Leu Cys Xaa Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Xaa Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Xaa Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Xaa Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa= isoleucine or methionine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 139 (D) OTHER INFORMATION: /note= "Xaa= lysine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 194 (D) OTHER INFORMATION: /note= "Xaa= serine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 195 (D) OTHER INFORMATION: /note= "Xaa= glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 196 (D) OTHER INFORMATION: /note= "Xaa= glycine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 216 (D) OTHER INFORMATION: /note= "Xaa= glutamine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 231 (D) OTHER INFORMATION: /note= "Xaa= tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 257 (D) OTHER INFORMATION: /note= "Xaa= alanine or leucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 260 (D) OTHER INFORMATION: /note= "Xaa= glycine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 268 (D) OTHER INFORMATION: /note= "xaa= alanine or valine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 301 (D) OTHER INFORMATION: /note= "Xaa= alanine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 332 (D) OTHER INFORMATION: /note= "Xaa= alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 387 (D) OTHER INFORMATION: /note= "Xaa= arginine or isoleucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 475 (D) OTHER INFORMATION: /note= "Xaa= glycine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 492 (D) OTHER INFORMATION: /note= "Xaa= valine or serine"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3313 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:
GCATTCCAAT TATAGGGATG CTCTTTTTTT GATIfTCTGGC CTATCCTTCT CATTTCATAG 2940 (2) INFORMATION FOR SEQ ID NO:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2172 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:
(2) INFORMATION FOR SEQ ID NO:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 723 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Lys Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Ala Lys Asn Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Thr Asn Gln Glu Phe Ser Asp Asn His Tyr Gln His Ser Asp Val Ser Asn Asp Ile Asn Ser Met Arg Asn Thr Leu Cys Lys Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Gln Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gin Gly Gly Thr Asp Ile Asn Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Ser Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Met Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Arg Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp His Ser Gly Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Arg Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Tyr Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Leu Ala Trp Gly Met Trp Ser Ser Asp Pro Ser Val Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Pro Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Thr Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Ile Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Arg Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Ser
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: AGRICULTURE AGROALIMENTAIRE CANADA
(ii) TITLE OF INVENTION: A Novel Bacillus Thuringiensis Strain, Crystal Gene and Crystal Protein and Uses Thereof (iii) NUMBER OF SEQUENCES: 18 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Goudreau Gage Dubuc (B) STREET: Stock Exchange Tower suite 3400, 800 Place Victoria (C) CITY: Montreal (D) STATE: Quebec (E) COUNTRY: Canada (F) ZIP: H4Z 1E9 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,410,153 (B) FILING DATE: 05-DEC-2002 (C) CLASSIFICATION: C12N-15/31 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Gauvreau, Julie (B) REGISTRATION NUMBER: 11,127 (C) REFERENCE/DOCKET NUMBER: 12292.5 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 514 397-7602 (B) TELEFAX: 514 397-4382 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2229 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Glu Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Met Lys Thr Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Ala Asn Gln Glu Phe Pro Asp Asn Tyr Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Asn Ile Asn Ser Met Arg Asn Thr Leu Cys Arg Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Glu Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Ala Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Gly Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Ile Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Lys Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Ser Gly Pro Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Gln Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Trp Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Ala Ala Trp His Met Trp Ser Ser Asp Pro Ser Ala Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Ala Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Ala Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Arg Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Gly Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Val (2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu (2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Glu Gln Lys Tyr Pro Asp (2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: misc feature (B) LOCATION: 15 (D) OTHER INFORMATION: /note= "N IS A, C, G, OR T"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 9 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: RNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus subtilis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
Ile Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Lys Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Ser Gly Pro Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Gln Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Trp Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Ala Ala Trp His Met Trp Ser Ser Asp Pro Ser Ala Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Ala Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Ala Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Arg Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Gly Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Val (2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1479 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn (2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 57 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
(2) INFORMATION FOR SEQ ID NO:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 24 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 37 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 39 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 51 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 56 (D) OTHER INFORMATION: /note= "Xaa=anyt amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 59 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 87 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 97 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 158 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 170 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 251 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 389 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 444 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 445 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 446 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 466 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE :
(A) NAME/KEY: Modified-site (B) LOCATION: 481 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 507 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 510 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 518 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 551 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 582 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 637 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 725 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 742 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Xaa Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Xaa Lys Xaa Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Xaa Asn Gln Glu Phe Xaa Asp Asn Xaa Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Xaa Ile Asn Ser Met Arg Asn Thr Leu Cys Xaa Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Xaa Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Xaa Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Xaa Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 139 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 194 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 195..196 (D) OTHER INFORMATION: /note= "Xaa can be any naturally occurring amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 216 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 231 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 257 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 260 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 268 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 301 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 332 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 387 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 475 (D) OTHER INFORMATION: /note= "Xaa= any amino acid"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 492 (D) OTHER INFORMATION: /note= "Xaa=any amino acid"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 24 (D) OTHER INFORMATION: /note= "Xaa=glutamate or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 37 (D) OTHER INFORMATION: /note= "Xaa= methionine or alanine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 39 (D) OTHER INFORMATION: /note= "Xaa=threonine or asparagine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 51 (D) OTHER INFORMATION: /note= "Xaa=alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 56 (D) OTHER INFORMATION: /note= "Xaa= proline or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 59 (D) OTHER INFORMATION: /note= "Xaa=tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 87 (D) OTHER INFORMATION: /note= "Xaa= asparagine or aspartate"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 97 (D) OTHER INFORMATION: /note= "Xaa=arginine or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 138 (D) OTHER INFORMATION: /note= "Xaa= glutamate or lysine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 158 (D) OTHER INFORMATION: /note= "Xaa= alanine or asparagine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 170 (D) OTHER INFORMATION: /note= "Xaa=glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 251 (D) OTHER INFORMATION: /note= "Xaa= isoleucine or methionine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 389 (D) OTHER INFORMATION: /note= "Xaa= lysine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 444 (D) OTHER INFORMATION: /note= "Xaa= serine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 445 (D) OTHER INFORMATION: /note= "Xaa= glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 446 (D) OTHER INFORMATION: /note= "Xaa= glycine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 466 (D) OTHER INFORMATION: /note= "Xaa= glutamine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 481 (D) OTHER INFORMATION: /note= "Xaa= tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 507 (D) OTHER INFORMATION: /note= "Xaa= alanine or leucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 510 (D) OTHER INFORMATION: /note= "Xaa= glycine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 518 (D) OTHER INFORMATION: /note= "Xaa= alanine or valine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 551 (D) OTHER INFORMATION: /note= "Xaa= alanine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 582 (D) OTHER INFORMATION: /note= "Xaa= alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 637 (D) OTHER INFORMATION: /note= "Xaa= arginine or isoleucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 725 (D) OTHER INFORMATION: /note= "Xaa= glycine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 742 (D) OTHER INFORMATION: /note= "Xaa= valine or serine"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Xaa Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Xaa Lys Xaa Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Xaa Asn Gln Glu Phe Xaa Asp Asn Xaa Tyr Gln His Ser Asp Val Ser Asn Ser Tyr Gln Asn Met Lys Thr Glu Ile Val Asn Thr Asp Leu Pro Tyr Asn Thr Asn Xaa Ile Asn Ser Met Arg Asn Thr Leu Cys Xaa Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Xaa Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gln Gly Gly Thr Asp Ile Xaa Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Xaa Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 1 (D) OTHER INFORMATION: /note= "Xaa= isoleucine or methionine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 139 (D) OTHER INFORMATION: /note= "Xaa= lysine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 194 (D) OTHER INFORMATION: /note= "Xaa= serine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 195 (D) OTHER INFORMATION: /note= "Xaa= glycine or serine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 196 (D) OTHER INFORMATION: /note= "Xaa= glycine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 216 (D) OTHER INFORMATION: /note= "Xaa= glutamine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 231 (D) OTHER INFORMATION: /note= "Xaa= tyrosine or tryptophan"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 257 (D) OTHER INFORMATION: /note= "Xaa= alanine or leucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 260 (D) OTHER INFORMATION: /note= "Xaa= glycine or histidine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 268 (D) OTHER INFORMATION: /note= "xaa= alanine or valine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 301 (D) OTHER INFORMATION: /note= "Xaa= alanine or proline"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 332 (D) OTHER INFORMATION: /note= "Xaa= alanine or threonine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 387 (D) OTHER INFORMATION: /note= "Xaa= arginine or isoleucine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 475 (D) OTHER INFORMATION: /note= "Xaa= glycine or arginine"
(ix) FEATURE:
(A) NAME/KEY: Modified-site (B) LOCATION: 492 (D) OTHER INFORMATION: /note= "Xaa= valine or serine"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
Xaa Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Xaa Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp Xaa Xaa Xaa Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Xaa Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Xaa Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Xaa Ala Trp Xaa Met Trp Ser Ser Asp Pro Ser Xaa Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Xaa Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Xaa Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Xaa Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Xaa Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Xaa (2) INFORMATION FOR SEQ ID NO:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3313 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:
GCATTCCAAT TATAGGGATG CTCTTTTTTT GATIfTCTGGC CTATCCTTCT CATTTCATAG 2940 (2) INFORMATION FOR SEQ ID NO:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2172 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:
(2) INFORMATION FOR SEQ ID NO:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 723 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (vi) ORIGINAL SOURCE:
(A) ORGANISM: Bacillus thuringiensis (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:
Met Asp Pro Phe Ser Asn Tyr Ser Glu Gln Lys Tyr Pro Asp Ser Asn Asn Asn Gln Glu Leu Ile Thr Lys Ser Ser Ser Phe Tyr Ser Asp Thr Thr Asn Glu Asn Ala Lys Asn Tyr His Pro Ile Glu Gln Asp Ile Leu Lys Phe Thr Asn Gln Glu Phe Ser Asp Asn His Tyr Gln His Ser Asp Val Ser Asn Asp Ile Asn Ser Met Arg Asn Thr Leu Cys Lys Asp Leu Pro Pro Glu Thr Asn Met Ser Ile Tyr Asp Asn Leu Arg Ser Thr Val Thr Val Pro Ser Phe Ser Asn Gln Phe Asp Pro Ile Lys Phe Leu His Asp Ile Glu Ile Ala Ile Gln Thr Gly Ser Phe Ser Ala Leu Thr Gln Ser Asn Met Asn Gin Gly Gly Thr Asp Ile Asn Pro Met Leu Ile Ser Thr Phe Phe Lys Val Ala Ser Ser Leu Leu Pro Phe Pro Leu Ser Ser Leu Gly Ala Leu Ala Ser Phe Tyr Val Thr Asp Ser Gln Thr Gly Ala Met Ala Asn Leu Trp Arg Gln Met Val Asp Tyr Val Glu Lys Arg Ile Asp Ser Lys Ile Leu Asp Tyr His Asn Phe Ile Met Gly Ala Glu Leu Ala Ala Leu Asn Ala Ser Leu Lys Glu Tyr Ala Arg Val Val Lys Ile Phe Glu Asn Asp Met Asn Arg Met Ala Glu Pro Pro Ser Thr Gly Val Ile Thr Gln Phe Arg Ile Leu Asn Asp Asn Phe Ile Lys Tyr Ile Ala Lys Leu Gln Phe Ser Thr Asn Gln Ser Asp Leu Gln Tyr Pro Val Leu Thr Leu Pro Leu Arg Ala Gln Ala Cys Val Met His Leu Met Leu Leu Lys Asp Ala Thr Thr Ser Val Trp Gly Gln Gln Ile Asp Ser Gln Gln Leu Asn Gly Tyr Lys Ala Glu Leu Ile Arg Leu Ile Lys Val Tyr Thr Asn Asp Val Asn Thr Thr Tyr Asn Gln Gly Leu Glu Leu Glu Lys Ala Lys Pro Leu Asn Tyr Ser Asp Pro Glu Glu Tyr Leu Gln Ala Gly Arg Pro Asp Ile Ser Val Leu Arg Ser Asn Phe Lys Glu Val Met Lys Trp Asn Arg Val Ala Lys Tyr Lys Arg Gly Met Ala Met Ser Ala Leu Ser Leu Ala Ala Leu Phe Pro Thr Phe Gly Pro Asn Tyr Pro Lys Gln Ala Leu Lys Val Val Gln Ser Arg Gln Ile Phe Ala Pro Val Ile Gly Ile Pro Gly Gly Ile Thr Ser Gln Asp His Ser Gly Thr Phe Gly Ser Met Arg Phe Asp Val Lys Thr Tyr Asp Gln Ile Asp Ala Leu Arg Arg Leu Met Glu Leu Tyr Ile Gln Pro Leu Lys Ser Ala Tyr Phe Tyr Ile Tyr Glu Ser Asp Trp Lys Val Arg Ala Thr Tyr Val Asn Asp Tyr Ile Gly Lys Arg Gly Ser Asn Thr Gly Leu Ala Trp Gly Met Trp Ser Ser Asp Pro Ser Val Ile Tyr Thr Ser Ala Leu Gly Ala Ala Gly Tyr Ala Pro Asn Val Val Gly Val Arg Tyr Ser His Gly Gly Ser Tyr Thr Lys Gly Met Ala Pro Pro Asn Thr Asn Ala Tyr Ala Pro Phe Glu Phe Lys Tyr Pro Gly Tyr Lys Leu His Ser Val Ser Ala Tyr Gly Leu Ser Lys Ala Pro Asp Thr Ala Asp Ser Val Met Phe Gly Phe Arg Pro Val Leu Leu Glu Asn Glu Ala Asn Gln Leu Leu Thr Asp Thr Ala Leu Gln Ile Pro Ala Glu Ile Gly Ile Thr Asp Val Val Pro Ala Phe Gly Arg Thr Glu Glu Pro Ile Asn Gly Gln Asp Ala Ile Ile Ile Trp Glu Ser Phe Thr Ser Gly Phe Gly Phe Thr Tyr Thr Val Asp Ser Pro Gln Lys Gln Lys Tyr Lys Ile Ile Tyr Arg Ile Ala Asn Asn Leu Ser Ala Ser Thr Val Ser Leu Thr Tyr Asn Asn Gln Thr Phe Phe Thr Asp Ile Leu Asn Thr Ser Leu Asp Pro Asn Gly Val Arg Gly Asn Tyr Gly Ser Tyr Thr Leu Val Glu Gly Pro Ile Ile Glu Phe Ser Gln Gly Thr Asn Ile Phe Lys Leu Arg Ser Gln Lys Gly Glu Phe Ala Ile Asp Ser Ile Ile Phe Ser Pro Val Ser
Claims (13)
1. A biologically pure culture of a microorganism strain comprising all of the biochemical characteristics of a Bacillus thuringiensis strain deposited at the International Depository Authority of Health Canada in Winnipeg under accession number IDAC010201-5, or a mutant thereof isolated from said strain, said mutant comprising all of said biochemical characteristics and producing a crystal protein having cytotoxicity against human cancer cells.
2. An isolated nucleic acid molecule comprising:
(a) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:2;
(b) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:8;
(c) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:14;
(d) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:13, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(e) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:15, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a nucleotide sequence comprising the nucleotide sequence of SEQ ID NO:1 or 9;
(g) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, said crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2;
(h) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, said crystal protein having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8;
(i) a nucleotide sequence completely complementary to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g) and (h);
or Q) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, which hybridizes under high stringency conditions to the nucleotide sequence in (i) wherein the high stringency conditions comprise hybridization in 5X SSC, 5X Denhardt's solution, 1% SDS and 100 µg/ml of carrier DNA at 65°C followed by washing in 0.2X SSC, 0.1 % SDS at 65°C.
(a) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:2;
(b) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:8;
(c) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:14;
(d) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:13, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(e) a nucleotide sequence encoding a polypeptide comprising the complete amino acid sequence of SEQ ID NO:15, with the proviso that said nucleotide sequence does not encode the amino acid sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a nucleotide sequence comprising the nucleotide sequence of SEQ ID NO:1 or 9;
(g) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, said crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2;
(h) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, said crystal protein having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8;
(i) a nucleotide sequence completely complementary to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g) and (h);
or Q) a nucleotide sequence encoding a crystal protein cytotoxic against at least one human cancer cell, which hybridizes under high stringency conditions to the nucleotide sequence in (i) wherein the high stringency conditions comprise hybridization in 5X SSC, 5X Denhardt's solution, 1% SDS and 100 µg/ml of carrier DNA at 65°C followed by washing in 0.2X SSC, 0.1 % SDS at 65°C.
3. An isolated polypeptide comprising:
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) an amino acid sequence as set forth in SEQ ID NO:
14;
(d) an amino acid sequence as set forth in SEQ ID
NO:13, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(e) an amino acid sequence as set forth in SEQ ID
NO:15, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (g) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) an amino acid sequence as set forth in SEQ ID NO:
14;
(d) an amino acid sequence as set forth in SEQ ID
NO:13, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(e) an amino acid sequence as set forth in SEQ ID
NO:15, with the proviso that said amino acid sequence is not the sequence at positions 232 to 723 of SEQ ID NO:18;
(f) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (g) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO:8.
4. A recombinant vector comprising the isolated nucleic acid molecule of claim 2.
5. A recombinant host cell comprising the vector of claim 4.
6. A method for making a recombinant vector comprising inserting the isolated nucleic acid molecule of claim 2 in a vector.
7. A recombinant method for producing the polypeptide of claim 3, comprising culturing the host cell of claim 5 under conditions such that said polypeptide is expressed and recovering said polypeptide.
8. An isolated antibody that binds specifically to:
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (d) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO: 8.
(a) an amino acid sequence as set forth in SEQ ID NO:2;
(b) an amino acid sequence as set forth in SEQ ID NO:8;
(c) a crystal protein having at least 94% identity with the complete amino acid sequence of SEQ ID NO:2, wherein said protein becomes cytotoxic upon treatment with trypsin; or (d) a crystal protein cytotoxic against at least one human cancer cell and having at least 97% identity with the complete amino acid sequence of SEQ ID NO: 8.
9. An in vitro method of using the polypeptide of claim 3 for lysing a human cancer cell, comprising contacting the cell with a cytotoxic amount of the polypeptide.
10. The method of claim 9, wherein the cell is a HELA, TCS, HL-60, Jurkat, or Hep-G2 cell.
11. An in vitro method of testing the cytotoxicity of the polypeptide of claim 3 against a candidate cancer cell, comprising determining the EC50 of the polypeptide on the candidate cell, wherein the polypeptide is characterized as possessing cytotoxicity against the candidate cell if the of the polypeptide against the candidate cell is measurably lower than that against a normal T cell.
12. A method for obtaining a cytotoxic polypeptide, comprising contacting the polypeptide of claim 3 (a), (c) or (f) with trypsin.
13. A biologically pure culture of a microorganism strain comprising all of the biochemical characteristics of a Bacillus thuringiensis strain deposited at the International Depository Authority of Health Canada in Winnipeg under accession number IDAC010201-5.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2410153A CA2410153C (en) | 2002-12-05 | 2002-12-05 | A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof |
US10/756,778 US7329733B2 (en) | 2002-12-05 | 2004-01-14 | Bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2410153A CA2410153C (en) | 2002-12-05 | 2002-12-05 | A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2410153A1 CA2410153A1 (en) | 2004-06-05 |
CA2410153C true CA2410153C (en) | 2013-04-23 |
Family
ID=32476876
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2410153A Expired - Fee Related CA2410153C (en) | 2002-12-05 | 2002-12-05 | A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof |
Country Status (1)
Country | Link |
---|---|
CA (1) | CA2410153C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9567381B2 (en) | 2012-03-09 | 2017-02-14 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11692016B2 (en) | 2012-03-09 | 2023-07-04 | Vestaron Corporation | High gene expression yeast strain |
BR112019008023A2 (en) | 2016-10-21 | 2019-07-09 | Vestaron Corp | peptide, insecticide and / or nematicide protein, polynucleotide, vector, host cell, DNA construct, plant, or part thereof, and method of controlling a plague infection of a plant. |
-
2002
- 2002-12-05 CA CA2410153A patent/CA2410153C/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9567381B2 (en) | 2012-03-09 | 2017-02-14 | Vestaron Corporation | Toxic peptide production, peptide expression in plants and combinations of cysteine rich peptides |
Also Published As
Publication number | Publication date |
---|---|
CA2410153A1 (en) | 2004-06-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Buist et al. | Molecular cloning and nucleotide sequence of the gene encoding the major peptidoglycan hydrolase of Lactococcus lactis, a muramidase needed for cell separation | |
Dramsi et al. | Identification of four new members of the internalin multigene family of Listeria monocytogenes EGD | |
Inamine et al. | ComEA, a Bacillus subtilis integral membrane protein required for genetic transformation, is needed for both DNA binding and transport | |
Del� cluse et al. | Cloning and expression of a novel toxin gene from Bacillus thuringiensis subsp. jegathesan encoding a highly mosquitocidal protein | |
Koni et al. | Cloning and characterization of a novel Bacillus thuringiensis cytolytic delta-endotoxin | |
Robichon et al. | The rpoN (sigma54) gene from Listeria monocytogenes is involved in resistance to mesentericin Y105, an antibacterial peptide from Leuconostoc mesenteroides | |
Lereclus et al. | spbA locus ensures the segregational stability of pTH1030, a novel type of Gram‐positive replicon | |
Schubert et al. | P45, an extracellular 45 kDa protein of Listeria monocytogenes with similarity to protein p60 and exhibiting peptidoglycan lytic activity | |
Moriyama et al. | A gene (sleB) encoding a spore cortex-lytic enzyme from Bacillus subtilis and response of the enzyme to L-alanine-mediated germination | |
Pereg-Gerk et al. | A transcriptional regulator of the LuxR-UhpA family, FlcA, controls flocculation and wheat root surface colonization by Azospirillum brasilense Sp7 | |
Bourgouin et al. | A Bacillus thuringiensis subsp. israelensis gene encoding a 125-kilodalton larvicidal polypeptide is associated with inverted repeat sequences | |
Sibakov et al. | Secretion of TEM beta-lactamase with signal sequences isolated from the chromosome of Lactococcus lactis subsp. lactis | |
Jung et al. | Isolation and characterization of a novel Bacillus thuringiensis strain expressing a novel crystal protein with cytocidal activity against human cancer cells | |
Aronson et al. | Gene structure and precursor processing of a novel Bacillus subtilis spore coat protein | |
Klier et al. | Mating between Bacillus subtilis and Bacillus thuringiensis and transfer of cloned crystal genes | |
Poyart et al. | Conjugative transposition of Tn916-related elements from Enterococcus faecalis to Escherichia coli and Pseudomonas fluorescens | |
Zanen et al. | Proteomic dissection of potential signal recognition particle dependence in protein secretion by Bacillus subtilis | |
Maciver et al. | Cloning and sequencing of a serine proteinase gene from a thermophilic Bacillus species and its expression in Escherichia coli | |
Burger et al. | Identification of homologues to the pathogenicity factor Pat-1, a putative serine protease of Clavibacter michiganensis subsp. michiganensis | |
JP4504314B2 (en) | Novel Brevibacillus choshinensis and method for producing protein using the microorganism as a host | |
Adams et al. | Elucidation of the mechanism of CryIIIA overproduction in a mutagenized strain of Bacillus thuringiensis var. tenebrionis | |
CA2410153C (en) | A novel bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof | |
Barloy et al. | Distribution of clostridial cry-like genes among Bacillus thuringiensis and Clostridium strains | |
US7329733B2 (en) | Bacillus thuringiensis strain, crystal gene and crystal protein and uses thereof | |
Uemori et al. | Parasporin-1Ab, a novel Bacillus thuringiensis cytotoxin preferentially active on human cancer cells in vitro |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |
Effective date: 20161205 |